SKILLS
Feature engineering, cleaning, and transformation datasets with Pandas, NumPy, Matplotlib.
Traditional algorithms (RF, KNN, SVM, XT) and dimensionality reduction (PCA, t-SNE). Sklearn models (XGB, LGBM, Cat) and competition techniques like blending, stacking.
PyTorch, common models (CNN, RNN, ResNet, LSTM), graph neural network (GCN, GAT, building graph with PyG), and including pre-training, transfer learning.
Hands-on experience with LLMs (Llama3, BERT) Text Classification and Generation .
Knowledge of PEFT and SFT, utilize quantization and fine-tune (QLoRA) LLMs for acceleration.
NLP (NLTK, HMM, Word2Vec), image processing (detection, segmentation).
ARIMA/DARIMA, differencing, ACF, and PACF analysis for Time Series forecasting.
Familiar with Linux systems, Git, MySQL.
Built online services with Django, HTML, CSS, and JavaScript.
WORK EXPERIENCE
Data analyst
SQL, PowerBI, EXCEL

Dec 2023 - Pres.
- Process, analyze, and summarize raw data to extract meaningful insights.
- Collaborate with multiple teams to improve data-driven decision-making.
- Assist team leads in root cause analysis to identify and resolve key business issues.
Management Trainee
Project Management

Aug 2020 - Nov 2022
- Conduct market analysis to identify target and trends.
- Plan, implement, and evaluate programs.
Internship in Financial department
Financial Modeling

Dec 2019 - Feb 2020
- Evaluate monthly pension reports, manage daily documents, and maintain financial records.
PROJECT
GCN, Llama-3-8B, BERT
# Github Stars: 0
Jul 2024
- Enhancing Text Classification with Meta-Llama-3-8B-Augmented BertGCN.
- Hugging Face's bitsandbytes and transformers for model quantization enhances efficiency.
- Address data scarcity and boost robustness.
Rank 820/3858
#Score:0.96436
Dec 2024
- Enhanced 2% performance through feature selection, interaction features, and categorical encoding.
- Ensemble CatBoost, LightGBM, and XGBoost, utilize hyperparameter optimization, 8% improve.
- Applied Blending and Stacking to improve generalization, achieving 2% performance gain.
EDUCATION
M.S. Computer Science
Oct 2022 - Oct 2024.GPA: 3/4.0

M.S. Computer Science
GPA: 3/4.0

BSc Mathematics - ESI Top 1%
Sep 2016 - Jul 2020GPA: 3/4.0

BSc Mathematics - ESI Top 1%
GPA: 3/4.0

Visiting student
Winter semester, 2018GPA: 3/4.0

Visiting student
GPA: 3/4.0

SKILLS
Feature engineering, cleaning, and transformation datasets with Pandas, NumPy, Matplotlib.
Traditional algorithms (RF, KNN, SVM, XT) and dimensionality reduction (PCA, t-SNE). Sklearn models (XGB, LGBM, Cat) and competition techniques like blending, stacking.
PyTorch, common models (CNN, RNN, ResNet, LSTM), graph neural network (GCN, GAT, building graph with PyG), and including pre-training, transfer learning.
Hands-on experience with LLMs (Llama3, BERT) Text Classification and Generation .
Knowledge of PEFT and SFT, utilize quantization and fine-tune (QLoRA) LLMs for acceleration.
NLP (NLTK, HMM, Word2Vec), image processing (detection, segmentation).
ARIMA/DARIMA, differencing, ACF, and PACF analysis for Time Series forecasting.
Familiar with Linux systems, Git, MySQL.
Built online services with Django, HTML, CSS, and JavaScript.
WORK EXPERIENCE
Data analyst
SQL, PowerBI, EXCEL

Dec 2023 - Pres.
- Process, analyze, and summarize raw data to extract meaningful insights.
- Collaborate with multiple teams to improve data-driven decision-making.
- Assist team leads in root cause analysis to identify and resolve key business issues.
Management Trainee
Project Management

Aug 2020 - Nov 2022
- Conduct market analysis to identify target and trends.
- Plan, implement, and evaluate programs.
Internship in Financial department
Financial Modeling

Dec 2019 - Feb 2020
- Evaluate monthly pension reports, manage daily documents, and maintain financial records.
PROJECT
GCN, Llama-3-8B, BERT
# Github Stars: 0
Jul 2024
- Enhancing Text Classification with Meta-Llama-3-8B-Augmented BertGCN.
- Hugging Face's bitsandbytes and transformers for model quantization enhances efficiency.
- Address data scarcity and boost robustness.
Rank 820/3858
#Score:0.96436
Dec 2024
- Enhanced 2% performance through feature selection, interaction features, and categorical encoding.
- Ensemble CatBoost, LightGBM, and XGBoost, utilize hyperparameter optimization, 8% improve.
- Applied Blending and Stacking to improve generalization, achieving 2% performance gain.
EDUCATION
M.S. Computer Science
Oct 2022 - Oct 2024.GPA: 3/4.0

M.S. Computer Science
GPA: 3/4.0

BSc Mathematics - ESI Top 1%
Sep 2016 - Jul 2020GPA: 3/4.0

BSc Mathematics - ESI Top 1%
GPA: 3/4.0

Visiting student
Winter semester, 2018GPA: 3/4.0

Visiting student
GPA: 3/4.0
