SKILLS

Feature engineering, cleaning, and transformation datasets with Pandas, NumPy, Matplotlib.
Traditional algorithms (RF, KNN, SVM, XT) and dimensionality reduction (PCA, t-SNE). Sklearn models (XGB, LGBM, Cat) and competition techniques like blending, stacking.
PyTorch, common models (CNN, RNN, ResNet, LSTM), graph neural network (GCN, GAT, building graph with PyG), and including pre-training, transfer learning.
Hands-on experience with LLMs (Llama3, BERT) Text Classification and Generation .
Knowledge of PEFT and SFT, utilize quantization and fine-tune (QLoRA) LLMs for acceleration.
NLP (NLTK, HMM, Word2Vec), image processing (detection, segmentation).
ARIMA/DARIMA, differencing, ACF, and PACF analysis for Time Series forecasting.
Familiar with Linux systems, Git, MySQL.
Built online services with Django, HTML, CSS, and JavaScript.

WORK EXPERIENCE

Data analyst
SQL, PowerBI, EXCEL
Capgemini, Warsaw
Dec 2023 - Pres.
  • Process, analyze, and summarize raw data to extract meaningful insights.
  • Collaborate with multiple teams to improve data-driven decision-making.
  • Assist team leads in root cause analysis to identify and resolve key business issues.
Management Trainee
Project Management
Yili Group, Beijing
Aug 2020 - Nov 2022
  • Conduct market analysis to identify target and trends.
  • Plan, implement, and evaluate programs.
Internship in Financial department
Financial Modeling
Chunguang Real Estate Development, Hebei
Dec 2019 - Feb 2020
  • Evaluate monthly pension reports, manage daily documents, and maintain financial records.

PROJECT

GCN, Llama-3-8B, BERT
# Github Stars: 0
Jul 2024
  • Enhancing Text Classification with Meta-Llama-3-8B-Augmented BertGCN.
  • Hugging Face's bitsandbytes and transformers for model quantization enhances efficiency.
  • Address data scarcity and boost robustness.
#Score:0.96436
Dec 2024
  • Enhanced 2% performance through feature selection, interaction features, and categorical encoding.
  • Ensemble CatBoost, LightGBM, and XGBoost, utilize hyperparameter optimization, 8% improve.
  • Applied Blending and Stacking to improve generalization, achieving 2% performance gain.

EDUCATION

M.S. Computer Science
GPA: 3/4.0
University of Łódź
Oct 2022 - Oct 2024.
BSc Mathematics - ESI Top 1%
GPA: 3/4.0
Tiangong University
Sep 2016 - Jul 2020
Visiting student
GPA: 3/4.0
Politechnika Krakowska
Winter semester, 2018