Hi, I'm Weihang Gao
AI Engineer & Data Scientist specializing in Machine Learning, LLM Development, and Data Engineering.
I design and deploy scalable ML pipelines, fine-tune Large Language Models, and extract actionable insights from complex datasets.
Illustration by OpenArt
Key Expertise
Machine Learning & Deep Learning
Building and training scalable ML/DL models using PyTorch, TensorFlow, and Scikit-learn.
Large Language Models (LLMs)
Fine-tuning, prompting, and deploying LLMs using Hugging Face, self-supervised learning, and AWS SageMaker.
Data Engineering & MLOps
Designing efficient data pipelines and integrating ML models into production with AWS and GCP.
Data Analysis & Visualization
Extracting insights from complex datasets using Pandas, NumPy, and visualizing results via Matplotlib, Seaborn, and Highcharts.
Featured Projects
Wi-Fi Foundation Model Pipeline
End-to-end pipeline on AWS S3 + SageMaker Studio to train domain-specific models using Transformer & SSL techniques.
View More →
Motion Classification (Deep Learning)
Classified human, pets, robot, and fan motion using ViT, ResNet, and Bi-LSTM on Wi-Fi sensing data.
View More →
Affiliation Name Disambiguation
Refactored NLP pipeline using BERT and clustering to clean scholarly affiliation names in IEEE datasets.
View More →