Wi-Fi Foundation Model Pipeline
Designed and implemented a large-scale data management pipeline using AWS S3 and SageMaker Studio to support the training of a Wi-Fi domain-specific foundation model. The model uses Transformer architecture and self-supervised learning techniques such as contrastive loss and masking. I led the data ingestion, quality control, versioning, and orchestration of large-scale training jobs. Pipeline refactoring improved reproducibility and reduced training costs.