Work and Service

Work Experience

AI Research Internship, Data & Data Science Group, Sanofi,S.A., 06/2023 ~ 09/2024, Boston, MA

Mentor: Wei Zhao and Bozhao Qi

  • Endoscopic Scoring and Localization in Unconstrained Clinical Trial Videos (Accepted at WACV2025)
    • Collected a comprehensive clinical endoscopy dataset through human-in-the-loop active learning, combining video cleaning and weak labeling by doctors to produce a large and balanced dataset (~49.9 hours, ~86,423 clips) for endoscopic mayo scoring
    • Developed an end-to-end foundational model for endoscopic scoring and localization, incorporating a ViT-based scoring model for clip-by-clip assessment and a trajectory model for precise localization, also serving as a vision input for medical multimodal models
    • Introduced a novel personalized Cumulative Disease Score (CDS) based on movement, surpassing the limitations of prior time-based CDS
  • Data-driven Neural ODE (DN-ODE) Modeling for Clinical Trails Breast Cancer Tumor Dynamics (Accepted at Neurips DLDE, CBM)
    • Developed a breast cancer pharmacodynamics model using neural ODEs, trained on phase 1/2 Amcenestrant clinical trials and validated in phase 3, which predicts tumor size and progression-free survival (PFS) from initial patient status
    • Offered interpretable results and visualizations of tumor dynamics, serving as a robust reference for clinical trial design by identifying potential responders and patients at risk of cancer progression
  • Generative AI & Retrieval-Augmented Generation (RAG) web-based Backend
    • Built a chat-based tool for researchers and doctors to review clinical trial documents and interactively find related answers.
    • Developed AWS Lambda backends integrating OpenAI’s ChatGPT API for query embeddings and Pinecone for similarity search.
    • Implemented automated tests with SonarCloud checks, ensuring high-quality code through GitHub Actions CI/CD pipelines.
  • Data-Driven Neural-ODE Modeling for Breast Cancer Tumor Dynamics and Progression-Free Probability Predictions
    • Managed sparse data from early phases of clinical trials and predicted later phases with only a single observation
    • Created visualizations depicting various tumor dynamics patterns and achieved accurate predictions (R2 exceeding 0.9)
    • Provided a robust reference for clinical trial design by identifying potential responders and progressed patients
  • Unpaired Tissue Segmentation & Virtual Restaining
    • Proposed an end-to-end unpaired segmentation GAN and virtual staining framework that mimics real staining processes
    • Supervised prior works and presented a promising substitute for traditional histopathology methods for clinical trial recruitment

Machine Learning Engineer Internship, Kernel Lab, 01/2021 ~ 08/2021, Seattle, WA.

Mentor: Dennis Meng, Lead Data Scientist at Kernel Labs Inc

  • Gender classifier: Design a model to classify a speaker’s gender.
  • Echo challenge: Design a Dual-signal Transformation LSTM Network to solve the acoustic echo cancellation (AEC) problems.

Teaching Experience

Teaching Assistant, University of Washington, 01/2020 ~ Now, Seattle, WA.

  • EE 241 Python Programming
  • EEP 596 Recommender Systems, Instructor: Karthik Mohan
  • ME/EE 547 Linear Control Theory, Instructor: Xu Chen

Services

  • Conference Reviews: CIKM, KDD, NeurIPS, AAAI, MICCAI, CLAI, ICLR, ICML, CVPR, WACV, ICMD, ISKE, SAM
  • Workshop: NeurIPS AFT, NeurIPS AID4, NeurIPS DLDE, NeurIPS AI4D
  • Journal Reviews: PLOS ONE, Data Science and Management (DSM), BICM, TNNLS
  • Other Services: Graduate Applicant Support Program (GASP)