Education

Current

University of California, Berkeley

Master of Engineering, EECS · 09/2025–05/2026

Focus: Data Science & Machine Learning Systems

Graduated 2025

University of California, San Diego

B.S. Data Science · B.S. Mathematics–Computer Science · Minor: Cognitive Science

GPA 3.94 · Provost Honors · Graduated in 3 years

Industry Experience

Software Engineer

Incoming
Uber · 2026

Joining Uber's engineering team as a full-time software engineer.

BackendDistributed Systems

Software Engineer Intern

Past
Venture Shares · 06/2024–10/2024

Built streaming data pipelines (Kafka, FastAPI, asyncio) processing 1M+ financial records/day. Developed RESTful APIs reducing response latency by 60%. Engineered MongoDB feature store restoring sub-second query performance on millions of records.

KafkaFastAPIasyncioMongoDBPython

Software Developer Intern

Past
Triton Funds · 06/2023–09/2023

Developed Python ETL pipeline processing 500+ SEC filings per batch. Integrated GPT-based NLP achieving 99% entity extraction accuracy. Built Flask + AWS (EC2, RDS) REST API serving 50+ QPS with p95 latency <200ms.

PythonFlaskAWSNLPETL

Research

ML Engineer · MEng Capstone

Current
Berkeley Teaching AI (TAI) · 08/2025–Present

Building LLM-based file reorganization agents and VLM-augmented slide QA systems for the Berkeley Teaching AI platform.

LLMVLMRAGPython

Research Assistant

Current
Trustworthy Data Management and Analysis Lab · 07/2024–Present

Co-designed Matryoshka, an ML-centric dataset discovery and feature selection system scaling to 100M+ row data lakes using DuckDB and PostgreSQL.

PythonDuckDBPostgreSQLML

Research Assistant

Past
Courchesne-Krak Lab · 06/2024–04/2025

Trained ensemble ML models (Ridge, LSTM) on multi-terabyte ABCD datasets for longitudinal adolescent health prediction; achieved AUC >0.80.

PyTorchSHAPOptunaPlotly

Engineering Research Assistant

Past
Power Transformation Lab · 06/2023–09/2025

Built ML forecasting platform processing 50M+ energy rows across 22 provinces; NLP pipeline analyzing 1,000+ policy documents for China's mid-century power analysis.

PythonXGBoostBERTPostgreSQL

Research Assistant

Past
Chinese Academy of Sciences · 06/2023–06/2024

Co-developed Re-NeRF, optimizing high-resolution NeRF training with deformable convolution; improved rendering quality in 4K scenes (PSNR/SSIM).

PyTorchNeRFComputer Vision

Teaching

UCSD Math Department

Tutor · 4 quarters

UCSD SPIS

Mentor

Extracurricular

Triple-C Club

Team Member

Technologies I Frequently Use