Hi, I am Yang

Yang Hu

Graduate student at New York University

I am a MS Computer Science student at the Tandon School of Engineering at New York University. I hold a BS from UCSB with a double major in Mathematics and Statistics & Data Science.

With a foundation in mathematical and statistical modeling, my research interests lie in leveraging machine learning and deep learning models for innovative solutions. I am experienced in modeling and extracting meaningful insights from complex, large-scale datasets across a diverse range of domains, including business, finance, sports, environmental science, and game industry, among others.

I am actively looking for 2026 New Graduate opportunities in Machine Learning, Software Engineering, or Quantitative Research roles!

Skills

Experiences

1
Abama Private Fund Investment Management Co., Ltd.

July. 2025 - Aug. 2025

Shanghai, China

Quantitative Research Intern

July. 2025 - Aug. 2025

Responsibilities:
  • Designed a parallelized data pipeline to transform raw event-level financial data into 300+ engineered featuresto feed high-frequency trading ML models, cutting data preparation from 8 h to 15 min.
  • Delivered a live-updating factor store database powering large-scale model training and backtests; increased experiment throughput by ∼ 10× and enabled apples-to-apples model comparison during strategy reviews.
  • Fine-tuned Transformers for trading signal prediction, improving annualized return by 3% in backtests.
  • Built an automated feature-discovery framework that iteratively surfaces noise-robust, signal-strengthening factors, accelerating model iteration and reducing manual screening.

Unity

May. 2024 - Aug. 2024

Shanghai, China

R&D Software Engineering Intern

May. 2024 - Aug. 2024

Responsibilities:
  • Scaled the RAG knowledge base for Unity’s AI assistant via community and forum ingestion and LLM-based quality filtering; expanded answer coverage for Editor/API issues by ∼ 10×, improving first-response resolution.
  • Built MuseBench, an evaluation platform that pairs an LLM-as-a-judge with a curated benchmark of Q&A pairs, enabling consistent scoring of RAG results across different model versions and cutting evaluation time by 95%.
  • Deployed a local-LLM pipeline to parse 100k+-line Unity Cloud Build logs and extract root-cause signals; reduced triage from hours to minutes and accelerated time-to-fix for recurring build failures.
2

3
The Waves Lab at UCSB

Jun. 2023 - Mar. 2024

Santa Barbara, California

Advisor: Prof. Kelly Caylor | Graduate Advsior: Anna Boser

Undergraduate Researcher

Sep. 2023 - Mar. 2024

Responsibilities:
  • Developed a novel Segment-Then-Classify Strategy leveraging the Segment Anything Model and Vision Transformer for instance segmentation in remote sensing, reducing manual labeling and training costs.
  • Applied multiple time series classification models on Sentinel satellite imagery to achieve pixel-level semantic segmentation, enhancing the detection accuracy of smallholder irrigation fields.
Bren Leaders and Internship Program

Jun. 2023 - Sep. 2023

Responsibilities:
  • Implemented an instance segmentation model for automated mapping of center-pivot irrigation systems in Sub-Saharan Africa for a deeper understanding of irrigation adoption and its impacts in the region.

Data Science & Engineering School at East China Normal University

Jun. 2022 - Jun. 2023

Shanghai, China

Advisor: Prof. Ding-jiang Huang

Undergraduate Research Assistant

Aug. 2022 - Jun. 2023

Responsibilities:
  • Co-authored the ACM MM ‘24 paper :’Compacter: A Lightweight Transformer for Image Restoration’, achieving state-of-the-art PSNR performance across different Image Restoration tasks with ~ 50% - 65% fewer parameters.
  • Co-designed Compact Adaptive Self-Attention, enabling omnidirectional spatial–channel information flow through cross-modulation of global context to strengthen long-range dependencies while preserving local detail.
  • Proposed a Dual Selective Gated Module that dynamically injects global context into each pixel for contextadaptive aggregation, amplifying informative features and suppressing noise.
  • Built the PyTorch training/benchmarking pipeline and ran ablation experiments, enabling reproducible results and efficient comparison to baselines.
Summer Intern

Jun. 2022 - Aug. 2022

Responsibilities:
  • Conducted literature reviews on Image Super-Resolution deep learning models.
4

Education

M.S. in Computer Science
GPA: 3.9 out of 4
Taken Courses:
  • Design and Analysis of Algorithms
  • Web Search Engines
  • Computer Networking
  • Operating Systems
  • Principles of Database Systems
  • Software Engineering
  • Blockchain and Distributed Ledger Technology
  • Big Data
  • Network Security
B.S. in Mathematics / Statistics and Data Science
Taken Courses:
  • CS | Data Structures and Algorithms
  • CS | Algorithms Engineering
  • CS | Deep Learning
  • DS | Statistical Machine Learning
  • DS | Statistical Data Science
  • DS | Big Data Analytics
  • STAT | Probability and Statistics A-B-C
  • STAT | Time Series
  • STAT | Regression Analysis
  • STAT | Applied Stochastic Processes
  • STAT | Design and Analysis of Experiments
  • MATH | Linear Algebra A-B
  • MATH | Real Analysis A-B
  • MATH | Differential Geometry
  • MATH | Non-euclidean Geometry
  • MATH | Abstract Algebra A-B-C
  • MATH | Number Theory
  • FINANCE | Financial Mathematics
  • FINANCE | Mathematics of Fixed Income Markets
Extracurricular Activities:
  • Chinese Students and Scholars Association (CSSA) Lead event planner
  • Data Science Club
  • Snow Club
  • Intramural soccer player
No. 2 High School Attached to East China Normal University
2017 - 2020
Secondary School

Projects

VocationalNYC
Jan. 2025 - May 2025
Information Retrieval System & Retrieval-Augmented Generation
Sep. 2024 – Dec. 2024
Semantic Segmentation by Pixel-level Time Series Classification
Nov. 2023 - Mar. 2024
Few-shot Instance Segmentation for Remote Sensing
Few-shot Instance Segmentation for Remote Sensing
First author Jun. 2023 - Oct. 2023
Time Series Forecasting of U.S. Candy Production
Time Series Forecasting of U.S. Candy Production
Time Series Course Project Sep. 2023 - Nov. 2023
Efficient Visual Attention Design for Image Super-Resolution
Efficient Visual Attention Design for Image Super-Resolution
AAAI 2024 submission Mar. 2022 - May 2023
Digitizing Handwritten Data with OCR
Oct. 2023 - Nov. 2023
Soccer player transfer market value prediction
Soccer player transfer market value prediction
Machine Learning Course Project Sep. 2022 - Jan. 2023
Exploring SO(3) through the lens of Orbifolds
Exploring SO(3) through the lens of Orbifolds
UCSB MATH Directed Reading Program Jan. 2023 - Mar. 2023
Differential Geometry Calculator