Chang Gao

🍎 [2024-Now] Foundation Models, Apple Intelligence

🚖 [2020-24] Tech Lead and Senior Engineer, Waymo Perception

🏫 [2014-19] MS in Computer Vision, CMU; BEng in CS, HKU

GitHubInstagramLinkedInEmail

I obtained my B.Eng in Computer Science at the University of Hong Kong and M.S. in Computer Vision at Carnegie Mellon University. My past undergraduate and graduate research focused on Artifical Intelligence, Computer Vision, Deep Learning, and Computer Graphics, with paper published at SIGGRAPH, IEEE GRSL (feat. on cover, 2018), and ACCV (Best Application Paper Award, 2018).

From early 2020 to late 2024, I was a Tech Lead and Senior Software Engineer at Waymo Perception team, shipping multimodal spatial-temporal algorithms and machine learning models to both the autonomous vehicles and the remote cloud services. I also led a part of the Perception system that utilizes large vision language models (VLMs) to improve Waymo's human-robot interaction and safety. I obtained 10+ spot bonuses by launching multiple impactful products and being a strong and caring team player.

In late 2024, I joined Apple. I have been researching, developing and deploying multimodal large language models (MLLM) on various Apple services and devices, also known as Apple Intelligence.

Although I mostly code for closed source projects (400k+ lines of code at Waymo), I also enjoy publishing open source code on GitHub, such as nanoGPT.jax and nanoDiffusion. I have been reviewing AI conference papers at AAAI, WACV, BMVC, etc. I provide LeetCode coaching for free.