Build real‑time automatic speech recognition (ASR) and intent recognition systems for Tesla products, integrating speech into embodied agents and on‑device AI. Role sits within Tesla's Core AI group.
Responsibilities
Develop robust, real‑time ASR systems for noisy, dynamic environments
Train and fine‑tune end‑to‑end speech models tailored for embodied agents
Build intent recognition pipelines that connect audio input to semantic understanding and control
Collaborate with perception and control teams to integrate speech into the sensorimotor stack
Deploy models on custom Tesla hardware, optimizing for latency, memory, and robustness
Benefits
Competitive Tesla compensation; Glassdoor data suggests total comp for ML Engineers in Palo Alto commonly in the ~$184K–$261K range depending on levelStandard Tesla benefits (health, equity, etc.; details not fully specified in this posting)