Infrastructure‑focused role on Perplexity’s AI team, responsible for large‑scale deployment and optimization of LLM inference (Python/Rust/C++, PyTorch, Triton, CUDA, Kubernetes), building APIs and platforms that serve real‑time queries for the answer engine and agents.
Category
MLOps / AI Infrastructure
Posted
11/5/2025