Research engineer on the Pre-training team, pushing next-generation language models by improving architectures, data pipelines and large-scale training infrastructure.
Responsibilities
Design and run experiments on model architectures, data, and optimizers
Scale training jobs to thousands of GPUs and improve reliability
Optimize throughput of attention and other model components
Build and maintain large-scale data processing and ETL pipelines
Develop tools and visualizations to inspect model internals
Collaborate with alignment, product and infrastructure teams on shared models
Benefits
Base salary range approximately 340,000–425,000 USD plus equityComprehensive health, dental and vision coverageOptional equity donation matchingGenerous vacation and parental leaveHybrid work with expectation of regular in-office time
Category
Applied Scientist
Ready to Apply?
Applications go directly to Anthropic's career portal