Principle AI Researcher/Engineer - (LLM reinforcement learning)

AMD|Helsinki, FinlandRemote

$99k - $142kEURVerified

Apply Now

AMD Deals Company Profile

Quick Insights

Level

Principal

Pace

Fast research cycles with external deadlines from open‑source and partner ecosystems.

Job Description

You join AMD’s Silo AI Base Models team to lead post‑training for large language models. The focus is turning strong base checkpoints into assistant‑grade models using supervised training, reward models, and reinforcement learning at scale, with a mandate to open‑source code, data, and recipes.

Responsibilities

Design and tune post‑training methods such as SFT, DPO/PPO/GRPO, and related RL variants on large clusters.
Build high‑throughput synthetic‑data pipelines with clear evaluation metrics.
Partner with evaluation teams to define metrics that actually track model quality.
Publish open‑source code, datasets, and training recipes; upstream improvements to TRL and similar frameworks.
Coordinate with pre‑training, infra, and OpenEuroLLM partners on checkpoints, data mix, and long‑context plans.
Shape the roadmap to improve multilingual and low‑resource performance in production models.

Benefits

Remote role based in Finland with hybrid flexibility noted in tags.Indicative salary tags around €99,050 – €141,500 per year.Access to AMD’s broader benefits; exact package referenced as "AMD benefits at a glance" in the posting.

Ready to Apply?

Applications go directly to AMD's career portal

Apply on AMD

View All AI Jobs

Apply Now