Research scientist/engineer focusing on LLM post-training (SFT, RLHF, reward modeling) for Scale’s partners and internal products. Role centers on data curation, evaluation, and novel methods to improve alignment and generalization of large-scale generative models.
Category
AI Research Scientist