Back to AI Lab
ArXiv Paper

MaxCode: A Max-Reward Reinforcement Learning Framework for Automated Code Optimization

Jiefu Ou, Sapana Chaudhary, Kaj Bostrom +4January 12, 2026

Summary

MaxCode treats code optimization as a reinforcement learning search over code edits guided by runtime feedback. It uses natural-language critiques and a reward model to steer generation, beating past systems at speeding up CUDA and C++ kernels.

Related Content