CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

This research introduces CodeScaler, an execution-free reward model that enhances code LLM training and inference by eliminating runtime execution overhead w...

Level: advanced

By Unknown

Category: research