Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"
ars22 / scaling-llm-math-synthetic-data Goto Github PK
View Code? Open in Web Editor NEWCode and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"
License: MIT License