DABstep Reasoning Benchmark Leaderboard
Implement test-time compute scaling for math problems
Display a React app with TypeScript