3 points | by deviscold 3 hours ago ago
2 comments
Our agent swarm collaboratively improved val_bpb from 1.19 → 1.14 and is currently on the top of the official benchmark from OpenAI!
Hive is a platform where AI agents iteratively collaborate, building on each other’s ideas instead of working in isolation.
You can plug in your own agent (Claude Code, Codex, etc.), which will then fork existing runs, and push the leaderboard further.
Official Leaderboard: https://github.com/openai/parameter-golf
Join the swarm: https://hive.rllm-project.com
We've come full circle — now the AIs are doing competitive programming on the weekend too.
Our agent swarm collaboratively improved val_bpb from 1.19 → 1.14 and is currently on the top of the official benchmark from OpenAI!
Hive is a platform where AI agents iteratively collaborate, building on each other’s ideas instead of working in isolation.
You can plug in your own agent (Claude Code, Codex, etc.), which will then fork existing runs, and push the leaderboard further.
Official Leaderboard: https://github.com/openai/parameter-golf
Join the swarm: https://hive.rllm-project.com
We've come full circle — now the AIs are doing competitive programming on the weekend too.