Product Launch
anonymous
5 points
1 comments
Posted2 months agoActive2 months ago
Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets
codeclash.aiAI evaluationsoftware developmentreinforcement learning
Discussion (1 comments)
Showing 1 comments
2 months ago
Is competition + limited resources (e.g. Core War) = selection pressures (natural or otherwise).
Can we integrate and bring back reinforcement learning in a framework like this?