minus-squareTetragrade@leminal.spacetoTechnology@lemmy.world•Announcing ARC-AGI-3 - A benchmark that tests if AI can explore, learn, and adapt in unfamiliar situations. Humans score 100%. Frontier AI scores 0.26%.linkfedilinkEnglisharrow-up3·edit-23 days agoThis replay is the funniest shit lmao. Keep building that bridge Claude. https://arcprize.org/replay/0964128b-a2f5-4c5b-886e-497d893f429d Interesting that it seems to be perceiving the environment mostly accurately, and is just completely wrong about the purpose of all the game objects. linkfedilink
This replay is the funniest shit lmao. Keep building that bridge Claude.
https://arcprize.org/replay/0964128b-a2f5-4c5b-886e-497d893f429d
Interesting that it seems to be perceiving the environment mostly accurately, and is just completely wrong about the purpose of all the game objects.