Seems like a good benchmark for AGI. Start with things that are easy for humans ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		robryan 5 months ago \| parent \| context \| favorite \| on: GPT-5 Seems like a good benchmark for AGI. Start with things that are easy for humans but hard for LLMs currently.

mustaphah 5 months ago [–]

But they have access to tools (though I'm not sure why they're not using them in this case).

Ask it to count using a coding tool, and it will always give you the right answer. Just as humans use tools to overcome their limits, LLMs should do the same.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact