Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> As the complexity of the game state grew and the screenshots were filled with more entities, the models got even more confused and started hallucinating directions, entities etc or weren't capable of troubleshooting factories with apparent mistakes (i.e missing transport belt, wrongly rotated inserter). We think it's because [...]

I think you just described a research paper that would advance sota. Less describing why, but how. (Assuming it's not just, wy finetuned the model and it worked perfectly)



Sounds almost like a visual "needle in a haystack" type of work, that could be quite interesting!


Where’s Waldo test for vlm




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: