Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Try MinerU 2.5 with two-step parsing. It gives good results with bounding boxes per block. Not sure if you can get it to do more detailed such as word or character level.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: