Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You could! But just like others have mentioned, the performance would be negligible. If you really wanted to see more of a performance boost by pretraining you could try to create a bigger chunk of data to train off of. This would be done by either creating synthetic data off of your material, or finding adjacent information to your material. Here's a good paper about it: <https://arxiv.org/abs/2409.07431>


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: