Hacker Newsnew | past | comments | ask | show | jobs | submit | desideratum's submissionslogin
1.Finetuning GPT-OSS with Axolotl (github.com/axolotl-ai-cloud)
3 points by desideratum 5 months ago | past
2.Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training (huggingface.co)
3 points by desideratum 5 months ago | past
3.Training LLMs with GRPO and Interpreter Feedback Using WebAssembly (huggingface.co)
3 points by desideratum 9 months ago | past
4.Training Large Language Models with Interpreter Feedback Using WebAssembly (huggingface.co)
1 point by desideratum 9 months ago | past
5.DeepSeek-V3-0324 (huggingface.co)
5 points by desideratum 9 months ago | past | 1 comment
6.Training Process Reward Models in Axolotl (axolotlai.substack.com)
2 points by desideratum 10 months ago | past
7.Torchtune – a native PyTorch library for fine-tuning LLMs (github.com/pytorch)
2 points by desideratum on Oct 8, 2024 | past
8.(Deep Learning Based) Opportunistic Screening to Improve Statin Rates (ahajournals.org)
1 point by desideratum on April 15, 2024 | past
9.The theory of Proximal Policy Optimisation implementations (salmanmohammadi.github.io)
1 point by desideratum on April 11, 2024 | past
10.Ask HN: Feel like I'm being lowballed by founders. Where do I go from here?
5 points by desideratum on Dec 31, 2020 | past | 13 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: