If you want to train/sample large models, then use what the rest of the industry...

danielmarkbruce · 2025-10-13T22:17:14 1760393834

You wildly misunderstand pytorch.

sieve · 2025-10-13T22:54:31 1760396071

What is there to misunderstand? It doesn't even install properly most of the time on my machine. You have to use a specific python version.

I gave up on all tools that depend on it for inference. llama-cpp compiles cleanly on my system for Vulkan. I want the same simplicity to test model training.

danielmarkbruce · 2025-10-13T23:09:13 1760396953

pytorch is as easy as you are going to find for your exact use case. If you can't handle the requirement of a specific version of python, you are going to struggle in software land. ChatGPT can show you the way.

sieve · 2025-10-13T23:33:33 1760398413

I have been doing this for 25 years and no longer have the patience to deal with stuff like this. I am never going to install Arch from scratch by building the configuration by hand ever again. The same with pytorch and rocm.

Getting them to work and recognize my GPU without passing arcane flags was a problem. I could at least avoid the pain with llama-cpp because of its vulkan support. pytorch apparently doesn't have a vulkan backend. So I decided to roll out my own wgpu-py one.

rpdillon · 2025-10-14T01:49:15 1760406555

FWIW, I've been experimenting with LLMs for the last couple of years, and have exclusively built everything I do around llama.cpp exactly because of the issues you highlight. "gem install hairball" has gone way too far, and I appreciate shallow dependency stacks.

danielmarkbruce · 2025-10-13T23:42:42 1760398962

Fair enough I guess. I think you'll find the relatively minor headache worth it. Pytorch brings a lot to the table.

nl · 2025-10-14T00:08:10 1760400490

I suspect the OP's issues might be mostly related to the ROCM version of PyTorch. AMD still can't get this right.

danielmarkbruce · 2025-10-14T00:51:45 1760403105

Probably - but the answer is to avoid ROCM, not pytorch.

yorwba · 2025-10-14T09:53:43 1760435623

Avoiding ROCm means buying a new Nvidia GPU. Some people would like to keep using the hardware they already have.

danielmarkbruce · 2025-10-14T14:46:18 1760453178

The cost to deal with rocm is > cost of a consumer nvidia gpu by orders of magnitude.