Currently I am chugging along with a single RTX 3090 FE in my workstation. Qwen 27B is nice, but 32B is already rather awkward and I have to go headless to get a halfway usable context window. I considered adding a second 3090, but decided against it for various reasons.
Right now I have my eyes set on the Qwen3-next-80B model at NVFP4 or Q5. But maybe that's just me dreaming and 32B would already be fine for my (non-work related) needs?
With the latest price hikes I am just unwilling to throw even more money at Anthropic or OpenAI. They are still burning through money, and my personal opinion is that they will need to raise prices massively. I also do not think we will see any meaningful price reductions after the bubble bursts. Imho, it will be quite the opposite, as all AI addicts will scramble to secure whatever hardware they can to keep going.
The most expensive option would be to shell out nearly 10k€ for an Nvidia Pro 6000 Blackwell with 97GB VRAM. The 72GB Pro 5000 model could work as well, but it still requires RAM offloading, which can lead to erratic TPS while still costing around 7200€. Right below that sits the M3 Ultra Mac Studio with 128GB unified memory, but it currently is not available in Europe. The 96GB model feels somewhat like a trap to me.
The more affordable options for me are:
I would settle for this: It is OK if it takes upwards of two minutes from the first prompt on a larger software project, as long as subsequent prompts are reasonably fast. It is not for work, but for ambitious home projects.
Right now I have my eyes set on the Qwen3-next-80B model at NVFP4 or Q5. But maybe that's just me dreaming and 32B would already be fine for my (non-work related) needs?
With the latest price hikes I am just unwilling to throw even more money at Anthropic or OpenAI. They are still burning through money, and my personal opinion is that they will need to raise prices massively. I also do not think we will see any meaningful price reductions after the bubble bursts. Imho, it will be quite the opposite, as all AI addicts will scramble to secure whatever hardware they can to keep going.
The most expensive option would be to shell out nearly 10k€ for an Nvidia Pro 6000 Blackwell with 97GB VRAM. The 72GB Pro 5000 model could work as well, but it still requires RAM offloading, which can lead to erratic TPS while still costing around 7200€. Right below that sits the M3 Ultra Mac Studio with 128GB unified memory, but it currently is not available in Europe. The 96GB model feels somewhat like a trap to me.
The more affordable options for me are:
- Nvidia Pro 5000 48GB — Even with NVFP4 and RAM offloading it probably would be unbearably slow for Qwen-next-80B.
- DGX Spark — On paper this seems more potent than a Ryzen AI Max+ system, but user experiences do not really seem to support that. The dev environment also appears to be rather beta right now.
- Ryzen AI Max+ mini PCs — Probably the most budget-friendly option, but they seem sluggish with larger models due to the mediocre memory bandwidth. They are also not dramatically cheaper than a DGX Spark.
I would settle for this: It is OK if it takes upwards of two minutes from the first prompt on a larger software project, as long as subsequent prompts are reasonably fast. It is not for work, but for ambitious home projects.