I liked this part: "one small step at a time, and one giant leap, together." I d...

headcanon · 2025-01-11T20:00:13 1736625613

Market price for a 4090 is ~$2700 today: https://stockx.com/nvidia-founders-geforce-rtx-4090-24gb-gra...

When you have a retail price so far below "street" price, it just makes it harder to obtain and scalpers take a bigger cut. Raising the price to something more normal at least gives you more of a chance at the big-box store.

heyoni · 2025-01-11T20:20:52 1736626852

Or scalpers won’t be dissuaded and street price for a 5090 will be $3200 or more. $1500 was already an insane price tag for scalpers to pay but they did it anyways.

pertymcpert · 2025-01-12T06:48:52 1736664532

The scalpers are trying to arbitrage the difference in price between the prices bought directly from suppliers and those on the open secondary market. increasing the retail price doesn't increase the price on the secondary market, it just lowers the margin of scalpers.

heyoni · 2025-01-12T16:53:07 1736700787

Great, so nothing changes.

geokon · 2025-01-12T04:57:17 1736657837

very naiive question... But why have a retail price at all and not just auction off batches to retailers (say 10 000 card at a time)

Let the market set the price and only control how many cards you auction and when

theptip · 2025-01-12T15:55:32 1736697332

Well for one thing it’s a lot easier to communicate expectations to consumers at CES.

“Coming soon to auction at a price set by users, no idea what that will be though, good luck!” is much less compelling for consumers trying to plan their purchases in advance.

gbin · 2025-01-12T14:41:33 1736692893

Being able to decide a price and who you sell your product to is a huge leverage. Nvidia can go to a retailer selling something they don't like to be side by side on a shelf, hey ditch this and we will make you a price. It is never that overt of course and it can play geopolitically too, hey government you want chips? We have chips and it would be a shame if the market grabs them before you, BTW don't forget my tax cut.

jsheard · 2025-01-07T03:10:47 1736219447

It's 32GB, that was leaked well before the event.

> If Apple M4 Ultra gets close to 1.8 TB/s bandwidth of 5090

If past trends hold (Ultra = 2x Max) it'll be around 1.1 TB/s, so closer to the 4090.

qingcharles · 2025-01-12T00:51:01 1736643061

It makes for a hard choice. M4 Ultra with say 128GB of RAM vs ... 2 x 5090 with 64GB of RAM for about the same price?

More tokens/sec on the dual 5090, but way bigger model on the M4.

Plus the dual 5090 might trip your breakers.

smallmancontrov · 2025-01-12T04:01:52 1736654512

If anyone in this thread had watched the linked video or even read a summary, they ought to be at least talking about the DIGITS announcement.

128GB of VRAM for $3000.

Slow? Yes. It isn't meant to compete with the datacenter chips, it's just a way to stop the embarrassment of being beaten at HPC workstations by apple, but it does the job.

behnamoh · 2025-01-07T03:11:47 1736219507

Jensen didn't talk about it. Maybe he knows it's embarrassingly low. Everyone knows nvidia won't give us more VRAM to avoid cannibalizing their enterprise products.

aseipp · 2025-01-11T18:39:09 1736620749

The official specs for the 5090 have been out for days on nvidia.com, and they explicitly state it's 32GB of GDDR7 with a 512-bit bus, for a total of 1.8TB/s of bandwidth.

behnamoh · 2025-01-11T20:23:25 1736627005

32GB is still not a lot.

Jedd · 2025-01-12T00:33:50 1736642030

This feels like a weird complaint, given you started by saying it was 24GB, and then argued that the person who told you it was actually 32GB was making that up.

behnamoh · 2025-01-12T03:16:28 1736651788

Looks like Nvidia did a simple calculation:

    32GB (for 5090) / 24GB (for 4090) ≃ 1.33

Then multiply 4090's price by that:

    $1500 × 1.33 ≃ $2000

All else equal, this means that price per GB of VRAM stayed the same. But in reality, other things improved too (like the bandwidth) which I appreciate.

I just think that for home AI use, 32GB isn't that helpful. In my experience and especially for agents, models at 32B parameters just start to be useful. Below that, they're useful only for simple tasks.

Jedd · 2025-01-13T04:42:58 1736743378

Yes, home / hobbyist LLM users are not overly excited about this, but

a) they are never going to be happy,

b) it's actually a significant step up given the bulk are dual-card users anyway, so this bumps them from (at the high end of the consumer segment) 48GB to 64GB of VRAM, which _is_ pretty significant given the prevalence of larger models / quants in that space, and

c) vendors really don't care terribly much about the home / hobbyist LLM market, no matter how much people in that market wish otherwise.

xcv123 · 2025-01-12T06:52:05 1736664725

It’s more enough for gaming.

Buy their datacenter GPU if you need more VRAM.

BryanLegend · 2025-01-11T20:43:48 1736628228

"640k should be enough for anybody"

ksec · 2025-01-11T16:43:21 1736613801

For what 5090 is offering with 32GB RAM I thought it is a pretty decent price comparatively to 4090. I thought the whole lineup is really well priced.

bn-l · 2025-01-11T23:57:56 1736639876

Good for image generation, inadequate for local llms (on its own).

toshinoriyagi · 2025-01-07T03:13:39 1736219619

edit: 32GB is confirmed here https://www.nvidia.com/en-us/geforce/graphics-cards/50-serie...

Supposedly, this image of a Inno3D 5090 box leaked, revealing 32GB of VRAM. It seems like the 5090 will be more of a true halo product, given the pricing of the other cards.

https://www.techpowerup.com/330538/first-nvidia-geforce-rtx-...