I am using the 32Gb distilled model on my local 3090 with Continue in VSCode. It... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		itissid 6 months ago \| parent \| context \| favorite \| on: The Illustrated DeepSeek-R1 I am using the 32Gb distilled model on my local 3090 with Continue in VSCode. It beats everything out of the water.

dontwearitout 6 months ago | [–]

How many tokens/s do you get on a 3090? With the extra tokens for the internal monologue, is it still performant enough for smooth VSCode integration?

MarcelOlsz 6 months ago | [–]

Any idea how to use a cloud hosted version with cursor?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact