AFAIK they used half-precision (Float16) | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		skdotdan on May 29, 2020 \| parent \| context \| favorite \| on: GPT-3: Language Models Are Few-Shot Learners AFAIK they used half-precision (Float16)

cs702 on May 29, 2020 [–]

Thanks. I should have written "if using Float32," which is what I meant -- instead of "with Float32," which in hindsight reads a bit ambiguous. But regardless of which floating-point representation is used, the number of weights is still in the hundreds of billions... which is insane.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact