They used the 6b GPT4-J, not 20B. That's what's interesting, it's a smallish lar... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

michaelhartm on March 24, 2023 | parent | context | favorite | on: LoRA: Low-Rank Adaptation of Large Language Models

They used the 6b GPT4-J, not 20B. That's what's interesting, it's a smallish large language model :).

dragonwriter on March 24, 2023 [–]

GPT-J, not GPT4-J.

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact