Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
michaelhartm
on March 24, 2023
|
parent
|
context
|
favorite
| on:
LoRA: Low-Rank Adaptation of Large Language Models
They used the 6b GPT4-J, not 20B. That's what's interesting, it's a smallish large language model :).
dragonwriter
on March 24, 2023
[–]
GPT-J, not GPT4-J.
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: