Today Databricks announced [0] 6b parameter model from EleutherAI finetuned on A...

numlocked · on March 24, 2023

Interesting. I wonder what the training cost was for:

https://huggingface.co/EleutherAI/gpt-neox-20b

Perhaps it’s in the paper…

michaelhartm · on March 24, 2023

They used the 6b GPT4-J, not 20B. That's what's interesting, it's a smallish large language model :).

dragonwriter · on March 24, 2023

GPT-J, not GPT4-J.

int_19h · on March 24, 2023

There are also some LLaMA LoRAs that are trained on the Anthropic dataset specifically for chat:

https://huggingface.co/serpdotai

I haven't done any formal tests on this yet, but with llama-13b, the overall structure of its responses definitely becomes much more ChatGPT-like. It would be very interesting to see how the 65B model performs.

m3affan · on March 24, 2023

Let the revolutionbbegin