One finding in the LLaMA paper [1] is that our current large models are undertra...

v64 on March 11, 2023 | parent | context | favorite | on: ChatGPT's API is so good and cheap, it makes most ...

One finding in the LLaMA paper [1] is that our current large models are undertrained. LLaMA with 13B params outperforms GPT-3 175B (not ChatGPT), but an "instruct" version of LLaMA was finetuned over the 65B model and did quite well.

[1] https://arxiv.org/pdf/2302.13971.pdf