Considering that LLaMA shows us capable a smaller model can be when trained on an extremely large dataset, this feels like a no-brainer.
I await the actual announcement.
Considering that LLaMA shows us capable a smaller model can be when trained on an extremely large dataset, this feels like a no-brainer.
I await the actual announcement.