This is the company behind RedPajama, which is a VERY big deal. It's the most re...

ericflo · on May 15, 2023

I would also check out their 3B model. I tested it on launch with LoRA fine-tuning and found it to be surprisingly capable despite its size. I think a lot of people are skipping past testing it because it only has 3B params.

Edit: https://huggingface.co/togethercomputer/RedPajama-INCITE-Bas...

sanxiyn · on May 15, 2023

Well, MPT-7B is also commercially usable and openly licensed: https://www.mosaicml.com/blog/mpt-7b

simonw · on May 15, 2023

Yeah it's really promising. It's partially trained on that RedPajama data.

akrymski · on May 16, 2023

So is https://github.com/Lightning-AI/lit-llama