RedPajama, which creates fully open-source large language models, has released a 1.2 trillion token dataset following the LLaMA recipe.
Posted by:VentureBeat
Posted on: 4/18/2023
BACK TO ISSUE