Hackernews posts about RedPajama
- SlimPajama: A 627B token cleaned and deduplicated version of RedPajama (www.cerebras.net)
- RedPajama-Incite-7B-Instruct Outperforms LLaMA on MMLU (twitter.com)
- RedPajama-Data-v2: 30T tokens filtered and de-duplicated (twitter.com)
- Lessons from fine-tuning RedPajama LLM on Slack data (www.union.ai)
- RedPajama-Data-v2: An open dataset with 30T tokens (2023) (www.together.ai)
- Show HN: finetune LLMs via the Finetuning Hub (github.com)