SmartNews
Interests
Topics
Top Domains
History
About
Topics
RedPajama
Hackernews posts about RedPajama
RedPajama
v2 Open Dataset with 30T Tokens for Training LLMs
(together.ai)
236 points by
programd
almost 2 years ago
|
60 comments
New Dataset:
RedPajama
Dynamic Topic Modeling, 100K Docs W Topic Heirarchies
(huggingface.co)
5 points by
IEatPrompts
11 months ago
|
discuss
RedPajama
-Data-v2: 30T tokens filtered and de-duplicated
(twitter.com)
4 points by
leumassuehtam
almost 2 years ago
|
discuss
Fine-Tuning Insights: Lessons from Experimenting with
RedPajama
on Slack Data
(www.union.ai)
2 points by
swiftlyTyped
over 1 year ago
|
discuss
RedPajama
-Data-v2: An open dataset with 30T tokens (2023)
(www.together.ai)
1 points by
tosh
over 1 year ago
|
discuss
Show HN: The fastest way to run Mixtral 8x7B on Apple Silicon Macs
18 points by
woadwarrior01
over 1 year ago
|
22 comments
Show HN: A Comprehensive AI Data Quality Evaluation Tool
(github.com)
4 points by
e06084
3 months ago
|
discuss