Hackernews posts about LLaMA
LLaMA is an AI model developed by Facebook that generates human-like text responses to user input.
- Update on Llama adoption (ai.meta.com)
- Hermes 3: The First Fine-Tuned Llama 3.1 405B Model (lambdalabs.com)
- The AI Hype Fraud of Reflection-Llama-70B (twitter.com)
- Reflection-Llama-3.1-70B is Llama-3 with LoRA (old.reddit.com)
- Llamafile v0.8.13 (and Whisperfile) (simonwillison.net)
- cerebras: 450 tokens/sec llama 3.1 70B (www.theregister.com)
- Show HN: I built a website where you can easily fine-tune Llama 3.1 models (www.tunellama.com)
- A Llama 70B finetune that has reflection baked into it's weights (huggingface.co)
- AI Uncensored: a fine tune of Llama 405B – no more PC nonsense (www.aiuncensored.info)
- Fine-tuning Llama3-70B for GradPilot – easily beats Claude Sonnet (www.alignedhq.ai)
- Llama3 Just Got Ears (homebrew.ltd)
- Open-weight Llama3-based LLMs for 5 African languages (jacarandahealth.org)
- The Mamba in the Llama: Distilling and Accelerating Hybrid Models (www.together.ai)
- Nvidia and LlamaIndex Developer Contest (developer.nvidia.com)
- Cerebras reaches 1800 tokens/s for 8B Llama3.1 (www.forbes.com)
- Llama3 Just Got Ears (homebrew.ltd)
- AI Uncensored: a fine tune of Llama 405B to remove political bias (www.aiuncensored.info)
- ChatGPT 4o Mini – A Free AI Chatbot Based on Llama-3.1 (chatgpt4omini.net)
- Up to 1.9X Higher Llama 3.1 Performance with Medusa (developer.nvidia.com)
- Prune and Distill Llama-3.1 8B to an Nvidia Llama-3.1-Minitron 4B (developer.nvidia.com)
- Reflection Llama-3.1 70B: Testing and Summary of What We Know (www.datacamp.com)