Hackernews posts about Llama 2
Llama 2 is a large language model AI designed to replace earlier models like GPT-3.5/4 and provide advanced natural language processing capabilities.
- Sagence emerges from stealth promising Llama 2 at 10 percent power (spectrum.ieee.org)
- Show HN: Open-Source Alternative for GitHub Copilot (marketplace.visualstudio.com)
- Show HN: Senti – Offline LLM Voice Chat (testflight.apple.com)
- Cerebras Trains Llama Models to Leap over GPUs (www.nextplatform.com)
- GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama (developer.nvidia.com)
- Meta to let US national security agencies and defense contractors use Llama AI (www.theguardian.com)
- Scale AI unveils 'Defense Llama' large language model (defensescoop.com)
- On Device Llama 3.1 with Core ML (machinelearning.apple.com)
- Show HN: Web App that looks at your resume and matches you to jobs (www.rocketjobs.app)
- Open-source AI must reveal its training data, per new OSI definition (www.theverge.com)
- LLama3.2-vision as almost perfect OCR? (demo.doctractor.com)
- Llama3.2-Vision on Ollama (ollama.com)
- Llama 2 (ai.meta.com)
- Run Llama 2 uncensored locally (ollama.ai)
- Llama2.c: Inference llama 2 in one file of pure C (github.com)
- Guide to running Llama 2 locally (replicate.com)
- LLaMA2 Chat 70B outperformed ChatGPT (tatsu-lab.github.io)
- Fast and Portable Llama2 Inference on the Heterogeneous Edge (www.secondstate.io)
- Fine-Tuning Llama-2: A Comprehensive Case Study for Tailoring Custom Models (www.anyscale.com)
- A simple guide to fine-tuning Llama 2 (brev.dev)
- JetMoE: Reaching LLaMA2 performance with 0.1M dollars (research.myshell.ai)
- Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse (neuralmagic.com)
- Accessing Llama 2 from the command-line with the LLM-replicate plugin (simonwillison.net)
- WebLLM: Llama2 in the Browser (webllm.mlc.ai)
- Llama 2 on ONNX runs locally (github.com)
- Show HN: Llama2 Embeddings FastAPI Server (github.com)
- What's new in Llama 2 and how to run it locally (agi-sphere.com)
- Understanding Llama 2 and the New Code Llama LLMs (magazine.sebastianraschka.com)
- Llama 2 Long (arxiv.org)
- SEQUOIA: Exact Llama2-70B on an RTX4090 with half-second per-token latency (infini-ai-lab.github.io)
- Cookbook: Finetuning Llama 2 in your own cloud environment, privately (blog.skypilot.co)
- LoRA Fine-Tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B (www.lesswrong.com)