Hackernews posts about Llama 3

Llama 3 is an AI-powered chatbot that uses pure NumPy to generate human-like conversations in a browser-based interface.

Related: NumPy WebGPU

Show HN: Fine-tune Llama3-8B on 8GB GPU without quantization (github.com)

3 points by anuarsh 1 day ago | discuss
Launch HN: LlamaFarm (YC W22) – Open-source framework for distributed AI (github.com)

106 points by mhamann 18 days ago | 71 comments
Show HN: I got tired of managing dev environments, so I built ServBay (www.servbay.com)

30 points by Saltyfishh 5 days ago | 19 comments
Show HN: Narada – Open-source secrets classification model

6 points by sanketsaurav 12 days ago | 1 comments
Show HN: Story Keeper – AI agents with narrative continuity instead of memory (github.com)

5 points by neurobloom 1 day ago | discuss
Show HN: Mindworld.space – A Hallunicationary Wikipedia (mindworld.space)

2 points by totaa 11 days ago | 1 comments
Show HN: EchoKit – An open-source, ESP32-based AI voice agent with a Rust server (www.instructables.com)

1 points by 3Sophons 5 days ago | discuss
Lightning.ai – production managed inference platform for AI (artificialanalysis.ai)

2 points by tchaton84 16 days ago | 1 comments
Show HN: Write deep learning code on your laptop and run it instantly on GPUs (aiengineering.academy)

2 points by Adithya-Kolavi 20 days ago | discuss
Probably the only public demo of a real-time, multi-agent AI governance system

2 points by Nel_limitless 13 days ago | discuss
Run 35B LLMs on Dual Pascal GPUs with QLoRA

4 points by rickesh_tn 18 days ago | discuss
Show HN: Even Ollama says this local AI inference is cool – Nexa SDK for NPU (sdk.nexa.ai)

4 points by ks1225 26 days ago | discuss
Show HN: I spent $450 on GCP's Video API, so I built a local alternative

3 points by iliashad about 18 hours ago | discuss
Show HN: Llmswap – Solving "Multiple Second Brains" with Per-Project AI Memory

2 points by sreenathmenon 24 days ago | discuss
Show HN: Pluely v0.1.5 Released, Open Source Invisible AI Assistant (pluely.com)

1 points by truly_sn 19 days ago | 1 comments
Meta Llama 3 (llama.meta.com)

2199 points by bratao over 1 year ago | 923 comments
Llama3 implemented from scratch (github.com)

1041 points by Hadi7546 over 1 year ago | 269 comments
Llama 3.2: Revolutionizing edge AI and vision with open, customizable models (ai.meta.com)

924 points by nmwnmw about 1 year ago | 328 comments
Show HN: Llama 3.2 Interpretability with Sparse Autoencoders (github.com)

579 points by PaulPauls 11 months ago | 99 comments
Show HN: I built a free in-browser Llama 3 chatbot powered by WebGPU (github.com)

547 points by abi over 1 year ago | 139 comments
Llama 3 implemented in pure NumPy (docs.likejazz.com)

476 points by orixilus over 1 year ago | 50 comments
Llama 3-V: Matching GPT4-V with a 100x smaller model and 500 dollars (aksh-garg.medium.com)

459 points by minimaxir over 1 year ago | 77 comments
Llama 3.1 (llama.meta.com)

437 points by luiscosio over 1 year ago | 269 comments
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference (cerebras.ai)

427 points by benchmarkist 11 months ago | 156 comments
Llama-3.3-70B-Instruct (huggingface.co)

425 points by pr337h4m 11 months ago | 219 comments
g1: Using Llama-3.1 70B on Groq to create o1-like reasoning chains (github.com)

334 points by gfortaine about 1 year ago | 148 comments
Llama 3.1 Omni Model (github.com)

304 points by taikon about 1 year ago | 41 comments
Cost of self hosting Llama-3 8B-Instruct (blog.lytix.co)

245 points by veryrealsid over 1 year ago | 183 comments
DeepDive in everything of Llama3: revealing detailed insights and implementation (github.com)

222 points by therealoliver 8 months ago | 14 comments
I Self-Hosted Llama 3.2 with Coolify on My Home Server (geek.sg)

221 points by whitefables about 1 year ago | 90 comments
llama-fs: A self-organizing file system with llama 3 (github.com)

221 points by archb over 1 year ago | 62 comments
Llama 3.1 in C (github.com)

212 points by AMICABoard over 1 year ago | 36 comments
Run llama3 locally with 1M token context (ollama.com)

204 points by mritchie712 over 1 year ago | 72 comments
Show HN: Llama 3.3 70B Sparse Autoencoders with API access (www.goodfire.ai)

201 points by trq_ 10 months ago | 51 comments
Ollama v0.1.33 with Llama 3, Phi 3, and Qwen 110B (github.com)

192 points by ashvardanian over 1 year ago | 64 comments
Meta's Llama 3.1 can recall 42 percent of the first Harry Potter book (www.understandingai.org)

191 points by aspenmayer 4 months ago | 305 comments
Show HN: Tune LLaMa3.1 on Google Cloud TPUs (github.com)

189 points by felarof about 1 year ago | 52 comments
Ollama 0.4 is released with support for Meta's Llama 3.2 Vision models locally (ollama.com)

182 points by BUFU 12 months ago | 25 comments
Llama 3 8B is almost as good as Wizard 2 8x22B (huggingface.co)

168 points by tosh over 1 year ago | 111 comments
Implementing LLaMA3 in 100 Lines of Pure Jax (saurabhalone.com)

167 points by jxmorris12 8 months ago | 22 comments
Longwriter – Increase llama3.1 output to 10k words (github.com)

154 points by taikon about 1 year ago | 29 comments
Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s (cerebras.ai)

147 points by campers about 1 year ago | 84 comments
Hermes 3: The First Fine-Tuned Llama 3.1 405B Model (lambdalabs.com)

146 points by mkaic about 1 year ago | 67 comments
Fine tune LLAMA3 on million scale dataset in consumer GPU using QLora, DeepSpeed (medium.com)

145 points by mehulashah over 1 year ago | 26 comments
Llama 3 feels significantly less censored than its predecessor (ollama.com)

123 points by davidbarker over 1 year ago | 39 comments