Hackernews posts about LLaMA
LLaMA is an AI model developed by Facebook that generates human-like text responses to user input.
- Llama.cpp now has an official website: llama.app (twitter.com)
- Llama.cpp now has an official website: llama.app (llama.app)
- WebGPU support in llama.cpp (reeselevine.github.io)
- Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM (deemwar-products.github.io)
- Llamas on the Web (reeselevine.github.io)
- Show HN: Will It Fit? – Opinionated Normal People Llama.cpp VRAM Estimator (hypfer.github.io)
- Tracing tokens through Llama 3.1 8B inference on H100s (krithik.xyz)
- The Winamp Skin Museum whips the Llama's ass (2020) (www.rockpapershotgun.com)
- WebGPU back end in llama.cpp/ggml (twitter.com)
- Show HN: Best setup local LLM found for a 5090 (llama.cpp fork + turboquant) (local-llm.utop.workers.dev)
- Run Llama.cpp on a Mac Pro 6,1 with Dual FirePro D700 GPUs on Ubuntu (matthewgribben.com)
- I ditched LM Studio for llama.cpp and my local LLM doesn't feel like a downgrade (www.xda-developers.com)
- Show HN: Llama CPU Benchmarks (deemwar-products.github.io)
- Benchmarking llama.cpp's new MTP support on Strix Halo (calebcoffie.com)
- Llama.cpp b9180: MTP support landed (github.com)
- ZML: Between Jax and Llama.cpp (jaco-bro.github.io)
- Find bugs in YOUR code using OpenCode, Llama.cpp and Qwen3.6 (wtarreau.blogspot.com)