Hackernews posts about Llama.cpp
Related:
M2 Max
- Llama.cpp now has an official website: llama.app (twitter.com)
- Llama.cpp now has an official website: llama.app (llama.app)
- WebGPU support in llama.cpp (reeselevine.github.io)
- Apple MLX vs. llama.cpp: compared and benchmarked [video] (www.youtube.com)
- WebGPU back end in llama.cpp/ggml (twitter.com)
- Run Llama.cpp on a Mac Pro 6,1 with Dual FirePro D700 GPUs on Ubuntu (matthewgribben.com)
- I ditched LM Studio for llama.cpp and my local LLM doesn't feel like a downgrade (www.xda-developers.com)
- Benchmarking llama.cpp's new MTP support on Strix Halo (calebcoffie.com)
- Llama.cpp b9180: MTP support landed (github.com)
- ZML: Between Jax and Llama.cpp (jaco-bro.github.io)
- Find bugs in YOUR code using OpenCode, Llama.cpp and Qwen3.6 (wtarreau.blogspot.com)
- Llama and Spec: MTP Support (github.com)
- Vision Now Available in Llama.cpp (github.com)
- Llama.cpp guide – Running LLMs locally on any hardware, from scratch (steelph0enix.github.io)
- Heap-overflowing Llama.cpp to RCE (retr0.blog)
- Llama.cpp supports Vulkan. why doesn't Ollama? (github.com)
- Ollama violating llama.cpp license for over a year (github.com)
- Llama.cpp AI Performance with the GeForce RTX 5090 Review (www.phoronix.com)
- Mistral Integration Improved in Llama.cpp (github.com)
- Llama.cpp: Add GPT-OSS (github.com)
- Llama.cpp Now Part of the Nvidia RTX AI Toolkit (developer.nvidia.com)
- DeepSeek-R1 speeds up llama.cpp code by x2 (github.com)
- Llama.cpp AI Performance with the GeForce RTX 5090 (www.phoronix.com)
- Local Agents with Llama.cpp and Pi (huggingface.co)
- Tinker with LLMs in the privacy of your own home using Llama.cpp (www.theregister.com)
- Llama.cpp AI Performance with the GeForce RTX 5090 (www.phoronix.com)
- LlaMa.cpp Robot Wars (www.youtube.com)
- Llama.cpp's Agents.md (github.com)
- Llama's Paradox – Exploiting Llama.cpp (retr0.blog)