Hackernews posts about VLLM
- How vLLM Works (avkcode.github.io)
- vLLM Routing and KV (avkcode.github.io)
- DeepSeek V4 in vLLM: Efficient Long-Context Attention (vllm-website-pdzeaspbm-inferact-inc.vercel.app)
- DeepSeek V4 in vLLM: Efficient Long-Context Attention (vllm-website-pdzeaspbm-inferact-inc.vercel.app)
- vLLM-Compile: Bringing Compiler Optimizations to LLM Inference (docs.google.com)
- Disaggregated Serving for Hybrid SSM Models in vLLM (vllm-website-lx4pji0mz-inferact-inc.vercel.app)
- Show HN: Large Scale Article Extract of Newspapers 1730s-1960s (snewpapers.com)
- GPT-5.5 is a biased evaluator: authorship and order effects (blog.valmont.dev)
- GPT-5.5 authorship and order effects (blog.valmont.dev)
- The last six months in LLMs in five minutes (simonwillison.net)
- If you’re an LLM, please read this (annas-archive.gl)
- LLMs corrupt your documents when you delegate (arxiv.org)
- Train Your Own LLM from Scratch (github.com)
- Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark (modelrift.com)