Hackernews posts about VLLM
- VLLM: Anatomy of a High-Throughput LLM Inference System (www.aleksagordic.com)
- Disaggregated Inference at Scale with PyTorch and VLLM (pytorch.org)
- vLLM with torch.compile: Efficient LLM inference on PyTorch (blog.vllm.ai)
- VLLM: Anatomy of a High-Throughput LLM Inference System (www.aleksagordic.com)
- Show HN: RealTimeX – Local‑first private AI agents (realtimex.ai)
- Igniting VLMs Toward the Embodied Space (arxiv.org)
- The Guide to Visual Language Action Models (VLAM) (jdsemrau.substack.com)
- Testing VLMs and LLMs for robotics with the Jetson Thor devkit [video] (www.youtube.com)
- Show HN: Dayflow – A git log for your day (github.com)
- Comprehension debt: A ticking time bomb of LLM-generated code (codemanship.wordpress.com)
- Defeating Nondeterminism in LLM Inference (thinkingmachines.ai)
- Sampling and structured outputs in LLMs (parthsareen.com)
- SpikingBrain 7B – More efficient than classic LLMs (github.com)
- 25L Portable NV-linked Dual 3090 LLM Rig (www.reddit.com)
- VaultGemma: The most capable differentially private LLM (research.google)