Hackernews posts about Llama 2

Llama 2 is a large language model AI designed to replace earlier models like GPT-3.5/4 and provide advanced natural language processing capabilities.

Related: Meta Qualcomm Cloudflare LeCun Vicuna

Load Llama-3.2 WebGPU in the browser from a local folder (simonwillison.net)

3 points by indigodaddy 29 days ago | discuss
Launch HN: LlamaFarm (YC W22) – Open-source framework for distributed AI (github.com)

85 points by mhamann about 20 hours ago | 47 comments
Show HN: Aotol AI – Offline LLM app runs on iOS with voice and multilingual (apps.apple.com)

1 points by doublez78 23 days ago | discuss
We bought the whole GPU, so we're damn well going to use the whole GPU (hazyresearch.stanford.edu)

501 points by sydriax 10 days ago | 110 comments
LLM Inference Providers – Recommendations?

1 points by ane-multiverse 22 days ago | discuss
Meta's AI system Llama approved for use by US Government agencies (www.reuters.com)

5 points by TMWNN 15 days ago | 1 comments
Launch HN: Cactus (YC S25) – AI inference on smartphones (github.com)

123 points by HenryNdubuaku 20 days ago | 63 comments
Show HN: I built an AI that roasts your website and gives tips to fix it (ai-roast-vert.vercel.app)

7 points by happy_malone 26 days ago | 4 comments
Show HN: LLM-Use – An LLM router that chooses the right model for each prompt (github.com)

3 points by justvugg about 21 hours ago | 2 comments
Ask HN: What in your opinion is the best model for vibecoding? My thoughts below

1 points by adinhitlore 22 days ago | 2 comments
Show HN: Mixture of Voices–Open source goal-based AI router-uses BGE transformer

1 points by KylieM 21 days ago | 1 comments
Show HN: I made an OSS tool to remove Sora 2 Watermark in less than 72h released (github.com)

3 points by kuberwastaken 3 days ago | 2 comments
Scaling your models to Zero with Fly.io (xeiaso.net)

1 points by indigodaddy 22 days ago | discuss
Backing up the MCP ecosystem: 3% of repos gone in under a year (glama.ai)

2 points by punkpeye 15 days ago | discuss
Show HN: AionUi v1.2 – GUI to Boost Gemini CLI with Multi-Agent (github.com)

1 points by waili 27 days ago | discuss
Show HN: Claude Code 2.0 router – preference-aligned routing to multiple LLMs (github.com)

4 points by adilhafeez 7 days ago | 1 comments
Show HN: Even Ollama says this local AI inference is cool – Nexa SDK for NPU (sdk.nexa.ai)

4 points by ks1225 9 days ago | discuss
Show HN: IAB Taxonomy Mapper – Convert Taxonomies Using Ollama (github.com)

3 points by Beefin 20 days ago | discuss
Show HN: Dayflow – A git log for your day (github.com)

480 points by jerryliu12 14 days ago | 130 comments
Ask HN: Why aren't local LLMs used as widely as we expected?

5 points by briansun 30 days ago | 10 comments
Show HN: Production-ready macOS dev environment setup with 10 preset configs (github.com)

5 points by davidsilvestre 10 days ago | discuss
Show HN: Llmswap – Universal AI SDK and Code Generation CLI (sreenathmenon.com)

5 points by sreenathmenon 28 days ago | discuss
Run 35B LLMs on Dual Pascal GPUs with QLoRA

3 points by rickesh_tn 1 day ago | discuss
Show HN: VittoriaDB – Zero-config embedded vector DB with HNSW and ACID storage (github.com)

3 points by antonellof 24 days ago | discuss
Show HN: 5MB Rust binary that runs HuggingFace models (no Python) (github.com)

2 points by MKuykendall 28 days ago | 2 comments
Ask HN: Can you help me securing my droplet?

2 points by atum47 7 days ago | 1 comments
Show HN: Llmswap – Solving "Multiple Second Brains" with Per-Project AI Memory

2 points by sreenathmenon 7 days ago | discuss
Show HN: Pluely v0.1.3 – Invisible Open Source AI Assistant with System Audio (pluely.com)

2 points by truly_sn 30 days ago | discuss
Show HN: Pluely v0.1.5 Released, Open Source Invisible AI Assistant (pluely.com)

1 points by truly_sn 3 days ago | discuss
Show HN: Bookmark GPT Pro – Full-content search and AI chat for Chrome bookmarks (chromewebstore.google.com)

1 points by g-savitha 6 days ago | discuss