Hackernews posts about Llama-3.1 70B

g1: Using Llama-3.1 70B on Groq to create o1-like reasoning chains (github.com)

334 points by gfortaine 11 months ago | 148 comments
Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s (cerebras.ai)

147 points by campers 10 months ago | 84 comments
Reflection-Llama-3.1-70B is Llama-3 with LoRA (old.reddit.com)

16 points by lnyan 12 months ago | 1 comments
Nvidia releases weights for Llama-3.1-Nemotron-70B-Instruct (huggingface.co)

9 points by rvnx 10 months ago | 3 comments
Nvidia Llama-3.1-Nemotron-70B-Instruct (build.nvidia.com)

9 points by KenHV 10 months ago | discuss
Insights from 84,000 comments on "Ask HN: Who Is Hiring" using Llama3.1-70B-FP16 (exxa.notion.site)

8 points by Blue_Cosma 11 months ago | 1 comments
Llama 3.1 70B compressed by 6.4x using AQLM-PV, now released (huggingface.co)

8 points by annaerma 11 months ago | discuss
cerebras: 450 tokens/sec llama 3.1 70B (www.theregister.com)

7 points by davidfiala 12 months ago | 2 comments
Llama 3.1 Nemotron 70B: Open model closing the gap with GPT-4o and Sonnet-3.5 (huggingface.co)

7 points by victormustar 10 months ago | discuss
Cerebras runs Llama 3.1 70B at 2,100 tokens per second, live demo available (twitter.com)

6 points by modeless 10 months ago | discuss
Cerebras Inference now runs Llama 3.1-70B at 2100 tokens/s (cerebras.ai)

6 points by cs-fan-101 10 months ago | discuss
We've now partially replicated Reflection Llama 3.1 70B's eval claims (twitter.com)

4 points by _micah_h 12 months ago | 1 comments
Reflection Llama-3.1 70B: Testing and Summary of What We Know (www.datacamp.com)

2 points by alexolteanu 12 months ago | 1 comments
HuggingChat: Chat with Llama 3.1 (70B and 405B) (huggingface.co)

2 points by nsarrazin about 1 year ago | 1 comments
Llama-3.1-Nemotron-70B-Instruct Model (build.nvidia.com)

2 points by lapnect 10 months ago | discuss
Reflection Llama 3.1 70B (artificialanalysis.ai)

1 points by aitooltrek-com 12 months ago | discuss
Show HN: Open-source study to measure end user satisfaction levels with LLMs (open-llm-initiative.com)

12 points by sparacha 12 months ago | 2 comments
Show HN: Omnifact – Self-Hosted, Privacy-First AI Platform for Enterprise

9 points by flore2003 about 1 year ago | 15 comments
Show HN: I built a website where you can easily fine-tune Llama 3.1 models (www.tunellama.com)

8 points by hamadafm 12 months ago | 4 comments
Show HN: Slash your LLM Inference Costs with Overnight Processing

5 points by Blue_Cosma 11 months ago | 1 comments
Show HN: Mixlayer – code and deploy LLM prompts using JavaScript (www.mixlayer.com)

5 points by zackangelo 11 months ago | discuss
Show HN: YouTube Addiction Rehab - content filter chrome extension using LLMs (github.com)

4 points by joeysywang 12 months ago | 2 comments
Is it just me or are LLM chat bots still useless in many cases?

3 points by pthr 10 months ago | 2 comments
Free Qwen 2.5, Llama Nemotron, OpenBioLLM, Gemini Exp 1121, and LearnLM 1.5 Pro

2 points by gizai 9 months ago | 1 comments
Show HN: Which LLM Finds Obscure Knife-Brand URLs Cheapest? (8-Model Benchmark) (new.knife.day)

2 points by p-s-v 3 months ago | discuss
Show HN: Slashing LLM Costs for Overnight Batch Inference

2 points by Blue_Cosma 11 months ago | discuss
Show HN: I Used Llama-70B Logprobs for Better, Cheaper and Faster Chunking (github.com)

1 points by ghita_ 9 months ago | discuss
Show HN: Most Efficient Batch API for Open-Source and Custom Models (withexxa.com)

1 points by Blue_Cosma 10 months ago | discuss
Open-source 70B model surpass GPT-4o and Claude 3.5 on Arena Hard (huggingface.co)

8 points by openbuilder 10 months ago | discuss
'Reflection 70B' AI model could be the answer to pesky LLM hallucinations (huggingface.co)

4 points by alexfefun 12 months ago | discuss