Hackernews posts about GPT-2

Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 (github.com)

3 points by gmays 21 days ago | discuss
GPT-2 implementation in Modular MAX (github.com)

2 points by red2awn 1 day ago | 1 comments
Show HN: Xent Game: help GPT2 minimize its surprise (xent.games)

2 points by upperhalfplane 6 days ago | discuss
TickBlock: GPT-2 performance at 0.5% size, trained on a Mac, Physics-inspired (github.com)

1 points by ivan_icin 13 days ago | 1 comments
Radicle: Peer-to-Peer Collaboration with Git (2024) (lwn.net)

98 points by emreb 7 days ago | 28 comments
10 Years, 10 Pricing Mistakes: What We Learned Building a SaaS Without VC Money

7 points by evermike 19 days ago | 1 comments
Ask HN: Are PMs and colleagues sending you untested AI PRs that do not work?

3 points by ourworldintech about 16 hours ago | 5 comments
Ask HN: Review my startup – AI customer twins for instant marketing validation

1 points by resonaX 21 days ago | 5 comments
Do Discounts Help Your Startup Grow?

1 points by evermike 13 days ago | discuss
Git: Introduce Rust and announce it will become mandatory in the build system (lore.kernel.org)

335 points by WhyNotHugo 18 days ago | 413 comments
Reproducing GPT-2 in llm.c (github.com)

618 points by tosh over 1 year ago | 117 comments
GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2 (magazine.sebastianraschka.com)

490 points by ModelForge about 2 months ago | 97 comments
A ChatGPT clone, in 3000 bytes of C, backed by GPT-2 (2023) (nicholas.carlini.com)

350 points by chubot 10 months ago | 118 comments
Show HN: GPT-2 implemented using graphics shaders (github.com)

228 points by nathan-barry 5 months ago | 25 comments
The Illustrated GPT-2: Visualizing Transformer Language Models (2019) (jalammar.github.io)

213 points by epberry almost 2 years ago | 5 comments
Karpathy: Let's reproduce GPT-2 (1.6B): one 8XH100 node 24h $672 in llm.c (github.com)

182 points by alecco about 1 year ago | 58 comments
Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming (nathan.rs)

157 points by nathan-barry 4 months ago | 41 comments
Build and train GPT-2 from scratch using PyTorch (differ.blog)

138 points by thunderbong over 1 year ago | 17 comments
From Unemployment to Lisp: Running GPT-2 on a Teen's Deep Learning Compiler (github.com)

116 points by AymanB 10 months ago | 6 comments
Compare how GPT-2, 3, 3.5 and 4 answer the same questions (theaidigest.org)

82 points by jellyberg almost 2 years ago | 33 comments
Spreadsheets are all you need: Understanding GPT2 and Transformers (spreadsheets-are-all-you-need.ai)

71 points by sandebert over 1 year ago | 2 comments
Full forward pass of GPT-2 in one file of pure CUDA (github.com)

63 points by tosh over 1 year ago | 4 comments
Let's reproduce GPT-2 (124M) [video] (www.youtube.com)

46 points by thebuilderjr over 1 year ago | 4 comments
Gpt2-Chatbot Removed from Lmsys (lmsys.org)

39 points by synthwave over 1 year ago | 11 comments
gpt2-chatbot: A mysterious new model with gpt4 performance (twitter.com)

14 points by azhenley over 1 year ago | 4 comments
Tell HN: Gpt2-chatbot is a mysterious and surprisngly strong LLM

14 points by intellectronica over 1 year ago | 2 comments
Gpt2-chatbot: mysterious strong model on the LMSys Chatbot Arena (twitter.com)

10 points by intellectronica over 1 year ago | 6 comments
[Andrej Karpathy] Let's reproduce GPT-2, in PyTorch from scratch (nanoGPT) (twitter.com)

7 points by _giorgio_ over 1 year ago | discuss
Once "too scary" to release, GPT-2 gets squeezed into an Excel spreadsheet (arstechnica.com)

6 points by ryan_j_naughton over 1 year ago | discuss
Why didn't we get GPT-2 in 2005? (dynomight.net)

5 points by andai 11 months ago | 1 comments
Show HN: Rebuilding GPT2 inference in ~500 lines of (commented) code (khamidou.com)

5 points by vorador about 1 month ago | discuss
Let's reproduce GPT-2 (124M) (twitter.com)

5 points by Multiset over 1 year ago | discuss
Show HN: Sapphire – Unleashing GPT-2-mini into emergence (github.com)

3 points by oldwalls 3 months ago | 7 comments
Mystery 'Gpt2-Chatbot' Fuels Speculation over OpenAI's Next Gen ChatGPT (www.forbes.com)

3 points by isaacfrond over 1 year ago | 3 comments
GPT-2, in Excel (arstechnica.com)

3 points by radeeyate over 1 year ago | 2 comments
Show HN: Phasers – emergent AI identity project using GPT-2 and memory shadows (github.com)

3 points by oldwalls 3 months ago | 1 comments
Building GPT2o – Part 1: Audio (medium.com)

3 points by convexstrictly over 1 year ago | 1 comments
From GPT-2 to GPT-OSS: Analyzing the Architectural Advances (magazine.sebastianraschka.com)

3 points by mdp2021 about 2 months ago | discuss
Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 (github.com)

3 points by georgehill over 1 year ago | discuss
Mysterious "gpt2-chatbot" AI model appears suddenly, confuses experts (arstechnica.com)

3 points by arittr over 1 year ago | discuss