Top Hackernews posts from arxiv.org
- A New Coefficient of Correlation (arxiv.org)
- Othello Is Solved? (arxiv.org)
- Crypto Wash Trading (arxiv.org)
- Electronic Structure of LK-99 (arxiv.org)
- Phi-3 Technical Report (arxiv.org)
- TimeGPT-1 (arxiv.org)
- OpenVoice: Versatile Instant Voice Cloning (arxiv.org)
- Origin of correlated isolated flat bands in LK99 (arxiv.org)
- Bringing GNU Emacs to Native Code (2020) (arxiv.org)
- Player of Games (arxiv.org)
- RWKV: Reinventing RNNs for the Transformer Era (arxiv.org)
- Website Fingerprinting on Early QUIC Traffic (arxiv.org)
- QLoRA: Efficient Finetuning of Quantized LLMs (arxiv.org)
- Beyond A*: Better Planning with Transformers (arxiv.org)
- What if an SQL statement returned a database? (arxiv.org)
- “I’ll Finish It This Week” and Other Lies (arxiv.org)
- Chameleon: Meta’s New Multi-Modal LLM (arxiv.org)
- Exponentially faster language modelling (arxiv.org)
- Information Theory: A Tutorial Introduction (arxiv.org)
- σ-GPTs: A new approach to autoregressive models (arxiv.org)
- MusicLM: Generating music from text (arxiv.org)
- How is ChatGPT's behavior changing over time? (arxiv.org)
- faulTPM: Exposing AMD fTPMs' Deepest Secrets (arxiv.org)
- The Modern Mathematics of Deep Learning (arxiv.org)
- Conway's Game of Life is omniperiodic (arxiv.org)
- A formula for the nth digit of 𝜋 and 𝜋^n (arxiv.org)
- Mistral 7B (arxiv.org)
- Llemma: An Open Language Model for Mathematics (arxiv.org)
- Textbooks are all you need (arxiv.org)
- An Introduction to Graph Theory (arxiv.org)
- Transformers as Support Vector Machines (arxiv.org)
- Python type hints are Turing complete (arxiv.org)
- Grokked Transformers Are Implicit Reasoners (arxiv.org)
- Ultra Fast Bert (arxiv.org)
- Multiplying Matrices Without Multiplying (arxiv.org)
- Scaling Transformers to 1B Tokens (arxiv.org)
- Thermodynamic Linear Algebra (arxiv.org)
- Matrix multiplication using only addition (arxiv.org)
- Catala: A Programming Language for the Law (arxiv.org)
- MemGPT: Towards LLMs as Operating Systems (arxiv.org)
- Neural Network Diffusion (arxiv.org)
- The Principles of Deep Learning Theory (arxiv.org)
- Stealing Part of a Production Language Model (arxiv.org)
- How to fit any dataset with a single parameter (arxiv.org)