Top Hackernews posts from arxiv.org

Successful room temperature ambient-pressure magnetic levitation of LK-99 (arxiv.org)

2408 points by spekcular over 2 years | 1199 comments
The first room-temperature ambient-pressure superconductor? (arxiv.org)

1690 points by Akronymus over 2 years | 877 comments
The Era of 1-bit LLMs: ternary parameters for cost-effective computing (arxiv.org)

1040 points by fgfm about 2 years | 447 comments
Observation of zero resistance above 100 K in Pb₁₀₋ₓCuₓ(PO₄)₆O (arxiv.org)

779 points by segfaultbuserr over 2 years | 372 comments
Possible Meissner effect near room temperature: copper-substituted lead apatite (arxiv.org)

729 points by zaikunzhang about 2 years | 318 comments
A New Coefficient of Correlation (arxiv.org)

682 points by malshe about 4 years | 158 comments
Is this the simplest (and most surprising) sorting algorithm? (arxiv.org)

621 points by ColinWright over 4 years | 318 comments
Semiconducting Transport in LK99 reproduction attempt (arxiv.org)

613 points by spekcular over 2 years | 255 comments
Othello Is Solved? (arxiv.org)

607 points by Tepix over 2 years | 268 comments
Crypto Wash Trading (arxiv.org)

572 points by paulpauper over 4 years | 299 comments
Electronic Structure of LK-99 (arxiv.org)

551 points by spekcular over 2 years | 432 comments
Room-Temperature Ambient-Pressure Superconductor LK-99 preprint revision 2 (arxiv.org)

476 points by lnyan over 2 years | 290 comments
OPT: Open Pre-trained Transformer Language Models (arxiv.org)

461 points by MasterScrat almost 4 years | 224 comments
Is the emergence of life an expected phase transition in the evolving universe? (arxiv.org)

451 points by harscoat about 2 years | 380 comments
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory (arxiv.org)

450 points by Anon84 about 2 years | 154 comments
Surveilling the masses with wi-fi-based positioning systems (arxiv.org)

444 points by belter almost 2 years | 142 comments
Phi-3 Technical Report (arxiv.org)

411 points by varunvummadi almost 2 years | 130 comments
TimeGPT-1 (arxiv.org)

411 points by PaulHoule over 2 years | 132 comments
OpenVoice: Versatile Instant Voice Cloning (arxiv.org)

399 points by saeedesmaili about 2 years | 192 comments
Origin of correlated isolated flat bands in LK99 (arxiv.org)

397 points by lawrenceyan over 2 years | 203 comments
Generative Agents: Interactive Simulacra of Human Behavior (arxiv.org)

391 points by mmq almost 3 years | 252 comments
Better Call GPT: Comparing large language models against lawyers [pdf] (arxiv.org)

389 points by vinnyglennon about 2 years | 264 comments
Fair coins tend to land on the same side they started (arxiv.org)

377 points by fbartos over 2 years | 265 comments
Simple tasks showing reasoning breakdown in state-of-the-art LLMs (arxiv.org)

375 points by tosh almost 2 years | 380 comments
Pen and paper exercises in machine learning (2021) (arxiv.org)

371 points by beefman over 3 years | 55 comments
Bringing GNU Emacs to Native Code (2020) (arxiv.org)

369 points by textread almost 5 years | 130 comments
Player of Games (arxiv.org)

364 points by vatueil over 4 years | 231 comments
An Empirical Study and Evaluation of Modern CAPTCHAs (arxiv.org)

362 points by vincent_s over 2 years | 329 comments
Mixtral 8x7B: A sparse Mixture of Experts language model (arxiv.org)

359 points by ignoramous about 2 years | 150 comments
RWKV: Reinventing RNNs for the Transformer Era (arxiv.org)

358 points by ianbutler almost 3 years | 171 comments
Are Open-Source Large Language Models Catching Up? (arxiv.org)

342 points by rkwz over 2 years | 212 comments
Mission to reach and operate at the focal region of the solar gravitational lens (arxiv.org)

340 points by WithinReason over 3 years | 141 comments
Ferromagnetic half levitation of LK-99-like synthetic samples (arxiv.org)

339 points by platz over 2 years | 314 comments
Website Fingerprinting on Early QUIC Traffic (arxiv.org)

338 points by pueblito about 5 years | 123 comments
GPT detectors are biased against non-native English writers (arxiv.org)

338 points by giuliomagnifico almost 3 years | 274 comments
Unlimiformer: Long-Range Transformers with Unlimited Length Input (arxiv.org)

335 points by shishy almost 3 years | 101 comments
Statistical Analysis shows Echos process voice to serve ads (arxiv.org)

318 points by BeniBoy almost 4 years | 118 comments
StarCoder and StarCoderBase: 15.5B parameter models with 8K context length (arxiv.org)

317 points by belter almost 3 years | 162 comments
Why do tree-based models still outperform deep learning on tabular data? (arxiv.org)

315 points by isolli over 3 years | 139 comments
QLoRA: Efficient Finetuning of Quantized LLMs (arxiv.org)

315 points by Garcia98 almost 3 years | 107 comments
Beyond A*: Better Planning with Transformers (arxiv.org)

313 points by jonbaer about 2 years | 120 comments
SATAn: Air-Gap Exfiltration Attack via Radio Signals from SATA Cables (arxiv.org)

312 points by PaulHoule over 3 years | 122 comments
Orca 2: Teaching Small Language Models How to Reason (arxiv.org)

310 points by fgfm over 2 years | 80 comments
What if an SQL statement returned a database? (arxiv.org)

309 points by matt_d over 2 years | 159 comments
Hallucination is inevitable: An innate limitation of large language models (arxiv.org)

308 points by louthy about 2 years | 474 comments
“I’ll Finish It This Week” and Other Lies (arxiv.org)

306 points by lnwlebjel almost 5 years | 115 comments
Infinite Photorealistic Worlds Using Procedural Generation (arxiv.org)

306 points by cpeterso almost 3 years | 76 comments
Chameleon: Meta’s New Multi-Modal LLM (arxiv.org)

304 points by gabrielbirnbaum almost 2 years | 40 comments
Better and Faster Large Language Models via Multi-Token Prediction (arxiv.org)

302 points by jasondavies almost 2 years | 128 comments
Automated Unit Test Improvement Using Large Language Models at Meta (arxiv.org)

301 points by mfiguiere about 2 years | 188 comments
Exponentially faster language modelling (arxiv.org)

301 points by born-jre over 2 years | 137 comments
Information Theory: A Tutorial Introduction (arxiv.org)

297 points by teleforce over 4 years | 26 comments
σ-GPTs: A new approach to autoregressive models (arxiv.org)

293 points by mehulashah almost 2 years | 93 comments
MusicLM: Generating music from text (arxiv.org)

291 points by georgehill about 3 years | 107 comments
DeepMind achieves SOTA image recognition with 8.7x faster training (arxiv.org)

291 points by highfrequency about 5 years | 83 comments
How is ChatGPT's behavior changing over time? (arxiv.org)

289 points by tim_sw over 2 years | 178 comments
More Agents Is All You Need: LLMs performance scales with the number of agents (arxiv.org)

288 points by TaurenHunter almost 2 years | 206 comments
faulTPM: Exposing AMD fTPMs' Deepest Secrets (arxiv.org)

287 points by kerm1t almost 3 years | 262 comments
Graph of Thoughts: Solving Elaborate Problems with Large Language Models (arxiv.org)

283 points by jonbaer over 2 years | 47 comments
Mixture-of-Depths: Dynamically allocating compute in transformers (arxiv.org)

281 points by milliondreams almost 2 years | 83 comments
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking (arxiv.org)

280 points by hackerlight about 2 years | 264 comments
Scaling Transformer to 1M tokens and beyond with RMT (arxiv.org)

277 points by panabee almost 3 years | 132 comments
Large language models lack deep insights or a theory of mind (arxiv.org)

277 points by mnode over 2 years | 261 comments
The Modern Mathematics of Deep Learning (arxiv.org)

276 points by tims457 almost 5 years | 70 comments
Planting Undetectable Backdoors in Machine Learning Models (arxiv.org)

275 points by belter almost 4 years | 59 comments
Recursively summarizing enables long-term dialogue memory in LLMs (arxiv.org)

273 points by PaulHoule over 2 years | 152 comments
Conway's Game of Life is omniperiodic (arxiv.org)

272 points by sohkamyung over 2 years | 100 comments
A formula for the nth digit of 𝜋 and 𝜋^n (arxiv.org)

268 points by georgehill about 3 years | 133 comments
Do not rug on me: Zero-dimensional Scam Detection (arxiv.org)

267 points by churchill over 3 years | 154 comments
Mistral 7B (arxiv.org)

267 points by fgfm over 2 years | 123 comments
Llemma: An Open Language Model for Mathematics (arxiv.org)

267 points by AlphaWeaver over 2 years | 46 comments
Gamification affects software developers: Cautionary evidence from GitHub (arxiv.org)

264 points by edward over 3 years | 304 comments
Bytes are all you need: Transformers operating directly on file bytes (arxiv.org)

263 points by pmoriarty almost 3 years | 96 comments
Textbooks are all you need (arxiv.org)

256 points by foobarqux over 2 years | 106 comments
An Introduction to Graph Theory (arxiv.org)

255 points by Anon84 over 2 years | 26 comments
Transformers as Support Vector Machines (arxiv.org)

251 points by fofoz over 2 years | 156 comments
Python type hints are Turing complete (arxiv.org)

246 points by nemoniac over 3 years | 135 comments
Bluesky and the AT Protocol: Usable decentralized social media (arxiv.org)

245 points by lawgimenez about 2 years | 276 comments
HuggingGPT: Solving AI tasks with ChatGPT and its friends in HuggingFace (arxiv.org)

243 points by r_singh almost 3 years | 267 comments
ChatGPT outperforms crowd-workers for text-annotation tasks (arxiv.org)

240 points by georgehill almost 3 years | 205 comments
Factoring 2048 RSA integers in 177 days with 13436 qubits and a multimode memory (arxiv.org)

240 points by athul7744 almost 5 years | 144 comments
Grokked Transformers Are Implicit Reasoners (arxiv.org)

239 points by jasondavies almost 2 years | 61 comments
LLMs cannot find reasoning errors, but can correct them (arxiv.org)

239 points by koie over 2 years | 142 comments
Ultra Fast Bert (arxiv.org)

237 points by gyre007 over 2 years | 1 comments
Multiplying Matrices Without Multiplying (arxiv.org)

235 points by moinnadeem over 4 years | 122 comments
Regularized Newton Method with Global $O(1/k^2)$ Convergence (arxiv.org)

235 points by ColinWright over 4 years | 103 comments
Scaling Transformers to 1B Tokens (arxiv.org)

234 points by mottiden over 2 years | 68 comments
Thermodynamic Linear Algebra (arxiv.org)

234 points by aifer4 over 2 years | 55 comments
New attention mechanisms that outperform standard multi-head attention (arxiv.org)

233 points by snats almost 2 years | 49 comments
Matrix multiplication using only addition (arxiv.org)

233 points by daniel-cussen over 2 years | 108 comments
Catala: A Programming Language for the Law (arxiv.org)

232 points by todsacerdoti almost 5 years | 126 comments
Maximum Flow and Minimum-Cost Flow in Almost-Linear Time (arxiv.org)

230 points by tarxzvf almost 4 years | 40 comments
MemGPT: Towards LLMs as Operating Systems (arxiv.org)

225 points by belter over 2 years | 106 comments
Neural Network Diffusion (arxiv.org)

223 points by vagabund about 2 years | 86 comments
The Principles of Deep Learning Theory (arxiv.org)

221 points by Anon84 almost 4 years | 139 comments
Toolformer: Language Models Can Teach Themselves to Use Tools (arxiv.org)

220 points by jasondavies about 3 years | 45 comments
Wikidata, with 12B facts, can ground LLMs to improve their factuality (arxiv.org)

219 points by raybb over 2 years | 84 comments
Accidentally quadratic: When Python is faster than C++ (arxiv.org)

218 points by mehrdadn about 5 years | 213 comments
Stealing Part of a Production Language Model (arxiv.org)

218 points by alphabetting about 2 years | 51 comments
How to fit any dataset with a single parameter (arxiv.org)

217 points by tambourine_man over 4 years | 146 comments