Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Provable scaling laws of feature emergence from learning dynamics of grokking (arxiv.org)
29 points by sva_ 5 days ago | past | discuss
How to inject knowledge efficiently? Knowledge infusion scaling law for LLMs (arxiv.org)
104 points by PaulHoule 5 days ago | past | 35 comments
Recursive self-aggregation unlocks deep thinking in large language models (arxiv.org)
1 point by ivan_ah 5 days ago | past | 1 comment
Scaling Test Time Compute (arxiv.org)
2 points by math-llm-agi 5 days ago | past | discuss
Physics of Learning: A Lagrangian perspective to different learning paradigms (arxiv.org)
3 points by Anon84 5 days ago | past | discuss
The Missing Link Between the Transformer and Models of the Brain (arxiv.org)
2 points by dominik-m 5 days ago | past | discuss
Characterizing Realistic Workloads on a Commercial Compute-in-SRAM Device (arxiv.org)
7 points by PaulHoule 6 days ago | past | 2 comments
Pretraining Large Language Models with NVFP4 (arxiv.org)
1 point by aportnoy 6 days ago | past | discuss
AegisShield: Democratizing Cyber Threat Modeling with Generative AI (arxiv.org)
1 point by PaulHoule 6 days ago | past | discuss
Pretraining Under Infinite Compute (arxiv.org)
3 points by jedharris 6 days ago | past | 1 comment
The AI Productivity Index (Apex) (arxiv.org)
1 point by paulpauper 6 days ago | past | discuss
Who's Advertising to Your AI? (arxiv.org)
1 point by zerolayers 6 days ago | past | 1 comment
Aristotle: IMO-Level Automated Theorem Proving (arxiv.org)
3 points by jasondavies 6 days ago | past | discuss
xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity (arxiv.org)
1 point by lairv 6 days ago | past | discuss
Thoughtbubbles: An Unsupervised Method for Parallel Thinking in Latent Space (arxiv.org)
4 points by shetaye 6 days ago | past | 1 comment
A Pipeline for Continual Learning Without Catastrophic Forgetting in LLMs (arxiv.org)
2 points by PaulHoule 6 days ago | past | discuss
Room-Temperature Superconductivity at High-Pressure Conditions (arxiv.org)
3 points by P_qRs 6 days ago | past | 1 comment
Universal Gradient Methods in Nonlinear Optimization (arxiv.org)
5 points by fofoz 6 days ago | past | discuss
Delta-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs? (arxiv.org)
1 point by sonabinu 6 days ago | past | discuss
Redshifted civilizations, galactic empires, and the Fermi paradox (arxiv.org)
8 points by arbesman 6 days ago | past | discuss
1-Bit RIS-Aided Index Modulation with Quantum Annealing (arxiv.org)
2 points by donutloop 6 days ago | past | discuss
Folding lattice proteins confined on minimal grids using a quantum encoding (arxiv.org)
2 points by donutloop 6 days ago | past | discuss
Comparing Quantum Annealing and BF-DCQO (arxiv.org)
2 points by donutloop 6 days ago | past | discuss
Security Degradation in Iterative AI Code Generation (arxiv.org)
1 point by chillax 6 days ago | past | discuss
Measurement and Patient Modeling for Model-Mediated Tele-Ultrasound (arxiv.org)
2 points by PaulHoule 7 days ago | past | discuss
Shelby: Decentralized hot storage protocol competitive with AWS S3 performance (arxiv.org)
3 points by todsacerdoti 7 days ago | past | discuss
Dragon Hatchling: The Missing Link B. The Transformer and Models of the Brain (arxiv.org)
6 points by polskibus 7 days ago | past | discuss
Implementing OpenMP for Zig to enable its use in HPC context (arxiv.org)
7 points by cyber1 7 days ago | past | discuss
Empirical Study of Pull Requests on GitHub (arxiv.org)
1 point by nkko 7 days ago | past | discuss
"We Have No Idea How Models Will Behave in Production Until Production": ML Ops (arxiv.org)
6 points by todsacerdoti 7 days ago | past | discuss

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: