Submissions from arxiv.org

		Provable scaling laws of feature emergence from learning dynamics of grokking (arxiv.org)
		29 points by sva_ 5 days ago \| past \| discuss
		How to inject knowledge efficiently? Knowledge infusion scaling law for LLMs (arxiv.org)
		104 points by PaulHoule 5 days ago \| past \| 35 comments
		Recursive self-aggregation unlocks deep thinking in large language models (arxiv.org)
		1 point by ivan_ah 5 days ago \| past \| 1 comment
		Scaling Test Time Compute (arxiv.org)
		2 points by math-llm-agi 5 days ago \| past \| discuss
		Physics of Learning: A Lagrangian perspective to different learning paradigms (arxiv.org)
		3 points by Anon84 5 days ago \| past \| discuss
		The Missing Link Between the Transformer and Models of the Brain (arxiv.org)
		2 points by dominik-m 5 days ago \| past \| discuss
		Characterizing Realistic Workloads on a Commercial Compute-in-SRAM Device (arxiv.org)
		7 points by PaulHoule 6 days ago \| past \| 2 comments
		Pretraining Large Language Models with NVFP4 (arxiv.org)
		1 point by aportnoy 6 days ago \| past \| discuss
		AegisShield: Democratizing Cyber Threat Modeling with Generative AI (arxiv.org)
		1 point by PaulHoule 6 days ago \| past \| discuss
		Pretraining Under Infinite Compute (arxiv.org)
		3 points by jedharris 6 days ago \| past \| 1 comment
		The AI Productivity Index (Apex) (arxiv.org)
		1 point by paulpauper 6 days ago \| past \| discuss
		Who's Advertising to Your AI? (arxiv.org)
		1 point by zerolayers 6 days ago \| past \| 1 comment
		Aristotle: IMO-Level Automated Theorem Proving (arxiv.org)
		3 points by jasondavies 6 days ago \| past \| discuss
		xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity (arxiv.org)
		1 point by lairv 6 days ago \| past \| discuss
		Thoughtbubbles: An Unsupervised Method for Parallel Thinking in Latent Space (arxiv.org)
		4 points by shetaye 6 days ago \| past \| 1 comment
		A Pipeline for Continual Learning Without Catastrophic Forgetting in LLMs (arxiv.org)
		2 points by PaulHoule 6 days ago \| past \| discuss
		Room-Temperature Superconductivity at High-Pressure Conditions (arxiv.org)
		3 points by P_qRs 6 days ago \| past \| 1 comment
		Universal Gradient Methods in Nonlinear Optimization (arxiv.org)
		5 points by fofoz 6 days ago \| past \| discuss
		Delta-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs? (arxiv.org)
		1 point by sonabinu 6 days ago \| past \| discuss
		Redshifted civilizations, galactic empires, and the Fermi paradox (arxiv.org)
		8 points by arbesman 6 days ago \| past \| discuss
		1-Bit RIS-Aided Index Modulation with Quantum Annealing (arxiv.org)
		2 points by donutloop 6 days ago \| past \| discuss
		Folding lattice proteins confined on minimal grids using a quantum encoding (arxiv.org)
		2 points by donutloop 6 days ago \| past \| discuss
		Comparing Quantum Annealing and BF-DCQO (arxiv.org)
		2 points by donutloop 6 days ago \| past \| discuss
		Security Degradation in Iterative AI Code Generation (arxiv.org)
		1 point by chillax 6 days ago \| past \| discuss
		Measurement and Patient Modeling for Model-Mediated Tele-Ultrasound (arxiv.org)
		2 points by PaulHoule 7 days ago \| past \| discuss
		Shelby: Decentralized hot storage protocol competitive with AWS S3 performance (arxiv.org)
		3 points by todsacerdoti 7 days ago \| past \| discuss
		Dragon Hatchling: The Missing Link B. The Transformer and Models of the Brain (arxiv.org)
		6 points by polskibus 7 days ago \| past \| discuss
		Implementing OpenMP for Zig to enable its use in HPC context (arxiv.org)
		7 points by cyber1 7 days ago \| past \| discuss
		Empirical Study of Pull Requests on GitHub (arxiv.org)
		1 point by nkko 7 days ago \| past \| discuss
		"We Have No Idea How Models Will Behave in Production Until Production": ML Ops (arxiv.org)
		6 points by todsacerdoti 7 days ago \| past \| discuss
		More