Submissions from arxiv.org

		Just-in-time and distributed task representations in language models (arxiv.org)
		1 point by PaulHoule 9 days ago \| past \| discuss
		The Illusion of Readiness: Stress Testing Frontier Models on Medical Benchmarks (arxiv.org)
		6 points by mellosouls 9 days ago \| past \| discuss
		Report on the 63rd Annual International Mathematical Olympiad (arxiv.org)
		1 point by bikenaga 9 days ago \| past \| discuss
		A fast, strong, topologically meaningful and fun knot invariant (arxiv.org)
		52 points by bikenaga 9 days ago \| past \| 7 comments
		Quantized LLMss in Biomedical Natural Language Processing (arxiv.org)
		1 point by PaulHoule 9 days ago \| past \| discuss
		Ransomware 3.0: Self-Composing and LLM-Orchestrated (arxiv.org)
		1 point by PaulHoule 9 days ago \| past \| discuss
		Multi-Modal vs. Text-Based: Benchmarking LLM Strategies for Invoice Processing (arxiv.org)
		1 point by PaulHoule 9 days ago \| past \| discuss
		LIMI: Less Is More for Agency (arxiv.org)
		1 point by pella 10 days ago \| past \| discuss
		Design, analysis, and manufacturing of microstructured blade-like geometries (arxiv.org)
		2 points by PaulHoule 10 days ago \| past \| discuss
		Fill probability estimates in institutional bond trading with quantum computers (arxiv.org)
		2 points by polrjoy 10 days ago \| past \| 2 comments
		Weak Memory Model Formalisms: Introduction and Survey (arxiv.org)
		2 points by matt_d 10 days ago \| past \| discuss
		Why Language Models Hallucinate (arxiv.org)
		1 point by ummonk 10 days ago \| past \| discuss
		GPU Implementation of Second-Order Linear and Nonlinear Programming Solvers (arxiv.org)
		1 point by adgjlsfhk1 10 days ago \| past \| 1 comment
		Bluffing in Scrabble (arxiv.org)
		8 points by fanf2 10 days ago \| past \| discuss
		Opal: An Operator Algebra View of RLHF (arxiv.org)
		2 points by P_qRs 10 days ago \| past \| discuss
		Effects of the entropy source on Monte Carlo simulations (arxiv.org)
		2 points by bob1029 10 days ago \| past \| discuss
		Enabling an Ecosystem of Personalized and Interoperable Social Applications (arxiv.org)
		2 points by sportdeath 10 days ago \| past \| discuss
		Space Mission Options for Reconnaissance and Mitigation of Asteroid 2024 YR4 [pdf] (arxiv.org)
		2 points by croes 10 days ago \| past \| discuss
		Discrete Diffusion in Large Language and Multimodal Models: A Survey (arxiv.org)
		2 points by NeoInHacker 10 days ago \| past \| discuss
		Personalised Pricing: The Demise of the Fixed Price? (arxiv.org)
		2 points by Hard_Space 10 days ago \| past \| discuss
		OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection (arxiv.org)
		4 points by pykello 11 days ago \| past \| discuss
		Space Mission Options for Mitigation of Asteroid 2024 YR4 (arxiv.org)
		4 points by geox 11 days ago \| past \| discuss
		DeepMind Paper on Virtual Agent Economies (arxiv.org)
		2 points by nanfinitum 11 days ago \| past \| discuss
		Seeing Is Deceiving:Mirror-Based Lidar Spoofing for Autonomous Vehicle Deception (arxiv.org)
		1 point by bikenaga 11 days ago \| past \| discuss
		The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs (arxiv.org)
		1 point by mathattack 11 days ago \| past \| discuss
		Are elites meritocratic and efficiency-seeking? Evidence from MBA students (arxiv.org)
		103 points by bikenaga 11 days ago \| past \| 73 comments
		Pre-training under infinite compute (arxiv.org)
		3 points by jonbaer 11 days ago \| past \| discuss
		Hyb Error: A Hybrid Metric Combining Absolute and Relative Errors (2024) (arxiv.org)
		19 points by ncruces 11 days ago \| past \| 2 comments
		The illusion of diminishing returns in LLM progress (arxiv.org)
		3 points by SCEtoAux 12 days ago \| past \| discuss
		Learn Your Way: Towards an AI-Augmented Textbook, Google Research (arxiv.org)
		3 points by walterbell 12 days ago \| past \| discuss
		More