Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Just-in-time and distributed task representations in language models (arxiv.org)
1 point by PaulHoule 9 days ago | past | discuss
The Illusion of Readiness: Stress Testing Frontier Models on Medical Benchmarks (arxiv.org)
6 points by mellosouls 9 days ago | past | discuss
Report on the 63rd Annual International Mathematical Olympiad (arxiv.org)
1 point by bikenaga 9 days ago | past | discuss
A fast, strong, topologically meaningful and fun knot invariant (arxiv.org)
52 points by bikenaga 9 days ago | past | 7 comments
Quantized LLMss in Biomedical Natural Language Processing (arxiv.org)
1 point by PaulHoule 9 days ago | past | discuss
Ransomware 3.0: Self-Composing and LLM-Orchestrated (arxiv.org)
1 point by PaulHoule 9 days ago | past | discuss
Multi-Modal vs. Text-Based: Benchmarking LLM Strategies for Invoice Processing (arxiv.org)
1 point by PaulHoule 9 days ago | past | discuss
LIMI: Less Is More for Agency (arxiv.org)
1 point by pella 10 days ago | past | discuss
Design, analysis, and manufacturing of microstructured blade-like geometries (arxiv.org)
2 points by PaulHoule 10 days ago | past | discuss
Fill probability estimates in institutional bond trading with quantum computers (arxiv.org)
2 points by polrjoy 10 days ago | past | 2 comments
Weak Memory Model Formalisms: Introduction and Survey (arxiv.org)
2 points by matt_d 10 days ago | past | discuss
Why Language Models Hallucinate (arxiv.org)
1 point by ummonk 10 days ago | past | discuss
GPU Implementation of Second-Order Linear and Nonlinear Programming Solvers (arxiv.org)
1 point by adgjlsfhk1 10 days ago | past | 1 comment
Bluffing in Scrabble (arxiv.org)
8 points by fanf2 10 days ago | past | discuss
Opal: An Operator Algebra View of RLHF (arxiv.org)
2 points by P_qRs 10 days ago | past | discuss
Effects of the entropy source on Monte Carlo simulations (arxiv.org)
2 points by bob1029 10 days ago | past | discuss
Enabling an Ecosystem of Personalized and Interoperable Social Applications (arxiv.org)
2 points by sportdeath 10 days ago | past | discuss
Space Mission Options for Reconnaissance and Mitigation of Asteroid 2024 YR4 [pdf] (arxiv.org)
2 points by croes 10 days ago | past | discuss
Discrete Diffusion in Large Language and Multimodal Models: A Survey (arxiv.org)
2 points by NeoInHacker 10 days ago | past | discuss
Personalised Pricing: The Demise of the Fixed Price? (arxiv.org)
2 points by Hard_Space 10 days ago | past | discuss
OpenFake: An Open Dataset and Platform Toward Large-Scale Deepfake Detection (arxiv.org)
4 points by pykello 11 days ago | past | discuss
Space Mission Options for Mitigation of Asteroid 2024 YR4 (arxiv.org)
4 points by geox 11 days ago | past | discuss
DeepMind Paper on Virtual Agent Economies (arxiv.org)
2 points by nanfinitum 11 days ago | past | discuss
Seeing Is Deceiving:Mirror-Based Lidar Spoofing for Autonomous Vehicle Deception (arxiv.org)
1 point by bikenaga 11 days ago | past | discuss
The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs (arxiv.org)
1 point by mathattack 11 days ago | past | discuss
Are elites meritocratic and efficiency-seeking? Evidence from MBA students (arxiv.org)
103 points by bikenaga 11 days ago | past | 73 comments
Pre-training under infinite compute (arxiv.org)
3 points by jonbaer 11 days ago | past | discuss
Hyb Error: A Hybrid Metric Combining Absolute and Relative Errors (2024) (arxiv.org)
19 points by ncruces 11 days ago | past | 2 comments
The illusion of diminishing returns in LLM progress (arxiv.org)
3 points by SCEtoAux 12 days ago | past | discuss
Learn Your Way: Towards an AI-Augmented Textbook, Google Research (arxiv.org)
3 points by walterbell 12 days ago | past | discuss

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: