Hacker News new | past | comments | ask | show | jobs | submit | from login
A (Long) Peek into Reinforcement Learning (lilianweng.github.io)
160 points by Brysonbw 29 days ago | past | 16 comments
Diffusion Models for Video Generation (lilianweng.github.io)
2 points by ydnyshhh 3 months ago | past
Reward Hacking in Reinforcement Learning (lilianweng.github.io)
1 point by samanthasu 4 months ago | past | 1 comment
Reward Hacking in Reinforcement Learning (lilianweng.github.io)
3 points by tosh 4 months ago | past
Reward Hacking in Reinforcement Learning (lilianweng.github.io)
4 points by swyx 4 months ago | past
Lil’Log from OpenAI VP (lilianweng.github.io)
1 point by acmerfight 5 months ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by mooreds 5 months ago | past
What Are Diffusion Models? (lilianweng.github.io)
1 point by Anon84 6 months ago | past
Some Math Behind Neural Tangent Kernel (lilianweng.github.io)
2 points by reqo 7 months ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by gregzeng95 8 months ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by luu 9 months ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
1 point by sebg 9 months ago | past
Extrinsic Hallucinations in LLMs (lilianweng.github.io)
3 points by RevoGen 9 months ago | past
Diffusion Models for Video Generation (lilianweng.github.io)
1 point by moks 12 months ago | past
Diffusion Models for Video Generation (lilianweng.github.io)
1 point by alexmolas on April 17, 2024 | past
Diffusion Models for Video Generation (lilianweng.github.io)
2 points by TheAlchemist on April 17, 2024 | past
Thinking about high-quality human data (lilianweng.github.io)
103 points by tim_sw on Feb 9, 2024 | past | 4 comments
Meta-Learning: Learning to Learn Fast (lilianweng.github.io)
3 points by jxmorris12 on Feb 8, 2024 | past
Exploration Strategies in Deep Reinforcement Learning (2020) (lilianweng.github.io)
1 point by rzk on Jan 31, 2024 | past
Attention Mechanism Explained (lilianweng.github.io)
2 points by ashvanth on Nov 17, 2023 | past | 1 comment
Adversarial Attacks on LLMs (lilianweng.github.io)
1 point by georgehill on Nov 10, 2023 | past
Controllable Neural Text Generation (2021) (lilianweng.github.io)
1 point by typicalHNuser on Oct 30, 2023 | past
Attention? Attention (lilianweng.github.io)
1 point by todsacerdoti on Sept 20, 2023 | past
LLM Powered Autonomous Agents (lilianweng.github.io)
285 points by DanielKehoe on June 27, 2023 | past | 176 comments
Prompt Engineering: Steer a large pretrained language model to do what you want (lilianweng.github.io)
190 points by sebg on March 20, 2023 | past | 49 comments
How to train large models on many GPUs? (2021) (lilianweng.github.io)
216 points by eternalban on Feb 11, 2023 | past | 33 comments
The Transformer Family Version 2.0 (lilianweng.github.io)
3 points by lostConnection on Jan 29, 2023 | past
The Transformer Family (lilianweng.github.io)
254 points by alexmolas on Jan 29, 2023 | past | 46 comments
The Transformer Family Version 2.0 (lilianweng.github.io)
2 points by sadiq on Jan 28, 2023 | past
Large Transformer Model Inference Optimization (lilianweng.github.io)
136 points by headalgorithm on Jan 20, 2023 | past | 20 comments

Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: