| | A (Long) Peek into Reinforcement Learning (lilianweng.github.io) |
|
160 points by Brysonbw 29 days ago | past | 16 comments
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
2 points by ydnyshhh 3 months ago | past
|
| | Reward Hacking in Reinforcement Learning (lilianweng.github.io) |
|
1 point by samanthasu 4 months ago | past | 1 comment
|
| | Reward Hacking in Reinforcement Learning (lilianweng.github.io) |
|
3 points by tosh 4 months ago | past
|
| | Reward Hacking in Reinforcement Learning (lilianweng.github.io) |
|
4 points by swyx 4 months ago | past
|
| | Lil’Log from OpenAI VP (lilianweng.github.io) |
|
1 point by acmerfight 5 months ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by mooreds 5 months ago | past
|
| | What Are Diffusion Models? (lilianweng.github.io) |
|
1 point by Anon84 6 months ago | past
|
| | Some Math Behind Neural Tangent Kernel (lilianweng.github.io) |
|
2 points by reqo 7 months ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by gregzeng95 8 months ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by luu 9 months ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
1 point by sebg 9 months ago | past
|
| | Extrinsic Hallucinations in LLMs (lilianweng.github.io) |
|
3 points by RevoGen 9 months ago | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
1 point by moks 12 months ago | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
1 point by alexmolas on April 17, 2024 | past
|
| | Diffusion Models for Video Generation (lilianweng.github.io) |
|
2 points by TheAlchemist on April 17, 2024 | past
|
| | Thinking about high-quality human data (lilianweng.github.io) |
|
103 points by tim_sw on Feb 9, 2024 | past | 4 comments
|
| | Meta-Learning: Learning to Learn Fast (lilianweng.github.io) |
|
3 points by jxmorris12 on Feb 8, 2024 | past
|
| | Exploration Strategies in Deep Reinforcement Learning (2020) (lilianweng.github.io) |
|
1 point by rzk on Jan 31, 2024 | past
|
| | Attention Mechanism Explained (lilianweng.github.io) |
|
2 points by ashvanth on Nov 17, 2023 | past | 1 comment
|
| | Adversarial Attacks on LLMs (lilianweng.github.io) |
|
1 point by georgehill on Nov 10, 2023 | past
|
| | Controllable Neural Text Generation (2021) (lilianweng.github.io) |
|
1 point by typicalHNuser on Oct 30, 2023 | past
|
| | Attention? Attention (lilianweng.github.io) |
|
1 point by todsacerdoti on Sept 20, 2023 | past
|
| | LLM Powered Autonomous Agents (lilianweng.github.io) |
|
285 points by DanielKehoe on June 27, 2023 | past | 176 comments
|
| | Prompt Engineering: Steer a large pretrained language model to do what you want (lilianweng.github.io) |
|
190 points by sebg on March 20, 2023 | past | 49 comments
|
| | How to train large models on many GPUs? (2021) (lilianweng.github.io) |
|
216 points by eternalban on Feb 11, 2023 | past | 33 comments
|
| | The Transformer Family Version 2.0 (lilianweng.github.io) |
|
3 points by lostConnection on Jan 29, 2023 | past
|
| | The Transformer Family (lilianweng.github.io) |
|
254 points by alexmolas on Jan 29, 2023 | past | 46 comments
|
| | The Transformer Family Version 2.0 (lilianweng.github.io) |
|
2 points by sadiq on Jan 28, 2023 | past
|
| | Large Transformer Model Inference Optimization (lilianweng.github.io) |
|
136 points by headalgorithm on Jan 20, 2023 | past | 20 comments
|
|
|
More |