Yes! I remember the "Obama stepping on the scale" example that was used in that ...

LeanderK · on March 14, 2023

you mean this http://karpathy.github.io/2012/10/22/state-of-computer-visio...? Very funny to revisit. How primitive our tools were in comparison to now is astounding. It feels like the first flight of the Wright Brothers vs a jetliner. Imagenet was the new frontier. Simpler times...

kromem · on March 14, 2023

I think the interesting thing here is the very, very surprising result that LLMs would be capable of abstracting the things in the second to last paragraph from the described experiences of amalgamated written human data.

It's the thing most people even in this thread don't seem to realize has emerged in research in the past year.

Give a Markov chain a lot of text about fishing and it will tell you about fish. Give GPT a lot of text about fishing and it turns out that it will probably learn how to fish.

World model representations are occuring in GPT. And people really need to start realizing there's already published research demonstrating that, as it goes a long way to explaining why the multimodal parts work.

lysozyme · on March 15, 2023

Especially funny since the author, Andrej Karpathy, wrote at the end of the 2012 article that

>we are very, very far and this depresses me. What is the way forward? :( Maybe I should just do a startup

and was a founding member of OpenAI just a few years later in 2015

djmips · on March 15, 2023

And he just rejoined them in February.

_qua · on March 14, 2023

Didn't realize this was from 2012, but yes this is definitely what I was thinking of.

djmips · on March 15, 2023

They say there are 3 mirrors in the scene but there are at least 5 - one which can only be seen indirectly through one of the other mirrors!