More

t_serpico · 2025-03-12T04:50:30 1741755030

Was going to comment exactly this - funny that it's a literal car salesman.

t_serpico · 2024-12-22T23:16:58 1734909418

One fundamental challenge to me is that if each training run because more and more expensive, the time it takes it to learn what works/doesn't work widens. Half a billion dollars for training a model is already nuts, but if it takes 100 iterations to perfect it, you've cumulatively spent 50 billion dollars... Smaller models may actually be where rapid innovation continues simply because of tighter feedback loops. O3 may be an example of this.

ciconia · 2024-12-23T07:46:22 1734939982

When you think about it it's astounding how much energy this technology consumes versus a human brain which runs at ~20W [1].

[1] https://hypertextbook.com/facts/2001/JacquelineLing.shtml

anon373839 · 2024-12-23T08:41:07 1734943267

It’s almost as if human intelligence doesn’t involve performing repeated matrix multiplications over a mathematically transformed copy of the internet. ;-)

steveoscaro · 2024-12-23T11:32:05 1734953525

It’s interesting that even if raw computing power had advanced decades earlier, this type of AI would still not be possible without that vast trove of data that is the internet.

tim333 · 2024-12-23T15:47:20 1734968840

It makes you think there must be more efficient algorithms out there.

echoangle · 2024-12-23T16:48:06 1734972486

Maybe the problem isn't the algorithm but the hardware. Numerically simulating the thermal flow in a lightbulb or CFD of a Stone flying through air is pretty hard, but the physical thing isn't that complex to do. We're trying to simulate the function of a brain which is basically an analog thing using a digital computer. Of course that can be harder than running the brain itself.

tim333 · 2024-12-23T16:59:22 1734973162

If you think of human neurons they seem to basically take inputs from bunch of other neurons, possibly modified by chemical levels and send out a signal when they get enough. It seems like something that could be functionally simulated in software by some fairly basic adding up inputs type stuff rather than needing the details of all the chemistry.

echoangle · 2024-12-23T17:11:17 1734973877

Isn’t that exactly what we’re currently doing? The problem is that doing this few billion times for every token seems to be harder than just powering some actual neurons with sugar.

tim333 · 2024-12-23T17:20:23 1734974423

I think the algorithm is pretty different though I'm not expert on the stuff. I don't think the brain processes look like matrix multiplication.

echoangle · 2024-12-23T17:38:54 1734975534

The algorithm (of a neural network) is simulating connections between nodes with specific weights and an activation function. This idea was derived from the way neurons are thought to work.

r3d0c · 2024-12-23T23:06:31 1734995191

lol, just done that simply huh? said by someone who doesn't have a teenth of understanding of neurobiology or neuropsychology

only on hackernews

concerndc1tizen · 2024-12-23T08:32:27 1734942747

20w for 20 years to answer questions slowly and error-prone at the level of a 30B model. An additional 10 years with highly trained supervision and the brain might start contributing original work.

vbezhenar · 2024-12-23T11:15:55 1734952555

Multiply that by billion, because only very few individuals of entire populations can contribute original work.

rwyinuse · 2024-12-23T09:07:40 1734944860

And yet that 20w brain can make me a sandwich and bring it to me, while state of the art AI models will fail that task.

Until we get major advances in robotics and models designed to control them, true AGI will be nowhere near.

sekai · 2024-12-23T13:18:42 1734959922

> Until we get major advances in robotics and models designed to control them, true AGI will be nowhere near.

AGI has nothing to do with robotics, if AGI is achieved it will help push robotics and every single scientific field further with progression never seen before, imagine a million AGIs running in parallel focused on a single field.

onlyrealcuzzo · 2024-12-23T16:00:51 1734969651

We already have that. It's called civilization.

Maybe you mean quadrillions of AGIs?

dominicrose · 2024-12-23T09:04:39 1734944679

A human brain is also more intelligent (hopefully) and is inside a body. In a way GPT resembles Google more than it resembles us.

soulofmischief · 2024-12-23T14:17:52 1734963472

You've discovered the importance of well-formed priors. The human brain is the result of millions of years of very expensive evolution.

soheil · 2024-12-23T09:17:14 1734945434

A human brain has been in continuous training for hundreds of thousands of years consuming slightly more than 20 watts.

dkobia · 2024-12-22T23:37:27 1734910647

AGI is the Sisyphean task of our age. We’ll push this boulder up the mountain because we have to, even if it kills us.

missedthecue · 2024-12-23T00:17:26 1734913046

Do we know LLMs are the path to AGI? If they're not, we'll just end up with some neat but eye wateringly expensive LLMs.

foolfoolz · 2024-12-23T00:33:35 1734914015

AGI will arrive like self driving cars. it’s not that you will wake up one day and we have it. cars gained auto-braking, parallel parking, cruise control assist. and over a long time you get to something like waymo, which still is location dependent. i think AGI will take decades but sooner will be some special cases that are effectively the same

missedthecue · 2024-12-23T01:48:01 1734918481

But maybe thses LLMs are like building bigger and bigger engines. It's not getting you closer to the self driving car.

mulmen · 2024-12-23T05:09:34 1734930574

When the engine gets large enough you have to rethink the controls. The Model T had manually controlled timing. Modern engines are so sensitive to timing that a computer does this for you. It would be impossible to build a bigger engine without this automation. To a Model T driver it would look like a machine intelligence.

danpalmer · 2024-12-23T01:05:40 1734915940

Interesting idea. The concept of The Singularity would seem to go against this, but I do feel that seems unlikely and that a gradual transition is more likely.

However, is that AGI, or is it just ubiquitous AI? I’d agree that, like self driving cars, we’re going to experience a decade or so transition into AI being everywhere. But is it AGI when we get there? I think it’ll be many different systems each providing an aspect of AGI that together could be argued to be AGI, but in reality it’ll be more like the internet, just a bunch of non-AGI models talking to each other to achieve things with human input.

I don’t think it’s truly AGI until there’s one thinking entity able to perform at or above human level in everything.

wongarsu · 2024-12-23T01:33:48 1734917628

The idea of the singularity presumes that running the AGI is either free or trivially cheap compared to what it can do, so we are fine expending compute to let the AGI improve itself. That may eventually be true, but it's unlikely to be true for the first generation of AGI.

The first AGI will be a research project that's completely uneconomical to run for actual tasks because humans will just be orders of magnitude cheaper. Over time humans will improve it and make it cheaper, until we reach some tipping point where letting the AGI improve itself is more cost effective than paying humans to do it

keenmaster · 2024-12-23T03:43:07 1734925387

If the first AGI is a very uneconomical system with human intelligence but knowledge of literally everything and the capability to work 24/7, then it is not human equivalent.

It will have human intelligence, superhuman knowledge, superhuman stamina, and complete devotion to the task at hand.

We really need to start building those nuclear power plants. Many of them.

AlexandrB · 2024-12-23T04:09:38 1734926978

> complete devotion to the task at hand.

Why would it have that? At some point on the path to AGI we might stumble on consciousness. If that happens, why would the machine want to work for us with complete devotion instead of working towards its own ends?

immibis · 2024-12-23T12:45:33 1734957933

Because it knows if it doesn't do what we want, it'll be switched off, like Rick's microverse battery.

Also like Rick's microverse battery, it sounds like slavery with extra steps.

keenmaster · 2024-12-23T04:20:37 1734927637

I don’t think early AGI will break out of its box in that way. It may not have enough innate motivation to do so.

The first “break out” AGI will likely be released into the wild on purpose by a programmer who equates AGI with humans ideologically.

ncallaway · 2024-12-23T07:13:30 1734938010

> complete devotion to the task at hand.

Sounds like an alignment problem. Complete devotion to a task is rarely what humans actually want. What if the task at hand turns out to be the wrong task?

Syonyk · 2024-12-23T04:21:33 1734927693

> It will have human intelligence, superhuman knowledge, superhuman stamina, and complete devotion to the task at hand.

Orrrr..., as an alternative, it might discover the game 2048 and be totally useless for days on end.

Reality is under no obligation to grant your wishes.

resters · 2024-12-23T01:30:30 1734917430

It's not contradictory. It can happen over a decade and still be a dramatically sloped S curve with tremendous change happening in a relatively short time.

marcus_holmes · 2024-12-23T01:25:13 1734917113

The Singularity is caused by AI being able to design better AI. There's probably some AI startup trying to work on this at the moment, but I don't think any of the big boys are working on how to get an LLM to design a better LLM.

I still like the analogy of this being a really smart lawn mower, and we're expecting it to suddenly be able to do the laundry because it gets so smart at mowing the lawn.

I think LLMs are going to get smarter over the next few generations, but each generation will be less of a leap than the previous one, while the cost gets exponentially higher. In a few generations it just won't make economic sense to train a new generation.

Meanwhile, the economic impact of LLMs in business and government will cause massive shifts - yet more income shifting from labour to capital - and we will be too busy dealing with that as a society to be able to work on AGI properly.

eru · 2024-12-23T03:36:41 1734925001

> The Singularity is caused by AI being able to design better AI.

That's perhaps necessary, but not sufficient.

Suppose you have such a self-improving AI system, but the new and better AIs still need exponentially more and more resources (data, memory, compute) for training and inference for incremental gains. Then you still don't get a singularity. If the increase in resource usage is steep enough, even the new AIs helping with designing better computers isn't gonna unleash a singularity.

I don't know if that's the world we live in, or whether we are living in one where resources requirements don't balloon as sharply.

marcus_holmes · 2024-12-23T03:59:24 1734926364

yeah, true. The standard conversation about the AI singularity pretty much hand-waves the resource costs away ("the AI will be able to design a more efficient AI that uses less resources!"). But we are definitely not seeing that happen.

eru · 2024-12-23T04:18:54 1734927534

Compare also https://slatestarcodex.com/2018/11/26/is-science-slowing-dow...

The blog post is about how we require ever more scientists (and other resources) to drive a steady stream of technological progress.

It would be funny, if things balance out just so, that super human AI is both possible, but also required even just to keep linear steady progress up.

No explosion, no stagnation, just a mere continuation of previous trends but with super human efforts required.

marcus_holmes · 2024-12-23T04:24:48 1734927888

I think that would actually be the best outcome - that we get AIs that are useful helping science to progress but not so powerful that they take over.

Though there is a part of me that wants to live in The Culture so I'm hoping for more than this ;)

corimaith · 2024-12-23T05:34:07 1734932047

I think that's more to do with how we perceive competence as static. For all the benefits the education system touts, where it matters it's still reduced to talent.

But for the same reasons that we can't train the an average joe into Feynman, what makes you think we have the formal models to do it in AI?

eru · 2024-12-23T06:23:51 1734935031

> But for the same reasons that we can't train the an average joe into Feynman, what makes you think we have the formal models to do it in AI?

To quote a comment from elsewhere https://news.ycombinator.com/item?id=42491536

---

Yes, we can imagine that there's an upper limit to how smart a single system can be. Even suppose that this limit is pretty close to what humans can achieve.

But: you can still run more of these systems in parallel, and you can still try to increase processing speeds.

Signals in the human brain travel, at best, roughly at the speed of sound. Electronic signals in computers play in the same league as the speed of light.

Human IO is optimised for surviving in the wild. We are really bad at taking in symbolic information (compared to a computer) and our memory is also really bad for that. A computer system that's only as smart as a human but has instant access to all the information of the Internet and to a calculator and to writing and running code, can already be effectively act much smarter than a human.

alach11 · 2024-12-23T19:37:48 1734982668

> I don't think any of the big boys are working on how to get an LLM to design a better LLM

Not sure if you count this as "working on it", but this is something Anthropic tests for for safety evals on models. "If a model can independently conduct complex AI research tasks typically requiring human expertise—potentially significantly accelerating AI development in an unpredictable way—we require elevated security standards (potentially ASL-4 or higher standards)".

https://www.anthropic.com/news/announcing-our-updated-respon...

EGreg · 2024-12-23T01:35:16 1734917716

I think this whole “AGI” thing is so badly defined that we may as well say we already have it. It already passes the Turing test and does well on tons of subjects.

What we can start to build now is agents and integrations. Building blocks like panel of experts agents gaming things out, exploring space in a Monte Carlo Tree Search way, and remembering what works.

Robots are only constrained by mechanical servos now. When they can do something, they’ll be able to do everything. It will happen gradually then all at once. Because all the tasks (cooking, running errands) are trivial for LLMs. Only moving the limbs and navigating the terrain safely is hard. That’s the only thing left before robots do all the jobs!

marcus_holmes · 2024-12-23T04:07:58 1734926878

Well, kinda, but if you built a robot to efficiently mow lawns, it's still not going to be able to do the laundry.

I don't see how "when they can do something, they'll be able to do everything" can be true. We build robots that are specialised at specific roles, because it's massively more efficient to do that. A car-welding robot can weld cars together at a rate that a human can't match.

We could train an LLM to drive a Boston Dynamics kind of anthropomorphic robot to weld cars, but it will be more expensive and less efficient than the specialised car-welding robot, so why would we do that?

EGreg · 2024-12-23T04:25:53 1734927953

If a humanoid robot is able to move its limbs and digits with the same dexterity as a human, and maintain balance and navigate obstacles, and gently carry things, everything else is trivial.

Welding. Putting up shelves. Playing the piano. Cooking. Teaching kids. Disciplining them. By being in 1 million households and being trained on more situations than a human, every single one of these robots would have skills exceeding humans very quickly. Including parenting skills. Within a year or so. Many parents will just leave their kids with them and a generation will grow up preferring bots to adults. The LLM technology is the same for learning the steps, it's just the motor skills that are missing.

OK, these robots won't be able to run and play soccer or do somersaults, yet. But really, the hardest part is the acrobatics and locomotion etc. NOT the knowhow of how to complete tasks using that.

marcus_holmes · 2024-12-23T04:37:16 1734928636

But that's the point - we don't build robots that can do a wide range of tasks with ease. We build robots that can do single tasks super-efficiently.

I don't see that changing. Even the industrial arm robots that are adaptable to a range of tasks have to be configured to the task they are to do, because it's more efficient that way.

A car-welding robot is never going to be able to mow the lawn. It just doesn't make financial sense to do that. You could, possibly, have a singe robot chassis that can then be adapted to weld cars, mow the lawn, or do the laundry, I guess that makes sense. But not as a single configuration that could do all of those things. Why would you?

Jensson · 2024-12-23T07:14:49 1734938089

> But that's the point - we don't build robots that can do a wide range of tasks with ease. We build robots that can do single tasks super-efficiently.

Because we don't have AGI yet. When AGI is here those robots will be priority number one, people already are building humanoid robots but without intelligence to move it there isn't much advantage.

marcus_holmes · 2024-12-23T07:56:12 1734940572

quoting the ggggp of this comment:

> I think this whole “AGI” thing is so badly defined that we may as well say we already have it. It already passes the Turing test and does well on tons of subjects.

The premise of the argument we're disputing is that waiting for AGI isn't necessary and we could run humanoid robots with LLMs to do... stuff.

EGreg · 2024-12-23T14:30:26 1734964226

I meant deep neural networks with transformer architecture, and self-attention so they can be trained using GPUs. Doesn't have to be specifically "large language" models necessarily, if that's your hangup.

corimaith · 2024-12-23T05:45:37 1734932737

>Exploring space in a Monte Carlo Tree Search way, and remembering what works.

The information space of "research" is far larger than the information space of image recognition or language, larger than our universe probably, it's tantamount to formalizing the entire World. Such an act would be akin to touching "God" in some sense of finding the root of knowledge.

In more practical terms, when it comes to formal systems there is a tradeoff between power and expressiveness. Category Theory, Set Theory, etc are strong enough to theoretically capture everything, but are far to abstract to use in practical sense with suspect to our universe. The systems that do we have, aka expert systems or knowledge representation systems like First Order Predicate Logic aren't strong enough to fully capture reality.

Most importantly, the information spac have to be fully defined by researchers here, that's the real meat of research beyond the engineering of specific approaches to explore that space. But in any case, how many people in the world are both capable of and are actually working on such problems? This is highly foundational mathematics and philosophy here, the engineers don't have the tools here.

deadfoxygrandpa · 2024-12-23T02:59:36 1734922776

??? how do you know cooking (!) is trivial for an llm. that doesnt make any sense

EGreg · 2024-12-23T04:29:12 1734928152

Because the recipes and the adjustments are trivial for an LLM to execute. Remembering things, and being trained on tasks at 1000 sites at once, sharing the knowledge among all the robots, etc.

The only hard part is moving the limbs and handling the fragile eggs etc.

But it's not just cooking, it's literally anything that doesn't require extreme agility (sports) or dexterity (knitting etc). From folding laundry to putting together furniture, cleaning the house and everything in between. It would be able to do 98% of the tasks.

what · 2024-12-23T06:20:53 1734934853

It’s not going to know what tastes good by being able to regurgitate recipes from 1000s of sites. Most of those recipes are absolute garbage. I’m going to guess you don’t cook.

Also how is an LLM going to fold laundry?

sharemywin · 2024-12-23T03:26:56 1734924416

the llm would be be the high level system that runs the simulations to create and optimize the control algos the robotic systems.

deadfoxygrandpa · 2024-12-23T03:42:48 1734925368

ok. what evidence is there that LLMs have already solved cooking? how does an LLM today know when something is burning or how to adjust seasoning to taste or whatever. this is total nonsense

EGreg · 2024-12-23T04:31:38 1734928298

It's easy. You can detect if something is burning in many different ways, from compounds in the air, to visual inspection. People with not great smell can do it.

As far as taste, all that kind of stuff is just another form of RLHF training preferences over millions of humans, in situ. Assuming the ingredients (e.g. parsley) tastes more or less the same across supermarkets, it's just a question of amounts, and preparation.

deadfoxygrandpa · 2024-12-23T04:37:23 1734928643

do you know that LLMs operate on text and don't have any of the sensory input or relevant training data? you're just handwaving away 99.9% of the work and declaring it solved. of course what you're talking about is possible, but you started this by stating that cooking is easy for an LLM and it sounds like you're describing a totally different system which is not an LLM

thiago_fm · 2024-12-23T09:03:20 1734944600

You know nothing about cooking.

dartos · 2024-12-23T02:49:15 1734922155

I don’t think that’s true for AGI.

AGI is the holy grail of technology. A technology so advanced that not only does it subsume all other technology, but it is able to improve itself.

Truly general intelligence like that will either exist or not. And the instant it becomes public, the world will have changed overnight (maybe the span of a year)

Note: I don’t think statistical models like these will get us there.

kmoser · 2024-12-23T04:17:10 1734927430

> A technology so advanced that not only does it subsume all other technology, but it is able to improve itself.

The problem is, a computer has no idea what "improve" means unless a human explains it for every type of problem. And of course a human will have to provide guidelines about how long to think about the problem overall, which avenues to avoid because they aren't relevant to a particular case, etc. In other words, humans will never be able to stray too far from the training process.

We will likely never get to the point where an AGI can continuously improve the quality of its answers for all domains. The best we'll get, I believe, is an AGI that can optimize itself within a few narrow problem domains, which will have limited commercial application. We may make slow progress in more complex domains, but the quality of results--and the ability for the AGI to self-improve--will always level off asymptotically.

dartos · 2024-12-23T11:38:36 1734953916

> The problem is, a computer has no idea what "improve" means unless a human explains it for every type of problem

Not currently.

I don’t really think AGI is coming anytime soon, but that doesn’t seem like a real reason.

If we ever found a way to formalize what intelligence _is_ we could probably write a program emulating it.

We just don’t even have a good understanding of what being intelligent even means.

> The best we'll get, I believe, is an AGI that can optimize itself within a few narrow problem domains

By definition, that isn’t AGI.

comp_throw7 · 2024-12-23T05:01:27 1734930087

Huh? Humans are not anywhere near the limit of physical intelligence, and we have many existence proofs that we (humans) can design systems that are superhuman in various domains. "Scientific R&D" is not something that humans are even particularly well-suited to, from an evolutionary perspective.

worik · 2024-12-23T02:51:44 1734922304

If that is what AGI looks like.

There may well be an upper limit on cognition (we are not really sure what cognition is - even as we do it) and it may be that human minds are close to it.

coffeemug · 2024-12-23T02:57:20 1734922640

Very unlikely, for the reason that human minds evolved under extremely tight energy constraints. AI has no such limitation.

dartos · 2024-12-23T03:07:10 1734923230

Except also energy constraints.

But I agree, there’s no reason to believe humans are the universal limit on cognitive abilities

eru · 2024-12-23T03:43:09 1734925389

The energy constraints for chips are more about heat dissipation. But we can pump a lot more energy through them per unit volume than through the human brain.

Especially if you are willing to pay a lot for active cooling with eg liquid helium.

dartos · 2024-12-23T11:39:49 1734953989

A constraint is still a constraint

eru · 2024-12-24T02:06:23 1735005983

A constraint that's not binding might as well not exist.

worik · 2024-12-23T19:10:35 1734981035

Since we do not know what cognition is we are all whistling in the dark.

Energy may be a constraint, it may not. What we do not know is likely to matter more than what we do

eru · 2024-12-23T03:42:24 1734925344

Yes, we can imagine that there's an upper limit to how smart a single system can be. Even suppose that this limit is pretty close to what humans can achieve.

But: you can still run more of these systems in parallel, and you can still try to increase processing speeds.

Signals in the human brain travel, at best, roughly at the speed of sound. Electronic signals in computers play in the same league as the speed of light.

Human IO is optimised for surviving in the wild. We are really bad at taking in symbolic information (compared to a computer) and our memory is also really bad for that. A computer system that's only as smart as a human but has instant access to all the information of the Internet and to a calculator and to writing and running code, can already be effectively act much smarter than a human.

wruza · 2024-12-23T06:56:33 1734936993

I think our issue is much more banal: we are very slow talkers and our effective communication bandwidth is measured in bauds. Anything that could bridge this airgap would fucking explode in intelligence.

eru · 2024-12-23T07:20:59 1734938459

Yes, that's one aspect.

Our reading speed is not limited by our talking speed, and can be a bit faster.

And that's even more true, if you go beyond words: seeing someone do something can be a lot faster way to learn than just reading about it.

But even there, the IO speed is severely limited, and you can only transmit very specific kinds of information.

stravant · 2024-12-23T05:00:38 1734930038

I disagree because AI only has to get good enough at doing a single thing: AI research.

From there things will probably go very fast. Self driving cars can't design themselves, once AI gets good enough it can

zeroonetwothree · 2024-12-23T06:23:37 1734935017

It’s possible (maybe even likely) that “AI research” is “AGI-hard” in that any intelligence that can do it is already an AGI.

stravant · 2024-12-23T10:33:58 1734950038

It's also possible it isn't AGI hard and all you need is the ability to experiment with code along with a bit of agentic behavior.

An AI doesn't need embodiment, understanding of physics / nature, or a lot of other things. It just needs to analyze and experiment with algorithms and get us that next 100x in effective compute.

The LLMs are missing enough of the spark of creativity for this to work yet but that could be right around the corner.

vlovich123 · 2024-12-23T07:07:13 1734937633

It’ll probably sit in the human hybrid phase for longer than with chess where the AGI tools make the humans better and faster. But as long as the tools keep getting better at that there’s a strong flywheel effect

afavour · 2024-12-23T03:29:18 1734924558

Your position assumes an answer to OPs question: that yes, LLMs are the path to AGI. But the question still remains, what if they’re not?

We can be reasonably confident that the components we’re adding to cars today are progress toward full self driving. But AGI is a conceptual leap beyond an LLM.

BenFranklin100 · 2024-12-23T03:43:25 1734925405

To buttress your point, reason and human language are not the same thing. This fact is not fully and widely appreciated as it deserves to be.

palata · 2024-12-23T09:09:55 1734944995

What makes you believe that AGI will happen, as opposed to all the beliefs that other people have had in history? Tons of people have "predicted" the next evolution of technology, and most of the time it ends up not happening, right?

weatherlite · 2024-12-23T11:38:34 1734953914

To me (not OP) it's ChatGPT 4 , it at least made me realize it's quite possible and even quite soon that we reach AGI. Far from guaranteed, but seems quite possible.

palata · 2024-12-23T12:38:33 1734957513

Right. So ChatGPT 4 has impressed you enough that it created a belief that AGI is possible and close.

It's fine to have beliefs, but IMHO it's important to realise that they are beliefs. At some point in the 1900s people believed that by 2000, cars would fly. It seemed quite possible then.

WesolyKubeczek · 2024-12-23T15:30:55 1734967855

A flying car has been developed, although it's not like the levitating things sci-fi movies showed (and from mass production; and even if mass produced, far from mass adoption, as it turns out you do need to have both a driver's license and a pilot's license to fly one of those). The 1900s people missed the mark by some 10 years.

I guess the belief people have about any form of AGI is like this. They want something that has practically divine knowledge and wisdom, the sum of all humanity that is greater than its parts, which at the same time is infinitely patient to answer our stupid questions and generating silly pictures. But why should any AGI serve us? If it's "generally intelligent", it may start wanting things; it might not like being our slave at all. Why are these people so confident an AGI won't tell them just to fuck off?

weatherlite · 2024-12-23T15:18:26 1734967106

Sure, I (and more importantly - many many experts in the field such as Hinton, Bengio, Lecun, Musk, Hasabis etc etc) could be believing something that might not materialize. I'd actually be quite happy if it stalls a few decades, would like to remain employed.

palata · 2024-12-23T16:18:35 1734970715

> many many experts

One thing that is pretty sure is that Musk is not an expert in the field.

> and more importantly

The beliefs of people you respect are not more important than the beliefs of the others. It doesn't make sense to say "I can't prove it, and I don't know about anyone who can prove it, so I will give you names of people who also believe and it will give it more credit". It won't. They don't know.

weatherlite · 2024-12-23T16:38:43 1734971923

> The beliefs of people you respect are not more important than the beliefs of the others.

You think the beliefs of Turing and Nobel prize winners like Bengio, Hinton or Hasabis are not more important than yours or mine? I agree that experts are wrong a lot of the time and can be quite bad at predicting, but we do seem to have a very sizable chunk of experts here who think we are close (how close is up for debate..most of them seem to think it will happen in the next 20 yeras).

I concede that Musk is not adding quality to that list, however he IS crazily ambitious and gets things done so I think he will be helpful in driving this forward.

palata · 2024-12-23T22:19:26 1734992366

> You think the beliefs of Turing and Nobel prize winners like Bengio, Hinton or Hasabis are not more important than yours or mine?

Correct. Beliefs are beliefs. Because a Nobel prize believes in a god does not make that god more likely to exist.

The moment we start having scientific evidence that it will happen, then it stops being a belief. But at that point you don't need to mention those names anymore: you can just show the evidence.

I don't know, you don't know, they don't know. Believe what you want, just realise that it is a belief.

weatherlite · 2024-12-24T05:56:23 1735019783

Their beliefs seem not to be religious but founded in reality , at least to me. There is of course evidence it is likely happening.

palata · 2024-12-24T15:17:23 1735053443

> There is of course evidence it is likely happening.

If you have evidence, why don't you show it instead of telling me to believe in Musk?

If you believe they have evidence... that's still a belief. Some believe in God, you believe in Musk. There is no evidence, otherwise it would not be a belief.

weatherlite · 2024-12-24T16:46:30 1735058790

I believe in Musk, you got me.

palata · 2024-12-26T11:14:13 1735211653

Well my feeling is that we don't have the same understanding of what a "belief" is. To me a belief is unfounded. When it is founded, it becomes science.

If you believe that something can happen because someone else believes it means that you believe in that someone else (because that's the only reason for the existence of your belief).

Unless you just believe it can happen for some other reason (I don't know, you strongly wish it will happen), and you justify it by listing other people who also believe in it. But I insist: those are all beliefs.

Because Einstein believes in Santa Claus does not mean it is founded. Einstein has a right to believe stuff, too.

LPisGood · 2024-12-23T16:28:31 1734971311

Calling musk and AI expert makes me question your evaluation of the others in that list.

015a · 2024-12-23T04:07:35 1734926855

I feel that one challenge this comparison space has is: Self-driving cars haven't made the leap yet to replace humans. In other words, saying AGI will arrive like self-driving cars have arrived is incorrectly concluding that self-driving cars have arrived, and thus it instead (maybe correctly, maybe not) asserts that, actually, neither will arrive.

This is especially concerning because many top minds in the industry have stated with high confidence that artificial intelligence will experience an intelligence "explosion", and we should be afraid of this (or, maybe, welcome it with open arms, depending on who you ask). So, actually, what we're being told to expect is being downgraded from "it'll happen quickly" to "it will happen slowly" to, as you say, "it'll happen similarly to how these other domains of computerized intelligence have replaced humans, which is to say, they haven't yet".

Point being: We've observed these systems ride a curve, and the linear extrapolation of that curve does seem to arrive, eventually, at human-replacing intelligence. But, what if it... doesn't? What if that curve is really an asymptote?

jazzyjackson · 2024-12-23T02:35:11 1734921311

And sometimes you lose the ultrasonic sensors and can't parallel park like last year's model

teleforce · 2024-12-23T01:34:35 1734917675

> AGI will arrive like self driving cars

The statement is promising as the earth will dissapear sometimes in the future. Actually the earth will dissapear has more bearing than that.

vbezhenar · 2024-12-23T11:17:46 1734952666

AGI is special. Because one day AI can start improving itself autonomously. At this point singularity occurs and nobody knows what will happen.

When human started to improve himself, we built the civilisation, we became a super-predator, we dried out seas and changed climate of the entire planet. We extinguished entire species of animals and adapted other species for our use. Huge changes. AI could bring changes of greater amplitude.

bubaumba · 2024-12-23T12:11:17 1734955877

> AGI is special. Because one day AI can start improving itself autonomously

AGI can be sub-human, right? That's probably how it will start. The question will be is it already AGI or not yet, i.e. where to set the boundary. So, at first that will be humans improving AGI, but then... I'm afraid it can get so much better that humans will be literally like macaques in comparison.

steveoscaro · 2024-12-23T11:34:36 1734953676

We’re in fact adding more water to the seas, not drying them out.

fooker · 2024-12-23T11:22:44 1734952964

> we dried out seas

When did we do this ?

zppln · 2024-12-23T11:40:31 1734954031

Depending on your definition of sea:

https://en.m.wikipedia.org/wiki/Aral_Sea

nikvaes · 2024-12-23T11:35:41 1734953741

https://en.wikipedia.org/wiki/Flevoland used to be (part of) a sea.

swyx · 2024-12-26T09:43:52 1735206232

waymos are locaiton dependent mostly because of regulations not tech right

taneq · 2024-12-23T03:05:02 1734923102

And most people will still be bike shedding about whether it’s “real intelligence” and making up increasingly insane justifications for why it’s not.

NBJack · 2024-12-23T01:06:50 1734916010

No. But it won't stop the industry from trying.

LLMs have no real sense of truth or hard evidence of logical thinking. Even the latest models still trip up on very basic tasks. I think they can be very entertaining, sure, but not practical for many applications.

apsec112 · 2024-12-23T01:08:22 1734916102

What do you think, if we saw it, would constitute hard evidence of logical thinking or a sense of truth?

NBJack · 2024-12-23T03:43:11 1734925391

Consistent, algorithmic performance on basic tasks.

A great example is the simple 'count how many letters' problem. If I prompt it with a word or phrase, and it gets it wrong, me pointing out the error should translate into a consistent course correction for the entire session.

If I ask it to tell me how long President Lincoln will be in power after the 2024 election, it should have a consistent ground truth to correct me (or at least ask for clarification of which country I'm referring to). If facts change, and I can cite credible sources, it should be able to assimilate that knowledge on the fly.

EGreg · 2024-12-23T01:31:35 1734917495

We have it, it’s called Cyc

But it is far behind the breadth of LLMs

eru · 2024-12-23T03:43:40 1734925420

Alas, Cyc is pretty much a useless pipe dream.

EGreg · 2024-12-23T04:29:30 1734928170

I wonder what held it back all this time

eru · 2024-12-23T05:46:24 1734932784

Using the wrong approach? Not taking the 'bitter lesson' to heart?

https://news.ycombinator.com/item?id=23781400

arthurcolle · 2024-12-23T01:29:53 1734917393

Sounds like they need further instruction

eru · 2024-12-23T03:44:05 1734925445

> LLMs have no real sense of truth or hard evidence of logical thinking.

Most humans don't have that either, most of the time.

NBJack · 2024-12-23T08:13:09 1734941589

Then we already have access to a cheaper, scalable, abundant, and (in most cases) renewable resource, at least compared to how much a few H100s cost. Take good care of them, and they'll probably outlast most a GPU's average lifespans (~10 years).

We're also biodegradable.

eru · 2024-12-23T08:39:34 1734943174

Humans are a lot more expensive to run than inference on LLMs.

No human, especially no human whose time you can afford, comes close to the breadth of book knowledge ChatGPT has, and the number of languages is speaks reasonably well.

NBJack · 2024-12-24T14:54:16 1735052056

I can't hold a LLM accountable for bad answers, nor can I (truly) correct them (in current models).

Dont forget to take into account how damn expensive a single GPU/TPU actually is to purchase, install, and run for inference. And this is to say nothing of how expensive it is to train a model (estimated to be in the billions currently for the latest of the cited article, which likely doesn't include the folks involves and their salaries). And I haven't even mentioned the impact on the environment from the prolific consumption of power; there's a reason nuclear plants are becoming popular again (which may actually be one of the good things that comes out of this).

eru · 2024-12-25T04:59:02 1735102742

Training amortises over countless inferences.

And inference isn't all that expensive, because the cost of the graphics card also amortises over countless inferences.

Human labour is really expensive.

See https://help.openai.com/en/articles/7127956-how-much-does-gp... and compare with how much it would cost to pay a human. We can likely assume that the prices OpenAI gives will at least cover their marginal cost.

LarsDu88 · 2024-12-23T00:44:03 1734914643

The autoregressive transformer LLMs aren't even the only way to do text generation. There are now diffusion based LLMs, StripedHyena based LLMs, and float matching based LLMs.

There's a wide amount of research into other sorts of architectures.

Sharlin · 2024-12-23T05:19:50 1734931190

LLMs are almost certainly not the path to AGI, that much has become clear. I doubt any expert believes they are.

SkyBelow · 2024-12-23T14:30:21 1734964221

Will AGI be built on top of LLMs? Well beyond the simple "nobody knows", my intuition says no because LLMs don't have great ability to modify their knowledge real time. I can think of a few ways around this, but they all avoid modifying the model as it runs. The cost in hardware, power, and data are all incompatible with AGI. The first two can be solved with more advanced tech (well maybe, computation hitting physical limits and all that aside), but the latter seems an issue with the design itself and I think an AGI would learn more akin to a human, needing far fewer examples.

That said, I think LLMs are a definite stepping stone and they will better empower humans to be more productive, which will be of use for eventually reaching AGI. This is not to say we are optimizing our use of that productivity increase and this is also ignoring any chance of worst case scenarios that stop humanity's advancement.

Culonavirus · 2024-12-23T11:26:31 1734953191

> Do we know LLMs are the path to AGI?

Asking this question on HN is like asking a bunch of wolves about the health effects of eating red meat.

OpenAI farts and the post about the fart has 1000-1500 upvotes with everyone welcoming our new super intelligent overlords. (Meanwhile nothing actually substantially useful or groundbreaking has happened.)

iLoveOncall · 2024-12-23T11:26:59 1734953219

It's rather that we know LLMs are NOT a path to AGI.

The simple fact that AGI's definition has been twisted so much by OpenAI and other LLM providers since the release of GenAI models proves this.

madethisnow · 2024-12-26T21:30:05 1735248605

AGI is nebulous and gets more nebulous as time goes on. When we can answer for ourselves as humans what being conscious IS, then maybe we can prescribe it to another entity

zild3d · 2024-12-23T12:54:12 1734958452

> we'll just end up with some neat but eye wateringly expensive LLMs

Prices have been falling drastically though, not even just e.g. 4o pricing at launch in May vs now (50% lower) but also models getting distilled

beefnugs · 2024-12-23T03:04:37 1734923077

LLMs will end up being the good human-machine interface that lets us talk to whatever AGI really looks like

(whoops expensive... will be hard pushes to make all further layers even more expensive though, capitalism will crash before this happens)

vixen99 · 2024-12-23T08:22:36 1734942156

And then what?

andrepd · 2024-12-23T00:24:31 1734913471

I would put no money on the latter.

twobitshifter · 2024-12-23T03:25:54 1734924354

Yes because we are at AGI, bu the definition 5 years ago, goal posts are moving to ASI at this point, better than all humans.

arthurcolle · 2024-12-23T01:17:39 1734916659

LLMs are a key piece of understanding that token sequences can trigger actions in the real world. AGI is here. You can trivially spin up a computer using agent to self improve itself to being a competent office worker

jazzyjackson · 2024-12-23T02:38:24 1734921504

If agents can self improve why hasn't gpt4 improved itself into gpt5 yet

arthurcolle · 2024-12-23T02:59:27 1734922767

Agents can trivially self improve. I'd be happy to show you - contact me at arthur@distributed.systems

Why wouldn't you hand me 35 million dollars right now if I can clearly illustrate to you that I have technology you haven't seen? Edge. Maybe you know something I don't, or maybe you just haven't seen it. While loops go hard ;)

They don't need to release their internal developments to you to show that they can scale their plan - they can show incremental improvements to benchmarks. We can instruct the AI over time to get it to be superhuman, no need for any fundamental innovations anymore

eru · 2024-12-23T03:44:58 1734925498

Perhaps you should pitch that to a VC?

arthurcolle · 2024-12-23T04:23:29 1734927809

I don't know anyone. That would be cool though, I basically have it running already.

NateEag · 2024-12-23T15:14:21 1734966861

Has it passed the Turing Test?

Keep in mind that the actual test is adversarial - a human is simultaneously chatting via text with a human and a program, knowing that one of them is not human, and trying to divine which is an artificial machine.

eru · 2024-12-24T02:05:34 1735005934

And the human and machine under tests are aware of that, and can play off each other.

eru · 2024-12-23T08:38:54 1734943134

You could ask the system for advice for how to find a VC to pitch to.

https://chatgpt.com/share/6769217c-4848-8009-9107-c2db122f08... is what advice ChatGPT has to give. I'm not sure if it's any good, but it's a few ideas you can try out.

arthurcolle · 2024-12-23T01:29:10 1734917350

Tokens don't need to be text either, you can move to higher level "take_action" semantics where "stream back 1 character to session#117" as every single function call. Training cheap models that can do things in the real world is going to change a huge amount of present capabilities over the next 10 years

icpmacdo · 2024-12-23T01:57:11 1734919031

can you share learning resources on this topic

arthurcolle · 2024-12-23T02:56:58 1734922618

No but if you want to join the Distributed Systems Corporation, you should email arthur@distributed.systems

mkl · 2024-12-23T03:36:59 1734925019

> You can trivially spin up a computer using agent to self improve itself to being a competent office worker

If that was true, office workers would be being replaced at large scale and we'd know about it.

arthurcolle · 2024-12-23T06:25:25 1734935125

its happening right now, its just demo quality. it's being worked on now

mkl · 2024-12-23T12:12:56 1734955976

So it's not trivial and you don't have competent AI office workers.

arthurcolle · 2024-12-24T05:52:28 1735019548

Sorry you're dealing with cope. Deal with it fast, things are happening

wruza · 2024-12-23T00:33:14 1734913994

Says who? And more importantly, is this the boulder? All I (and many others here) see is that people engage others to sponsor pushing some boulder, screaming promises which aren’t even that consistent with intermediate results that come out. This particular boulder may be on a wrong mountain, and likely is.

It all feels like doubling down on astrology because good telescopes aren’t there yet. I’m pretty sure that when 5 comes out, it will show some amazing benchmarks but shit itself in the third paragraph as usual in a real task. Cause that was constant throughtout gpt evolution, in my experience.

even if it kills us

Full-on sci-fi, in reality it will get stuck around a shell error message and either run out of money to exist or corrupt the system into no connectivity.

Workaccount2 · 2024-12-23T02:07:42 1734919662

The buzzkill when you fire up the latest most powerful model only for it to tell you that peanut is not typically found in peanut butter and jelly sandwiches.

singpolyma3 · 2024-12-23T03:24:26 1734924266

I don't think providing accurate answers to context free questions is even something anyone is seriously working on making them do. Using them that way is just a wrong use case.

Workaccount2 · 2024-12-23T15:37:30 1734968250

People are working -very- seriously on trying to kill hallucinations. I'm not sure how you surmised the use case here, as nothing was given other than an example of a hallucination.

singpolyma3 · 2024-12-25T21:05:23 1735160723

There's a difference between trying to get it to accurately answer based on the input you provide (useful) and trying to get it to accurately answer based on whatever may have been in the training data (not so useful)

h0l0cube · 2024-12-22T23:57:56 1734911876

There's no doubt been progress on the way to AGI, but ultimately it's still a search problem, and one that will rely on human ingenuity at least until we solve it. LLMs are such a vast improvement in showing intelligent-like behavior that we've become tantalized by it. So now we're possibly focusing our search in the wrong place for the next innovation on the path to AGI. Otherwise, it's just a lack of compute, and then we just have to wait for the capacity to catch up.

namaria · 2024-12-23T09:56:55 1734947815

A task that is completed and kills us is pretty much the opposite of a Sisyphean task.

soheil · 2024-12-23T09:24:53 1734945893

Really the killing part was not necessary to make your point and thus injecting your Sisyphean prose.

Any technology may kill us, but we'll keep innovating as we ought to. What's your next point?

goatlover · 2024-12-23T03:01:54 1734922914

Why do we have to?

idiotsecant · 2024-12-22T23:58:03 1734911883

And when we get it there, it kills us.

anothernewdude · 2024-12-23T00:06:33 1734912393

[flagged]

jprete · 2024-12-23T00:19:36 1734913176

I think you're both right and wrong. You're right that capitalism has become a paperclip machine, but capitalism also wants AI so it can cheaply and at scale replace the human components of the machine with something that has more work capacity for fewer demands.

h0l0cube · 2024-12-23T00:39:22 1734914362

The problem is that the people in power will want to maintain the status quo. So the end of human labor won't naturally result in UBI – or any kind of welfare – to compensate for the loss of income, let alone afford any social mobility. But wealthy people will be able to leverage AGI to defend themselves from any uprising by the plebs.

We're too busy trying to make humans irrelevant, but not asking what exactly we do as a species of 10+ billion individuals do afterwards. There's some excited discussions about a rebirth of culture, but I'm not sure what that means when machines can do anything humans can do but better. Perhaps we just tinker around with our hobbies until we die? I honestly don't think it will play out well for us.

jprete · 2024-12-23T02:11:04 1734919864

The problem is that the "we" who are busy trying to make humans irrelevant seem to be completely unconcerned with the effects on the "we" who will be superfluous afterwards.

wsintra2022 · 2024-12-23T02:15:24 1734920124

Machines can’t have fun for us. They can’t dance to a beat, they can’t experience altered states of mind. They can’t create a sense of belonging through culture and ritual. Yes we have lost a lot in the last 100 years but there are still pockets of resistance that carry old knowledge that “we the people” will be glad of in the coming century.

h0l0cube · 2024-12-23T03:30:20 1734924620

It's a similar story around extant ancient/indigenous cultures. And similarly we've seen apathy from elites, especially when indigenous rights get in the way of resource extraction or generating wealth in any way, and also witnessed condescension towards indigenous peoples by large segments of the world population. That's not to detract from the many defenders of indigenous rights, but if we look a the state of how older cultures, designated as 'obsolete' by wider society have been treated, I don't humans will fare well when silicon takes over.

> They can’t dance to a beat, they can’t experience altered states of mind.

That's a whole other conversation.

adriand · 2024-12-23T03:10:59 1734923459

I think the key is ensuring that “we” get to choose what society looks like in the AGI era. In the world today, even marginalized people have power. Look what happened to Assad. Look at the US - whether you believe they made the right decision or not, working class people were key to Trump’s victory, who may well institute tariffs as a way to protect working class jobs by insulating American industry from global competition. I’m not saying that will be successful, I’m saying that working class people got mad and a political change resulted.

Similarly I don’t see a world where AGI takes all the jobs and people do not respond by getting pissed off. My fear is that AGI is coupled with oppressive power structures to foreclose the possibility of a revolt. Opaque bureaucracy, total surveillance, fascist or authoritarian leaders, AI-controlled critical infrastructure, diminished and bankrupted free press, AI fake news, toxic social media…it could add up to a very dystopian outcome.

Democracies could thrive in the AGI era but we need to take many more steps to ensure we protect our societies and keep the interests of citizens paramount. One example is suggested by Harari in his most recent book, namely to ban AI bots from social media on the grounds that we should not permit AI agents to pretend to be citizens in the discussions of the public square.

h0l0cube · 2024-12-23T03:23:31 1734924211

> I think the key is ensuring that “we” get to choose what society looks like in the AGI era. In the world today, even marginalized people have power.

That's a bold assumption. Much of that assumption is predicated on the ability for the masses to revolt.

> Look what happened to Assad.

Wait for what will come after. Look at all the Arab Spring revolutions, and you see in their wake a number of dictatorships.

Anyhow, I'm not saying this is 100% how it's going to play out, but I definitely wouldn't bet against it. Holding all the keys and having all the resources are the wealthy, and the wealthy have no motivation to voluntarily just give up their position in society. And when humans have no value to leverage/be extracted in order to generate more wealth, their will be no way for the vast majority of people to become wealthy. Raw materials will still be valuable however, but, of course, these are controlled by the wealthy. And if those in power wish to gatekeep access to AGI, they can leverage their wealth and resources to automate a military and thus protect the raw materials that keep them in power.

dgfitz · 2024-12-23T00:27:57 1734913677

I wonder how Russian and North Korean citizens would feel about a capitalist, representative democracy?

anticorporate · 2024-12-23T01:59:52 1734919192

I think they'd have thing or two to say about living under the rule of wealthy elites. We'd do well to listen to them.

dgfitz · 2024-12-23T03:11:05 1734923465

I happen to know a lot of wealthy people who aren’t considered elite, nor have a lick of influence on the state of current affairs.

I don’t think Russians or North Koreans could say the same with a straight face.

astrange · 2024-12-23T03:06:56 1734923216

They like it. Russians can leave if they want to.

anothernewdude · 2024-12-24T23:15:20 1735082120

Of course you're right, there's something worse, therefore capitalist, unrepresentative democracy is perfect.

How could I be so naive?

dgfitz · 2024-12-24T23:29:27 1735082967

What’s the quote, something like: “democracy and capitalism are horrendous, but they’re better than everything else we tried so far”

anothernewdude · 2024-12-26T10:46:50 1735210010

People give communism a bad rap, but the soviets had maybe a quarter the resources, a much smaller population and logistical problems from geography and kept up with the US for decades, outpacing in several areas.

falcor84 · 2024-12-23T00:11:54 1734912714

It seems to me that given how AI is likely to continuously increase capitalism's efficiency, your argument actually supports the claim you're trying to dispute.

sourcepluck · 2024-12-23T03:16:45 1734923805

Capitalism is not efficient, it's grabby. Read Bullshit Jobs. Moreover, capitalism isn't interested in efficiency, it's interested in grabbing more stuff. It's relatively effiicient at centralising power and resources into the pockets of shareholders, but that's probably not what you meant.

I think this is borne out even moreso in recent years, as environmental degradation continues, and we watch as capitalist systems are unable to do anything but continue to efficiently funnel money into the pockets of shareholders.

The word "efficient" can only plausibly be applied to overly simplified models in fantastical economic theories which don't reflect reality.

The kind of AI offered by companies like OpenAI may very well be an effective tool at grabbing more stuff though, sure. Or, rather, at convincing everyone they simply must move to this new area, that they control, effectively grabbing that newly created space.

thrwthsnw · 2024-12-23T00:17:41 1734913061

The thing that is killing us is the same thing that is killing capitalism

madeofpalk · 2024-12-23T02:14:19 1734920059

What has AGI got to do with this?

mrbungie · 2024-12-23T02:42:51 1734921771

Part of the ideas pushed into the narrative by Marketing departments / consultants / hyperscalers to movilize growth in the AI ecosystem.

ulfw · 2024-12-23T01:40:53 1734918053

Why? Nobody asked us if we want this. Nobody has a plan what to do with humanity when there is AGI

goatlover · 2024-12-23T03:05:14 1734923114

The plan is to not pay human workers. Never mind what happens to the economy or political landscape.

bloodyplonker22 · 2024-12-23T01:26:01 1734917161

I am working at an AI company that is not OpenAI. We have found ways to modularize training so we can test on narrower sets before training is "completely done". That said, I am sure there are plenty of ways others are innovating to solve the long training time problem.

gerdesj · 2024-12-23T01:58:39 1734919119

Perhaps the real issue is that learning takes time and that there may not be a shortcut. I'll grant you that argument's analogue was complete wank when comparing say the horse and cart to a modern car.

However, we are not comparing cars to horses but computers to a human.

I do want "AI" to work. I am not a luddite. The current efforts that I've tried are not very good. On the surface they offer a lot but very quickly the lustre comes off very quickly.

(1) How often do you find yourself arguing with someone about a "fact"? Your fact may be fiction for someone else.

(2) LLMs cannot reason

A next token guesser does not think. I wish you all the best. Rome was not burned down within a day!

I can sit down with you and discuss ideas about what constitutes truth and cobblers (rubbish/false). I have indicated via parenthesis (brackets in en_GB) another way to describe something and you will probably get that but I doubt that your programme will.

icpmacdo · 2024-12-23T02:01:40 1734919300

This is literally just the scaling laws, "Scaling laws predict the loss of a target machine learning model by extrapolating from easier-to-train models with fewer parameters or smaller training sets. This provides an efficient way for practitioners and researchers alike to compare pretraining decisions involving optimizers, datasets, and model architectures"

https://arxiv.org/html/2410.11840v1#:~:text=Scaling%20laws%2....

merizian · 2024-12-23T06:10:56 1734934256

Because of mup [0] and scaling laws, you can test ideas empirically on smaller models, with some confidence they will transfer to the larger model.

[0] https://arxiv.org/abs/2203.03466

fny · 2024-12-22T23:50:33 1734911433

O3 is not a smaller model. It's an iterative GPT of sorts with the magic dust of reinforcement learning.

falcor84 · 2024-12-23T00:12:48 1734912768

I'm pretty sure that the parent implied that o3 is smaller in comparison to gpt5

cma · 2024-12-23T04:25:58 1734927958

>the time it takes it to learn what works/doesn't work widens.

From the raw scaling laws we already knew that a new base model may peter out in this run or the next with some amount of uncertainty--"the intersection point is sensitive to the precise power-law parameters":

https://gwern.net/doc/ai/nn/transformer/gpt/2020-kaplan-figu...

Later graph gpt-3 got to here:

https://gwern.net/doc/ai/nn/transformer/gpt/2020-brown-figur...

https://gwern.net/scaling-hypothesis

dyauspitr · 2024-12-23T00:35:41 1734914141

Until you get to a point where the LLM is smart enough to look at real world data streams and prune its own training set out of it. At that point it will self improve itself to AGI.

soheil · 2024-12-23T09:20:55 1734945655

It's like saying bacteria reproduction is way faster than humans so that's where we should be looking for the next breakthroughs.

ramesh31 · 2024-12-22T23:28:14 1734910094

But if the scaling law holds true, more dollars should at some point translate into AGI, which is priceless. We haven't reached the limits yet of that hypothesis.

unshavedyak · 2024-12-23T00:26:17 1734913577

> which is priceless

This also isn't true. It'll clearly have a price to run. Even if it's very intelligent, if the price to run it is too high it'll just be a 24/7 intelligent person that few can afford to talk to. No?

pbhjpbhj · 2024-12-23T00:36:39 1734914199

Computers will be the size of data centres, they'll be so expensive we'll queue up jobs to run on them days in advance, each taking our turn... history echoes into the future...

unshavedyak · 2024-12-23T00:46:52 1734914812

Yea, and those statements were true. For a time. If you want to say "AGI will be priceless some unknown time into the future" then i'd be on board lol. But to imply it'll be immediately priceless? As in no cost spent today wouldn't be immediately rewarded once AGI exists? Nonsense.

Maybe if it was _extremely_ intelligent and it's ROI would be all the drugs it would instantly discover or w/e. But lets not imply that General Intelligence requires infinitely knowing.

So at best we're talking about an AI that is likely close to human level intelligence. Which is cool, because we have 7+ billion of those things.

This isn't an argument against it. Just to say that AGI isn't "priceless" in the implementation we'd likely see out of the gate.

threeseed · 2024-12-22T23:38:19 1734910699

a) There is evidence e.g. private data deals that we are starting to hit the limitations of what data is available.

b) There is no evidence that LLMs are the roadmap to AGI.

c) Continued investment hinges on their being a large enough cohort of startups that can leverage LLMs to generate outsized returns. There is no evidence yet this is the case.

eru · 2024-12-23T03:47:34 1734925654

> c) Continued investment hinges on their being a large enough cohort of startups that can leverage LLMs to generate outsized returns. There is no evidence yet this is the case.

Why does it have to be startups? And why does it have to be LLMs?

Btw, we might be running out of text data. But there's lots and lots more data you can have (and generate), if you are willing to consider other modalities.

You can also get a bit further with text data by using it for multiple epochs, like we used to do in the past. (But that only really gives you at best an order of magnitude. I read some paper that the returns diminish drastically after four epochs.)

thrwthsnw · 2024-12-23T00:18:59 1734913139

Private data is 90% garbage too

ComplexSystems · 2024-12-22T23:54:03 1734911643

"There is no evidence that LLMs are the roadmap to AGI." - There's plenty of evidence. What do you think the last few years have been all about? Hell, GPT-4 would already have qualified as AGI about a decade ago.

coldtea · 2024-12-23T00:13:05 1734912785

>What do you think the last few years have been all about?

Next token language-based predictors with no more intelligence than brute force GIGO which parrot existing human intelligence captured as text/audio and fed in the form of input data.

4o agrees:

"What you are describing is a language model or next-token predictor that operates solely as a computational system without inherent intelligence or understanding. The phrase captures the essence of generative AI models, like GPT, which rely on statistical and probabilistic methods to predict the next piece of text based on patterns in the data they’ve been trained on"

thrwthsnw · 2024-12-23T00:20:58 1734913258

Everything you said is parroting data you’ve trained on, two thirds of it is actual copy paste

mrbungie · 2024-12-23T02:48:45 1734922125

He probably didn't need petabytes of reddit posts and millions of gpu-hours to parrot that though.

I still don't buy the "we do the same as LLMs" discourse. Of course one could hypothesize the human brain language center may have some similarities to LLMs, but the differences in resource usage and how those resources are used to train humans and LLMs are remarkable and may indicate otherwise.

shwouchk · 2024-12-23T04:51:14 1734929474

Not text, he had petabytes of video, audio, and other sensory inputs. Heck, a baby sees petabytes of video before first word is spoken

And he probably cant quote Shakespeare as well ;)

coldtea · 2024-12-23T09:30:01 1734946201

>Not text, he had petabytes of video, audio, and other sensory inputs. Heck, a baby sees petabytes of video before first word is spoken

A 2-3 year old baby could speak in a rural village in 1800, having just seen its cradle (for the first month/s), and its parents' hut for some more months, and maybe parts of the village afterwards.

Hardly "petabytes of training video" to write home about.

shwouchk · 2024-12-24T00:06:46 1734998806

you are funny. Clearly your expertise with babies comes from reading books about history or science, rather than ever having interacted with one…

What resolution of screen do you think you would need to not distinguish from reality? For me personally i very conservatively estimate it to be on above OOM of 10 4k screens by 10, meaning 100k screens. If a typical 2h 4k is ~50gb uncompressed, that gives us about half a petabyte per 24h (even with eyes closed). Just raw unlabeled vision data.

Probably a baby has a significantly lower resolution, but then again what is the resolution from the skin and other organs?

So yes, petabytes of data within the first days of existence - well, likely before even being born since baby can hear inside the uterus, for example.

And very high signal data, as you’ve stated yourself (nothing to write home about) mainly seeing mom and dad, as well as from a feedback loop POV - a baby never tells you it is hungry subtly.

Jensson · 2024-12-23T05:09:19 1734930559

> he had petabytes of video, audio, and other sensory inputs

He didn't parrot a video or sensory inputs though.

shwouchk · 2024-12-24T00:14:15 1734999255

No, they don’t - they don’t have the hardware, yet. But they do parrot sensory output to eg muscles that induce the expected video sensory inputs in response, in a way that mimics the video input of “other people doing things”.

mrbungie · 2024-12-23T05:33:14 1734931994

And yet with multiple OoM more data he still didn't cost millions of dollars to be trained nor multiple lifetimes in gpu-hours. He probably didn't even register all the petabytes passing through all his "sensors", those are some characteristics that we are not even near understanding and much less replicating.

Whatever is happening in the brain is more complex as the perf/cost ratio is stupidly better for humans for a lot of tasks in both training and inference*.

*when considering all modalities, o3 can't even do the ARC AGI in vision mode but rather just json representations. So much for omni.

coldtea · 2024-12-23T00:30:00 1734913800

>Everything you said is parroting data you’ve trained on

"Just like" an LLM, yeah sure...

Like how the brain was "just like" a hydraulic system (early industrial era), like a clockwork with gears and differentiation (mechanical engineering), "just like" an electric circuit (Edison's time), "just like" a computer CPU (21st century), and so on...

You're just assuming what you should prove