Ask HN: Pull the curtain back on Nvidia's CES keynote please

zer0x4d · 2025-01-12T03:22:11 1736652131

Agree with all your points on the real world consumer experience.

* I would never assume the AI answer to a consequential problem to be authoritative, unless it shows me the source and I can click on the link to verify the source and the data presented (search engine use case).

* Rewrites with AI are bug-prone and often produce hard to trace bugs to the seemingly correct nature of these bugs. Generating the scaffolding works super well.

* Images are often too smooth, videos too robotic and rhythmic, water too shiny, etc. Trained eyes can easily distinguish between AI and real.

* Hallucinations are commonplace.

gary_0 · 2025-01-12T09:05:33 1736672733

> it shows me the source and I can click on the link to verify the source ... (search engine use case)

To me, this is exactly a search engine. I type my query into 2005-2015 Google, I scan the page summaries under the links to see the answer, and click the best-looking result to confirm or read the details. Occasionally you need to re-word your query to get the answer you're looking for. Sometimes I don't bother clicking through because the answer is right there.

I don't really care that I can use plainer English with an "AI"; I'd be happier if I just got 2010 Google back. But sadly, it's gone.

Animats · 2025-01-12T03:28:50 1736652530

> Images are often too smooth, videos too robotic and rhythmic, water too shiny, etc. Trained eyes can easily distinguish between AI and real.

That's likely to get better. Last year, consistently getting fingers and arms right was tough. This year, there are AI-generated violin playing videos.

> I would never assume the AI answer to a consequential problem to be authoritative, unless it shows me the source and I can click on the link to verify the source and the data presented (search engine use case).

That remains the elephant in the room - the tendency to make up fake answers. Until that's fixed, LLMs are only useful for problems where the cost of such errors is an externality, dumped on the consumer.

ab0aa907 · 2025-01-12T18:58:48 1736708328

> > I would never assume the AI answer to a consequential problem to be authoritative, unless it shows me the source and I can click on the link to verify the source and the data presented (search engine use case).

> That remains the elephant in the room - the tendency to make up fake answers. Until that's fixed, LLMs are only useful for problems where the cost of such errors is an externality, dumped on the consumer.

That’s one of fears. The general public and politicians alike will trust AI without scrutiny. We’ve already seen examples of judges relying on flawed software, with devastating outcomes for innocent people. With the rapid push and widespread enthusiasm for AI, a darker future looms if these problems aren’t addressed.

dsr_ · 2025-01-12T06:50:10 1736664610

I don't think the elephant can be solved by a tweak to LLMs. Producing a statistically-likely continuation of a pattern is what they do; there is no encoding of the world, just an encoding of language and image data.

A general crossing of that gap is, dare I say it, a problem requiring real intelligence.

Animats · 2025-01-12T07:35:12 1736667312

> I don't think the elephant can be solved by a tweak to LLMs.

I doubt that too. But solving it is essential to the valuations of OpenAI and NVidia.

Braini · 2025-01-12T10:41:05 1736678465

Wondering whether we will see some combination with Cyc at some point (which tried to solve the „encoding of the world“ problem)

bpiche · 2025-01-14T21:36:18 1736890578

After Doug's recent passing, I sincerely doubt it.

dang · 2025-01-12T03:25:52 1736652352

Related ongoing thread:

Jensen Huang keynote at CES 2025 [video] - https://news.ycombinator.com/item?id=42618595 - Jan 2025 (65 comments)

PaulKeeble · 2025-01-12T04:52:56 1736657576

From the consumer perspective DLSS upscaling and frame prediction both had a lot of introduced artefacts that made them less than ideal even though they do improve performance quite a bit. This generation improves their accuracy and more heavily leans on the technologies to continue performance improvements. AMD is also making the same investments in their silicon putting considerable space to their AI cores and Ray tracing and not much at all to prior compute or rasterisation.

They are either right and its just a matter of more data and more compute thrown at the problem and its going to get indistinguishable from pixels more traditionally rendered or they are going to waste considerable silicon on the problem and it never becomes convincing. Even though its got problems DLSS3 has been quite popular for gamers and below perfect in these example are fine the errors aren't very consequential.

I don't know where this goes. I do know each generation of AI has improved quite a lot and we no longer talk about the turing test, we definitely took a jump but there remains a lot of hard engineering problems in every domain of AI to make it function as we want it to. Feels to me like a lot of these generators are in the uncanny valley, they are making the sort of errors that are weird and creepy but the thing is about that valley is it hides a lot of the progress being made.

Scene_Cast2 · 2025-01-12T03:47:51 1736653671

I worked as an applied ML researcher for a while, so I'll give this a shot.

"- AI can solve any problem across modalities—just feed it data." - a large chunk of my time in ML is spent on data. I can't emphasize this enough - obtaining large amounts of quality data is a primary challenge with any sort of ML task. This might get easier with time, but will remain a challenge.

The corollary is that niche applications (and thus good fundamentals) are still important.

"- Are the challenges you encounter just a matter of “more compute/money,” or are they fundamental barriers?" - Well, there's a spectrum. Hallucinations are inherent to ML models - I don't think anybody has cracked ML model confidence estimation, and plenty have tried.

A slew of current limitations around LLMs stem from limited context windows. That is "only" inherent to the Transformer architecture (and there is some ongoing work on alternatives such as Mamba).

I think that "agents" and deep integration with computer interfaces will have some interesting automations come out of it.

fulafel · 2025-01-12T07:33:23 1736667203

I'd add to your list of problems: Publicly offered AIs are tuned to present a puritanical, sexless, inoffensive view of the world aligned with "the man" and kowtowing to corporate america's rules.

cwiz · 2025-01-12T03:18:09 1736651889

Nvidia robotic tools were somewhat convoluted last time I checked them couple years ago. I've seen people train robotic control systems with their sim, but for me it seems that the correct way is to set them as reference designs and reimplement in open source setting with constant community interest. Nvidia hardware and software reminds me military-grade stuff in their engineering approach, which may be sound for robotics. World models and simulators still in the state that doesn't require multibillion dollars to make progress in open setting.

btbt · 2025-01-12T19:27:59 1736710079

It's interesting to hear you've had similar experience.

I used Isaac Sim for research work a few years ago (https://github.com/qcr/benchbot) and, although its capabilities were head and shoulders above the rest, its usability was amazingly convoluted. Even simple things became week-long dives into undocumented blind digging.

Part of why I'm so interested in the experiences of people actually using these tools for something productive.

spwa4 · 2025-01-12T13:22:33 1736688153

I am making software for myself to learn, and to help my kids. I am using AI to essentially make A LOT of language exercises. It's really, really good at that. And learning is a lot more fun if you're creative with prompts.

I made javascript for a range of question types (things like fill-in, multiple-choice, ...) and have AI use that to e.g. generate short stories where you have to complete the verbs. Or replace some english words with german ones ... that sort of stuff.

Oh, and any time I need either tests or do something to a large range of variables, do what needs to be done to the first variable, copy over (using the reference) the list of fields, comment out the list of fields, and ask AI to suggest what to do. Usually only needs 1 or 2 changes.

And yes, I've noticed the hallucination. If you ask AI to correctly do a scatter-gather parallel processing in Go ... it's incredible how many errors it makes and it's infuriating how you have to explain every one of it's errors again and again. You have it output a basic structure, because that's still fast, and then rewrite the whole thing. I think it still gains me a bit of time ... but I see the point.