I have the same observation. I've been able to improve things I just didn't have...

protocolture · 2025-05-26T02:15:46 1748225746

Recent example.

Trying to get some third party hardware working with raspi

The hardware provider provides 2 separate code bases with separate documentation but only supports the latest one.

I literally had to force feed the newer code base into ChatGPT, and then feed in working example code to get it going, else it constantly reference the wrong methods.

If I just kept going Code / output / repeat it would maybe have stumbled on the answer but it was way off.

redtaperat · 2025-05-26T03:08:50 1748228930

This is one of several shortcomings I’ve encountered in all major LLMs. The llm has consumed multiple versions of SDKs from the same manufacturer and cannot tell them apart. Mixing up apis, methods, macros, etc. Worse is that for more esoteric code with fewer samples, or more wrong than right answers in the corpus means always getting broken code. I had this issue working on some NXP embedded code.

dpkirchner · 2025-05-26T04:16:13 1748232973

Human sites are also really bad about mixing content from old and new versions. SO to this day still does not have a version field you can use to filter results or more precisely target questions.

lodovic · 2025-05-26T05:45:24 1748238324

We have MCP servers to solve that (and Context7)

redtaperat · 2025-05-26T12:31:16 1748262676

I see those as carefully applied bandaids. But maybe that’s how we need to use AI for now. I mean we’re burning a lot of tokens to undo mistakes in the weights. That can’t be the right solution because it doesn’t scale. IMO.

guappa · 2025-05-26T13:32:11 1748266331

Yesterday I searched how to fix some windows issue, google AI told me to create a new registry key as a 32 bit value and write a certain string in there.

winrid · 2025-06-01T19:07:27 1748804847

Google's search LLM I think is not very good.