Hacker Newsnew | past | comments | ask | show | jobs | submit | more netdevphoenix's commentslogin

How come that OpenAI and Anthropic both released their models pretty much at the same time? Does anyone know if the timing is coincidental?


I would bet to be ready before the Superbowl ads


> are there modes of thinking that fundamentally require something other than what current LLM architectures do?

Possibly. There are likely also modes of thinking that fundamentally require something other than what current humans do.

Better questions are: are there any kinds of human thinking that cannot be expressed in a "predict the next token" language? Is there any kind of human thinking that maps into token prediction pattern such that training a model for it would not be feasible regardless of training data and compute resources?

At the end of the day, the real world value is utility, some of their cognitive handicaps are likely addressable. Think of it like the evolution of flight by natural selection, flight is usefulness to make it worth it adapt the whole body to make flight not just possible but useful and efficient. Sleep falls in this category too imo.

We will likely see similar with AI. To compensate for some of their handicaps, we might adapt our processes or systems so the original problem can be solved automatically by the models.


Waiting until the moment they get good enough is not a smart thing to do either. If you are a farmer and know it is going to snow, at some point in the next 5 months, you make plans NOW, you don't wait until the temperatures drop and you see the snow falling. Right now, people are waiting for the snowfall before moving their proverbial chickens indoors


Top AI researchers like Yann LeCunn have said that LLMs are a dead end.

It seems to me that LLM performance is plateuing and not improving exponentially anymore. This recent hubbub about rewriting a worse GCC for $20,000 is another example of overhype and regurgitating training data.

You don't know for sure if it is going to "snow" (AI reaches general intelligence) Snow happens frequently, AI reaching general intelligence has never happened. If it ever happens, 99% of jobs are gone and there is really nothing you can do to prepare for this other than maybe buy guns and ammo, and even that might not do anything to robotic soldiers.

People were worried about AI taking their jobs 60 years ago when perceptrons came out, and anyone who avoided a tech career because of that back then would have lost out majorly.


There is no reason why an AI model capable of pushing a significant chunk of devs into lower paid and highly competitive dev jobs as a result of automation needs to be a general artificial intelligence. There is a lack of nuance that comes with thinking that either AI is dumb or it has human level general intelligence. As much as devs hate to admit it, you don't need that much of what we understand as general intelligence to write software. Only a portion of your intelligence is needed and arguably not all of it at the same time.

While general purpose models might be plateauing soon (arguably they have for a while). Highly specialised models (especially for programming) haven't necessarily plateaud yet. And anyway, existing functionality seem like a good foundation to build upon systems that remove the need of hiring as many devs. It's not the "being out of a job" that should worry you. Open up your binary thinking and consider that facing a 08 job market for the rest of your career is not the same permanent unemployment but it is not a market you would like to have.

That is the real concern.


You don't need to be a genius or rocket scientist to write code, but llm don't even reach the bar for anything but the most simple things. Take a look at the video I posted earlier for an example.

And specialised models for programming HAVE plateaued.

https://livebench.ai/#/?sort=Agentic+Coding+Average

From Claude 4.1 to 4.5 was only an 18% gain, and from 4.5 to 4.6 it even DECLINED. Codex 5.1 to 5.2 also shows a decline.


https://arxiv.org/abs/2510.26787

Testing the top llms on wework, the highest performing one only succeeded with a rate of 2.5%

Can you imagine not being fired when you can only do 2.5% of all tasks?

This study is dated October 30th, very recent.


> Can you imagine not being fired when you can only do 2.5% of all tasks?

You are not competing against LLMs though. You are competing against people (who in a pre-LLM world wouldn't be in tech) using LLMs tools to beat you in terms of value. In the new world, you either are a top 1% dev or you beat everyone in race to the bottom pricewise. The middle will become vanishingly small. Think of manufacturing in developed countries.


> Or I fixed a bug in a linux scanner driver. None of these I could have done properly (within an acceptable time frame) without AI. But also none of there I could have done properly without my knowledge and experience, even with AI

There are some things here that folks making statements like yours often omit and it makes me very sus about your (over)confidence. Mostly these statements talk in a business short-term results oriented mode without mentioning any introspective gains (see empirically supported understanding) or long-term gains (do you feel confident now in making further changes _without_ the AI now that you have gained new knowledge?).

1. Are you 100% sure your code changes didn't introduce unexpected bugs?

1a. If they did, would you be able to tell if they where behaviour bugs (ie. no crashing or exceptions thrown) without the AI?

2. Did you understand why the bug was happening without the AI giving you an explanation?

2a. If you didn't, did you empirically test the AI's explanation before applying the code change?

3. Has fixing the bug improved your understanding of the driver behaviour beyond what the AI told you?

3a. Have you independently verified your gained understanding or did you assume that your new views on its behaviour are axiomatically true?

Ultimately, there are 2 things here: one is understanding the code change (why it is needed, why that particular change implementation is better relative to others, what future improvements could be made to that change implementation in the future) and skill (has this experience boosted your OWN ability in this particular area? in other words, could you make further changes WITHOUT using the AI?).

This reminds me of people that get high and believe they have discovered these amazing truths. Because they FEEL it not because they have actual evidence. When asked to write down these amazing truths while high, all you get in the notes are meaningless words. While these assistants are more amenable to get empirically tested, I don't believe most of the AI hypers (including you in that category) are actually approaching this with the rigour that it entails. It is likely why people often think that none of you (people writing software for a living) are experienced in or qualified to understand and apply scientific principles to build software.

Arguably, AI hypers should lead with data not with anecdotal evidence. For all the grandiose claims, the lack of empirical data obtained under controlled conditions on this particular matter is conspicuous by its absence.


It's incredible that within two minutes after posting this comment is already grayed out whereas it makes a number of excellent points.

I've been playing with various AI tools and homebrew setups for a long time now and while I see the occasional advantage it isn't nearly as much of a revolution as I've been led to believe by a number of the ardent AI proponents here.

This is starting to get into 'true believer' territory: you get these two camps 'for and against' whereas the best way forward is to insist on data rather than anecdotes.

AI has served me well, no doubt about that. But it certainly isn't a passe-partout and the number of times it has caused gross waste of time because it insisted on chasing some rabbit simply because it was familiar with the rabbit adds up to a considerable loss in productivity.

The scientific principle is a very powerful tool in such situations and anybody insisting on it should be applauded. It separates fact from fiction and allows us to make impartial and non-emotional evaluations of both theories and technologies.


> (...) you get these two camps 'for and against' whereas the best way forward is to insist on data rather than anecdotes.

I think that's an issue with online discussions. It barely happens to me in the real world, but it's huge on HN.

I'm overall very positive about AI, but I also try to be measured and balanced and learn how to use it properly. Yet here on HN, I always get the feeling people responding to me have decided I am a "true believer" and respond to the true believer persona in their head.


Thanks for pointing these things out. I always try to learn and understand the generated code and changes. Maybe not so deep for the android app (since it's just my own pet project). But especially for every pull request to a project. Everyone should do this out of respect to the maintainers who review the change.

> Are you 100% sure your code changes didn't introduce unexpected bugs?

Who is this ever? But I do code reviews and I usually generate a bunch of tests along with my PRs (if the project has at lease _some_ test infrastructure).

Same applies for the rest of the points. But that's only _my_ way to do these things. I can imagine that others do it a different way and that the points above are more problematic then.


> I always try to learn and understand the generated code and changes

Not to be pedantic but, do you _try_ to understand? Or do you _actually_ understand the changes? This suggests to me that there are instances where you don't understand the generated code on projects others than your own, which is literally my point and that of many others. And even if you did understand it, as I pointed out earlier, that's not enough. It is a low bar imo. I will continue to keep my mind open but yours isn't a case study supporting the use of these assistants but the opposite.

In science, when a new idea is brought forward, it gets grilled to no end. The greater the potential the harder the grilling. Software should be no different if the builders want to lay a claim on the name "engineer". It is sad to see a field who claims to apply scientific principles to the development of software not walking the walk.


Why would you ever, outside flight and medical software, care about being 100% sure that the change did not introduce any bugs?


Because bugs are bad. Fixing one bug but accidentally introducing three more is such a pattern it should have a name.


They are. And we have processes to minimize them - tests, code review, staging/preprod envs - but they are nowhere close to being 100% sure that code is bug free - that's just way too high bar for both AI and purely human workflows outside of few pretty niche fields.


When you use AI to 'fix' something you don't actually understand the chances of this happening go up tremendously.


I propose "the whack-a-hydra" pattern


Hehe, yes, very apt. It immediately gives the right mental image.


Because why would you make something broken when you could make something not broken?


Because it's way too high bar to be 100% sure outside of few niche fields.


> 1. Are you 100% sure your code changes didn't introduce unexpected bugs?

How often have you written code and been 100% your code didn't introduce ANY bugs?

Seriously, for most of the code out there who cares? If it's in a private or even public repo, it doesn't matter.


>Arguably, AI hypers should lead with data not with anecdotal evidence

This reminds me of people who get sad when they realize they haven’t discovered anything amazing.

I am pedantic and “people that” → “people who” (for people, who is preferred).


>using that tool isn't using AI

It is though. App is using AI underneath to generate audio snippets. That's literally its purpose


Creating those snippets don't require knowing how to make a proper recording, how to edit it down, or how to direct the voice actor for the line.


> The second group is one that thinks talking to a chatbot will replace senior developer

No one is going to replace senior developers. But senior developer pay WILL decrease relative to its historical values.


Surely making use of a new tool that makes you more productive would increase your value rather than decreasing it? Especially when, knowing the kinds of mistakes AI could make that would affect your codebase negatively in terms of maintainability, security etc would require significant experience.


> Surely making use of a new tool that makes you more productive would increase your value rather than decreasing it?

Think wider. You, sharperguy, are not and will not the only person with access to these tools. Therefore, your productivity increase will likely be the same as everyone else's. If you are as good as everyone else, why would YOU get paid more? Have you ever seen a significant number of companies outside FAANG permanently boost everyone's salary just because they they did well on a given year?

A company's goal is to the shareholders not to you. Your value exists relative to that of others.


> If you are as good as everyone else, why would YOU get paid more?

If every coal miner could suddenly produce 10x the amount of goal, do people say "well now we can just hire one coal miner instead of 10". Or do they say "now thousands of new project which were not economically viable due to the high price of coal are now viable, meaning we actually need to increase our total output beyond even 10x of what it was previously."


Coal miners are cheap, easy to replace and have little negotiation power. Devs are expensive, harder to replace and have some leverage. Coal miners can take a pick or an axe with them when they leave. Devs can take away with them valuable operational knowledge with them that can bring to a competitor. Not comparable.

Plus, look at the job market. Every single tech company out there has been laying off devs in the last 3 years. If maximising productivity above expenses was so valuable, every tech company out there would be hiring like crazy because senior devs are cheap as chips nowadays. But they aren't, devs might be cheap but money itself isn't right now so they are prioritising lower expenses over increased productivity. Because that makes shareholders happy. And that's what every company aims for.

Maximising productivity is only an absolute goal in the minds of devs not in the minds of executives.


Not really. If pay decreases it's because you're not required anymore or less, which is contrary to what has been shown. IF educating and enabling juniors etc. is not handled correctly, then senior pay will explode, because whilst they are much more efficient, their inherent knowledge is required to produce sustainable results.


> If pay decreases it's because you're not required anymore or less

Not necessarily, there are many factors at play here which are downplayed. The first one is education: LLMs are going to significantly improve skill training. Arguably, it is already happening. So the gap between you and a middev will get narrower. At the same time, candidates who can be as good as you will increase.

While you can argue that you possess specialised skills that not many do, you are unlikely to prove that under pressure within a couple of hours and certainly not to the level where you can have late 10s level of negotiating power imo.

At the end of the day, the market can stay irrational longer than you can continue refuse to accept a lower offer imo. I believe there will be winners. But pure technical skill isn't the moat you think it is. Not anymore.


> I think software is about to become disposable and that’s uncharted territory.

I agree that most software will likely head that way. I wonder what this means for the economics of the open source ecosystem most software depends on. In a future where most software is made by the successor of LLMs can a human dev grab a tutorial and write software or will it be too unintelligible for a human to do?


Neither do I. But with Windows slipping badly, Google could start encroaching on their core tech.


Linux seems to be gaining a lot of traction, both with the fall of windows and gaming being more than feasible.

It makes sense for the tech savvy option to succeed, now that personal computing is disappearing. Average folks won’t use a windows/macbook, they’ll use phones and tablets.

My only concern is ending in a macOS+asahi situation where supporting a single device requires mountains of effort.


The fall of windows and Linux gaining traction.

I've seen that written on here, Reddit, /., digg, hell even on usenet back in the day. . . .


Yes, but have you seen you real life non tech friends move to Linux?

I’m seeing it now, and this is new.


Yes, and I also have seen they come back to Windows, when they got into issues sharing software or files with friends, or local goverment requirements, and didn't had a relative to do their IT support for free.


And yet it's undeniable that 2025 had some of the biggest Linux hype in recent times:

- Windows 10 went EOL and triggered a wave of people moving to Linux to escape Windows 11 - DHH's adventures in Linux inspired a lot of people (including some popular coding streamers/YouTubers) to try Linux - Pewdiepie made multiple videos about switching to Linux and selfhosting - Bazzite reported serving 1 PB of downloads in one month - Zorin reported 1M downloads of ZorinOS 18 in one month and crossed the 2M threshold in under 3 months - I personally recall seeing a number of articles from various media outlets of writers trying Linux and being pretty impressed with how good it was - And don't forget Valve announced the Steam Machine and Steam Frame, which will both run Linux and have a ton of hype around them

In fact, I think that we will look back in 5 or 10 years and point at 2025 as the turning point for Linux on the desktop.


The cycling speech since Window XP Toy's R US L&F days, unfortunely.

Less fragmentation, more focus, OEM support on devices selling on regular stores is needed, otherwise we won't get away from the yearly meme.


> otherwise we won't get away from the yearly meme

What's different in the last decade is that Windows is on an undeniable quality downward spiral, it's simply not important anymore for Microsoft.

E.g. desktop Linux doesn't even need to improve, it just needs to wait for Windows to become worse ;)


Unless it becomes available for normies to buy laptops with it pre-installed at Saturn, Media Market, FNAC, Cool Blue, and co, it won't matter.

They aren't going to buy them from Tuxedo.


"Normies" buy smartphones and maybe a tablet, neither of those has Windows preinstalled either.


Available at Saturn, Media Market, FNAC, Cool Blue show floor.


...and a market share of 0.02% ;)

https://gs.statcounter.com/os-market-share/tablet/worldwide

(I guess those 'Windows tablets' are running under convertible laptops or something...)


Where do you think normies that don't live in cities with Apple stores, or with salaries unable to afford Apple tax, get their smartphones and tablets?

I have made zero mentions of Windows tablets, that market died with Windows 8, replaced by 2-1 laptops.


So just don't use windows? The only reason I use android to begin with is because the mobile centric distros I looked into didn't appear to be to the point I would want to daily drive them yet. If and when that changes I'll switch.

The only real issue is sourcing good mobile hardware that isn't locked down. At least for the time being the pixel line satisfies that.


Fitness correlates with health though. Just because you don't have any conditions does not mean that you are healthy. And inability to meet certain fitness tests is correlated with lower health.


Site Reliability Engineering. It is the role that, among other things, ensures that a service uptime is optimal. It's the closest thing we have nowadays to the system admin role


Seems like that would only be relevant to web development, not software engineering in general.


True, but since the vast majority of software engineering is web engineering and the title is clearly about web, it seems fit to mention that.


IMO, that isn't true, nor is the vast majority of software engineering related to the web.

Every industry has been undergoing digital transformation for decades. There are SREs ensuring service levels for everything, from your electrical meter, to satellite navigation systems. Someone wrote the code that boots your phone and starts your car. Somebody's wireless code is passing through your body as you read this, while an SRE ensures the packet loss isn't too high.


Your point doesn't really change what I said. There are many languages in the world but English is the most common one. Those two facts are true at the same time. This is the same, there are many types of software engineering out there but the most common software engineering job relates to building web applications. If you don't believe me, hit your regular job board and count.


Thank you!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: