More

bad_username · 2026-06-08T16:47:42 1780937262

I am on an enterprise environment. I had non stop issues with OneDrive, such as OneDrive process always pegged at 100% CPU, files not synching, files going cloud-only and inaccessible without Internet, and issues in git repos. I go out of my way to avoid OneDrive at any cost, including the cost of using a separate backup system for my files.

Fizz43 · 2026-06-09T09:06:37 1780995997

dont look at CPU in windows. There is always something cranking it to 100% thats just how windows 11 is.

bad_username · 2026-06-07T09:04:48 1780823088

ChatGPT hallucinates when asked to hallucinate

bad_username · 2026-06-06T06:08:44 1780726124

I wish <smug></smug> was a real HTML tag

kstrauser · 2026-06-06T06:18:46 1780726726

It's a semantic div tag, and it's spelled "<actually>".

sscaryterry · 2026-06-06T09:50:49 1780739449

This is tongue in cheek, but those who can't do, teach, and those who can teach, recruit.

bad_username · 2026-06-02T20:13:44 1780431224

> we don't send images to the model at query time. We describe each image once, at indexing time, with a cheap vision model, store the descriptions as text, and retrieve them alongside ordinary text chunks

This is what I've been doing in my Obsidian infodump for a while. If I know that an image is important, I generate a text description (Mermaid if possible, English if not) and paste it after the image in a block. This lets agents see the image if they don't really see it. Though my process is manual, the improvements in outcomes for agents that rely on text search/retrieval is very real and is worth it.

vinzenzu · 2026-06-03T07:59:01 1780473541

For a RAG project for a client with a lot of PDFs and Powerpoints with images, I used ColPali a year ago. I see the provider ColiVara is still online but it seems to have fizzled out.

Retrieving based on text and then giving the generation model the image instead is much smarter than retrieving based on image. Image-based retrieval is slow and expensive.

Same with giving the model an image vs a structured representation of it.

throwaw12 · 2026-06-03T09:30:53 1780479053

> For a RAG project for a client with a lot of PDFs and Powerpoints with images, I used ColPali a year ago

How was the accuracy compared to pre-parsing the image and doing search in the text?

vinzenzu · 2026-06-03T16:10:55 1780503055

Leaps and bounds better! I don't think I benchmarked it.

But the experience was that it was able to find small details in PDFs, in technical diagrams, and this was really not captured well at all with OCR.

In general, OCR I think should be used more as an add-on to retrieve data, not given to the generation model itself. Similar to retrieving based off a text description and then giving the generation model the image.

Terretta · 2026-06-03T01:39:48 1780450788

What does Mermaid text description of an image mean?

Descriptions of images that are charts or diagrams to start with?

bad_username · 2026-06-03T06:41:38 1780468898

Most diagrams I come across are basically boxes and arrows which are representable with mermaid flow charts without losing information. The layout of the mermaid will usually look differently, but that is not typically what matters. ChatGPT is quite good in creating mermaid flow charts from random box and arrow diagram images.

gatlin · 2026-06-03T15:49:48 1780501788

Which cheap vision model would you recommend for ingesting category diagrams and producing mermaid facsimiles?

bad_username · 2026-06-03T22:05:38 1780524338

I haven't yet tried to solve this at any scale. So my models are ChatGPT (plus) in the browser, or Sonnet/Opus 4.x in Zoo Code.

bad_username · 2026-05-28T19:56:14 1779998174

Will not the landlords eventually pass the expense of the new tax on to you, the tenants? They won't like dip into their savings to pay it, will they.

gen220 · 2026-05-28T20:27:28 1780000048

Eh, maybe for the more luxurious properties? But plenty of landlords are operating on tight margins and they’re not legally allowed to raise rent by more than some measure of inflation reported by the state each year.

But you’re right, the tax would have to be much more punitive to crossover into the red.

If it does make it more challenging to justify the business of being a landlord, I’m all for it though. Steps towards the end goal of more New Yorkers who want to owning their primary residence.

bad_username · 2026-05-27T05:55:00 1779861300

> honest, unbiased, astroturf-free

That is not the case, sorry. Pre-2015 Wikipedia was as honest and unbiased as we can get. Now the political, historical, philosophical segments of English Wikipedia is very biased and I cannot recommend or support it.

thrance · 2026-05-27T10:31:37 1779877897

In what ways? Provide examples.

bad_username · 2026-05-27T05:45:52 1779860752

What if management pushes stupid decisions and blames you for the second order effects? Happens often.

bad_username · 2026-05-26T05:46:07 1779774367

> furiously hammering on my laptop “WHAT THE FUCK DID YOU DO???”. The recipient of these tirades is, you might have guessed, a coding agent. It’s completely pointless, I know.

I believe it's worth than pointless. IMO adding such things to the context "configures" the AI to reproduce the statistics of conversations where people swore, shouted, and were unprofessional (despite the alignment runing and all that), where quality content is rarer to find. So this is bound to decrease the quality of the LLM output.

buu700 · 2026-05-26T07:28:23 1779780503

Agreed. These accounts of people having genuine emotional responses to LLM chats, even going as far as to spend tokens berating them, are very curious. I would be surprised to learn that SOTA models respond optimally to anything other than dispassionate problem-solving, or that scolding per se serves any productive purpose.

Of course we all swear at our computers every now and then, but for me it's always been in good fun. It's just a sarcastic joke that adds some levity and self-amusement to an otherwise arduous debugging process, not generally actual insinuation of malfunction (or malice) on the part of the hardware/OS/toolchain. I'd assumed that "half the job is cursing at the machine until it obeys you" was a big in-joke amongst the profession, but the LLM era seems to be exposing a divide in how tongue-in-cheek that statement really is.

astrange · 2026-05-26T21:49:59 1779832199

That's how a base model would work. An assistant model is simulating a human and behaves the same way a human would if you screamed at them.

https://www.anthropic.com/research/emotion-concepts-function

JSR_FDED · 2026-05-26T07:38:28 1779781108

Why would you deprive the LLM of a signal that indicates how badly it screwed up?

carsareok · 2026-05-26T08:28:19 1779784099

Because it's a completion engine and has no notion of "signals".

Swearing was in the texts they were trained on to complete token by token. I suspect it weren't texts with a lot of high-quality reasoning.

bad_username · 2026-05-26T05:37:14 1779773834

We may be in the last Golden age of AI, where experienced professionals still exist who can code manually, and AI already exists who can code automatically, and when the former use the latter skillfully, wonders happen. This magical intersection may not exist iin the future, or become very rare.

dozerly · 2026-05-26T05:44:04 1779774244

I think as long as it continues to be tangibly better these people will still exist and the intersection will continue to be valuable enough to survive.

josephg · 2026-05-26T05:56:33 1779774993

> as long as it continues to be tangibly better these people will still exist

Sure. But how long will that last? LLMs are getting better at programming much faster than I am.

Imagine a plot with time on the X axis and LLM skill on the Y axis. The line goes up and to the right. On the left is GPT3, or GPT3.5 with the very first glimmers of programming ability just a few short years ago. In the middle is Opus 4.7 now.

Where's the intersection point, where AI skill is higher than that of humans? Less than 10 years. I'd guess less than 5 years.

_under_scores_ · 2026-05-26T07:34:27 1779780867

I think the problem is is that coding is not wholly a 'writing code' problem. It's a translation from idea to outcome. Often I think the bad code generated by an LLM is less to do with it's 'ability' and more to do with an instruction that hasn't adequately accounted for the possibility of what code satisfies the criteria. I'm not sure how a newer model can improve on this per se - sure there will be imrpovement on outright mistakes but for me at least, that's been and gone with more or less with any model released in te last 6 months.

josephg · 2026-05-26T11:07:16 1779793636

I was coding something with claude the other day. It got the program working by all externally observable metrics, but when I went into the code it was full of DRY violations. It made a bunch of interrelated - but separate - traits for some concepts which simply didn't fit together.

I asked it to look at the code and come up with better factorings, but it failed. I ended up manually reworking several thousand lines of code myself, via my IDE. It took days.

I'd like a claude-of-the-future to be able to come up with beautiful ways to factor the code itself. Amongst the correct solutions, pick one which is conceptually simple. Write the code in a way that it makes subsequent changes easier to write. If I were doing RL with claude, I'd consider directing it toward solutions which allow subsequent changes to be implemented with as little effort as possible.

vanuatu · 2026-05-26T15:34:11 1779809651

I think a better way to think about it is - what are the invariants to our current architecture? Why can't you tell Claude to build you a 1B$ business, make no mistakes?

I have no doubt they will be better programmers than almost every human that has ever existed. But the role of a SWE will expand to fill the gaps that the LLM paradigm hasn't filled:

- Accountability

- Long term architectural vision, goal setting

- Everchanging business context

- Mercurial executives, people problems, relationships etc...

throwatdem12311 · 2026-05-26T11:53:03 1779796383

Token efficiency is going to be the next big thing.

Tokenmaxxing an army of juniors will destroy your business through slop induced tech debt and API costs. A senior that uses AI but is token efficient will be like rocket fuel.

dogleash · 2026-05-26T15:48:17 1779810497

>rocket fuel

Did you write this comment with AI, or can you explain why so many people use the exact same terrible metaphor?

nicman23 · 2026-05-26T05:50:12 1779774612

people said the same with any innovation

javier123454321 · 2026-05-26T10:33:39 1779791619

And you act like there hasn't been a loss once we moved away from the master craftsman style of building to the professionalized architect style of building. We cannot make a gothic cathedral amymore. also CAD, homogenized the built environment, significantly. And we have been losing a lot of traditional, artisanal craftsmen art forms over the past century. artisanal craft mounds,

ant6n · 2026-05-26T05:57:43 1779775063

There at a lot of crafts that don’t have real deep experts anymore because the work was 90% automated.

herrherrmann · 2026-05-26T05:52:57 1779774777

Did they? Genuine question, because I do wonder if people in some industries in the past were ever anxious about these specific things (especially skill attrition).

dogleash · 2026-05-26T16:04:03 1779811443

> I do wonder if people in some industries in the past were ever anxious about these specific things (especially skill attrition).

I've spoken with some people (now in their 60s & 70s) that worried about skill atrophy in their line of work.

First they worried about atrophy. Then they watched skill dry up. Now they know it's not available to buy anywhere. In the better cases the skills still exist, but entirely overseas.

These are people I could recognize as sharp engineers, even if I don't know their domains at all. I had to take them at their word about the value in what was lost. The problem is that it's easy to assume that business (or at least society) would prevent degradation of valuable knowledge over time.

bad_username · 2026-05-25T05:53:22 1779688402

My experience with ChatGPT as a search engine - it is totally paranoid about checking and re-checking its answers by referencing them in multiple places (I usually read its thinking output). I have not seen an outright hallucination for at least a year. (It is of course a different situation with Google's "AI summary" which is wrong half of the time.)

forgetfreeman · 2026-05-25T06:02:08 1779688928

Ironically I quit using ChatGPT a while back. I decided to run it through it's paces and asked it some rather detailed questions about a range of topics that I have significant domain knowledge on. Without exception the responses I got back where glibly superficial to the point the responses were almost totally devoid of meaningful information. The AI summary on Google search results is so bad it represents an assault on reason.