More

conjectures · 2025-04-25T14:08:52 1745590132

'Let them eat cake.'

conjectures · 2025-04-16T16:54:09 1744822449

Common pattern where a bright spark asks, 'why you all so complicated?' Proceeds to assume we're dealing with a finite graph / set.

All the complication is needed to handle the fact that the state can be a random vector of real numbers in a possibly varying dimensional space. It's not jerking off on jargon for its own sake.

Sure, there are simple cases - doesn't make the general case 'bullshit'.

conjectures · 2025-03-13T17:18:46 1741886326

It does apply to people? When you read a copy of a book, you can't be sued for making a copy of the book in the synapses of your brain.

Now, if you have eidetic memory and write out large chunks of the book from memory and publish them, that's what you could be sued for.

tsimionescu · 2025-03-13T17:47:24 1741888044

This is not about memory or training. The LLM training process is not being run on books streamed directly off the internet or from real-time footage of a book.

What these companies are doing is:

1. Obtain a free copy of a work in some way.

2. Store this copy in a format that's amenable to training.

3. Train their models on the stored copy, months or years after step 1 happened.

The illegal part happens in steps 1 and/or 2. Step 3 is perhaps debatable - maybe it's fair to argue that the model is learning in the same sense as a human reading a book, so the model is perhaps not illegally created.

But the training set that the company is storing is full of illegally obtained or at least illegally copied works.

What they're doing before the training step is exactly like building a library by going with a portable copier into bookshops and creating copies of every book in that bookshop.

visarga · 2025-03-13T18:31:21 1741890681

But making copies for yourself, without distributing them, is different than making copies for others. Google is downloading copyrighted content from everywhere online, but they don't redistribute their scraped content.

Even web browsing implies making copies of copyrighted pages, we can't tell the copyright status of a page without loading it, at which point a copy has been made in memory.

tsimionescu · 2025-03-13T19:32:36 1741894356

Making copies of an original you don't own/didn't obtain legally is not fair use. Also, this type of personal copying doesn't apply to corporations making copies to be distributed among their employees (it might apply to a company making a copy for archival, though).

codedokode · 2025-03-14T01:48:02 1741916882

> But making copies for yourself, without distributing them,

If this was legal, nobody would be paying for software.

triceratops · 2025-03-13T17:39:50 1741887590

> When you read a copy of a book

They're not talking about reading a book FFS. You absolutely can be sued for illegally obtaining a copy of the book.

conjectures · 2025-03-07T22:57:52 1741388272

Not from what I've seen. The compiler is slow af which plays badly with how fussy the thing is.

It's easy to have no defects in functionality you never got around to writing because you ran out of time.

rescbr · 2025-03-08T01:36:05 1741397765

So it makes you think before typing and compiling the code?

Doesn’t look like a con to me :)

conjectures · 2025-03-11T10:40:01 1741689601

Here's a great life hack for you. Set a 15s delay whenever you type in an editor. That way you'll think more before writing code.

conjectures · 2025-03-07T22:51:50 1741387910

Ah, the good ole Rube Goldberg machine.

conjectures · 2025-02-20T15:49:26 1740066566

It's not so hard. One of the interview stages I did somewhere well known used this.

Here's the neural net model your colleague sent you. They say it's meant to do ABC, but they found limitation XYZ. What is going on? What changes would you suggest and why?

Was actually a decent combined knowledge + code question.

CharlieDigital · 2025-02-20T16:04:42 1740067482

There are so many interesting ways to use code reviews like subtly introducing defects and bugs and see if people can follow the logic, read the code, find where the reasoning comes up short.

I wrote up 7 general strategies for teams that are interested: https://coderev.app/blog/7-strategies-for-using-code-reviews...

conjectures · 2025-02-20T15:37:51 1740065871

> Ironically, vegetarianism really only started to become popular in the Western world once people lost their connection to farms

As did dental care and cars. Correlation is not causation.

conjectures · 2025-02-03T09:29:30 1738574970

It's a bad question. What is actually being tested here is whether the candidate can reel off an 'acceptable' motivation. Whether it is their motivation or not. This is asking questions that incentivize disingenuous answers (boo) and then reacting with pikachu shock when the obvious outcome happens.

conjectures · 2025-01-27T11:55:56 1737978956

I used to work for a historical records org. As of 10 years back, OCR was getting humans to transcribe such work. So whatever the limitations of genai, my prior is against there being a perfectly good old fashioned OCR solution to the 'obscure hisotrical handwriting' problem.

conjectures · 2025-01-06T16:44:38 1736181878

Hi Le Bot, say potato?