More

RadiozRadioz · 2025-06-01T09:02:22 1748768542

Many of them do, e.g. Java. Why don't _some_ mainstream languages like Python not support it? Entirely design preference, usually because they want to have less emphasis on the type system.

pdhborges · 2025-06-01T10:26:57 1748773617

Java supports it? Are you conflating method overload (which is statically determined) with multimethods?

RadiozRadioz · 2025-06-01T08:53:29 1748768009

These are good general tips applicable to other languages too. I strongly dislike when code returns errors as arbitrary strings rather than classes, as it makes errors extremely difficult to handle; one would presumably want to handle a http 502 diffrernetly to a 404, but if a programmer returns that in a string, I have to do some wonky regex instead of checking the type of error class (or pulling a property from an error class). I've commonly found JS and Go code particularly annoying as they tend to use strings, as the author mentioned.

An additional thing that is useful here would be a stack trace. So even when you catch, wrap & rethrow the error, you'll be able to see exactly where the error came from. The alternative is searching in the code for the string.

For the hate they seem to get, checked exceptions with error classes do give you a lot of stuff for free.

ajd555 · 2025-06-02T12:04:07 1748865847

I couldn't agree more. I was surprised to see the default go error handling when I switched to the language a few years ago. Any meaningful REST API implementation, as you say, needs to know what to return to the user. Perhaps there is an error for the user, and then an error for the logs. With the default go mechanism, it's too easy to return system information to the user, potentially revealing database schemas.

mirekrusin · 2025-06-01T09:56:47 1748771807

I think what you want are not dedicated classes but error codes.

If you find yourself needing to branch on error classes it may mean error handling is too high up.

ps. personally I always prefer string error codes, ie. "not-found" as opposed to numeric ones ie. 404.

RadiozRadioz · 2025-06-01T10:47:13 1748774833

No, I want dedicated classes. Be they thrown or returned as a value. Error codes are limiting and serve a different purpose.

Error codes contain only the type of error that occurred and cannot contain any more data. With an error class you can provide context - a 400 happened when making a request, which URL was hit? What did the server say? Which fields in our request were incorrect? From a code perspective, if an error happens I want to know as much detail as possible about it, and that simply cannot be summarised by an error code.

If I want to know the type of an error and do different things based on its type, I can think of no better tool to use than my language's type system handling error classes. I could invent ways to switch on error codes (I hope I'm using a language like Rust that would assert that my handling of the enum of errors is exhaustive), but that doesn't seem very well-founded. For example, using error enums, how do I describe that an HTTP_404 is a type of REQUEST_ERROR, but not a type of NETWORK_CONN_ERROR? It's important to know if the problem is with us or the network. I could write some one-off code to do it, or I could use error classes and have my language's typing system handle the polymorphism for me.

Not that error codes are not useful. You can include an error code within an error class. Error codes are useful for presenting to users so they can reference an operator manual or provide it to customer support. Present the user with a small code that describes the exact scenario instead of an incomprehensible stack trace, and they have a better support experience.

Side note: please don't use strings for things that have discrete values that you switch on. Use enums.

mirekrusin · 2025-06-01T19:59:19 1748807959

You need some kind of grouping otherwise any simple action ie. involving single io would require handling of dozens different classes.

RadiozRadioz · 2025-06-03T19:52:21 1748980341

Yes of course. That's why I mentioned polymorphism. A FileNotFoundException and NoDiskSpaceException can inherit from IOException for example. With polymorphic error classes, a caller can decide if they want to handle the different cases individually, or just catch the overarching IOException.

All this flexibility comes for free when your use your language's type system, whereas with plain error codes you would have to implement grouping yourself manually with some kind of lookup table.

maccard · 2025-06-01T10:27:53 1748773673

Agree on error codes, but I disagree on branching. An api request failing with 429 is retry able (after a period), but a 401 isn’t. A 401 might require a refreshing an authorisation token, but a 400 maybe needs the user to change their input.

> I always prefer string error codes

My parent company provides an API for us to use for “admin-y” things, but they use stringy errors in the response payload of the body. Except they’re in mandarin, and you’d be surprised (or maybe not) at how many tools barf at it. Getting them to localise the error codes is as likely to happen as us fixing the referer heading. The really nice thing about status codes is that there’s only a fixed amount of them so you don’t get two slightly different responses from two different endpoints (not-found vs not_found), and there’s no locale issues involved.

mirekrusin · 2025-06-01T16:53:58 1748796838

I didn't say not to branch on error code.

Error _code_ is code, shouldn't be localized.

RadiozRadioz · 2025-06-01T08:39:05 1748767145

I generated 5 jokes and didn't understand any of them.

Here's an example Python programming joke as it was presented:

> Why did the Python go to therapy? Because it had too many "issues" with its "exceptions"!

RadiozRadioz · 2025-06-01T08:32:06 1748766726

Can you explain what is odd about this? I think it seems perfectly logical.

The tuple type definition _looks_ like how the tuple will look. The nullable indicator is a symbol with a delightfully appropriate semantic meaning.

RadiozRadioz · 2025-05-22T18:58:22 1747940302

Typically I get tired of a side project when I stop learning from it. There's a honeymoon period at the start when you're rapidly prototyping things and figuring it out. Then you get to the point where you've explored the problem domain, figured out what works, and have a clear path ahead - the rest of the work being to just implement the vision. This is the part I get bored.

Something switches in my brain when I've figured out how to solve a problem, it starts to seek a new problem because it considers that one "done", even though most of the work remains. Perhaps it's because that work is more "painting by numbers" after the problem is figured out. Does anyone else experience this?

gdulli · 2025-05-22T19:08:32 1747940912

That's similar to how it is with me for my personal projects. But I always focus on the technical aspect or the subject matter alone without attempting to make money from it.

For some income I'd be fine with maintaining something long term, I'm just not the person to bootstrap a project into that.

whstl · 2025-05-26T09:06:05 1748250365

I've found that having users and fixing/building stuff for them can get the passion back, even if it's minor things where I don't learn anything technical.

Just getting the feedback, implementing it, deploying the next day and sending "hey, I loved you feedback, so I implemented your suggestion, lemme know what you think".

I personally enjoy this.

msgodel · 2025-05-30T11:43:10 1748605390

The same thing happens to me. I wonder if we could create a market for "micro exits" like what OP wants.

MagneLauritzen · 2025-05-23T14:17:24 1748009844

I feel the same way. Projects are only fun as long as you are challenged and learning new things. Once it becomes routine, the fun is gone.

I have not found a solution to this apart from just getting done with the boring things as quickly as possible.

bruce511 · 2025-05-24T05:26:11 1748064371

You're describing the difference between a hobby and a job. And it's perfectly OK to have a hobby. It's desirable to have a job.

In other words, a job offers you money for adding value to society. Most of us need this yo live. Adding value is usually "work" because it involves many things, most parts of which are not fun. (If they were fun, there'd be little value, people would just do it themselves.)

A hobby however is the "fun" part without the "work" part. The value added is usually marginal. For example I've been doing ceramics as a hobby. It's lots of fun and the rubbish monstrosities I create are not really valuable.

If I applied myself, I could churn out bowls that are sellable. But frankly, where's the fun in that? Making 4 identical vases for sale would rob all the joy from it.

Understanding the difference is key to your long term satisfaction. My work gets me paid, so I need to do all the boring bits, and the better I do that the more I get paid. I get satisfaction from doing the job well, and having happy customers, but there's a lot of grind involved. Maybe 80% grind to 20% fun.

My hobby is the reverse. 80% fun, 20% grind. I get satisfaction from pushing my skills to the next level, even if the results are far from perfect, and frankly not sellable. (I'll give away pieces to someone if they like it, but I won't sell it.)

So, to answer your question, yes everyone experiences this. It is quite literally the definition of "work".

wpm · 2025-05-23T06:41:40 1747982500

Story of my goddamn life right here

RadiozRadioz · 2025-05-18T21:57:43 1747605463

If you reframe it slightly, it can make sense. Those x hundred engineers are working on y hundred features / integrations. Do you need all of those? Do you want all of those? I bet a handful of those engineers are currently working on a brand new UI redesign that will move all the buttons you're used to. One of the engineers is adding a new cookie popup & enforcing SMS 2FA as we speak.

One of the things I dislike about moden software is the constant bloat and churn, because there are so many customers and so many different incentives for software companies to keep pushing features ad infinitum. In contrast, home-grown software like this has one customer and they know exactly what they want. It doesn't matter that a theoretical home-grown app doesn't integrate with the 10 social networks the user doesn't use, because it integrates perfectly with the one they do use.

This person isn't rebuilding the entirety of Obsidian, they're rebuilding the subset of parts they actually use and get value from, which is a much smaller project. By intelligently narrowing your scope like this, making stuff yourself is totally viable. Reframe "limiting" as "targeted".

RadiozRadioz · 2025-05-16T23:28:22 1747438102

This comment appears frequently and always surprises me. Do people just... not know regex? It seems so foreign to me.

It's not like it's some obscure thing, it's absolutely ubiquitous.

Relatively speaking it's not very complicated, it's widely documented, has vast learning resources, and has some of the best ROI of any DSL. It's funny to joke that it looks like line noise, but really, there is not a lot to learn to understand 90% of the expressions people actually write.

It takes far longer to tell an AI what you want than to write a regex yourself.

simonw · 2025-05-17T06:46:18 1747464378

"It takes far longer to tell an AI what you want than to write a regex yourself."

My experience is the exact opposite. Writing anything but the simplest regex by hand still takes me significant time, and I've been using them for decades.

Getting an LLM to spit out a regex is so much less work. Especially since an LLM already knows the details of the different potential dialects of regex.

I use them to write regexes in PostgreSQL, Python, JavaScript, ripgrep... they've turned writing a regex from something I expect to involve a bunch of documentation diving to something I'll do on a whim.

Here's a recent example - my prompt included a copy of a PostgreSQL schema and these instructions:

  Write me a SQL query to extract
  all of my images and their alt
  tags using regular expressions.
  In HTML documents it should look
  for either <img .* src="..." .*
  alt="..." or <img alt="..." .*
  src="..." (images may be self-
  closing XHTML style in some 
  places). In Markdown they will
  always be ![alt text](url)

I ended up with 100 lines of SQL: https://gist.github.com/simonw/5b44a662354e124e33cc1d4704cdb...

The markdown portion of that is a good example of the kind of regex I don't enjoy writing by hand, due to the need to remember exactly which characters to escape and how:

  (REGEXP_MATCHES(commentary,
  '!\[([^\]]*)\]\(([^)]*)\)', 'g'))[2] AS src,
  (REGEXP_MATCHES(commentary,
  '!\[([^\]]*)\]\(([^)]*)\)', 'g'))[1] AS alt_text

Full prompt and notes here: https://simonwillison.net/2025/Apr/28/dashboard-alt-text/

RadiozRadioz · 2025-05-17T09:50:24 1747475424

Perhaps Perl has given me Stockholm Syndrome, but when I look at your escaped regex example, it's extremely natural for me. In fact, I'd say it's a little too simple, because the LLM forgot to exclude unnecessary whitespace:

  (REGEXP_MATCHES(commentary,
  '!\[\s*([^\]]*?)\s*\]\(\s*([^)]*?)\s*\)', 'g'))[2] AS src,
  (REGEXP_MATCHES(commentary,
  '!\[\s*([^\]]*?)\s*\]\(\s*([^)]*?)\s*\)', 'g'))[1] AS alt_text

That is just nitpicking a one-off example though, I understand your wider point.

I appreciate the LLM is useful for problems outside one's usual scope of comfort. I'm mainly saying that I think it's a skill where the "time economics" really are in favor of learning it and expanding your scope. As in, it does not take a lot learning time before you're faster than the LLM for 90% of things, and those things occur frequently enough that your "learning time deficit" gets repaid quickly. Certainly not the case for all skills, but I truly believe regex is one of them due to its small scope and ubiquitous application. The LLM can be used for the remaining 10% of really complicated cases.

As you've been using regex for decades, there is already a large subset of problems where you're faster than the LLM. So that problem space exists, it's all about how to tune learning time to right-size it for the frequency the problems are encountered. Regex, I think, is simple enough & frequent enough where that works very well.

simonw · 2025-05-17T11:54:46 1747482886

> As in, it does not take a lot learning time before you're faster than the LLM for 90% of things, and those things occur frequently enough that your "learning time deficit" gets repaid quickly.

It doesn't matter how fast I get at regex, I still won't be able to type any but the shortest (<5 characters) patterns out quicker than an LLM can. They are typing assistants that can make really good guesses about my vaguely worded intent.

As for learning deficit: I am learning so much more thanks to heavy use of LLMs!

Prior to LLMs the idea of using a 100 line PostgreSQL query with embedded regex to answer a mild curiosity about my use of alt text would have finished at the idea stage: that's not a high value enough problem for me to invest more than a couple of minutes, so I would not have done it at all.

Agraillo · 2025-05-17T12:52:04 1747486324

Good points. Also looking at your original example I noticed that not only humans can explain regularities they expect in many different ways (also correcting along the way), they can basically ask LLM to base the result on a reference. In your example you provided a template with an img tag and brackets having different attributes patterns. But one can also just ask for a html-style tag. As I did with the "Please create a regex for extracting image file names when in a text a html-style tag img is met" (not posting it here, but "src" is clearly visible in the answer). So the "knowledge" from other domains is applied to the regex creation.

eddd-ddde · 2025-05-16T23:55:35 1747439735

I know regex. But I use it so sparingly that every time I need it I forgot again the character for word boundary, or the character for whitespace, or the exact incantation for negative lookahead. Is it >!? who knows.

A shortcut to type in natural language and get something I can validate in seconds is really useful.

layer8 · 2025-05-17T00:10:28 1747440628

How do you validate it if you don’t know the syntax? Or are you saying that looking up syntax –> semantics is significantly quicker than semantics –> syntax? Which I don’t find to be the case. What takes time is grokking the semantics in context, which you have to do in both cases.

eddd-ddde · 2025-05-17T14:22:10 1747491730

In my case most of my regex is for utility scripts for text processing. That means that I just run the script, and if it does what I want it to do I know I'm done.

LLMs have been amazing in my experience putting together awk scripts or shell scripts in general. I've also discovered many more tools and features I wouldn't have otherwise.

tough · 2025-05-17T00:16:41 1747441001

https://regex101.com/

layer8 · 2025-05-17T00:43:38 1747442618

That doesn’t answer the question. By “validate”, I mean “prove to yourself that the regular expression is correct”. Much like with program code, you can’t do that by only testing it. You need to understand what the expression actually says.

widdershins · 2025-05-17T09:41:11 1747474871

Testing something is the best way to prove that it behaves correctly in all the cases you can think of. Relying on your own (fallible) understanding is dangerous.

Of course, there may be cases you didn't think of where it behaves incorrectly. But if that's true, you're just as likely to forget those cases when studying the expression to see "what it actually says". If you have tests, fixing a broken case (once you discover it) is easy to do without breaking the existing cases you care about.

So for me, getting an AI to write a regex, and writing some tests for it (possibly with AI help) is a reasonable way to work.

layer8 · 2025-05-17T13:48:48 1747489728

I don’t believe this is true. That’s why we do mathematical proofs, instead of only testing all the cases one can think of. It’s important to sanity-check one’s understanding with tests, but mere black-box testing is no substitute for the understanding.

tough · 2025-05-17T15:40:45 1747496445

Code is not perfect like math imho

libraries some times make weird choices

in theory theory and practice are the same, in practice not really

in the context of regex, you have to know which dialect and programming language version of regex you’re targeting for example. its not really universal how all libs/languages works

thus the need to test

marcosdumay · 2025-05-17T00:29:58 1747441798

Notice that site has a very usable reference list you can consult for all those details the GP forgets.

CuriouslyC · 2025-05-17T00:12:22 1747440742

I was using perl in the late 90s for sysadmin stuff, have written web scrapers in python and have a solid history with regex. That being said, AI can still write really complex lookback/lookahead/nested extraction code MUCH faster and with fewer bugs than me, because regex is easy to make small mistakes with even when proficient.

nevf1 · 2025-05-17T09:52:56 1747475576

I respectfully disagree. Thankfully, I don't need to write regex much, so when I do it's always like it's the first time. I don't find the syntax particularly intuitive and I always rely on web-based or third party tools to validate my regex.

Whenever I have worked on code smells (performance issues, fuzzy test fails etc), regex was 3rd only to poorly written SQL queries, and/or network latency.

All-in-all, not a good experience for me. Regex is the one task that I almost entirely rely on GitHub Copilot in the 3-4 times a year I have to.

insin · 2025-05-16T23:45:54 1747439154

IME it's not just longer, but also more difficult to tell the LLM precisely what you want than to write it yourself if you need a somewhat novel RegExp, which won't be all over the training data.

I needed one to do something with Markdown which was a very internal BigCo thing to need to do, something I'd never have written without weird requirements in play. It wasn't that tricky, but going back trying to get LLMs to replicate it after the fact from the same description I was working from, they were hopeless. I need to dig that out again and try it on the latest models.

emmelaich · 2025-05-17T02:27:10 1747448830

There's often a bunch of edge cases that people overlook. And you also get quadratic behaviour for some fairly 'simple' looking regexes that few people seem aware of.

nashashmi · 2025-05-17T01:07:08 1747444028

I use regex as an alternative to wildcards in various apps like notepad++ and vscode. The format is different in each app. And the syntax is somewhat different. I have to research it each time. And complex regex is a nightmare.

Which is why I would ask an AI to build it if it could.

crystal_revenge · 2025-05-17T02:14:30 1747448070

I personally didn’t really understand how to write regex until I understood “regular languages” properly, then it was obvious.

I’ve found that the vast majority of programmers today do not have any foundation in formal languages and/or the theory of computation (something that 10 years ago was pretty common to assume).

It used to be pretty safe to assume that everyone from perl hackers to computer science theorists understood regex pretty well, but I’ve found it’s increasingly a rare skill. While it used to be common for all programmers to understand these things, even people with a CS background view that as some annoying course they forgot as soon as the exam was over.

jacob019 · 2025-05-16T23:42:23 1747438943

The first languge I used to solve real problems was perl, where regex is a first class citizen. In python less so, most of my python scripts don't use it. I love regex but know several developers who avoid it like plague. You don't know what you don't know, and there's nothing wrong with that. LLM's are super helpful for getting up to speed on stuff.

fooker · 2025-05-17T02:29:10 1747448950

Regex, especially non standard (and non regular) extensions can be pretty tricky to grok.

http://alf.nu/RegexGolf?world=regex&level=r00

RadiozRadioz · 2025-05-17T09:07:19 1747472839

/foo/

took me 25.75 seconds, including learning how the website worked. I actually solved it in ~15 seconds, but I hadn't realized I got the correct answer becuase it was far too simple.

This website is much better https://regexcrossword.com/challenges/experienced/puzzles/e9...

fooker · 2025-05-17T23:42:33 1747525353

Great, except you misunderstood the problem and wrote the exact opposite solution here.

Also this is the easiest starter puzzle, once you solve it you can click through to the next ones with increasing difficulty.

tough · 2025-05-16T23:39:54 1747438794

Its something you use so sparingly far away usually that never sticks around

skydhash · 2025-05-17T00:46:01 1747442761

A cheat sheet is just a web search away.

jimbokun · 2025-05-17T04:33:26 1747456406

So is an LLM.

DonHopkins · 2025-05-17T08:58:33 1747472313

So is a real html parser.

https://blog.codinghorror.com/parsing-html-the-cthulhu-way/

https://en.wikipedia.org/wiki/Beautiful_Soup_(HTML_parser)

vivzkestrel · 2025-05-17T04:44:13 1747457053

since you know so much regex, why dont you write a regex html parser /s

RadiozRadioz · 2025-05-16T21:11:14 1747429874

That just sounds irresponsible. The correct choice for prod isn't "the cool new trendy thing that will solve all our problems once it hits 1.0", the correct choice is "the boring stable thing that has existed long enough for everyone to understand its shortcomings".

RadiozRadioz · 2025-05-11T17:50:29 1746985829

I think their lunch sandwiches are perfectly acceptable, I used to eat a Sainsbury's meal deal every day. Anecdotally I find the UK's prepackaged supermarket sandwiches are much nicer than than the equivalents on the continent.

RadiozRadioz · 2025-05-10T08:02:11 1746864131

Appears to be a fediverse social network like Mastodon, there's a demo on their homepage: https://bonfirenetworks.org/

aloisdg · 2025-05-10T08:52:31 1746867151

but what kind? Microblogging, macroblogging, link aggregation, forum, imageboard etc.

chobeat · 2025-05-10T10:33:05 1746873185

there are plugins for all of this stuff and more: there are kanban boards, stuff for openscience (I think peer-review and the likes), some collaboration features etc etc