GNU core utils is 134 lines of code, not 50, so the Rust version is even slightl...

3eb7988a1663 · 2025-05-27T05:55:05 1748325305

That reddit thread has some amazing benchmarks.

The GNU-yes

  $ yes | pv > /dev/null
  ... [10.2GiB/s] ...

The way I (not a C programmer) would have written it

  void main() {
      while(write(1, "y\n", 2)); // 1 is stdout
  }

  $ gcc yes.c -o yes
  $ ./yes | pv > /dev/null
  ... [6.21 MiB/s] ...

pona-a · 2025-05-27T08:23:18 1748334198

As a non-system-programmer, here's my attempt in Odin.

  yes | pv > /dev/null
  0:00:15 [1.12GiB/s]
  build/yes | pv > /dev/null
  0:00:20 [1.03GiB/s]


  package main
  
  import "core:sys/linux"
  import "core:os"
  import "core:strings"
  
  main :: proc() {
    msg := "y" if len(os.args) == 1 else os.args[1]
    msg = strings.concatenate({msg, "\n"})
  
    buf := transmute([]u8) strings.repeat(msg, 8192)
    for {
      linux.write(linux.STDOUT_FILENO, buf)
    }
  }

forgotpwd16 · 2025-05-27T08:50:48 1748335848

Replace `write(..)` with `puts("y")` and you'll be an order of magnitude faster. This is due to `puts` (`printf` too) being buffered (data isn't written to term/file immediately but retained in memory until some point). Improving this process (as seen in the reddit thread) gets GNU-yes.

GoblinSlayer · 2025-05-27T09:01:32 1748336492

It's line buffered when it prints to terminal.

jesprenj · 2025-05-27T11:38:37 1748345917

One rarely needs yes' output to be a terminal.

throwawaymaths · 2025-05-27T09:35:59 1748338559

Don't you actively want it to flush asap since you're usually piping into another program?

I suspect what you suggest creates a more voluminous dump but is slower in the desired use case

aequitas · 2025-05-27T17:36:22 1748367382

  yes &

A few times is still my favorite way to push a cpu to max temperature for testing. Used it a lot to detect faulty Core 2 Duo MacBook back in the day. They would short circuit some CPU sensor due to thermal expansion or melting of the wire insulation. Yes was an easy way to get the CPU’s hot enough.

Macha · 2025-05-27T10:39:05 1748342345

If you compile your variant with -O3 I imagine it will be much faster? Iirc, the default is for GCC is to not optimise

imurray · 2025-05-27T12:17:21 1748348241

No, it will be about the same. The algorithm is wrong (calling write repeatedly) and -O3 isn't sufficient to rewrite that.

phh · 2025-05-27T07:43:49 1748331829

Which implies you get pretty much 3M syscall per second. Which is a good order magnitude to know

nasretdinov · 2025-05-27T08:53:34 1748336014

I don't believe puts is performing unbuffered I/O though. It's a libc function, not a direct syscall. Correct me if I'm wrong of course

lieks · 2025-05-27T11:38:03 1748345883

The write(2) libc function is just a C wrapper for the syscall. It's the functions from stdio.h that are buffered.

nasretdinov · 2025-05-27T12:26:19 1748348779

Ah ok sorry I got confused by the comments nesting level. I thought we were talking about the OpenBSD's version which uses puts

nasretdinov · 2025-05-27T06:43:54 1748328234

In this case OpenBSD version does a much better job imo (although I don't agree with the lack of braces). The performance of such a tool does not matter at all, and a larger implementation is not only unnecessary, but it can actually introduce bugs in otherwise completely straightforward code

mrweasel · 2025-05-27T09:18:57 1748337537

The OpenBSD version of true is also amazing: https://cvsweb.openbsd.org/cgi-bin/cvsweb/~checkout~/src/usr...

The GNU version of true/false is more interesting. All the logic is in true and false just redefined the EXIT_STATUS and imports all of true.c. https://github.com/coreutils/coreutils/blob/master/src/false...

johnisgood · 2025-05-27T11:22:42 1748344962

For reference, here is true.c: https://github.com/coreutils/coreutils/blob/master/src/true.....

AlecSchueler · 2025-05-27T13:39:51 1748353191

So basically they also introduced the complexity of respecting --help and --version.

johnisgood · 2025-05-27T14:55:27 1748357727

They did, because it is written somewhere that all GNU programs must conform to having --help and --version. I forgot where I read it.

1718627440 · 2025-05-29T11:51:12 1748519472

The GNU Coding Standard: https://www.gnu.org/prep/standards/standards.html#Command_00...

layer8 · 2025-05-27T10:16:00 1748340960

> This is as simple as it gets

It unnecessarily duplicates the for loop. I would have written something like:

    char *what = argc > 1 ? argv[1] : "y";
    for (;;)
        puts(what);

williamdclt · 2025-05-27T11:25:56 1748345156

What a waste of 8 bytes! :)

layer8 · 2025-05-27T11:32:43 1748345563

It’s not about bytes, it’s about duplicating logic that should inherently be the same. If you change something about the loop or the puts, you now have to take care to change it identically in two places to be consistent. That’s a situation that should be avoided, and is what makes it not “as simple as it gets”.

williamdclt · 2025-05-27T12:21:33 1748348493

I was being humorous, but tbh it’s not so clear cut!

In 99% of cases, yes of course you’re right, factor this loop.

In this specific case? This is trivial code, that will likely _never_ change. If it does change, it’s extremely unlikely that the two loops would accidentally diverge (the dev would likely not miss one branch, tests would catch it, reviewers would catch it). So if you get any upside by keeping the two loops, it might be worth it.

Here you get 8 bytes back. I honestly can’t see how that would ever matter, but hey it’s _something_, and of course this is a very old program that was running on memory-constrained machines.

So it’s a trade-off of (minor) readability versus (minor) runtime optimisation. I think it’s the better choice (although it’s very minor).

Or maybe there’s a better reason they chose this pattern… can’t imagine the compiler would generate worse code, but maybe it did back in the days?

layer8 · 2025-05-27T12:39:02 1748349542

I agree that it’s borderline pedantic for this simple code, but I also find it an obvious code smell, contradicting the “as simple as it gets”.

If you consistently deduplicate code that is supposed to do the same and evolve the same, then any duplicated code sticks out as a statement of “this isn’t the same”, and in the present case it then makes you wonder what is supposed to be different about both cases. In other words, such code casts doubt on one’s own understanding, raising the question whether one might be overlooking an important conceptual reason for why the code is being kept duplicated. So in that sense I disagree that the duplicated version is more readable, because it immediately raises unanswered questions.

About possible performance reasons, those need an explanatory comment, exactly for the above reason. And also, if performance reasons warrant complicating the code, then it isn’t “as simple as it gets” any more. I was commenting because I disagreed with that latter characterization.

williamdclt · 2025-05-27T16:03:14 1748361794

I agree it's a code _smell_. But a "smell" doesn't mean that something is necessarily wrong, just that there's a clue that it might be wrong.

> in that sense I disagree that the duplicated version is more readable

I didn't say it is, I agreed it's _less_ readable. I said it's trading off readability for 8 bytes of memory at runtime.

> If you consistently deduplicate code that is supposed to do the same and evolve the same, then [...]

I agree with all this. I'm not saying to consistently go for the deduplicated approach (I don't think anyone would say that), I'm saying it's a reasonable trade-off in this specific case (each branch is still trivial, and the code won't evolve much if at all).

> About possible performance reasons, those need an explanatory comment, exactly for the above reason.

Agreed.

> if performance reasons warrant complicating the code, then it isn’t “as simple as it gets” any more. I was commenting because I disagreed with that latter characterization.

Also agreed.

johnisgood · 2025-05-27T12:58:12 1748350692

To some, the current way is more readable than yours, though. It is much more explicit, and that often is a good thing.

layer8 · 2025-05-27T13:22:51 1748352171

I disagree that it’s more explicit. The case distinction and the loop and the puts are exactly the same in both code variants. You can replace the ternary operator by an if if that’s bothering you, that wasn’t the point of the change. The point is to first determine what should be output repeatedly, and then to output it, because the output logic is independent from what is being output (in particular, `yes` and `yes y` should be guaranteed have identical behavior). I don’t really see what’s non-explicit about that. Rather to the contrary, it makes it explicit that the output logic is intended to be independent from what is being output.

johnisgood · 2025-05-27T13:40:46 1748353246

How about:

  for (;;) {
    if (argc > 1)
      puts(argv[1]);
    else
      puts("y");
  }

?

You said "it’s about duplicating logic that should inherently be the same", but that is exactly how it is more explicit, by having this duplication. I assume your problem is with the two "puts()"?

layer8 · 2025-05-27T14:01:51 1748354511

Your proposal is a bit better than the original, although it still duplicates the puts (imagine a variant where you’d want to handle I/O errors), and some will be bothered by the fact that the same unchanging condition is being retested in each loop iteration (the compiler may even warn about it).

But still, I don’t see why you wouldn’t first name what you want to output before starting the outputting. If anything, I’d place the whole output loop in a separate function and have two calls to that function. Nevertheless, it’s even better to express in code the fact that the program doesn’t want to make a distinction between a literal “y” and an argument “y”, by consolidating them into the same variable.

Another way to do this would be to have a static default argument array containing the “y”, and for example having:

    if (argc <= 1) { argv = default_argv; }
    for (;;) { puts(argv[1]); }

This would make explicit the fact thst the argument-less invocation is merely a shortcut for an invocation with an argument and doesn’t otherwise provide any new or different behavior.

Though I think the separate variable (what) is clearly preferable.

johnisgood · 2025-05-27T14:16:45 1748355405

I have made the decision to use your "what" method many times before, but in this particular case I do not see the reason to do that, and perhaps this is what I have an issue with. There are many cases in which I would definitely use "what".

williamdclt · 2025-05-27T15:58:05 1748361485

That's a whole lot less efficient than both other options, the condition will be checked at every loop, that's not cheap

johnisgood · 2025-05-27T16:06:11 1748361971

Oh, I know. :P