To achieve that it is enough to hash inputs, and cache resulting outputs. Repeat...

klysm · 2025-03-26T19:25:35 1743017135

Outputs are used as inputs later. If everything is deterministic, you can actually cache everything by hash

mschuster91 · 2025-03-26T18:38:14 1743014294

> To achieve that it is enough to hash inputs, and cache resulting outputs.

Thing is, inputs can be nondeterministic too - some programs (used to) embed the current git commit hash into the final binary so that a `./foo --version` gives a quick and easy way for bug triage to check if the user isn't using a version from years ago.

telotortium · 2025-03-26T19:52:45 1743018765

Adding the Git hash is reproducible, assuming you build from a clean tree (which the build script can check). Embedding the current date and time is the canonical cause of non-reproducibility, but that can be worked around in most cases by embedding the commit and/or author date of the commit instead.

layer8 · 2025-03-26T18:50:40 1743015040

This is only a problem if those nondeterministic inputs are actually included in the hash. This is often not the case, because the values are included implicitly in the build rather than explicitly.

(Just playing devil’s advocate here.)