More

bjoli · 2026-04-21T20:23:52 1776803032

They compose. And can be passed around and be completely oblivious to how they will be reduced. With conj or sum or whatever they want. And you can extend them at any point at any end.

They are like map, filter and friends, but they compose. I think of iterators as an iterator protocol and transducers as a streaming protocol. An iterator just describes how to iterate over a collection. Transducers are transformations that can be plugged into any point where data goes in one direction.

css_apologist · 2026-04-22T03:26:07 1776828367

js iterators work over lazy streams

bjoli · 2026-04-22T14:20:23 1776867623

As I said, it is a protocol for iteration or data access. You cant take an iterator and hand it as a filter to a file reader. If I make a rot13 transducer I can hand it to a transduce function that transforms a collection. I can give it to a file reader as a transformer on any char.

Transducers are way to express transformations.

bjoli · 2026-04-21T16:13:14 1776787994

I made srfi-171 [0], transducers for scheme. If you have any questions about them in general I can probably answer them. My version is pretty similar to the clojure version judging by the talks Rich Hickey gave on them.

I know a lot of people find them confusing.

0: https://srfi.schemers.org/srfi-171/srfi-171.html

matrix12 · 2026-04-21T21:21:38 1776806498

thanks. this is going in my scheme.

bjoli · 2026-04-19T19:23:31 1776626611

Optimization level 2 in chez scheme does about 100 KLOC/s in my pretty modest machine, while also producing code that is pretty darn fast.

bjoli · 2026-04-19T06:24:09 1776579849

Hah! I wrote a unit converter for Android recently and that is one of the criticism I get. "Why does my conversion end up in becquerel?" It is usually because people forgot to divide by time, where they write something like "(31l/m2)/1min in mm" when they should have have written something like "(31l/m2)/1min in mm/h". Anyway, check it out here:

https://github.com/bjoli/Umits

I am about 6 days away from publishing and open beta (currently in mandatory closed testing). If you want to join the closed test, you can do so by mailing me at the email at the top of the readme.

bananaflag · 2026-04-19T07:09:33 1776582573

I think your interface is a bit inconsistent, this is why people ask that question.

If you have

65mi in 12mi/h -> 19500s

then instead of

12h in s -> 43200s

you should have

12h in s -> 43200

Then a unit at the end should mean that not all dimensions have been reduced.

In the same vein, in the README, the "weird results" section should come after the "dimension removal" section. The way it is now, the apparent "bug" comes before the feature.

bjoli · 2026-04-19T07:47:22 1776584842

You are right about this being confusing. I have thought about whether to adopt in as strict division or whether to be strict about in UNIT to have to produce UNIT. The first one will not resolve the issue of Umits selecting becquerel or Hz to represent N/s, but the second is not as much fun.

bananaflag · 2026-04-19T08:51:03 1776588663

I think the behaviour is good as it is, it is just the output display that should be consistent (as I suggested in my example).

bjoli · 2026-04-19T12:15:31 1776600931

Yes. But treating in a strict division isn't really what people expect. Then 12mi/h in km/h becomes "19.3xyz" not "19.3xyz km/h".

The least surprising thing would be to enforce unit output. If I say I want "in km/h" the output should be in km/h or show an error. It is however less fun. Getting becquerel when you forget a unit along the way is the kind of spice that makes life fun.

Treating "in" as strict division also doesnt solve the surprise of getting Bq or Hz when you accidentally end up with something that is N/s

bjoli · 2026-04-17T05:56:25 1776405385

And, in some ways, PyPy. I still think it is the sanest way to implement Python.

It makes me sad that I have to write C to make any meaningful changes to Python. Same goes for ruby. Rubinius was such a nice project.

Hacking on schemes and lisps made me realize how much more fun it is when the language is implemented in the language itself. It also makes sure you have the right abstractions for solving a bunch of real problems.

actionfromafar · 2026-04-17T09:07:37 1776416857

Well, one could rewrite Python (perhaps piece by piece?) in Shedskin.

Shedskin is very nearly Python compatible, one could say it is an implementation of Python.

anitil · 2026-04-17T05:57:09 1776405429

> And, in some ways, PyPy

What do you mean by that? I'm not familiar with PyPy

nxpnsv · 2026-04-17T06:01:24 1776405684

PyPy is python implemented in python. It is fast.

notpushkin · 2026-04-17T06:32:41 1776407561

https://pypy.org/

It lags behind CPython in features and currently only supports Python versions up to 3.11. There was a big discussion a month ago: https://news.ycombinator.com/item?id=47293415

But you can help! https://pypy.org/howtohelp.html

https://opencollective.com/pypy

Doxin · 2026-04-17T06:31:19 1776407479

PyPy is python implemented in RPython, which is technically a python subset. It's so restricted it might as well be a different language though.

bjoli · 2026-04-17T08:11:17 1776413477

It is restricted in a way that you would restrict yourself to write high speed software in most languages, and I found it is not that restrictive compared to C that you would have to use if you were to write a fast Python library.

Doxin · 2026-04-17T08:57:05 1776416225

oh for sure, but I still feel like telling people pypy is written in python is misleading. it's written in something significantly like python, but it's not python.

mjmas · 2026-04-17T10:15:36 1776420936

> technically a python subset

So it can just run under CPython? If so, then that isn't too misleading.

bjoli · 2026-04-17T11:15:16 1776424516

Yes. It can run under Cpython (2.7).

nxpnsv · 2026-04-18T04:33:03 1776486783

PyRPy is just less catchy sounding

wyldfire · 2026-04-17T13:02:05 1776430925

The fact that it's written in python is often brought up in order to explain its name. But really, it's much less interesting than the fact that it has a tracing JIT. If it were called PyJIT I'd bet it would be clearer and more obvious that it's fast. And people would prob get less hung up on the distinction between python/rpython.

bjoli · 2026-04-15T09:29:20 1776245360

And Hickey himself said he adapted ideas from Bagwell's HAMTs. And tries are 60 years old.

I have always thought Hickeys main contribution was making it default in a coherent way, and proved it could be done. Before clojure most peoplle still thought immutable data structures were too I practical.

swannodette · 2026-04-15T10:58:15 1776250695

That's a big contribution, also the original HAMTs are not a functional data structure. See Section 3.4.1 in https://docdrop.org/download_annotation_doc/3386321-trk2f.pd...

bjoli · 2026-04-15T13:24:27 1776259467

No, but persistent bit partitioned tries were pretty well known in the late 90s (I first met them in standard ML in 2005)

panick21_ · 2026-04-15T09:40:50 1776246050

I think the Clojure version does have some actual improvements over the Bagwell version, and some implementation tricks improvements as well. But I don't remember all the details.

bjoli · 2026-04-15T13:22:23 1776259343

Well, sure. But it is not like Hickey invented the 5bit partitioned trie (there is work in sml and Haskell before that), nor did he invent functional tries.

He took what was a research topic and made it standard. There were no other 5bit partitioned tries in (wide) use. I think he did that in a way that signals a fantastic sense of taste, and if you are implementing a programming language you need taste.

bjoli · 2026-04-15T09:00:05 1776243605

I know people using ref counting to support using allocation arenas for immutable structures. For some workloads that gives a pretty crazy performance boost.

Just pre-allocating leaf nodes can reduce iteration overhead by 40%.

bjoli · 2026-04-15T08:57:58 1776243478

He went on to implement https://github.com/hypirion/c-rrb Which are just like clojures vectors but has fast insertions/deletes and merges.

I semi-ported it to c# here: https://github.com/bjoli/RrbList/tree/main/src/Collections

It is faster than clojures vectors (running on the JVM, so apples and cucumbers) in all cases, and mostly beats scala's vectors except for splitting which is crazy fast in scala's vectors).

panick21_ · 2026-04-15T09:38:41 1776245921

Oh god, I remember, I tried to implemented this in Dylan once long ago. I didn't get very far but I really liked the data-structure:

https://github.com/nickik/RRB-Vector-in-Dylan/blob/master/RR...

bjoli · 2026-04-15T13:23:22 1776259402

I tried in scheme first, but failed miserably. Doing it in c# was easier since I could more directly compare code.

panick21_ · 2026-04-15T13:31:23 1776259883

Dylan wasn't the issue I failed, I found it nice to work with.

bjoli · 2026-04-15T08:50:56 1776243056

Those are not really the same. Those are N=32 finger trees which have extra benefits (quick slices, for example, quicker insertions).

bjoli · 2026-04-14T10:03:31 1776161011

AEPD are well known, even in the rest of the world. They have a different strategy compared to other countries. Ireland's DPC are also heavy handed, but focus on large companies mostly.

France's CNIL is also not bad. They are particularly hard against things like "you accidentally sign up for x y z services when only wanting to sign up to service A".

Gdpr in the EU is also miles ahead of what the US has, or at least what it has been enforcing for a long time.

rsynnott · 2026-04-14T12:08:04 1776168484

> Ireland's DPC are also heavy handed, but focus on large companies mostly.

Also, generally, very, very, VERY slow. The massive fines you hear about are usually for behaviour _years_ ago.

fakedang · 2026-04-14T10:19:56 1776161996

Is the CCPA anywhere near?