The general version of this is called *inverse transform sampling* [0], which us...

tmoertel · 2025-03-13T22:01:26 1741903286

For the particular case of the exponential distribution we can go further. By taking advantage of the theory of Poisson processes, we can take samples using a parallel algorithm. It even has a surprisingly succinct SQL translation:

    SELECT *
    FROM Population
    WHERE weight > 0
    ORDER BY -LN(1.0 - RANDOM()) / weight
    LIMIT 100  -- Sample size.

Notice our exponentially distributed random variable on prominent display in the ORDER BY clause.

If you're curious, I explore this algorithm and the theory behind it in https://blog.moertel.com/posts/2024-08-23-sampling-with-sql....

lkuty · 2025-03-14T09:30:26 1741944626

Quite off-topic, but do you know when you'll write the article about CPS, if ever?

tmoertel · 2025-03-14T11:48:03 1741952883

Oops. I had quite forgotten that I need to write about that. I said I would over a decade ago, so that's a long time for you to wait. Sorry about that.

I mainly write for myself, so I need the time and the motivation. Until recently, my job at G took up my time and also provided an internal community where I could scratch the writing itch, which reduced the motivation for public writing on my blog. But now that I'm semi-retired, I'll try to write more frequently.

Thanks for the accountability!

evanb · 2025-03-13T23:33:00 1741908780

Inverse transform sampling is a special case of normalizing flow where we don't need to learn anythin.g

https://en.wikipedia.org/wiki/Flow-based_generative_model