More

jmpeax · 2026-01-06T23:08:33 1767740913

I like the surface dots like it is. It gives me two points of reference at the poles, and adds intuition for how long it takes to go around the sphere.

jmpeax · 2026-01-04T23:32:58 1767569578

From that wikipedia article, delta is the ratio of y variance to x variance. If x variance is tiny compared to y variance (often the case in practice) then will we not get an ill-conditioned model due to the large delta?

kevmo314 · 2026-01-05T07:24:27 1767597867

If you take the limit of delta -> infinity then you will get beta_1 = s_xy / s_xx which is the OLS estimator.

In the wiki page, factor out delta^2 from the sqrt and take delta to infinity and you will get a finite value. Apologies for not detailing the proof here, it's not so easy to type math...

jmpeax · 2025-12-18T20:31:54 1766089914

Don't get me started on "software architect".

tremon · 2025-12-19T00:21:13 1766103673

On classic big waterfall projects, you can find actual architects. Those are the ones drafting interfaces and delineating components/teams before the first source file is even committed.

jmpeax · 2025-12-19T11:17:15 1766143035

Actual architects design buildings.

tremon · 2025-12-19T23:23:22 1766186602

I'm sorry. My fault for engaging you, I guess.

9rx · 2025-12-18T21:05:57 1766091957

Even "code monkey" is generous.

jmpeax · 2025-12-17T13:22:50 1765977770

How is blocking ad blockers going to make them $150m?

jmpeax · 2025-12-16T08:06:09 1765872369

> They typically need to compare many or all points to each other, leading to O(N²) complexity.

UMAP is not O(n^2) it is O(n log n).

romanfll · 2025-12-16T08:45:32 1765874732

Thanks for your comment! You are right, Barnes-Hut implementation brings UMAP down to O(N log N). I should have been more precise in the document. The main point is that even O(N log N) could be too much if you run this in a browser.. Thanks for clarifying!

emil-lp · 2025-12-16T09:57:23 1765879043

If k=50, then I'm pretty sure O(n log n) beats O(nk).

romanfll · 2025-12-16T13:16:12 1765890972

You are strictly correct for a single pass! log2(9000)~13, which is indeed much smaller than k=50. The missing variable in that comparison is Iterations. t-SNE and UMAP are iterative optimisation algorithms. They repeat that O(N log N) step hundreds of times to converge. My approach is a closed-form linear solution (Ax=b) that runs exactly once. So the wall-clock comparison is effectively: Iterations * (N log N) VS 1 * (N *k) That need for convergence is where the speedup comes from, not the complexity class per se.

jmpeax · 2025-12-03T08:50:24 1764751824

Polars made the mistake of not maintaining row order for all operations, via the False-by-default argument of maintain_order. This is basically the billion-dollar null mistake for data frames.

jononor · 2025-12-03T13:48:36 1764769716

Yeah that really should have been default. Very big footgun, especially when preserving ordering is default in pandas, numpy, etc. And especially when there is no ingrained index concept in polars, people might very well forget that one needs to have some natural keys and not rely on ordering. One needs to bring more of an SQL mindset.

jmpeax · 2025-11-29T23:01:58 1764457318

> always respect human dignity even when nasty players try to make a dirty move against you

What a gem of a quote. A great way to avoid becoming a bitter person.

jmpeax · 2025-11-29T20:12:22 1764447142

> does not provide any concrete proof, but it confirms many people's suspicions

Without proof there is no confirmation.

lazide · 2025-11-30T15:09:31 1764515371

Formally? Sure. In the current zeitgeist it’s more than enough to start pointing fingers around, etc.

jmpeax · 2025-11-23T23:43:27 1763941407

The pro version comes with "Professional-grade creative suite", but they don't tell you what you're actually getting. It's just opaque corporate-speak one-liners "Make real progress toward your goals".

jmpeax · 2025-11-18T21:29:55 1763501395

Except on figure 1 they're all at 0, making it look like the authors didn't know how to use the models or deliberately made them do nothing.

andai · 2025-11-19T15:14:52 1763565292

I think it just looks that way because they used a linear x axis for comedic effect.