More

yurivish · 2025-04-18T22:47:48 1745016468

yurivish · 2025-03-06T19:41:37 1741290097

I also emailed Gonzalo Navarro once to ask a question, and we had a great discussion and ended up writing a paper together about the answer. [1]

Another paper of his that I really like combines a few elegant ideas into a simple implementation of bitvector rank/select: https://users.dcc.uchile.cl/~gnavarro/ps/sea12.1.pdf

During this time I got really excited about succinct data structures and wrote a Rust library implementing many bitvector types and a wavelet matrix. [2]

My interest came from a data visualization perspective -- I was curious if space-efficient data structures could fundamentally improve the interactive exploration of large datasets on the client side. Happy to chat about that if anyone's curious.

[1] Paper: https://archive.yuri.is/pdfing/weighted_range_quantile_queri... though it's pretty hard to understand without some background context. I've been meaning to write a blog post explaining the core contribution, which is a simple tweak to one of Navarro's textbook data structures.

[2] The rust version is here: https://github.com/yurivish/made-of-bits/tree/main/rust-play... and an earlier pure-JS implementation is here: https://github.com/yurivish/made-of-bits/tree/main

sitkack · 2025-03-06T20:15:32 1741292132

Reading a Gonzalo Navarro paper is like going for walk, taking a shower and having a wonderful coffee. It literally sets the mind on fire.

https://dblp.org/pid/n/GonzaloNavarro.html

SoftTalker · 2025-03-07T00:29:02 1741307342

Well not literally.

dspillett · 2025-03-07T12:21:51 1741350111

Many dictionaries now list one common use of “literally” as meaning “figuratively, with emphasis”. So literally officially sometimes now literally means figuratively.

I suspect some people are literally having conniption fits about this…

cowsandmilk · 2025-03-07T12:54:37 1741352077

I’m sorry, but your comment mixes two different types of dictionaries. You talk about “official” meanings which would be a prescriptive dictionary telling you the way you are allowed to use a word. But the dictionaries that include “figuratively” in their definitions are clearly descriptive, presenting all the ways words are commonly used.

You can’t take a descriptive dictionary and then claim it is prescriptive.

dspillett · 2025-03-07T16:14:45 1741364085

There are no prescriptive dictionaries, at least not correct ones, for living languages.

IIRC both the OED and CED list figurative uses for the word, do you know any publications considered more authoritative than those for English? Webster too, for those who prefer simplified English.

ForTheKidz · 2025-03-07T16:21:15 1741364475

I think French has prescriptive dictionaries (to varying degrees of success)

dspillett · 2025-03-07T16:37:15 1741365435

They have Académie Française which intends to control the language to an extent, in recent times focussing a lot on resisting then encroachment of English word and phrases, but IIRC their recommendations don't carry as much weight as many think and are often ignored even by government departments and other official French bodies.

The Académie do publish a dictionary every few decades though, there was a new edition recently, so there is a prescriptive dictionary for French even though it carries little weight in reality.

French is the only living language to attempt it to this extent, though the existence of one is enough to make my “there are none for living languages” point incorrect. It is difficult to pin a language down until no one really speaks it day-to-day (so it doesn't evolve at the rates commonly used languages do).

throwaway290 · 2025-03-08T04:29:09 1741408149

https://en.wikipedia.org/wiki/Linguistic_prescription#Formal...

dspillett · 2025-03-08T10:11:22 1741428682

Very few of those have official force or cover much more than a subset of language properties (i.e. spelling rules), but definitely more than the "none" of my original assertion.

throwaway290 · 2025-03-09T09:29:52 1741512592

"prescriptive" does not mean "have legal force" though...

throwaway290 · 2025-03-08T04:25:15 1741407915

> There are no prescriptive dictionaries, at least not correct ones, for living languages.

there are no 100% correct descriptive dictionaries. Any prescriptive dictionary is automatically correct.

dspillett · 2025-03-08T10:15:53 1741428953

> Any prescriptive dictionary is automatically correct.

… in the view of their compilers.

I could write a prescriptive dictionary far more easily than I could get others to accept it as correct.

throwaway290 · 2025-03-09T09:31:42 1741512702

If you write a prescriptive dictionary it is correct because you are dictating the norms not describing what is real.

Yes you would have to be involved with a regulatory institution first

computably · 2025-03-12T07:34:23 1741764863

Right, just like every law is automatically just. /s

throwaway290 · 2025-03-13T05:19:55 1741843195

If it's not just then change the law!

cycomanic · 2025-03-08T02:48:03 1741402083

The Duden is prescriptive for German AFAIK.

davidcalloway · 2025-03-08T06:03:00 1741413780

Isn't this more of a cultural thing, that Germans seem to agree that it is authoritative and use it as a reference?

I'm not sure what would even make a dictionary prescriptive other than an explicit declaration that it is so or, ridiculously, a law declaring the same.

soulofmischief · 2025-03-07T19:27:22 1741375642

I'm sorry, can you point to such a prescriptive dictionary? People can talk however they please, and dictionaries are tasked with keeping up with the vernacular.

The "literally" ship sailed centuries ago. Sorry, but that battle has been lost. Even so-called "prescriptive" dictionaries would be categorically incorrect if they ignore nearly three centuries of common vernacular.

bee_rider · 2025-03-07T15:09:48 1741360188

There aren’t prescriptive dictionaries for (American, at least) English.

brandly · 2025-03-07T14:54:36 1741359276

But “official” is defined in descriptive dictionaries to include descriptive dictionaries.

penguin_booze · 2025-03-07T08:10:10 1741335010

Well, literally doesn't mean literally anymore--literally.

gwd · 2025-03-07T09:19:30 1741339170

It never has, it always will. We've already lost a host of words that meant "I'm not exaggerating, I actually mean it": "really", "very", etc. I'm going to keep up the fight.

Zecc · 2025-03-07T09:55:27 1741341327

Since there are _literally_ people who use, and have been using for a while, the word without the same exact meaning as we both agree on... well.

Having said that, I will join you in this fight.

See also: exponentially.

gwd · 2025-03-07T20:58:01 1741381081

Language is defined by its speakers, as basically a "vote". I'm going to keep voting for "literally" meaning "this actually happened" as long as it's practical, because 1) there are dozens of other ways to emphasize something 2) we need some way to say "this is not an exaggeration".

bee_rider · 2025-03-07T15:19:31 1741360771

“Exponentially” and “quantum” are the only language hills I’d die on.

nxobject · 2025-03-08T00:31:00 1741393860

Why a quantum leap isn’t the length of an Ångstrom will always sadden me. I’m sure there are other scientific concepts you can use to describe a Great Leap Forward…

Quekid5 · 2025-03-08T13:49:04 1741441744

I think the Quantum Leap expression can also be understood as a "step" with no intermediate stages, i.e. very abrupt or transformative.

gwd · 2025-03-07T09:23:23 1741339403

The moreso that those things don't even figuratively set my mind on fire.

sitkack · 2025-03-07T16:47:47 1741366067

What about metaphorically?

__tidu · 2025-03-07T12:01:52 1741348912

the "technical note" link in the RLE bit vector section of the rust repo is broken (https://yuri.is/pdfing/weighted_range_quantile_queries.pdf 404s)

__tidu · 2025-03-07T12:02:44 1741348964

oh wait nvm just realised you linked a working archive link in your post... still worth updating the link in the repo for people who stumble upon it

yurivish · 2025-03-07T13:51:15 1741355475

Fixed, thanks!

yurivish · 2024-12-24T14:08:25 1735049305

Why Brave is blocked: https://github.com/lobsters/lobsters-ansible/issues/45

DaSHacka · 2024-12-24T17:00:30 1735059630

Looks as though it's not currently in effect, however?

https://github.com/lobsters/lobsters/issues/761

What a trite cat-and-mouse game, though at least it's entertaining to watch them try.

yurivish · 2024-12-19T22:00:38 1734645638

Previously: https://news.ycombinator.com/item?id=2089615

kapitalx · 2024-12-20T07:26:28 1734679588

I totally remember that post. I think I spent 30 minutes just clicking around on it back then. Very nice.

yurivish · on Nov 25, 2023

Good point, that's worth emphasizing in the post – I added a section about it with reference to the paper (& to your comment).

yurivish · on July 13, 2022

Hi Charles, this is my fault for missing the attribution – I'm very sorry.

I've just added a credit to you and your repository (see the second sentence).

I had put together this minimal example based on your repository together with a StackOverflow answer containing the build command (https://stackoverflow.com/questions/68476647/errors-with-com...).

Being just a single file with a simple build command it seemed like a minimal advancement on the state of the art, so I quickly decided to publish, and did not appropriately credit the original as I should have. I hope you can accept my apology – this was an honest mistake.

meheleventyone · on July 14, 2022

No problem Yuri, thanks for the credit and the apology! I do recommend writing articles based on doing the thing from scratch yourself though as you can write something more nuanced and interesting that way.

yurivish · on May 26, 2022

See also: http://sqlime.org

Which is another nice WASM-based browser SQLite user interface.

sgbeal · on May 26, 2022

> ... another nice WASM-based browser SQLite user interface.

Thank you for pointing that one out. Every conceptually similar project is a great source of ideas. sqlite's fiddle app is literally less than 2 weeks old so still has lots of room left for feature creep ;).

yurivish · on April 11, 2022

That's really interesting! Could you say more about the job-like aspects of having users? Do you have advice on infrastructure that could be built to make the job easier?

I'm currently working on my first-ever side project with user accounts, and now I'm wondering what I'm in for. :-)

simonw · on April 11, 2022

For me it's about the moral responsibility. If people are trusting your site with their data, you have an obligation to keep it running, and to keep it secure. This is a big responsibility! Especially since over the long-term the vast majority of projects eventually cease to exist.

ehnto · on April 11, 2022

In addition to user PII responsibilities, you may also be responsible for user generated content depending on where you live. Both real users and bots will inevitably submit nefarious material on your servers.

exdsq · on April 11, 2022

Not OP but if you have user accounts you suddenly have legal responsibilities (in Europe) to follow GDPR rules etc…

ThunderSizzle · on April 11, 2022

The easy solution there is to just ban European users if it's just a hobby project and your concerned about that. Probably not the solution that GDPR would prefer.

aquarin · on April 11, 2022

And if your users are Europeans?

kybernetikos · on April 11, 2022

That's not as easy a solution as it appears - the GDPR isn't the only piece of personal data legislation in the world. If your strategy is to keep track of all the places that place responsibilities on you for collecting personal data and reject users from those locations then, you need to be looking at every state in the USA (Californian citizens have a consitutional right to privacy), and many countries across the world have various data protection laws.

yurivish · on Jan 28, 2021

Thanks! I've used it when the data is itself binned frequency data, which is one way to reduce data volume (e.g. binning 2 weeks of minutely time series data into (hour, percentile) buckets and plotting them as weighted points).

When the number of the bins in the data is not an exact integer multiple of the number of bins in the histogram, adjacent data bins can get mapped to the same histogram bin, resulting in e.g. 2x the data volume in some rows/columns of the histogram.

When a bit of loss of fidelity is acceptable the two solutions I've used are to render the histogram at an exact factor of the number of data bins then set the canvas dimensions to the desired size (relying on the browser to downsample the resulting image), or to use `max` as the reduceOp and render the histogram directly at the intended size.

yurivish · on Jan 28, 2021

This page was specifically created as a library example. Here's the full collection: https://observablehq.com/collection/@twitter/density

If you're interested in the technique behind the plot you can read more about how that works in this paper: https://arxiv.org/abs/1808.06019