More

caleblloyd · 2025-06-11T22:54:35 1749682475

Awesome! Only a little over a billion more to go before GitHub’s very own OpenAPI Spec can start overflowing int32 on repositories too, just like it already does for workflows run IDs!

https://github.com/github/rest-api-description/issues/4511

bartread · 2025-06-11T23:05:56 1749683156

The company where I did my stint as CTO I turned up, noticed they were using 32-bit integers as primary keys on one of their key tables that already had 1.3 billion rows and, at the rate they were adding them, would overflow on primary key values within months… so we ran a fairly urgent project to upgrade the IDs to 64-bit to avoid the total meltdown that would have ensued otherwise.

hobs · 2025-06-11T23:08:07 1749683287

heh, that's happened at at least 5 companies I have worked at - go to check the database, find - currency as floats, hilarious indexes, integers gonna overflow, gigantic types with nothing in them.

rudasn · 2025-06-11T23:11:01 1749683461

I bet you haven't seen indeces on decimals though! Fun times :)

azophy_2 · 2025-06-12T01:51:28 1749693088

Just curious as someone with limited experience on this. Whats wrong with it? decimal is consistent & predictable (compared than float), so it shouldn't be that big of a deal right? CMIIW

rudasn · 2025-06-12T04:09:19 1749701359

Yeah, not a big deal but completely useless nonetheless as you would never really query your table for just the one decimal column (eg the price) but a couple more (eg the category and the price) so you'd have a multi-column index on those columns. The index on just the price column never gets used.

cwbriscoe · 2025-06-12T07:05:11 1749711911

What if you wanted to select "top 100 most expensive products" or number of products between $0.01 and $10, $10.01 and $100, $100.01 and $1000? Sure you could do a full table scan on your products table on both queries but an index on price would speed both queries up a lot if you have a lot of products. Of course you have to determine if the index would be used enough to make up for the extra time on index update when the price changes or products are added or deleted.

erulabs · 2025-06-12T07:33:44 1749713624

Cheap solution, sure, add an index. But you're asking an OLAP question question of an OLTP system. Questions like that are best asked at least of an out-of-production read replica or better an analytics db.

robertlagrant · 2025-06-12T08:56:22 1749718582

I don't really understand this - what is an out of production read replica? Why wouldn't it just go to a production read replica?

And what is an "analytics db" in this context?

erulabs · 2025-06-15T20:05:40 1750017940

In general just avoiding mixed types of load. Predictable, audited application queries in a user request shouldn’t be mixed with potentially extremely expensive long running analytics queries. Different replica sets isolates customers from potential performance impacts caused by data analytics.

gkalin59 · 2025-06-12T14:05:59 1749737159

You stream CDC events to have a 1 to 1 read replica in something like Snowflake/Databricks where you can run all kinds of OLAP workflows on this analytics db replica.

robertlagrant · 2025-06-13T08:33:50 1749803630

Oh, sure, but wouldn't the whole website be served out of a read-friendly database? Why would you have a separate "analytics" database to the main database(s) driving the site?

cwbriscoe · 2025-06-12T14:11:25 1749737485

In the real world, people want cheap solutions and they want it yesterday.

hobs · 2025-06-13T03:04:33 1749783873

They'd certainly need decimals in the first place, but yeah I have seen indexes on every column, multiple times, I have seen indexes such that the sum of their size are 26 times the size of the original data... that's actively being written to.

gchamonlive · 2025-06-11T23:25:28 1749684328

What are the challenges of such projects? How many people are usually involved? Does it incur downtimes or significant technical challenges for either the infrastructure or the codebase?

bartread · 2025-06-12T00:00:35 1749686435

Changing the type of the column is no big deal per se, except on a massive table it’s a non-trivial operation, BUT you also have to change the type in everything that touches it, everywhere it’s assigned or copied, everywhere it’s sent over the wire and deserialized where assumptions might be made, any tests, and on, and on. And god help you if you’ve got stuff like int.MaxValue having a special meaning (we didn’t in this context, fortunately).

Our hosting environment at that time was a data centre so we were limited on storage, which complicated matters a bit. Like ideally you’d create a copy of the table but with a wider PK column and write to both tables, then migrate your reads, etc., but we couldn’t do that because the table was massive and we didn’t have enough space. Procuring more drives was possible but took sometimes weeks - no just dragging a slider in your cloud portal. And then of course you’d have to schedule a maintenance window for somebody to plug it in. It was absolutely archaic, especially when you consider this was late 2017/early 2018.

You need multiple environments so you can do thorough testing, which we barely had at that point, and because every major system component was impacted, we had to redeploy our entire platform. Also, because it was the PK column affected, we couldn’t do any kind of staged migration or rollback without the project becoming much more complex and taking a lot longer - time we didn’t have due to the rate at which we were consuming 32-bit integer values.

In the end it went off without a hitch, but pushing it live was still a bit of a white knuckle moment.

robertlagrant · 2025-06-12T08:47:23 1749718043

Well done. Unsung heroes keeping it all going, and unsung villains who chose int32 in the first place long gone :-)

ipaddr · 2025-06-12T20:35:29 1749760529

This comment can be reused when int64 is forced to change into int128 or int255 in the future.

eadmund · 2025-06-13T13:15:33 1749820533

Just use bignums. Seriously, a decent type system can use smaller representations for efficiency and not accidentally break the world.

That way, one can represent the numbers found in cryptography as … numbers, instead of opaque Base64- or (God help one) ASN.1-encoded byte sequences.

Machine words are an efficiency hack.

tengbretson · 2025-06-12T02:21:27 1749694887

If you've written your services in JavaScript, going from i32 to i64 means your driver is probably going return it as a string (or a BigInt or some custom Decimal type), rather than the IEEE754 number you were getting before. This means you now need change your interfaces (both internal and public-facing) to a string or some other safely serializable representation. And if you are going to go through all that trouble, you may as well take the opportunity to just switch to some uuid strategy anyway.

The alternative is that you can monkey-patch the database driver to parse the i64 id as an IEEE754 number anyway and deal with this problem later when you overflow the JavaScript max safe integer size (2^53), except when that happens it will manifest in some really wacky ways, rather than the db just refusing to insert a new row.

dietr1ch · 2025-06-12T04:09:38 1749701378

Maybe you are better off moving to UUIDs then? It seems that there's packages to make handling them easier, but you'll still need a tiny hack to map old i32 Ids to some UUID.

roberttod · 2025-06-12T04:34:16 1749702856

I remember such a project, and due to our large and aging TypeScript frontend projects it would have added a couple of weeks to adjust all the types affected. All IDs in many places deep in code caused thousands of errors from the mismatch which was a nightmare. I can't remember exactly why it was so tough to go through them all, but we were under intense time pressure.

To speed things up we decided to correct the ID types for the server response, which was key since they were generated from protobuf. But we kept everything using number type IDs everywhere else, even though they would actually be strings, which would not cause many issues because there ain't much reason to be doing numeric operations on an ID, except the odd sort function.

I remember the smirk on my face when I suggested it to my colleague and at the time we knew it was what made sense. It must have been one of the dumbest solutions I've ever thought of, but it allowed us to switch the type eventually to string as we changed code, instead of converting the entire repos at once. Such a Javascript memory that one :)

jiggawatts · 2025-06-11T23:33:10 1749684790

Not the original commenter, but I've read through half a dozen post-mortems about this kind of thing. The answer is: yes. There's challenges and sometimes downtime and/or breaking changes are inevitable.

For one, if your IDs are approaching the 2^31 signed integer limit, then by definition, you have nearly two billion rows, which is a very big DB table! There are only a handful of systems that can handle any kind of change to that volume of data quickly. Everything you do to it will either need hours of downtime or careful orchestration of incremental/rolling changes. This issue tends to manifest first on the "biggest" and hence most important table in the business such as "sales entries" or "user comments". It's never some peripheral thing that nobody cares about.

Second, if you're using small integer IDs, that decision was probably motivated in part because you're using those integers as foreign keys and for making your secondary indexes more efficient. GUIDs are "simpler" in some ways but need 4x the data storage (assuming you're using a clustered database like MySQL or SQL Server). Even just the change from 32-bits to 64-bits doubles the size of the storage in a lot of places. For 2 billion rows, this is 8 GB more data minimum, but is almost certainly north of 100 GB across all tables and indexes.

Third, many database engines will refuse to establish foreign key constraints if the types don't match. This can force big-bang changes or very complex duplication of data during the migration phase.

Fourth, this is a breaking change to all of your APIs, both internal and external. Every ORM, REST endpoint, etc... will have to be updated with a new major version. There's a chance that all of your analytics, ETL jobs, etc... will also need to be touched.

Fun times.

lcnPylGDnU4H9OF · 2025-06-11T23:52:13 1749685933

> For one, if your IDs are approaching the 2^31 signed integer limit, then by definition, you have nearly two billion rows

Just wanted to nitpick this; this is not actually definitively true. A failed insert in some systems will increment the counter and deleting rows usually does not allow the deleted ID to be re-used (new inserts use the current counter). Of course, that is beside the point: the typical case of a table approaching this limit is a very large table.

lmm · 2025-06-12T01:02:12 1749690132

It's actually fairly common to see this problem crop up in systems that are using a database table as a queue (which is a bad idea for many reasons, but people still do it) in which case the number of live rows in the table can be fairly small.

jamwil · 2025-06-12T02:25:32 1749695132

If a SQLServer instance is killed unceremoniously it adds 1000 to the pk increment.

jiggawatts · 2025-06-12T12:57:05 1749733025

I'm trying not to imagine the poor SQL Server that has crashed one or two million times and hence pushed the ID values into the billions!

jamwil · 2025-06-12T14:20:52 1749738052

haha—somewhere out there it’s crashing right now. Keep it in your thoughts.

darkwater · 2025-06-12T06:25:42 1749709542

Lived that with a MySQL table. The best thing is that the table was eventually dismissed (long after the migration) because the whole data model around it was basically wrong.

cyberax · 2025-06-12T04:13:53 1749701633

The same story happened inside Amazon.

neomantra · 2025-06-12T09:56:57 1749722217

A couple weeks ago there was some Lua community issues because LuaRocks surpassed 65,535 packages.

There was a conflict between this and the LuaRocks implementation under LuaJIT [1] [2], inflicting pain on a narrow set of users as their CI/CD pipelines and personal workflows failed.

It was resolved pretty quick, but interesting!

[1] https://github.com/luarocks/luarocks/issues/1797

[2] https://github.com/openresty/docker-openresty/issues/276

caleblloyd · 2025-06-06T02:05:45 1749175545

Thinking of switching from API access to Max 20x tomorrow for a more consistent bill.

Been using $50-100 of Opus tokens through API access per day. Think I’ll hit the Max 20x limits and get put in timeout?

I wish Max could automatically overflow to API access when it times out so I would need to have token anxiety.

Syzygies · 2025-06-06T02:17:33 1749176253

I moved to Max after projecting a $2,000 annual API bill. I haven't yet hit five hour limits, but login/ toggles easily between plans. I believe the interface tells you when you've hit a limit, but as I said I don't know first hand.

SatvikBeri · 2025-06-06T02:48:42 1749178122

According to CCUsage, I hit limits on Opus usage around the equivalent of $150. If we naively extrapolate, that suggests about $600 of Opus usage per session on Max 20x.

caleblloyd · 2025-06-05T02:35:44 1749090944

I would prefer the BSL with some sort of trial period grant and source available to closed source.

Other nice thing about BSL is it converts to an Open Source license after 3-4 years which addresses the concern “what if the software vendor goes out of business”. You can support it yourself or another vendor can pick it up and support it after that time period.

caleblloyd · 2025-06-04T12:30:32 1749040232

Can you have different people at different levels on the Teams plan? Like 2 pro, 2 max 5x, and 1 max 20x?

Them can you have it overflow to API credits once it hits the 5 hours max?

caleblloyd · 2025-06-01T01:33:57 1748741637

Looks really neat! I like the looks of the API.

LGPL can make it a tough decision to use, since that essentially rules out Single-file publish, R2R, and Native AOT in the .NET world. Your project seems like it could be very useful for edge in which those packaging models can be very attractive.

falsename · 2025-06-01T11:56:54 1748779014

The author (it is not me) has added a special permission for these cases in the Readme.md License section.

caleblloyd · 2025-06-01T23:37:47 1748821067

Ah nice! Unfortunately many people (like me) won’t notice that and will probably not be able to consider using it though if they are using corporate license scanners or blanket enterprise policies that don’t allow LGPL.

But that is a nice special permission for those who are still able to use it!

And that could also be a good model if the plan is to dual license it and sell to enterprise.

SahAssar · 2025-06-01T15:39:33 1748792373

FYI: Show HN is just for your own projects, not for posting other peoples projects.

falsename · 2025-06-01T16:38:03 1748795883

I contributed a little bit

caleblloyd · 2025-05-22T17:04:12 1747933452

Maybe funny now but once (if?) it can eventually contribute meaningfully to dotnet/runtime, AI will probably be laughing at us because that is the pinnacle of a massive enterprise project.

caleblloyd · 2025-05-20T21:57:21 1747778241

Is the backend pluggable? Could it be configured to write to any key value store with support for optimistic concurrency control?

benbjohnson · 2025-05-20T22:04:50 1747778690

We don't support plug-ins at the moment but there's several backends at the moment (S3, Azure Blob Storage, Google Cloud Storage, SFTP, etc)

caleblloyd · 2025-05-20T01:12:22 1747703542

Both announcements on the heels of OpenAI Codex Research Preview too, which is essentially the same product

cess11 · 2025-05-20T06:24:55 1747722295

All the monies on the same idea at the same time, sounds a bit desperate to me.

caleblloyd · 2025-05-19T12:19:41 1747657181

I love Michael Lewis’ writing style! I have probably read half of the books in this blog article and can remember hardly any of them, but I could still summarize Flash Boys because it was such a cool story.

Havoc · 2025-05-19T20:30:12 1747686612

There is a new one by Gary Stevenson that has a very similar vibe to liars poker fyi. London not New York and FX not bonds

caleblloyd · 2025-05-19T12:09:13 1747656553

Anthropic is releasing blog articles where they are discovering how Claude works through experiments and observations. It seems more like science than engineering when even the creators have to run scientific experiments to figure out how what they engineered works.

https://www.anthropic.com/research/tracing-thoughts-language...

tough · 2025-05-19T12:55:31 1747659331

Isn't that more an artifact of the nature of the indeterministic black boxes that llm's are?

anthropic is doing great work, that understanding models blog post was really ground breaking.

jimbokun · 2025-05-19T13:41:44 1747662104

The description of how Claude does "arithmetic" was enlightening. Shows how it uses clever pattern matching to fit the data, without ever learning the algorithm, even though the algorithm description was certainly in its training data.