More

jackfischer · 2026-05-11T00:54:43 1778460883

We need more ambitious risk taking and more people "in the arena".

jackfischer · 2026-03-03T20:57:40 1772571460

Is this only in ChatGPT proper and not in the API? Requests appear to 400 and it does not appear in `/v1/models`

XCSme · 2026-03-03T21:30:40 1772573440

They said it's available in the API too, in the blog post.

EDIT:

> GPT‑5.3 Instant is available starting today to all users in ChatGPT, as well as to developers in the API as ‘gpt-5.3-chat-latest.’ Updates to Thinking and Pro will follow soon. GPT‑5.2 Instant will remain available for three months for paid users in the model picker under the Legacy Models section, after which it will be retired on June 3, 2026.

jackfischer · 2026-02-23T18:37:22 1771871842

Congrats guys! Curious how the read write splitting is reliable in practice due to replication lag. Do you need to run the underlying cluster with synchronous replication?

maherbeg · 2026-02-23T21:11:20 1771881080

The way we solved it is by checking the lsn on the primary, and then waiting for the replica to catch up to that lsn before doing reads on the replica in various scenarios.

levkk · 2026-02-23T18:41:49 1771872109

Not really, replication lag is generally an accepted trade-off. Sync replication is rarely worth it, since you take a 30% performance hit on commits and add more single points of failure.

We will add some replication lag-based routing soon. It will prioritize replicas with the lowest lag to maximize the chance of the query succeeding and remove replicas from the load balancer entirely if they have fallen far behind. Incidentally, removing query load helps them catch up, so this could be used as a "self-healing" mechanism.

jackfischer · 2026-02-23T19:17:01 1771874221

It sounds like this is one of the few places that might be a leaky abstraction in that queries _might_ fail and the failure might effectively be silent?

levkk · 2026-02-23T19:27:20 1771874840

It can be silent, but usually it's loud and confusing because people do something like this (Rails example):

    user = User.create(email: "test@test.com")
    SendWelcomeEmail.perform_later(user.id)

And the job code fetches the row like so:

    user = User.find(id)

This blows up because `find` throws an error if the record isn't there. Job queues typically use replicas for reads. This is a common gotcha: code that runs async expects the data to be there after creation.

There can be others, of course, especially in fintech where you have an atomic ledger, but people are usually pretty conscious about this and send those type of queries to the primary.

In general though, I completely agree, this is leaky and an unsolved problem. You can have performance or accuracy, but not both, and most solutions skew towards performance and make applications handle the lack of accuracy.

jackfischer · 2026-02-23T19:34:32 1771875272

Makes sense, appreciate it

jackfischer · 2026-02-14T15:57:16 1771084636

Why is that? Personally I appreciated the throwback look and it probably accomplished its goal of being memorable. Turbopuffer is another notable one seemingly leaning into this flavor of marketing

verdverm · 2026-02-14T19:04:24 1771095864

It doesn't look professional in the sense of a mature business

there are numerous inconsistencies and broken links

The link text that says they raised $23M but doesn't go anywhere is real sus

jackfischer · on Feb 26, 2025

Tight integration with the typescript tool chain has been great for us with edgeql and is about an order of magnitude less error prone than ORMs I've interacted with. Gel is a winning formula especially in the typescript world.

1st1 · on Feb 26, 2025

Thank you Jack!

jackfischer · on Aug 1, 2023

We used TipTap to great effect in an old iteration of our product at credal.ai. It helped us create nuanced text tagging behavior without too much time investment. Would happily recommend it.

jackfischer · on June 14, 2023

The AI Chief of Staff had a few layers. The first was data integration of both productivity data (slack, notion etc) and "big data" lakes/warehouses. The former tells you what is getting done at a human level and the latter has the potential to tell you whether and how it is working. The second layer was modeling of your business strategy and including dependencies between concepts like projects and teams, which allows us to back out things like stakeholders and early warning recipients for any given progress or problems. The third was a presentation layer allowing humans to get a birds' eye view of what's happening including generating artifacts like meetings decks.

Ultimately this 1) wasn't successfully solving an urgent enough problem for most businesses and 2) was too difficult to adopt.

LLMs do break open opportunities in this space so I expect to see some more versions of this, perhaps on top of the Credal API!

jackfischer · on June 14, 2023

Thank you! We haven't gone down the Foundry route yet. We do have some smaller scale apps and companies using Credal either as their AI API or chat platform respectively - would be interested to hear a bit about your use case and see if it's a match?

pseudonymouspat · on June 14, 2023

Word- we're in the thick of it but I'll reach out once we're ready to start thinking through bringing in chat.

jackfischer · on June 14, 2023

Many thanks! There is a lot of opportunity for LLMs lurking in regulated industries right now - glad to have given you a boost!

jackfischer · on March 1, 2023

Any chance you logged the request on the server?

eclipxe · on March 1, 2023

It didn’t connect to the server. It’s a large language model. It can’t “do” anything.

splatzone · on March 2, 2023

There was definitely a browsing component, at least originally. I remember seeing part of the prompt leaked and it had something like "internet browsing: disabled" in it