I can't imagine anyone would want to build on top of their APIs after they compl...

miki123211 · on Aug 14, 2024

LLMs are pretty easy to switch, though.

From a black box perspective, LLMs are pretty simple, you put text or images in, (possibly structured) text comes out, maybe with some tool invocations.

If you use a good library for this, like Python's litellm for example, all it takes is changing one string in your code or config, as the library exposes most APIs of most providers under a simple, uniform interface.

You might need to modify your prompt and run some evals on whatever task your app is solving, but even large companies regularly deprecate old models and introduce vastly better ones, so you should have a pipeline for that anyway.

These models have very little "stickiness" or lock-in. If your app is a Twitter client and is built around the Twitter API, turning it into a Mastodon client built around the Mastodon API would take a lot of work. If your app uses Grok and is designed properly, switching over to a different model is so simple that it might be worth doing for half an hour during an outage.

cosmosgenius · on Aug 14, 2024

Prompt to Output quality vary by a large amount between models IMO. The equivalent analogy would "lets switch programming language for this solved problem".

kloop · on Aug 14, 2024

Sure, but to be consistent with the analogy, we're evoking the program from bash and it's been solved in several languages already.

Trying it isn't exactly locking you into anything

maeil · on Aug 14, 2024

The models are still of a level where for less common/benchmarked tasks, there's often only one model that's very good at it, and whichever is 2nd best is markedly worse, possibly to a degree where it's unusable for anything serious.

createaccount99 · on Aug 14, 2024

From my experience, the system prompt matters a lot, and so it's not as simple as just switching.

perbu · on Aug 14, 2024

I assume it'll be a paid API so the "contract" is a lot more clear. Twitter never understood what to do with its API so pulling that particular rug makes sense.

But I too wouldn't use this. X is playing fast and loose with ... everything, so having a business rely on their product seems risky.

tgsovlerkhgsel · on Aug 14, 2024

The nice thing with LLMs is that the API is relatively simple - for the most basic case, it's string in, string out. While you may need to redesign your prompt a bit, I bet for many use cases, LLMs are reasonably interchangeable, and the integration work required for an API change should be minimal.

vincnetas · on Aug 14, 2024

Or like with ORM's you can start using intermediary library which unifies access to AI engines like Langchain4j (for java) and hides API details.

bitshiftfaced · on Aug 14, 2024

Those who would build on top of the API might be considering a couple of past changes that are significant, but not necessarily a reason to think they'll be further pain in the future: the company ownership changed, and those who train LLMs all of a sudden want all the human-created text on the internet.

bboygravity · on Aug 14, 2024

If they have the best model, everybody will use it.

With LLMs (and AGI) it's really that simple: the company with the best model wins regardless of all else.

autonomousErwin · on Aug 14, 2024

Best in what sense? Intelligence, speed, cost?

Sometimes having a fast enough model at a low enough price makes you the obvious choice e.g. I know Claude is better than gpt-4o-mini but I use the latter for a lot more data processing because it's significantly cheaper and faster and the gains I'd get out of Claude seem somewhat marginal for my use case

andsoitis · on Aug 14, 2024

> Best in what sense? Intelligence, speed, cost?

Best at product / market fit. And that space is very very wide. Does the GenAI serve as a feature in a larger product (like realtime “reasoning” on X or in Apple’s case in iOS)? Is it a standalone product that general public or enterprises use? Does it play in a niche area? Etc.

threeseed · on Aug 14, 2024

It isn't that simple at all.

It's going to be a combination of price, performance, quality, reliability, availability etc.

And since the prompts need to be optimised for each model there is a degree of vendor lock-in.

bboygravity · on Aug 21, 2024

I wasn't really talking about the marginal differences we see right now in August 2024.

I'm talking about the next huge step forward that only 1 company will achieve, because it simply has the most GPUs (in limited supply) + energy source first and keeps that advantage.

At some point this becomes a run-away self amplifying differentiator and it will make that company win regardless of all else.

My money is on xAI in 2025.

PS: the only reason prompts need to be optimized for each model is a symptom of models simply not being that good yet. This need will vanish in the near future as you get way better models. A recent hint of what I mean: mid-journey needed very elaborate prompt (and even loras) to get what you want. In flux that prompt can be much shorter (without loras) and it still gets closer to what you want. Same will happen with LLMs. Another example: with ChatGPT 4 you need to literally beg a model to only return what you ask for (for example JSON) or put it in a certain mode (JSON mode), in Claude Sonnet 3.5 it will simply just listen to what you ask for. So again: that's not "because every model needs model-specific fine-tuning" that's because previous models where simply not as good.

threeseed · on Aug 14, 2024

Musk apparently lied about the DDOS that caused the X Trump stream failure.

https://www.theverge.com/2024/8/12/24219121/donald-trump-elo...

If that's true it's not exactly the sort of behaviour you want from an API you're depending on.

chrisco255 · on Aug 14, 2024

This article is hearsay trash. It quotes an anonymous source saying "there was a “99 percent” chance Musk was lying about an attack.

mdhb · on Aug 14, 2024

Let’s review the evidence then shall we:

Evidence for DDOS:

- Elon said so

- the event in question very clearly had huge technical issues

Evidence Against DDOS:

- Elon said so

- People who worked at Twitter said it was bullshit

- every other spaces event that was run at the same time was unaffected.

- no other part of the website was impacted in any way whatsoever.

chrisco255 · on Aug 14, 2024

> People who worked at Twitter said it was bullshit

No, we have no idea from The Verge article whether the sources are even qualified to make such statements or if the statements are even true. In fact on the basis of the 99 percent speculative quote we can disregard the source quotes altogether. I'll say this, I work on far less significant software than X and we get DDOSed all the time.

> every other spaces event that was run at the same time was unaffected.

That's not true, I wasn't even able to load my feed during the initial part of the stream.

mdhb · on Aug 14, 2024

You seem to be invested in this topic in a weird and unhealthy way but there is nothing of value here in this comment.

You baselessly accuse journalists of straight up making things up and then go on to give some anecdotal evidence that conveniently nobody can disprove.

hayd · on Aug 14, 2024

- every other spaces event that was run at the same time was unaffected. - no other part of the website was impacted in any way whatsoever.

Aren’t these last two an argument FOR a ddos attack? It seems reasonable to assume we’re there a ddos attack at that time it would be against the Elon/Trump stream explicitly.

mdhb · on Aug 14, 2024

I’d like to see an explanation of how that is even possible to get that level of targeting without knowing the connection details of either Elon or Trump. The rest of the attack surface is surely shared infrastructure with the rest of the website.

So no I think it was just a straight up technical failure on their end.

CaptWillard · on Aug 14, 2024

How did it clear up?

threeseed · on Aug 14, 2024

It quotes two sources, both who work at X.

The Verge has no political bias, has a good reputation and thus deserve the benefit of the doubt.

horseRad · on Aug 14, 2024

"The Verge has no political bias". Okay, in the same way that wired has no political bias. They're so unbiased yet you know exactly the way an article is slanted towards given the topic and persons. Just like I know the slant given a reddit /r/all post or Fox News/msnbc article.

chrisco255 · on Aug 14, 2024

Verge editors most definitely are biased as are all humans. Journalists are not neutral. In this case someone made a "99 percent chance" speculative statement and the publication decided to print it as if it were fact and not just dismiss it as coming from someone who knew nothing.

We know nothing about the sources, and writers are not above making stuff up. I could just as easily spin it on them: there's a 99 percent chance they made up the sources.

s08148692 · on Aug 14, 2024

I think you'd struggle to find a human on this planet that isn't biased one way or another when it comes to Musk

tomp · on Aug 14, 2024

They titled the article: The Elon Musk / Donald Trump interview on X started with an immediate tech disaster

If they were actually neutral, they'd phrase it more like: with technical difficulties.

Philpax · on Aug 14, 2024

I would consider the widely-publicised event not starting for 40 minutes due to technical issues to be a "tech disaster."

rpdillon · on Aug 14, 2024

Trump called it a disaster when the same thing happened DeSantis, so I don't see a particular bias in play with that particular phrase.

chrisco255 · on Aug 14, 2024

Trump is both partisan and biased and doesn't claim to be neutral. Of course he was trashing things to do with his political opponents (he was running against DeSantis in the primary at the time).

rpdillon · on Aug 14, 2024

Thanks for the clarification. I should have never commented on anything even remotely political, my bad!

defrost · on Aug 14, 2024

The Register weighed in with a Yeah, Right skeptical attitude:

    The Register has found no evidence of a denial of service attack directed at X. Check Point Software's live cyber threat map does not record unusual levels of activity at the time of writing. NetScout's real-time DDoS map recorded only small attacks on the US.

    If a DDoS was indeed the reason for the delayed start of the event, it appears not to have impacted the rest of X's operations – there were plenty of posts commenting on the problems with the Space occupied by the interview. And Musk was tweeting from the very network said to be under attack.

Elon Musk claims live Trump interview on X derailed by DDoS https://www.theregister.com/2024/08/13/trump_musk_livestream...

They also threw shade on the numbers:

    The interview commenced some 40 minutes after its advertised time. Live audience statistics reported 1.1 to 1.3 million attendees during the portions of the event The Register observed – although during the stream Trump claimed that the event had an audience of 60 million or more, exceeding targets of 25 million.

mdhb · on Aug 14, 2024

This is the reason why we teach kids stories like little red riding hood because it’s just such a fundamental thing that when you lie about absolutely everything all the time people will just never trust you again even if you happen to be telling the truth one particular time.

And unfortunately both of these men are known for bullshitting more than anything else and have been now for a long time.

throwup238 · on Aug 14, 2024

Fully Working Spaces coming next year, he swears.

anonbanker · on Aug 14, 2024

by the time Trump made that statement, there were 60 million views, which is a different metric than the active viewers.

defrost · on Aug 15, 2024

Sure.

Was Trump a fool to count the people that took one look and changed channels, or a knowledgable and deliberate deceiver?

two_handfuls · on Aug 14, 2024

I mean, 99% percent chance of lying is the Bayesian prior with this person anyways.