I love the title "Big LLMs" because it means that we are now making a distinctio...

saltcured · 2025-03-16T17:42:23 1742146943

I'd prefer to see olive sizes get a renaissance. I was always amused by Super Colossal when following my mom around a store as a little kid.

From a random web search, it seems the sizes above Large are: Extra Large, Jumbo, Extra Jumbo, Giant, Colossal, Super Colossal, Mammoth, Super Mammoth, Atlas.

VectorLock · 2025-03-17T03:55:58 1742183758

How about wine bottle sizes since we're "bottling" a "distillation" of information...

https://en.wikipedia.org/wiki/Wine_bottle#Sizes

toasterlovin · 2025-03-17T04:19:51 1742185191

To get pedantic, wine is not a product of distillation.

hansvm · 2025-03-17T04:29:51 1742185791

That almost makes the metaphor more apt. Wine is the real deal, and brandy is the distilled approximation.

taneq · 2025-03-17T00:13:25 1742170405

Needs more superlatives. “Biggest” < “Extra Biggest” < “Maximum Biggest”. :D

enlightens · 2025-03-17T02:03:50 1742177030

maximum_biggest_final_2

fernmyth · 2025-03-17T02:22:52 1742178172

"Non Plus Ultra"

Followed by another company introducing their "Plus Ultra" model.

inciampati · 2025-03-16T20:10:17 1742155817

And I'd love to see data compression terminology get an overhaul. Do we need big LLMs or just succinct data structures? Or maybe "compact" would be good enough? (Yeah LLMs are cool but why not just, you know, losslessly compress the actual data in a way that lets us query its content?)

rowanG077 · 2025-03-16T21:48:55 1742161735

Well the obvious answer is that LLMs are more then just pure search. They can synthesize novel information from their learned knowledge.

xanderlewis · 2025-03-16T17:19:55 1742145595

And the US ‘small’ LLMs will actually be slightly larger than the ‘large’ LLMs in the UK.

aziaziazi · 2025-03-16T17:32:49 1742146369

I wonder how does the skinnies get dressed oversea: I wear European S which translate to XXS in the US, but there’s many people skinnier than me, still within a “normal" BMI. Do they have to find XXXS? Do they wear oversized clothes? Choosing trousers is way easier because the system of cm/inches of length+perimeter correspond to real values.

Spivak · 2025-03-16T18:42:21 1742150541

It's a crazy experience being just physically larger than most of the world. Especially when the size on the label carries some implicit shame/judgement. Like I'm skinny, I'm pretty much the lowest weight I can be and not look emaciated / worrying. But when shopping for a skirt in Asian sizes I was a 4XL, and usually an or L-2XL in European sizes. Having to shift my mental space that a US M is the "right" size for me was hard for many years. But like I guess this is how sizing was always kinda supposed to work.

krageon · 2025-03-17T05:14:01 1742188441

The shame you feel is yours, it's not inherent to the sizing.

Spivak · 2025-03-17T14:10:36 1742220636

The shame is inherent to the crushing expectations put on women's appearances and the pressure to be small. It manifests in clothing sizing for the same reason it manifests standing on a scale, it's a measure of your smallness. And what makes it insidious is that the measures are juuust comparable enough across different people to make people feel bad for not having the same numbers as someone 5" shorter than you.

And my experience isn't unique in any way here and it's really hard to not see it pervasive through our culture.

rcbdev · 2025-03-19T07:19:46 1742368786

Short men also tend to voice the same complaints you have. They tend to have the added strain of absolute helplessness in their situation.

Foobar8568 · 2025-03-17T10:05:16 1742205916

Uniqlo sizing looks pretty standard to what we have in Europe...

tbrownaw · 2025-03-17T03:29:56 1742182196

> Choosing trousers is way easier because the system of cm/inches of length+perimeter correspond to real values.

They're not merely real values, they're also rational.

vintermann · 2025-03-17T06:22:12 1742192532

I'm not so sure, there's pi involved here!

vintermann · 2025-03-17T06:21:14 1742192474

I worked at a Norwegian hospital once which had sizes from xxl (ekstra ekstra liten) to xxs (ekstra ekstra stor). So it's simple, you cross the ocean, you go from size xxl to xxs without having to do anything at all...

I should say though, that's the only place I've seen this particular localization.

deepsun · 2025-03-16T17:39:55 1742146795

We ordered swag T-shirts for a conference from two providers, but EU provider L's were actually larger than US L!

jgalt212 · 2025-03-16T17:44:16 1742147056

It's funny you say that, but when travelling abroad I wondered how Europeans and Japanese stay sufficiently hydrated.

jdietrich · 2025-03-16T18:52:23 1742151143

For healthy adults, thirst is a perfectly adequate guide to hydration needs. Historically normal patterns of drinking - e.g. water with meals and a few cups of tea or coffee in between - are perfectly sufficient unless you're doing hard physical labour or spending long periods of time outdoors in hot weather. The modern American preoccupation with constantly drinking water is a peculiar cultural phenomenon with no scientific basis.

droidist2 · 2025-03-16T21:14:44 1742159684

Don't many medications dehydrate you though? And Americans are on a lot of medications.

kccqzy · 2025-03-16T20:19:50 1742156390

I've always understood constantly drinking water as a ruse to use the bathroom more often, which is helpful for Americans with sedentary lifestyles.

Horffupolde · 2025-03-17T00:15:55 1742170555

If you are thirsty you are already dehydrated.

VectorLock · 2025-03-17T03:52:55 1742183575

Try getting a kidney stone and then find out if adequate hydration is what you want to squeak by with.

brian-armstrong · 2025-03-16T22:08:10 1742162890

Diabetes causes dehydration

floriannn · 2025-03-16T21:22:01 1742160121

Is this a thing about how restaurants in some European countries charge for water?

ac29 · 2025-03-16T23:10:02 1742166602

Its a joke about Americans carrying around giant water bottles

deepsun · 2025-03-17T16:05:53 1742227553

And for public toilets. I mean restrooms.

miki123211 · 2025-03-16T18:10:30 1742148630

> The UK

You mean the EU, right? The UK isn't covered by the AI act.

/s

t_mann · 2025-03-16T14:42:19 1742136139

Big LLM is too long as a name. We should agree on calling them BLLMs. Surely everyone is going to remember what the letters stand for.

nullhole · 2025-03-16T17:41:01 1742146861

I still like Big Data Statistical Model

bookofjoe · 2025-03-16T16:13:04 1742141584

>What does BLLM stand for?

https://www.abbreviations.com/BLLM#google_vignette

temp0826 · 2025-03-16T15:54:39 1742140479

Bureau of Large Land Management

heyjamesknight · 2025-03-16T17:39:08 1742146748

I want to apologize for this joke in advance. It had to be done.

We could take a page from Trump’s book and call them “Beautiful” LLMs. Then we’d have “Big Beautiful LLMs” or just “BBLs” for short.

Surely that wouldn’t cause any confusion when Googling.

cowsaymoo · 2025-03-16T17:54:52 1742147692

Weirdly enough, the ITU already chose the superlative for the bigliest radio frequency band to be Tremendous:

- Extremely Low Frequency (ELF)

- Super Low Frequency (SLF)

- Ultra Low Frequency (ULF)

- Very Low Frequency (VLF)

- Low Frequency (LF)

- Medium Frequency (MF)

- High Frequency (HF)

- Very High Frequency (VHF)

- Ultra High Frequency (UHF)

- Super High Frequency (SHF)

- Extremely High Frequency (EHF)

- Tremendously High Frequency (THF)

Maybe one day some very smart people will make Tremendously Large Language Models. They will be very large and need a lot of computer. And then you'll have the Extremely Small Language Model. They are like nothing.

https://en.wikipedia.org/wiki/Radio_frequency?#Frequency_ban...

kragen · 2025-03-17T04:23:59 1742185439

"The Overwhelmingly Large Telescope (OWL) was a conceptual design by the European Southern Observatory (ESO) organisation for an extremely large telescope, which was intended to have a single aperture of 100 metres in diameter. Because of the complexity and cost of building a telescope of this unprecedented size, ESO has decided to focus on the 39-metre diameter Extremely Large Telescope instead."

https://en.m.wikipedia.org/wiki/Overwhelmingly_Large_Telesco...

lifthrasiir · 2025-03-16T18:24:31 1742149471

AFAIK "tremendously" was chosen partly because the range includes 1 "T"Hz.

namaria · 2025-03-17T21:26:55 1742246815

I like tremendous as an adjective for a frequency range because etymologically it can be traced to the Latin word for 'shaking'. Tremendous, horrendous, terrible all kinda mean "makes you shake".

Horrendous being based on the Latin root for "trembling with fear", tremendous on another Latin root meaning "shaking from excitement" and terrible deriving from a Greek root for, again, "trembling with fear".

droidist2 · 2025-03-16T21:16:16 1742159776

I hope they go with "Ludicrous" like in Spaceballs.

encrypted_bird · 2025-03-17T00:16:37 1742170597

It bothers me that the level below 3 Hz is not given the name "Tremendously low". Now it's not symmetrical. I hope the ITU is happy...

bee_rider · 2025-03-16T20:34:16 1742157256

XKCD telescope sizes also could provide some guidance

https://xkcd.com/1294/

benatkin · 2025-03-16T23:56:36 1742169396

TLLM is close to TLM

badlibrarian · 2025-03-16T15:16:36 1742138196

I've sat in more than one board meeting watching them take 20 minutes to land on t-shirt sizes. The greatest enterprise sales minds of our generation...

ben_w · 2025-03-16T16:57:20 1742144240

I've seen things you people wouldn't believe.

I’ve seen corporate slogans fired off from the shoulders of viral creatives. Synergy-beams glittering in the darkness of org charts. Thought leadership gone rogue… All these moments will be lost to NDAs and non-disparagement clauses, like engagement metrics in a sea of pivot decks.

Time to leverage.

badlibrarian · 2025-03-16T17:52:17 1742147537

... destroyed by madness, starving hysterical! Buying weed in a store then meeting with someone off Craiglist to score eggs.

latexr · 2025-03-16T16:54:44 1742144084

Name them like clothing sizes: XXLLM, XLLM, LLM, MLM, SLM, XSLM XXSLM.

swyx · 2025-03-16T18:05:06 1742148306

i did this!

XXLLM: ~1T (GPT4/4.5, Claude Opus, Gemini Pro)

XLLM: 300~500B (4o, o1, Sonnet)

LLM: 20~200B (4o, GPT3, Claude, Llama 3 70B, Gemma 27B)

~~zone of emergence~~

MLM: 7~14B (4o-mini, Claude Haiku, T5, LLaMA, MPT)

SLM: 1~3B (GPT2, Replit, Phi, Dall-E)

~~zone of generality~~

XSLM: <1B (Stable Diffusion, BERT)

4XSLM: <100M (TinyStories)

https://x.com/swyx/status/1679241722709311490

ai-christianson · 2025-03-16T17:13:36 1742145216

MLM... uh oh

anonym29 · 2025-03-16T18:30:24 1742149824

I hate those ponzi schemes! Never buy a cutco knife or those crappy herbalife supplements.

Alternatively, just make sure you keep things consensual, and keep yourself safe, no judgement or labels from me :)

Arcuru · 2025-03-16T20:17:34 1742156254

I've been labeling LLMS as "teensy", "smol", "mid", "biggg", "yuuge". I've been struggling to figure out where to place the lines between them though.

zargon · 2025-03-17T03:05:24 1742180724

itsy-bitsy <= 3B

teensy 4B to 29B

smol 30B to 59B

mid 60B to 99B

biggg 100B to 299B

yuuge 300B+

HarHarVeryFunny · 2025-03-16T14:26:47 1742135207

But of course these are all flavors of "large", so then we have big large language models, medium large language models, etc, which does indeed make the tall/grande/venti names appropriate, or perhaps similar "all large" condom size names (large, huge, gargantuan).

guestbest · 2025-03-16T14:52:47 1742136767

Why not LLLM for large LLM’s and SLLM for small LLM’s, assuming there is no middle ground

flir · 2025-03-16T16:33:57 1742142837

M, LM, LLM, LLLM, L3M, L4M.

Gotta leave room for future expansion.

dan_linder · 2025-03-16T17:10:37 1742145037

Hopefully the USB making team does NOT step into this...

LLM 3.0, LLM 3.1 Gen 1, LLM 3.2 Gen 1, LLM 3.1, LLM 3.1 Gen 2, LLM 3.2 Gen 2, LLM 3.2, LLM 3.2 Gen 2x2, LLM 4, etc...

moffkalast · 2025-03-16T18:25:29 1742149529

kolinko · 2025-03-16T16:02:55 1742140975

VLLM, Super VLLM, Almost Large Language Model

_heimdall · 2025-03-16T15:55:46 1742140546

What makes it a Small Large Language Model? Why jot just an SLM?

technol0gic · 2025-03-16T16:03:20 1742141000

Smedium Language Model

dbalatero · 2025-03-16T16:28:39 1742142519

Lousy Smarch weather

guestbest · 2025-03-16T15:57:38 1742140658

If we can’t have fun with names, why even be in IT?

gpderetta · 2025-03-16T21:35:30 1742160930

S and L cancel out, so it just an LM.

_heimdall · 2025-03-17T00:56:53 1742173013

Small !== -Large

orbital-decay · 2025-03-16T15:17:35 1742138255

SLM is a widespread term already.

guestbest · 2025-03-16T15:35:22 1742139322

Slim pickings, then?

BobaFloutist · 2025-03-16T15:32:59 1742139179

LLM, LLM 2.0, LLM 3.0, Mini LLM, Micro LLM, LLM C.

jfengel · 2025-03-16T15:53:11 1742140391

LLM 95, LLM 98, LLM Millennium Edition, LLM NT, LLM XP, LLM 2000, LLM 7

I really appreciated the way they managed to come up with a new naming scheme each time, usually used exactly once.

ben_w · 2025-03-16T17:06:49 1742144809

Could always go with the Bungie approach for the Marathon series: LLM, LLM2, LLM∞, ℵ₁ — https://alephone.lhowon.org

(Obviously ∞ is for the actual singularity, and ℵ₁ is the thing after that).

jfengel · 2025-03-16T23:21:40 1742167300

Are you sure that ℵ1 is the thing after that?

https://en.m.wikipedia.org/wiki/Continuum_hypothesis

;-)

Scarblac · 2025-03-16T16:38:25 1742143105

LLM 3.11 for Workgroups

tonyhart7 · 2025-03-16T14:34:09 1742135649

can we have tiny LLM that can run on smartphone now

winter_blue · 2025-03-16T14:37:30 1742135850

Apple Intelligence has an LLM that runs locally on the iPhone (15 Pro and up).

But the quality of Apple Intelligence shows us what happens when you use a tiny ultra-low-wattage LLM. There’s a whole subreddit dedicated to its notable fails: https://www.reddit.com/r/AppleIntelligenceFail/top/?t=all

One example of this is “Sorry I was very drunk and went home and crashed straight into bed” being summarized by Apple Intelligence as ”Drunk and crashed”.

Spooky23 · 2025-03-16T17:16:11 1742145371

I think the real problem with LLMs is we have deterministic expectations of non-deterministic tools. We’ve been trained to expect that the computer is correct.

Personally, I think the summaries of alerts is incredibly useful. But my expectation of accuracy for a 20 word summary of multiple 20-30 word summaries is tempered by the reality that’s there’s gonna be issues given the lack of context. The point of the summary is to help me determine if I should read the alerts.

LLMs break down when we try to make them independent agents instead of advanced power tools. Alot of people enjoy navel gazing and hand waving about ethics, “safety” and bias… then proceed to do things with obvious issues in those areas.

mewpmewp2 · 2025-03-16T21:55:08 1742162108

Larger LLMs can summarize all of this quite well though.

hansvm · 2025-03-17T04:45:35 1742186735

Determinism isn't the issue though. Many responses are fine. The displayed one is bad, whether chosen deterministically or not. Some alternatives:

- Passed out drunk

- Crashed in bed

- Slacking because drunk

...

The issue isn't a lack of context; it's that even the available context was handled poorly.

badlibrarian · 2025-03-16T15:19:58 1742138398

No. Smartphone only spin animated gif while talk to big building next to nuclear reactor. New radio inside make more efficient.

rubslopes · 2025-03-16T16:04:33 1742141073

Is a tiny large language model equivalent to a normal sized one?

tonyhart7 · 2025-03-17T08:14:17 1742199257

Yes is called MLM (Medium Language Model)

intrasight · 2025-03-16T23:45:00 1742168700

I expect that the phone will only do the prompt parsing

samstave · 2025-03-16T14:52:54 1742136774

I want a tiny_phone_based LLM to do thought tracking and comms awareness..

I actually applied to YC in like ~2014 or such for thus;

-JotPlot - I wanted a timeline for basically giving a histo timeline of comms btwn me and others - such that I had a sankey-ish diagram for when and whom and via method I spoke with folks and then each node eas the message, call, text, meta links...

I think its still viable - but my thought process is too currently chaotic to pull it off.

Basically looking at a timeline of your comms and thoughts and expand into links of thought - now with LLMs you could have a Throw Tag od some sort whereby you have the bot do work on research expanding on certain things and plugging up a site for that Idea on LOCAL HOST (i.e. your phone so that you can pull up data relevant to the convo - and its all in a timeline of thought/stream of conscious

hopefully you can visualize it...

johnmaguire · 2025-03-16T15:38:33 1742139513

I had a thought that I think some people value social media (e.g. Facebook) essentially for this. Like giving up your Facebook profile means giving up your history or family tree or even your memories.

So in that sense, maybe people would prefer a private alternative.

samstave · 2025-03-16T18:06:30 1742148390

I read this in Sam Wattersons voice with a pipe abt maybey an inch from his beard,

(Fyi I was a designer at fb and while it was luxious I still hated what I saw in zucks eyes every morn when I passed him.

Super diff from Andy Grove at intel where for whateveer reason we were in the sam oee schekdule

(That was me typing with eues ckised as a test (to myself, typos abound

AlienRobot · 2025-03-16T15:22:33 1742138553

Terrible names, to be honest. My proposal: Hyper LLMs, Ultra LLMs, Large LLMs, Micro LLMs, Mobile LLMs.

isoprophlex · 2025-03-16T15:35:02 1742139302

LLM M4 Ultra Pro Max 16e (with headphone jack)

AlienRobot · 2025-03-16T18:45:22 1742150722

GPT Inside

naveen99 · 2025-03-16T16:41:29 1742143289

LLM already has one large in it…

ben_w · 2025-03-16T16:58:48 1742144328

If we can have a "Personal PIN Identification Number", we can have a "Large LLM Language Model".

mewpmewp2 · 2025-03-16T21:53:02 1742161982

What about Impersonal PIN anonymization letter?

naveen99 · 2025-03-16T17:07:30 1742144850

Redundundant

de-moray · 2025-03-16T14:27:40 1742135260

What does a 20 LLM signify?

davidwritesbugs · 2025-03-16T16:37:26 1742143046

or "DietLLM, RegularLLM, MealLLM and SuperSizedLLMWithFries"

rnrn · 2025-03-16T15:21:53 1742138513

it's too bad vLLM and VLM are taken because it would have been nice to recycle the VLSI solution to describing sizes - get to very large language models and leave it at that.

do_not_redeem · 2025-03-16T15:28:11 1742138891

After very large language models, the next step is mega language models, or MLMs. As a bonus, it describes the VC funding scheme that backs them too.

rnrn · 2025-03-16T15:26:52 1742138812

we could also look to magnetoresistance and go for giant, colossal, extraordinary

TZubiri · 2025-03-16T17:11:49 1742145109

Doesn't the first L in LLM mean large already?

It's like saying Automated ATM. Whoever wrote it barely knows what the acronym means.

This whole article feels like written by someone who doesn't understand the subject matter at all

thih9 · 2025-03-16T17:57:53 1742147873

We’re fine with “The big friendly giant” and the sahara desert (“desert desert”); big llm could join the family of pleonasms.

https://en.m.wikipedia.org/wiki/Pleonasm

TZubiri · 2025-03-16T19:25:40 1742153140

When it's a different language it's fine.

Kiro · 2025-03-16T19:55:04 1742154904

Yes, that's the point of the comment and the whole discussion here. LLMs are already Large so what should the prefix be? Big LLM is a strong contender. I'm also pretty sure the creator of redis is not "someone who doesn't understand the subject matter at all".

TZubiri · 2025-03-16T22:11:00 1742163060

It's very common for experts on one subject to take a jab at another subject and depend on their reputation while their skillset doesn't translate at all.

xanderlewis · 2025-03-16T17:20:22 1742145622

Almost everyone says ‘PIN number’ as well.

thih9 · 2025-03-16T17:25:15 1742145915

Dismissed, Big LLM will live on along with Big Data.

deepsun · 2025-03-16T17:33:03 1742146383

Well, big data for me was always clear -- when data sizes are too large to use regular tools (ls, du, wc, vi, pandas).

I.e. when pretty much every tool or script I used before doesn't work anymore, and need a special tool (gsutil, bq, dusk, slurm), it's a mind shift.

varispeed · 2025-03-16T18:55:17 1742151317

Then there will be "decaf LLM"

semireg · 2025-03-16T17:10:42 1742145042

Pro, max, ultra…

_bin_ · 2025-03-16T17:03:42 1742144622

"big large language model" renminds me uncomfortably of "automated teller machine machine"

huijzer · 2025-03-16T17:29:00 1742146140

“There are 2 hard problems in computer science: cache invalidation, naming things, and off-by-1 errors.“

nextts · 2025-03-16T20:57:27 1742158647

https://xkcd.com/1294/