More

jeffreysmith · 2025-11-22T02:29:43 1763778583

Weird that this late, dupe thread came alive after this/my earlier submission didn't seem to get noticed: https://news.ycombinator.com/item?id=45993118

jeffreysmith · 2025-11-21T00:32:56 1763685176

Totally. I don't get why people sleep on AI2's launches. They're such powerful platforms for AI R&D.

jeffreysmith · 2025-11-07T19:43:42 1762544622

I'm one of the many people who Soumith hired to Meta and PyTorch. I had the privilege of working on PyTorch with him and lots of the folks on this post.

As his longtime colleague, the one thing I would want people to know about him and this decision is that Soumith has always viewed PyTorch as a community project. He consistently celebrated the contributions of his co-creators Adam and Sam, and he extended the same view towards the Yangqing and the Caffe2 crew that we merged into PyTorch. At the very beginning, by Soumith's highly intentional design, PyTorch was aimed at being truly developed by and for the AI research community and for many years that was the key way in which we grew the framework, FB PT team, and the wider community. At every single stage of PT's lifecycle, he always ensured that our conception of PT and its community grew to include and celebrate the new people and organizations growing what was possible with PT. He's an incredible talent magnet, and thus more and more smart people kept dedicating their blood, sweat, and tears to making PT bigger and better for more people.

I've worked with some very well known and highly compensated leaders in tech, but *no one* has done the job he has done with ameliorating a bus factor problem with his baby. PT has a unique level of broad support that few other open source technology can reach. In a world of unbounded AI salaries, people who want to move AI research methods forward still freely give their time and attention to PyTorch and its ecosystem. It's the great lever of this era of AI that is moving the world, *due in large part* to the strength of the community he fostered and can now let continue without his direct involvement.

His departure is the end of an era, but it's also operationally a true non-event. PyTorch is going strong and can afford to let one of its creators retire from stewardship. This is precisely what success looks like in open source software.

He deserves our congratulations and our thanks. Enjoy your PT retirement, man.

casualscience · 2025-11-07T20:52:49 1762548769

Also worked with Soumith. The man is a legend, moves mountains and completely changed the course of my career because he liked something I wrote. No arrogance, no politics, just an extremely down to earth and chill guy who elevates everyone around him.

Hope him the best!

sumedh · 2025-11-07T22:11:26 1762553486

What did you write?

jeffreysmith · 2025-09-14T03:31:01 1757820661

American here who went to a Chinese (grad) school for CS and was admitted to every Chinese school I applied to. This is very much a possible route, if you’re appropriately qualified for the program. The main issue is language: outside of HK, programs in English are rare.

contrarian1234 · 2025-09-14T03:48:48 1757821728

That's extremely impressive that you managed to reach such a high level of fluency. I find written and technical Chinese is extremely tricky and different from spoken Chinese

jeffreysmith · 2025-08-14T15:22:21 1755184941

Not sure what's with the HN tone on this announcement. AI2 are really some of the best people around for creating truly open artifacts for the whole ecosystem. Their work on OLMo and Molmo is some of the most transparent and educational material you can find on model building. This is just great news for everyone.

Guthur · 2025-08-14T16:43:48 1755189828

Maybe because many of us are not from the US. The stated goal is US dominance of the AI field, and sorry if the rest of us don't see that as a good thing nor particularly open.

philipkglass · 2025-08-14T17:02:02 1755190922

The Allen Institute for Artificial Intelligence projects so far have been very open. They are open about the trained models, the inference code, the training data sets, and the training code. A research group from any country can pick up where AI2 left off if they want to try a different approach or extension. I want to live in a world where there are many models near the top of leader boards, from many different research groups and countries, and I think that AI2 helps enable that.

The stated "US dominance" goal just pays lip service to what appeals to the funders, kind of like how supercomputing projects traditionally claim that they contribute to curing disease or producing clean energy. (Even if it's something far removed from concrete applications, like high fidelity numerical simulations of aqueous solutions.)

FirmwareBurner · 2025-08-14T16:55:22 1755190522

>The stated goal is US dominance of the AI field

Any country tries to dominate any field if they can do it, it's just human nature. Why is that a bad thing?

That constant competition for superiority between nations is how humanity has evolved from hunter gatherer to having tractors, microwave ovens, airplanes, internet and penicillin.

Herring · 2025-08-14T18:28:46 1755196126

Yes, competition is good. Monopoly is bad. A more distributed power structure is much better for overall progress, and even for the monopolist in the long run (Ex: Intel).

FirmwareBurner · 2025-08-14T21:13:22 1755206002

>Monopoly is bad.

So what do you propose? Should the US stop development till other countries catch up?

Herring · 2025-08-14T21:32:04 1755207124

Nah, I'd say just do more anti-monopoly anti-inequality work. Probably start internally, that's a massive enough task on its own (eg breaking up big tech). Assist other countries eg with aid if (and only if) they are doing the same. This is a big topic, ask your favorite frontier LLM about it.

FirmwareBurner · 2025-08-14T21:54:32 1755208472

Unless China does the same that's an unrealistic ask. That would be like doing nuclear disarmament but only you and everyone else gets to keep their nukes.

Herring · 2025-08-14T22:20:41 1755210041

Your "nukes" are leaking into the water supply. Inequality shows up a million different ways that Americans don't fully understand yet, eg inflation (dominant companies increasing profits), teacher shortages (low wages), student debt (not an issue for the wealthy so why fix it), housing prices (corporate landlords, exclusionary zoning), layoffs (despite record profits) etc etc. This situation (Trump/Musk/Bezos taking most of the gains) is just not long-term stable, and if any other country wants to do the same to themselves let them. The longer it goes the harder it will be to fix.

Again, go have this discussion with the LLM you trust, it's much more informative.

FirmwareBurner · 2025-08-15T22:37:05 1755297425

The issues you described have nothing to do with AI development.

India also has an advanced space program despite many of its people starving and not having running water.

If you decide to invest in advanced tech development only when all your citizens don't have any issues, tech development would stand still.

byteknight · 2025-08-14T16:56:36 1755190596

As an American, I obviously can get behind it, but I can easily see how a declared goal of superiority of others would rub those others the wrong way (and possibly prevent their contribution)

[Insert xkcd new standard image here]

laughingcurve · 2025-08-14T17:04:34 1755191074

As an American researcher, I can assure you that the Chinese superiority and behavior in the field is certainly ENCOURAGING my contributions.

FirmwareBurner · 2025-08-14T17:02:28 1755190948

>a declared goal of superiority of others would rub those others the wrong way

So what? Does that change anything in how things work in reality? Everyone knows it, so why pussyfoot around it?.

Why are people nowadays so sensitive about saying the truth of how things work? Have people been coddled that much that they've can't handle reality? A good life lesson is that the world does not revolve around your feelings.

Guthur · 2025-08-14T17:12:11 1755191531

It's not my feelings mate, if you don't live outside the US and have not been subjected to their unipolar attitude you will probably never understand and there is literally nothing I'm going to say to convince you of the objective reality the rest of us face.

FirmwareBurner · 2025-08-14T17:13:52 1755191632

Sorry, I wasn't talking about you specifically, but the general "you" as in you the reader.

Guthur · 2025-08-14T17:09:29 1755191369

Of course you can justify this, as people have, but you can't then blame the rest of us non US citizens for not aligning with that goal. The US is only a small portion of the global population and the government itself has a long history of stamping on the rest of us.

insane_dreamer · 2025-08-14T17:38:07 1755193087

But better for the rest of the world than private US tech companies dominating.

nativeit · 2025-08-14T21:05:11 1755205511

This just in: AI2 pivoting to a for-profit model, and is seeking venture capital funding.

Oops, sorry that’s next year’s news. Anyway, this is all ringing very familiar.

laughingcurve · 2025-08-14T17:03:40 1755191020

Good luck trying to raise money from a NATIONAL science foundation without it being in the NATIONAL interest.

jejcndj1848 · 2025-08-15T01:21:24 1755220884

I think there’s an interesting cultural phenomena when dominance and interest are seen as one and the same thing

jeffreysmith · 2025-08-14T14:38:23 1755182303

Howdy, HN. Authors here. We got tired of text-to-image leaderboards that only focus on aesthetics, so we built our own benchmarks to test what matters for real work: fidelity to complex prompts, safety, bias, and IP infringement.

We analyzed 18 models and found that no single model is good at everything. For example, GPT-4o has the best safety guardrails but also a 98% IP infringement rate on celebrity likenesses. Google's Imagen 4 Ultra actively counters bias (e.g., 90% of its "CEOs" are female) but struggles with generating crowds. X AI's Grok 2 blocks almost nothing.

Lots more detail in the post. We'll be here all day to answer questions.

jeffreysmith · 2025-07-01T16:25:23 1751387123

[Stealth GenAI startup] | NYC ONSITE | Founding Engineering Lead | Full-Time]

We're a stealth mode genAI startup working to advance the state-of-the-art in generative visual media. We've just closed our preseed round and are now backed by 3 expert VC funds and a collection of AI leader angels. Our team consists of highly experienced AI leaders from leading companies. And we're working with some of the biggest companies in media.

The role we're hiring for is an engineering leader to drive the evolution of our research platform into our first commercial-grade product. This work centers around the orchestration and scaling of multimodal understanding and image and video generation models. You'll be the primary architect of our platform capabilities and working closely with both cloud and model partners as well as enterprise customers deploying our technology.

This is a unique opportunity to be part of the founding technical cohort of a deeply innovative team with a unique technology that no one else is building.

More about the role here: https://docs.google.com/document/d/1xMiTuj3VTgA96Yg3EzORsaDj...

jeffreysmith · 2025-06-02T17:58:53 1748887133

[Stealth GenAI startup] | NYC ONSITE | Founding Engineering Lead | Full-Time]

We're a stealth mode genAI startup working to advance the state-of-the-art in generative visual media. We've just closed our preseed round and are now backed by 3 expert VC funds and a collection of AI leader angels. Our team consists of highly experienced AI leaders from leading companies. And we're working with some of the biggest companies in media.

The role we're hiring for is an engineering leader to drive the evolution of our research platform into our first commercial-grade product. This work centers around the orchestration and scaling of multimodal understanding and image and video generation models. You'll be the primary architect of our platform capabilities and working closely with both cloud and model partners as well as enterprise customers deploying our technology.

This is a unique opportunity to be part of the founding technical cohort of a deeply innovative team with a unique technology that no one else is building.

More about the role here: https://docs.google.com/document/d/1xMiTuj3VTgA96Yg3EzORsaDj...

jeffreysmith · 2025-05-01T15:40:02 1746114002

[Stealth GenAI startup] | NYC ONSITE | Founding Engineering Lead | Full-Time]

We're a stealth mode genAI startup working to advance the state-of-the-art in generative visual media. We've just closed our preseed round and are now backed by 3 expert VC funds and a collection of AI leader angels. Our team consists of highly experienced AI leaders from leading companies. And we're working with some of the biggest companies in media.

The role we're hiring for is an engineering leader to drive the evolution of our research platform into our first commercial-grade product. This work centers around the orchestration and scaling of vision and image generation models. You'll be the primary architect of our platform capabilities and working closely with both cloud and model partners as well as enterprise customers deploying our technology.

This is a unique opportunity to be part of the founding technical cohort of a deeply innovative team with a unique technology that no one else is building.

More about the role here: https://docs.google.com/document/d/1xMiTuj3VTgA96Yg3EzORsaDj...

jeffreysmith · 2025-04-03T20:31:00 1743712260

I just tested this on NVIDIA's Sana codebase, and it did a pretty awesome job: https://gitsummarize.com/NVlabs/Sana

Impressive stuff!