Bagel • Unified Model for Multimodal Understanding and Generation

yreg · 2025-05-22T20:35:26 1747946126

Curious there is no discussion on this. I think it looks interesting.

If nothing else, I'm glad someone is still working on open-weight image models. AFAIK there hasn't been much movement in the area since Flux.

wsintra2022 · 2025-05-25T12:16:57 1748175417

Was looking at the model and was curious about HN comments, thought this would be a good talking piece since it has been released open, haven’t tried to run it locally yet but will do soon as I can.

yreg · 2025-05-25T20:56:40 1748206600

There has been some discussion in /r/stablediffusion I'm not sure if anyone tried to run it though.

mdaniel · 2025-05-26T16:41:06 1748277666

It's the luck of the submission time window; currently: https://news.ycombinator.com/item?id=44094362

wsintra2022 · 2025-05-25T12:19:55 1748175595

The model itself appears to be around 30gb, my rule of thumb double it for ram. So should run on 60gb vram/unified ram ?