Hacker News new | past | comments | ask | show | jobs | submit login
Bagel • Unified Model for Multimodal Understanding and Generation (github.com/bytedance-seed)
7 points by montyanderson 32 days ago | hide | past | favorite | 5 comments



Curious there is no discussion on this. I think it looks interesting.

If nothing else, I'm glad someone is still working on open-weight image models. AFAIK there hasn't been much movement in the area since Flux.


Was looking at the model and was curious about HN comments, thought this would be a good talking piece since it has been released open, haven’t tried to run it locally yet but will do soon as I can.


There has been some discussion in /r/stablediffusion I'm not sure if anyone tried to run it though.


It's the luck of the submission time window; currently: https://news.ycombinator.com/item?id=44094362


The model itself appears to be around 30gb, my rule of thumb double it for ram. So should run on 60gb vram/unified ram ?




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: