I’m no expert but Florence2 has been my go-to. It’s pretty great at picking up a...

wruza · on Sept 12, 2024

Yeah, blip mostly ignores prompt too. I tried to disassemble it and feed my prompts, to no avail. Although I found that default kohya gui arguments are not even remotely the best. Here's my args:

  finetune/make_captions.py ... \
    --num_beams=12 \
    --top_p=0.9 \
    --max_length=75 \
    --min_length=24 \
    --beam_search \
    ...

With this, it's very often that I just take its caption as is, or add little.

TagGui

Oh, interesting, thanks!