I see, it was probably my high learning rate that caused problems. To be honest,... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		frotaur 57 days ago \| parent \| context \| favorite \| on: Diffusion Finetuning Myself I see, it was probably my high learning rate that caused problems. To be honest, I got a bit lazy to retry full finetuning since LoRA worked so well, but maybe I'll revisit this in the future, maybe with Qwen Image.

throwaway314155 46 days ago [–]

Perhaps what you were dealing with was actually exploding gradients using fp16 training which _are_ prone to corrupting a model and this can depend on the learning rate.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact