I'm pretty sure anyone finetuning Lllama now on a regular basis is using https:/...

syntaxing · 2024-09-12T00:41:44 1726101704

I remember seeing them on HN when the first started! I never understood what’s the price you pay, how did they get such a big speed up and less memory usage?

randomcatuser · 2024-09-12T01:02:00 1726102920

There's previous comments, apparently the founder did a lot of math re-deriving things from scratch :)

https://news.ycombinator.com/item?id=39672070

https://unsloth.ai/blog/gemma-bugs

mistrial9 · 2024-09-12T13:45:06 1726148706

nice work in gemma-bugs -- compared to plenty of research work that is a km deep in real math, this tech note is a just few python tweaks. But finding those and doing it? apparently this is useful and they did it. Easy to read (almost child-like) writeup.. thx for pointing to this.

segmondy · 2024-09-12T14:49:19 1726152559

They main author used to worth Nvidia. There's a free plan, and you can pay to get multiple GPU support.

pilooch · 2024-09-12T06:22:41 1726122161

Indeed, a lora finetune of llama 3.1 8B works on a single 24GB GPU and takes from a few hours to a few days depending on the dataset size.