What would the hardware requirement for fine-tuning a Llama 2 ? | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		xmly on Aug 25, 2023 \| parent \| context \| favorite \| on: Beating GPT-4 on HumanEval with a fine-tuned CodeL... What would the hardware requirement for fine-tuning a Llama 2 ?

rushingcreek on Aug 25, 2023 [–]

We didn't want to use LoRA to maximize quality, so we used 32 A100-80GB with a sequence length of 4096. It's possible to do a native fine-tune on as little as 8 A100-80GB with DeepSpeed Zero 3, but it will take longer.

With LoRA you can probably get away with just a few 4090s.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact