YaFSDP – a tool for faster LLM training and optimized GPU consumption

Last week, we open-sourced the YaFSDP method — a new tool designed to dramatically speed up the training of large language models.

Read more here: External Link