You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
This new optimizer seems to be perfect for distributed training https://github.com/NousResearch/DisTrO - reducing communication bandwidth by several orders of magnitude. I apologize if I misunderstood since I am new to both projects. I just got excited about the potential of combining that with what hivemind can do.
Describe the solution you'd like
An optimizer class implementing DeMo.
Describe alternatives you've considered
No alternative currently exists as of my knowledge
Is your feature request related to a problem? Please describe.
This new optimizer seems to be perfect for distributed training https://github.com/NousResearch/DisTrO - reducing communication bandwidth by several orders of magnitude. I apologize if I misunderstood since I am new to both projects. I just got excited about the potential of combining that with what hivemind can do.
Describe the solution you'd like
An optimizer class implementing DeMo.
Describe alternatives you've considered
No alternative currently exists as of my knowledge
Additional context
Paper
https://arxiv.org/abs/2411.19870
Code
https://github.com/bloc97/DeMo
15B training run
https://distro.nousresearch.com/
The text was updated successfully, but these errors were encountered: