Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Integrate DeMO optimizer #640

Open
jzthree opened this issue Dec 20, 2024 · 0 comments
Open

[Feature Request] Integrate DeMO optimizer #640

jzthree opened this issue Dec 20, 2024 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@jzthree
Copy link

jzthree commented Dec 20, 2024

Is your feature request related to a problem? Please describe.
This new optimizer seems to be perfect for distributed training https://github.com/NousResearch/DisTrO - reducing communication bandwidth by several orders of magnitude. I apologize if I misunderstood since I am new to both projects. I just got excited about the potential of combining that with what hivemind can do.

Describe the solution you'd like
An optimizer class implementing DeMo.

Describe alternatives you've considered
No alternative currently exists as of my knowledge

Additional context
Paper
https://arxiv.org/abs/2411.19870
Code
https://github.com/bloc97/DeMo
15B training run
https://distro.nousresearch.com/

@jzthree jzthree added enhancement New feature or request help wanted Extra attention is needed labels Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant