Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bloom Supporting #44

Open
c-box opened this issue Mar 13, 2023 · 3 comments
Open

Bloom Supporting #44

c-box opened this issue Mar 13, 2023 · 3 comments

Comments

@c-box
Copy link

c-box commented Mar 13, 2023

The repository uses transformers version 4.18, which does not support bloom, is there any way to use bloom as the initial policy for training?

@rajcscw
Copy link
Contributor

rajcscw commented Mar 14, 2023

We are working on removing this hard constraint. Will keep you posted

@c-box
Copy link
Author

c-box commented Mar 15, 2023

We are working on removing this hard constraint. Will keep you posted

That's very nice, is there any time schedule?

@arian-askari
Copy link

arian-askari commented Jun 13, 2023

@c-box In arian-askari#1, I have made several modifications to facilitate the upgrade of the transformers version for RL4LMs. While I cannot guarantee that these changes won't introduce any unforeseen issues, I have successfully trained the BLOOM-560M model with batch size 2 on PPO policy for the summarization after this modifications.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants