Bloom Supporting #44

c-box · 2023-03-13T12:33:04Z

The repository uses transformers version 4.18, which does not support bloom, is there any way to use bloom as the initial policy for training?

rajcscw · 2023-03-14T13:10:59Z

We are working on removing this hard constraint. Will keep you posted

c-box · 2023-03-15T02:16:10Z

We are working on removing this hard constraint. Will keep you posted

That's very nice, is there any time schedule?

arian-askari · 2023-06-13T10:52:42Z

@c-box In arian-askari#1, I have made several modifications to facilitate the upgrade of the transformers version for RL4LMs. While I cannot guarantee that these changes won't introduce any unforeseen issues, I have successfully trained the BLOOM-560M model with batch size 2 on PPO policy for the summarization after this modifications.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bloom Supporting #44

Bloom Supporting #44

c-box commented Mar 13, 2023

rajcscw commented Mar 14, 2023

c-box commented Mar 15, 2023

arian-askari commented Jun 13, 2023 •

edited

Loading

Bloom Supporting #44

Bloom Supporting #44

Comments

c-box commented Mar 13, 2023

rajcscw commented Mar 14, 2023

c-box commented Mar 15, 2023

arian-askari commented Jun 13, 2023 • edited Loading

arian-askari commented Jun 13, 2023 •

edited

Loading