XTuner Release V0.1.21
What's Changed
- [Feature] Support DPO, ORPO and Reward Model by @RangiLyu in #743
- [Bugs] fix dispatch bugs by @HIT-cwh in #775
- [Bugs] Fix HFCheckpointHook bugs when training deepseekv2 and mixtral withou… by @HIT-cwh in #774
- [Feature] Support the scenario where sp size is not divisible by attn head num by @HIT-cwh in #769
- bump version to 0.1.21 by @HIT-cwh in #776
Full Changelog: v0.1.20...v0.1.21