[`memory`] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

tomaarsen · 2024-12-23T15:37:18Z

Resolves #3136

Hello!

Pull Request overview

Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData

Details

This seems to prevent cleanup, as there's a cyclical dependency between trainer -> model -> model card -> trainer. This means that once the trainer and model get overridden (e.g. in https://github.com/UKPLab/sentence-transformers/blob/master/examples/training/data_augmentation/train_sts_seed_optimization.py), the old model/trainer/model_card_data don't get automatically eaten by the garbage disposal.

I've moved a lot of components around, and now ModelCardCallback nor SentenceTransformerModelCardData need to store the Trainer. Although annoying, this does mean that memory should be cleared if the model/trainer gets overridden/deleted.

Before:

Approximate highest recorded VRAM during train_sts_seed_optimization:

16332MiB /  24576MiB

After

Approximate highest recorded VRAM during train_sts_seed_optimization:

8222MiB /  24576MiB

Note that the VRAM usage does still grow, albeit a lot more slowly, so this might not have resolved all issues. Having said that, because most people only make 1 trainer, it's not that big of an issue I suspect.

Tom Aarsen

and SentenceTransformerModelCardData This prevents a proper cleanup

Avoid storing trainer in ModelCardCallback

80fb41e

and SentenceTransformerModelCardData This prevents a proper cleanup

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`memory`] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

[`memory`] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

tomaarsen commented Dec 23, 2024

[memory] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

Are you sure you want to change the base?

[memory] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

Conversation

tomaarsen commented Dec 23, 2024

Pull Request overview

Details

Before:

After

[`memory`] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144

[`memory`] Avoid storing trainer in ModelCardCallback and SentenceTransformerModelCardData #3144