COCOLA

Introduction

This is the official repository COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations.

COCOLA is a contrastive model which is able to estimate the harmonic and rhythmic coherence between pairs of music audio examples.

Installation

Create virtual environment (Optional)

conda create --name cocola python=3.11
conda activate cocola

Install dependencies

pip install -r requirements.txt

Install datasets

If you wish to use MoisesDB for training/validation/test, download it from the official website and unzip it inside ~/moisesdb_contrastive. The other datasets (CocoChorales, Slakh2100, Musdb) are automatically downoladed and extracted by the respective PyTorch Datasets.

Usage

This project uses LightningCLI. For info about usage:

python main.py --help

For info about subcommands usage:

python main.py fit --help
python main.py validate --help
python main.py test --help
python main.py predict --help

You can pass a YAML config file as command line argument instead of specifying each parameter in the command:

python main.py fit --config path/to/config.yaml

See configs for examples of config files.

Example: Training a contrastive model on CocoChorales + MoisesDB + Slakh2100

python main.py fit --config configs/train_all_submixtures.yaml

Pretrained Models

Model Name	Model Checkpoint	Train Dataset	Train Config File	Description
COCOLA_HP_v1	https://drive.google.com/file/d/1HdKgDV2wCdGwCWPlIIRm2ytlbUNah8fo/view?usp=sharing	Moisesdb, Slakh2100, CocoChorales	`configs/train_all_submixtures_hpss.yaml`	Allows to compute COCOLA Score, COCOLA Harmonic Score and COCOLA Percussive Score.

Example 1: calculating COCOLA (Harmonic/Percussive) Score with COCOLA_HP_v1 on a of pair of music audio examples.

from contrastive_model import constants
from contrastive_model.contrastive_model import CoCola
from feature_extraction.feature_extraction import CoColaFeatureExtractor

model = CoCola.load_from_checkpoint("/path/to/checkpoint.ckpt")
feature_extractor = CoColaFeatureExtractor()

model.eval()

# Set to:
# - constants.EmbeddingMode.BOTH for standard COCOLA Score
# - constants.EmbeddingMode.HARMONIC for COCOLA Harmonic Score
# - constants.EmbeddingMode.PERCUSSIVE for COCOLA Percussive Score
model.set_embedding_mode(constants.EmbeddingMode.BOTH)

features_x = feature_extractor(x)
features_y = feature_extractor(y)
score = model.score(features_x, features_y)

where x and y are tensors of shape [1, 16000*5] (audio tracks of 5 seconds sampled at 16000 kHz).

Example 2: calculating COCOLA (Harmonic/Percussive) Scores with COCOLA_HP_v1 on a batch of pairs of music audio examples.

from contrastive_model import constants
from contrastive_model.contrastive_model import CoCola
from feature_extraction.feature_extraction import CoColaFeatureExtractor

model = CoCola.load_from_checkpoint("/path/to/checkpoint.ckpt")
feature_extractor = CoColaFeatureExtractor()

model.eval()

# Set to:
# - constants.EmbeddingMode.BOTH for standard COCOLA Score
# - constants.EmbeddingMode.HARMONIC for COCOLA Harmonic Score
# - constants.EmbeddingMode.PERCUSSIVE for COCOLA Percussive Score
model.set_embedding_mode(constants.EmbeddingMode.BOTH)

features_x = feature_extractor(x)
features_y = feature_extractor(y)
scores = model.score(x, y)

where x and y are tensors of shape [B, 1, 16000*5] (B audio tracks of 5 seconds sampled at 16000 kHz).

scores[i] contains the COCOLA score between x[i] and y[i].

Example 3: calculating COCOLA (Harmonic/Percussive) cross-scores matrix with COCOLA_HP_v1 on a batch of pairs of music audio examples.

from contrastive_model import constants
from contrastive_model.contrastive_model import CoCola
from feature_extraction.feature_extraction import CoColaFeatureExtractor

model = CoCola.load_from_checkpoint("/path/to/checkpoint.ckpt")
feature_extractor = CoColaFeatureExtractor()

model.eval()

# Set to:
# - constants.EmbeddingMode.BOTH for standard COCOLA Score
# - constants.EmbeddingMode.HARMONIC for COCOLA Harmonic Score
# - constants.EmbeddingMode.PERCUSSIVE for COCOLA Percussive Score
model.set_embedding_mode(constants.EmbeddingMode.BOTH)

features = feature_extractor(x)
scores = model(features)

where x is like:

x = {
    "anchor": torch.randn(batch_size, 1, 16000*5, dtype=torch.float32), # 5 seconds, 16000 kHz
    "positive": torch.randn(batch_size, 1, 16000*5, dtype=torch.float32) # 5 seconds, 16000 kHz
}

scores[i, j] contains the COCOLA score between x['anchor'][i] and y['positive'][j].

Troubleshooting

CocoChorales Dataset

Remove string_track001353 from the train split as one stem contains less frames than the other ones.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COCOLA

Introduction

Installation

Create virtual environment (Optional)

Install dependencies

Install datasets

Usage

Example: Training a contrastive model on CocoChorales + MoisesDB + Slakh2100

Pretrained Models

Example 1: calculating COCOLA (Harmonic/Percussive) Score with COCOLA_HP_v1 on a of pair of music audio examples.

Example 2: calculating COCOLA (Harmonic/Percussive) Scores with COCOLA_HP_v1 on a batch of pairs of music audio examples.

Example 3: calculating COCOLA (Harmonic/Percussive) cross-scores matrix with COCOLA_HP_v1 on a batch of pairs of music audio examples.

Troubleshooting

CocoChorales Dataset

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
assets		assets
configs		configs
contrastive_model		contrastive_model
data		data
feature_extraction		feature_extraction
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

License

gladia-research-group/cocola

Folders and files

Latest commit

History

Repository files navigation

COCOLA

Introduction

Installation

Create virtual environment (Optional)

Install dependencies

Install datasets

Usage

Example: Training a contrastive model on CocoChorales + MoisesDB + Slakh2100

Pretrained Models

Example 1: calculating COCOLA (Harmonic/Percussive) Score with COCOLA_HP_v1 on a of pair of music audio examples.

Example 2: calculating COCOLA (Harmonic/Percussive) Scores with COCOLA_HP_v1 on a batch of pairs of music audio examples.

Example 3: calculating COCOLA (Harmonic/Percussive) cross-scores matrix with COCOLA_HP_v1 on a batch of pairs of music audio examples.

Troubleshooting

CocoChorales Dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages