In this preliminary work, we interpolate between two music samples in the embedding space of Music2Latent. We compare two classical schemes that allow for averaging two feature sequences with arbitrary lengths: symmetric averaging (by Kruskal and Liberman) and DTW Barycenter Averaging. The former method results in stable samples that are coherent with temporal alignment.
Although I failed to visit the conference in person, I attach the paper and the poster, in case anyone is interested. I intend to expand the work in the future, so do not miss any updated in the repository.
- Clone the repository
- Install the required packaged via
pip install -r requirements.txtin your virtual or conda environment - Follow the notebooks
barycenters.ipynbandinterpolation.ipynbfor the examples of usage
