--- title: 'Singing voice conversion' --- Singing voice conversion === ## Table of Contents [TOC] ## Latest related work 1. 2019 Unsupervised Singing Voice Conversion - https://arxiv.org/pdf/1904.06590.pdf - [Nachmani, et al., INTERSPEECH’19] - Facebook AI Research - demo: https://enk100.github.io/Unsupervised_Singing_Voice_Conversion/ 2. 2020 PITCHNET: Unsupervised Singing Voice Conversion with Pitch Adversarial Network - https://arxiv.org/pdf/1912.01852.pdf - [Deng, et al., ICASSP’20] - Tencent AI Lab - demo: https://tencent-ailab.github.io/pitch-net/ 3. 2020 Unsupervised Cross-Domain Singing Voice Conversion - https://arxiv.org/pdf/2008.02830.pdf - Facebook AI Research ## Datasets 1. Standford CCMA - Smule Karaoke App - https://ccrma.stanford.edu/damp/ - > [name=2019 FB AI Research] From the “DAMP-multiple” section of this dataset, we selected five singers at random. Excluding several singers with low quality audio. Each singer has 10 vocal songs, out of which 9 songs are used for training, and the tenth for validation. 2. NUS-48E - https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6694316 - [downloads here](https://drive.google.com/open?id=12pP9uUl0HTVANU3IPLnumTJiRjPtVUMx) - 12 singers with 4 songs for each singer - 48 pairs of sung and spoken 3. LCSING - https://singing-conversion.github.io - LJS - voice learned from single speaker speech dataset - LCSING - voice learned from single speaker singing dataset - VCTK - Voices learned from multi-speaker speech dataset - NUS-48E - Voices learned from multi-speaker singing dataset ###### tags: `SingingSynthesis`