location privacy analysis

# location privacy analysis --- ## Q1. indoor, outdoor, transportation 哪一直種noise最容易暴險? MSP-PODCAST ![](https://i.imgur.com/aJvgARx.png) IEMOCAP ![](https://i.imgur.com/abjnoU7.png) LibriSpeech clean ![](https://i.imgur.com/wu7SkQ2.png) LibriSpeech other ![](https://i.imgur.com/RW0icKj.png) ### A: Noise 本身 transportation 最容易被辨識， indoor最難 ## Q2. DEMUCS denoise過後哪一種location消除得最好 MSP-PODCAST ![](https://i.imgur.com/6TDsTed.png) LibriSpeech clean ![](https://i.imgur.com/lav3btv.png) LibriSpeech other ![](https://i.imgur.com/cYRB860.png) ### A: 拿Demucs的f1 score去減No的f1 score: 三個資料庫outdoor消最好的情況比較多（5個snr中出現4個） ## Q3. 使用WAD比DEMUCS在哪種location消除更多 MSP-PODCAST ![](https://i.imgur.com/xApFZgI.png) IEMOCAP ![](https://i.imgur.com/eV6F6PT.png) LibriSpeech clean ![](https://i.imgur.com/q0hzlFU.png) LibriSpeech other ![](https://i.imgur.com/wUxNjSa.png) ### A: 各個資料庫有自己的特性，我們分成情緒資料庫(MSP-PODCAST, IEMOCAP)以及語音資料庫(LibriSpeech clean, other)來討論 1. 情緒資料庫：大部份是indoor還有transportation被辨識的機會降得比較多（WAD跟DEMUCS比），而outdoor的消除能力比較容易被犧牲掉， 2. 語音資料庫：整體都有降（WAD跟DEMUCS比），不過outdoor被辨識的機會降得比較多 ## Q3-1: 至於在不同SNR狀況的情境下WAD優於DEMUCS的地方有沒有差別？ ### A: 我們在LibriSpeech other與IEMOCAP上分別觀察到高低SNR之間的差別 ![](https://i.imgur.com/IACOIbk.png) 1. 我們可以看出IEMOCAP在SNR=5的時候會處理比較多outdoor ![](https://i.imgur.com/BEUWsFi.png) 3. 我們可以看出LibriSpeech other在SNR=-10的時候會處理比較多indoor # 驗證看看SpecStat（頻率能量統計）有沒有等值於原本的Spectrogram實驗 --- 大致上就把每個音檔（有把padding的部份移除掉）的頻譜對每個band作mean來代表它，然後丟SVM看能不能學得起來的實驗，最後看DEMUCS - NO 跟原本spectrogram + FCNN的prediction reduction狀況有沒有雷同 Librispeech clean的部份： ![](https://i.imgur.com/2IOVp3m.png) Librispeech other的部份： ![](https://i.imgur.com/irkDqfu.png) 結論：比較低的snr (snr=-10) 的趨勢一致（transportation降最多，indoor降最少。）而且降低的幅度也比較像，然後snr越高的幅度就有差了（趨勢可以保留一致到snr=5）。所以如果要拿SpecStat來探討頻率上的差別的話盡量拿snr=-10的，結論可能會比較明顯。