data_loader.py 文件源码

python
阅读 27 收藏 0 点赞 0 评论 0

项目:make_dataset 作者: hyzhan 项目源码 文件源码
def parse_audio(self, audio_path):
        if self.augment:
            y = load_randomly_augmented_audio(audio_path)
        else:
            y = load_audio(audio_path)
        if self.noiseInjector:
            add_noise = np.random.binomial(1, self.noise_prob)
            if add_noise:
                y = self.noiseInjector.inject_noise(y)
        n_fft = int(self.sample_rate * self.window_size)
        win_length = n_fft
        hop_length = int(self.sample_rate * self.window_stride)
        # STFT
        D = librosa.stft(y, n_fft=n_fft, hop_length=hop_length,
                         win_length=win_length, window=self.window)
        spect, phase = librosa.magphase(D)
        # S = log(S+1)
        spect = np.log1p(spect)
        spect = torch.FloatTensor(spect)
        if self.normalize:
            mean = spect.mean()
            std = spect.std()
            spect.add_(-mean)
            spect.div_(std)

        return spect
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号