porting the dithering function from kaldi #40

KarelVesely84 · 2024-03-13T16:24:07Z

No description provided.

kaldi-native-fbank/csrc/kaldi-math.h

KarelVesely84 · 2024-03-14T09:00:07Z

okay, I added the missing part, the local build including the test binaries was okay...

kaldi-native-fbank/csrc/feature-window.cc

KarelVesely84 · 2024-03-14T14:32:06Z

the failed linux-macos test seems unrelated

there is a return at unusual place in that failing test test_rfft.py

- 1.0 is too much, and would break the systems

KarelVesely84 · 2024-03-14T14:43:12Z

for me it is ready, i just changed the default dithering constant

k2 uses audio samples [-1..+1]
kaldi was using audio samples [-32k..+32k], hence the constant is 2^15x smaller

csukuangfj · 2024-03-14T15:43:02Z

Thanks! Will check it tomorrow morning.

csukuangfj · 2024-03-15T03:10:54Z

Thank you for your contribution!

nshmyrev · 2024-04-09T04:38:40Z

Hey, can we please change dither constant to be 1.0 like it was before. 0.00003 vs 1.0 can cause a lot of confusion in existing code.

csukuangfj · 2024-04-09T04:41:26Z

Could you create a PR to change it to 1.0?

KarelVesely84 · 2024-04-12T09:17:27Z

The problem is, that in some situations 1.0 is the correct value (for +/- 32k audio signal), while in other situation 0.00003 is the correct value (+/- 1.0 audio signal range), and both options are possible use-cases. So there will be confusion either on one or the other side. And the detection cannot be 100% reliable, as there can be an input signal with hard zero's in it.

So it is better to document it, and give a hint to the users.

But, it is true that if default is 1.0, being internally divided into 0.00003. And, occasionally it is used with +/-32k input singal, it will effectively disable the dithering. While as it is now, using 1.0 will break the system with +/-1.0 audio signal. So, changing the default to 1.0 is a safer way...

nshmyrev · 2024-04-12T10:57:47Z

Let it stay then, sorry for confusion and thanks for the patch.

KarelVesely84 mentioned this pull request Mar 13, 2024

Configurable low_freq high_freq, dithering k2-fsa/sherpa-onnx#664

Merged

csukuangfj reviewed Mar 14, 2024

View reviewed changes

kaldi-native-fbank/csrc/kaldi-math.h Show resolved Hide resolved

porting the dithering function from kaldi

1c96f64

KarelVesely84 force-pushed the add_dithering branch from 1a60898 to 1c96f64 Compare March 14, 2024 08:59

KarelVesely84 commented Mar 14, 2024

View reviewed changes

kaldi-native-fbank/csrc/feature-window.cc Outdated Show resolved Hide resolved

removing the debug message

89f98be

change the default value for dithering

3553db2

- 1.0 is too much, and would break the systems

csukuangfj merged commit 2c7ecc6 into csukuangfj:master Mar 15, 2024
4 of 7 checks passed

KarelVesely84 deleted the add_dithering branch March 20, 2024 17:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

porting the dithering function from kaldi #40

porting the dithering function from kaldi #40

KarelVesely84 commented Mar 13, 2024

KarelVesely84 commented Mar 14, 2024 •

edited

Loading

KarelVesely84 commented Mar 14, 2024

KarelVesely84 commented Mar 14, 2024

csukuangfj commented Mar 14, 2024

csukuangfj commented Mar 15, 2024

nshmyrev commented Apr 9, 2024

csukuangfj commented Apr 9, 2024

KarelVesely84 commented Apr 12, 2024 •

edited

Loading

nshmyrev commented Apr 12, 2024

porting the dithering function from kaldi #40

porting the dithering function from kaldi #40

Conversation

KarelVesely84 commented Mar 13, 2024

KarelVesely84 commented Mar 14, 2024 • edited Loading

KarelVesely84 commented Mar 14, 2024

KarelVesely84 commented Mar 14, 2024

csukuangfj commented Mar 14, 2024

csukuangfj commented Mar 15, 2024

nshmyrev commented Apr 9, 2024

csukuangfj commented Apr 9, 2024

KarelVesely84 commented Apr 12, 2024 • edited Loading

nshmyrev commented Apr 12, 2024

KarelVesely84 commented Mar 14, 2024 •

edited

Loading

KarelVesely84 commented Apr 12, 2024 •

edited

Loading