GPU memory leak with non-fixed audio buffer length #2666

Slava715 · 2021-08-16T17:55:57Z

Slava715
Aug 16, 2021

Hello
I am using online recognition as in the example:
https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/Online_ASR_Microphone_Demo.ipynb
But when a large buffer size is set hard (example, 40 sec), the quality of recognition of small files suffers. This is due to the fact that the missing buffer is filled with zeros in the transcribe() function.

frame_len = 40
self.n_frame_len = int(frame_len * self.sr)
self.buffer = np.zeros(shape=self.n_frame_len, dtype=np.float32)

def _decode(self, frame):
assert len(frame)==self.n_frame_len
self.buffer[:len(frame)] = frame
logits = infer_signal(asr_model, self.buffer).cpu().numpy()[0]
decoded = self._greedy_decoder(logits, self.vocab)
return decoded

@torch.no_grad()
def transcribe(self, frame=None, merge=True):
if frame is None:
frame = np.zeros(shape=self.n_frame_len, dtype=np.float32)
if len(frame) < self.n_frame_len:
frame = np.pad(frame, [0, self.n_frame_len - len(frame)], 'constant')
unmerged = self._decode(frame)
if not merge:
return unmerged
return self.greedy_merge(unmerged)

This issue is resolved by switching to a resizable buffer, but results in GPU memory leaks.

def _decode(self, frame):
logits = infer_signal(asr_model, frame).cpu().numpy()[0]
decoded = self._greedy_decoder(logits, self.vocab)
return decoded

@torch.no_grad()
def transcribe(self, frame=None, merge=True):
if frame is None:
frame = np.zeros(shape=self.n_frame_len, dtype=np.float32)
unmerged = self._decode(frame)
if not merge:
return unmerged
return self.greedy_merge(unmerged)

What is the cause of this memory leak and what are the solutions other than a fixed buffer size?

Slava715 · 2021-08-23T14:55:01Z

Slava715
Aug 23, 2021
Author

Really, no one has encountered such problems? :(

1 reply

Slava715 Sep 14, 2021
Author

The problem is not relevant, the memory itself is freed when necessary. But the use memory circuit seems strange to me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU memory leak with non-fixed audio buffer length #2666

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

GPU memory leak with non-fixed audio buffer length #2666

Uh oh!

Uh oh!

Slava715 Aug 16, 2021

Replies: 1 comment · 1 reply

Uh oh!

Slava715 Aug 23, 2021 Author

Uh oh!

Uh oh!

Slava715 Sep 14, 2021 Author

Slava715
Aug 16, 2021

Replies: 1 comment 1 reply

Slava715
Aug 23, 2021
Author

Slava715 Sep 14, 2021
Author