Clap processor: remove wasteful np.stack operations #27454

m-bain · 2023-11-12T08:08:44Z

What does this PR do?

Upon profiling, it showed some strange result that the ClapProcessor was taking 0.5s to apply _get_input_mel(...) on short audio (less than 10s), whereas medium length audio (10s-20s) was taking only 0.02s

As it turns out there was a wasteful np.stack operation on the 1-D waveform numpy array, meaning that the 1-D array is unpacked then stacked back together again, with no effect. This PR removes this wasteful op and short audio is now also processed in 0.02s

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker
@sanchit-gandhi

Np.stack on large 1-D tensor, causing ~0.5s processing time on short audio (<10s). Compared to 0.02s for medium length audio

amyeroberts

Thanks for updating @m-bain!

Could you provide an example snippet you used to run this for future reference of anyone visiting this PR?

HuggingFaceDocBuilderDev · 2023-11-13T16:26:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

m-bain · 2023-11-14T09:42:46Z

@amyeroberts hows this ?

import time
import numpy as np

waveform = np.random.rand(100_000)
n_repeat = 10

t1_p = time.time()
prev_impl = np.stack(np.tile(waveform, n_repeat))
t2_p = time.time()

t1_n = time.time()
new_impl = np.tile(waveform, n_repeat)
t2_n = time.time()

assert (prev_impl == new_impl).all()
print(f"Time to process [prev. impl.]: {t2_p-t1_p:.3f}s")
print(f"Time to process [new. impl.]: {t2_n-t1_n:.3f}s")

Time to process [prev. impl.]: 0.883s
Time to process [new. impl.]: 0.001s

amyeroberts · 2023-11-14T09:44:22Z

@m-bain Thanks!

ArthurZucker

Thanks for the catch! 🤗

remove wasteful np.stack Np.stack on large 1-D tensor, causing ~0.5s processing time on short audio (<10s). Compared to 0.02s for medium length audio

sanchit-gandhi

Awesome - thanks @m-bain! Also cc @ylacombe

remove wasteful np.stack Np.stack on large 1-D tensor, causing ~0.5s processing time on short audio (<10s). Compared to 0.02s for medium length audio

remove wasteful np.stack

63fc1bd

Np.stack on large 1-D tensor, causing ~0.5s processing time on short audio (<10s). Compared to 0.02s for medium length audio

m-bain changed the title ~~remove wasteful np.stack~~ Clap processor: remove wasteful np.stack operations Nov 12, 2023

amyeroberts approved these changes Nov 13, 2023

View reviewed changes

ArthurZucker approved these changes Nov 14, 2023

View reviewed changes

amyeroberts merged commit b86c54d into huggingface:main Nov 14, 2023
3 checks passed

sanchit-gandhi reviewed Nov 16, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clap processor: remove wasteful np.stack operations #27454

Clap processor: remove wasteful np.stack operations #27454

m-bain commented Nov 12, 2023

amyeroberts left a comment

HuggingFaceDocBuilderDev commented Nov 13, 2023

m-bain commented Nov 14, 2023 •

edited

Loading

amyeroberts commented Nov 14, 2023

ArthurZucker left a comment

sanchit-gandhi left a comment •

edited

Loading

Clap processor: remove wasteful np.stack operations #27454

Clap processor: remove wasteful np.stack operations #27454

Conversation

m-bain commented Nov 12, 2023

What does this PR do?

Before submitting

Who can review?

amyeroberts left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 13, 2023

m-bain commented Nov 14, 2023 • edited Loading

amyeroberts commented Nov 14, 2023

ArthurZucker left a comment

Choose a reason for hiding this comment

sanchit-gandhi left a comment • edited Loading

Choose a reason for hiding this comment

m-bain commented Nov 14, 2023 •

edited

Loading

sanchit-gandhi left a comment •

edited

Loading