Add a `wait_for_samples` method to the `MovingWindow` #1159

shsms · 2025-02-04T14:20:57Z

Closes #967

cwasicki · 2025-02-06T21:02:40Z

src/frequenz/sdk/timeseries/_ringbuffer/buffer.py

+        Args:
+            since: The timestamp from which to start counting.  If `None`, the oldest
+                timestamp in the buffer is used.
+            until: The timestamp until which to count.  If `None`, the newest timestamp


Better clarify that until is inclusive since usually the end timestamp (or index) is exclusive in python.

cwasicki · 2025-02-06T21:07:57Z

src/frequenz/sdk/timeseries/_moving_window.py

+            raise ValueError(
+                "The number of samples to wait for must be greater than 0."
+            )
+        if n > self.capacity:


Theoretically I don't see why this is required but also don't have a good example where it hurts.

We are counting the number of items in the buffer, for reporting. If we have to support n > capacity, we'll need a different implementation. But I couldn't think of a usecase for allowing n > capacity.

cwasicki · 2025-02-06T21:20:26Z

src/frequenz/sdk/timeseries/_moving_window.py

+        start_timestamp = self.newest_timestamp
+        if n < self.capacity:
+            n += self.count_valid(since=start_timestamp)
+        while True:


I am not familiar with the Condition concept and cannot follow what is happening here, maybe you could add some explanatory comments here and in the tests below.

I've added a few lines, hope that's sufficient.

cwasicki · 2025-02-06T21:28:42Z

src/frequenz/sdk/timeseries/_moving_window.py

@@ -318,6 +320,34 @@ def window(
            start, end, force_copy=force_copy, fill_value=fill_value
        )

+    async def wait_for_samples(self, n: int) -> None:
+        """Wait until the next `n` samples are available in the MovingWindow.


I guess this is valid samples? If so, I am actually not sure if we want this or something time-based i.e. allow that not all samples are valid. However, for our current use-case this also works since we would set n=1 anyway.

That's interesting. valid means that any data was received. If a component is missing data, resampler will send None and that is not a valid value.

If a component is sending only None, should this function return after n Nones are received? I'm guessing it should?

I see, in many scenarios I wouldn't distinguish between missing or None values. I think it shouldn't return after n Nones but after n new time steps which have at least 1 real value. However, we could leave this also for later.

I've updated it to return after n samples are received. Whether they were valid or not needs to be checked with a call to count_valid. I've also updated the docs to state this.

Oh, I see the confusion, I understood that it triggers when n new output samples have been "received", i.e. there are timestamps in the resulting moving window. But this is about input samples. So even if we receive 100 samples, if these are all older than newest timestamp we wouldn't get any new timestamp in the window but updated data points of older timestamps.

This makes sense to me, would stress that in the doc though, e.g the valid samples part is confusing IMO since this is indeed about the new samples.

Maybe:

"""Wait until the next `n` samples have been received in the MovingWindow. This function returns after `n` input samples have been received, without considering whether the received samples are valid or which timestamp they have. The validity of the samples in the updated moving window can be verified by calling the [`count_valid`][frequenz.sdk.timeseries.MovingWindow.count_valid] method.

No, it would wait until there are n output samples, but some output samples could be nan.

It does consider the timestamps when the samples are received. It expects n "new" samples to be available in the buffer before it returns.

The current tests only cover cases where there is no resampling in the moving window. I'll rectify that.

It retains the original behaviour of counting all the valid samples in the buffer when no time range is specified. Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

cwasicki · 2025-02-10T21:16:36Z

src/frequenz/sdk/timeseries/_moving_window.py

@@ -318,6 +320,34 @@ def window(
            start, end, force_copy=force_copy, fill_value=fill_value
        )

+    async def wait_for_samples(self, n: int) -> None:
+        """Wait until the next `n` samples are available in the MovingWindow.


Oh, I see the confusion, I understood that it triggers when n new output samples have been "received", i.e. there are timestamps in the resulting moving window. But this is about input samples. So even if we receive 100 samples, if these are all older than newest timestamp we wouldn't get any new timestamp in the window but updated data points of older timestamps.

This makes sense to me, would stress that in the doc though, e.g the valid samples part is confusing IMO since this is indeed about the new samples.

cwasicki · 2025-02-10T21:19:49Z

src/frequenz/sdk/timeseries/_moving_window.py

@@ -318,6 +320,34 @@ def window(
            start, end, force_copy=force_copy, fill_value=fill_value
        )

+    async def wait_for_samples(self, n: int) -> None:
+        """Wait until the next `n` samples are available in the MovingWindow.


Maybe:

"""Wait until the next `n` samples have been received in the MovingWindow. This function returns after `n` input samples have been received, without considering whether the received samples are valid or which timestamp they have. The validity of the samples in the updated moving window can be verified by calling the [`count_valid`][frequenz.sdk.timeseries.MovingWindow.count_valid] method.

cwasicki · 2025-02-10T21:40:50Z

tests/timeseries/test_moving_window.py

+        await push_logical_meter_data(sender, range(0, 5))
+        await asyncio.sleep(0)
+        # After pushing 5 values, the `wait_for_samples` task should be done.
+        assert task.done()


Now looking at the indended usage example here, would it be possible to wrap this into a receiver that sends the moving window content each time the method triggers? We could implement this in downstream apps of course, but I guess in the end this would be the pattern we would use.

Yup, can do that.

shsms requested a review from a team as a code owner February 4, 2025 14:20

shsms requested review from daniel-zullo-frequenz and removed request for a team February 4, 2025 14:20

github-actions bot added part:docs Affects the documentation part:tests Affects the unit, integration and performance (benchmarks) tests part:data-pipeline Affects the data pipeline labels Feb 4, 2025

shsms mentioned this pull request Feb 4, 2025

[MovingWindow] Add a trigger that fires after having received a fixed number of samples #967

Open

cwasicki reviewed Feb 6, 2025

View reviewed changes

shsms added 4 commits February 10, 2025 16:01

[MovingWindow] Accept a time range in the count_valid method

2a718ba

It retains the original behaviour of counting all the valid samples in the buffer when no time range is specified. Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

[MovingWindow] Accept a time range in the count_covered method

5057e0a

It retains the original behaviour of counting all the valid samples in the buffer when no time range is specified. Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

[MovingWindow] Add a wait_for_samples method

2c92dd9

Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

Update release notes

f94c4fa

Signed-off-by: Sahas Subramanian <sahas.subramanian@proton.me>

shsms force-pushed the moving-window-samples-trigger branch from a001bf7 to f94c4fa Compare February 10, 2025 16:33

shsms requested a review from cwasicki February 10, 2025 16:36

cwasicki reviewed Feb 10, 2025

View reviewed changes

shsms requested a review from cwasicki February 11, 2025 08:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `wait_for_samples` method to the `MovingWindow` #1159

Add a `wait_for_samples` method to the `MovingWindow` #1159

shsms commented Feb 4, 2025

cwasicki Feb 6, 2025

shsms Feb 10, 2025

cwasicki Feb 6, 2025

shsms Feb 7, 2025

cwasicki Feb 6, 2025

shsms Feb 10, 2025

cwasicki Feb 6, 2025

shsms Feb 7, 2025

cwasicki Feb 7, 2025

shsms Feb 10, 2025

cwasicki Feb 10, 2025

cwasicki Feb 10, 2025

shsms Feb 11, 2025

cwasicki Feb 10, 2025

cwasicki Feb 10, 2025

cwasicki Feb 10, 2025

shsms Feb 11, 2025

Add a wait_for_samples method to the MovingWindow #1159

Are you sure you want to change the base?

Add a wait_for_samples method to the MovingWindow #1159

Conversation

shsms commented Feb 4, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add a `wait_for_samples` method to the `MovingWindow` #1159

Add a `wait_for_samples` method to the `MovingWindow` #1159