Resampler - work in progress #14

HpLightcorner · 2022-03-17T12:21:45Z

Hi,
@matthiasstraka, I have several questions regarding the newly implemented Resampler - as we can directly discuss source code here it might be the easiest to discuss questions within this issue and simply close it when we are confident everything is working.

Out of Range issues
I think the following check should be

ODK_ASSERT_LTE(idx + 1, fp.size() - 1);

but is

OXYGEN-SDK/odk/framework/src/odkfw_resampler.cpp

Line 59 in 257b418

ODK_ASSERT_LTE(idx + 1, fp.size());

First Sample Timestamp
The first sample timestamp should be calculated based on the current amount of samples?
So, instead of

OXYGEN-SDK/odk/framework/src/odkfw_resampler.cpp

Line 101 in 257b418

first_sample_timestamp -= m_input_buffer.size() * estimated_sample_duration;

So, I suggest to do

first_sample_timestamp -= num_samples * estimated_sample_duration;

Furthermore, an additional safety check is a good idea to prevent negative timestamps

first_sample_timestamp -= num_samples * estimated_sample_duration;
ODK_ASSERT_GTE(first_sample_timestamp, 0);

What are your thoughts on that?

The text was updated successfully, but these errors were encountered:

matthiasstraka · 2022-03-17T13:41:33Z

You are correct, the ASSERT should be less-than fp.size(). Since we currencly do not have ASSERT_LT, I will change it to

ODK_ASSERT_LTE(idx + 2, fp.size());

to prevent issues with unsigned size() values of 0 (or add ASSERT_LT, which requires a change in two codebases). But this immediately triggers a Unit test error -- I will look into that.

if m_input_buffer is not empty, this means there are samples from a previous run that will be added before the new "num_samples"-samples (or like in the code, we just add the new samples after the old ones). We need them so we can use the interpolation code smoothly between blocks of data.
So,

OXYGEN-SDK/odk/framework/src/odkfw_resampler.cpp

Line 101 in 257b418

first_sample_timestamp -= m_input_buffer.size() * estimated_sample_duration;

will simply say: correct the first timestamp of the final m_input_buffer. I am pretty sure that this works but this code definitely keeps more "old" samples than needed. And the more estimated_sample_duration-scaled timestamps we accumulate, the more estimation errors we make - so we might have to improve on that when there is a problem with real-world data. Is that understandable?
Additional ASSERTS for timestamps are definitely a good idea, especially since this code has not been used with non-synthetic data so far and who knows what kind of data the user will throw at it.

I will look into the ASSERT problems and push an updated version probably by tomorrow.

HpLightcorner · 2022-03-17T14:02:40Z

Great! Regarding 2)

It seems that I am having issues here with real-world data. Especially with start of the stream. I cannot guarantee that the first package (lets call it start of stream - SoS) of the stream will arrive perfectly in time with Oxygen sample time restart. So I have to add zeros to align the SoS with the Oxygen time (NaN would be better I guess, but not supported by Resampler for now). This will be an arbitrary number of zeros. Hence, I can not guarantee a constant packet size which seems to cause issues with the calculation of first_sample_time using m_input_buffer - using num_samples instead of m_input_buffer.size() solves this for now, maybe causing other issues, we will have to take another look into that issue I guess.....

…mputed) #14

matthiasstraka · 2022-03-18T10:11:10Z

I added a fix that addresses the problem with reading past the last element of the input vector.
I have a hard time understanding what real-world data looks like and only rely on synthetic data provided to me by @moberhofer that works similarly to what is generated in

OXYGEN-SDK/examples/sync_resample_source/odkex_sync_resample_source.cpp

Lines 161 to 163 in d1d688e

    
           auto samples = generateSignalRamp(block_size); 
        
           double t = m_t_prev + samples.size() / m_true_sample_rate->getValue().m_val; 
        
           m_resampler.addSamples(host, out_channel->getLocalId(), t, samples.data(), samples.size());

Currently, the assumption is that all data starts at time 0, but it should not be hard to initialize m_last_timestamp of the Resampler to a different value here:

OXYGEN-SDK/odk/framework/src/odkfw_resampler.cpp

Line 89 in d1d688e

m_last_timestamp = 0;

ADD_CONTIGUOUS_SAMPLES should be able to handle pauses in the beginning of the stream (after the initial pause, all samples must be provided as it is a sync channel).

HpLightcorner · 2022-03-18T13:58:06Z

I will test the improvements on my code base.

@matthiasstraka When talking about real-world data, lets first define a use-case: For example, microcontrollers are streaming ADC-samples to the host by sending, e.g. 1000 samples every 100 mS - giving us 10.000 samples/second. The microcontrollers are synced by, e.g. gPTP and have a common understanding when exactly 1000 samples are acquired so that the timestamp is a global time among all microcontrollers.

Scenario A
The microcontrollers are starting their stream perfectly in sync with the Oxygen sampling (re)start - this is never going to happen unless Oxygen implements some further communication with these devices (e.g. sending a start command or whatever).

Scenario B
The first packet of the stream arrives at an arbitrary Oxygen time. The resampler needs to "fill up" the time from 0 (start of Oxygen time) to the time-point the first packet arrived with NaN or zeros. Further, this first packet maps the Oxygen-Time to the global, microcontroller time, so this should only happen once from my understanding and is then valid for all microcontrollers in the network (however, this might only be the case when all devices are synced with e.g. gPTP, so we should make this an requirement when working with multiple streams coming from multiple sources and the Resampler or we make this optional with an OOP design pattern). We can further demand that every packet afterwards has the same amount of samples.

So, i think the Resampler should be flling up samples with 0 or NaN such that the stream of data is aligned with the Oxygen time - or ADD_CONTIGUOUS_SAMPLES is able to pause a stream at the beginning 👍

Packet Loss
The Resampler must be able to deal with packet-losses to recover from network faults. There are multiple ways to implement that, a very simple one could be:

We now the timestamp of the last packet that has arrived
We now the timestamp of the packet indicating stream recovery
We now the nominal sample rate

Hence we can calculate the missing number of samples and fill up the stream again with either 0 or NaN.

matthiasstraka added a commit that referenced this issue Mar 17, 2022

Add ODK_ASSERT_LT and ODK_ASSERT_GT #14

bde67e6

matthiasstraka added a commit that referenced this issue Mar 18, 2022

use dynamic interpolation limits (stop when no more samples can be co…

d1d688e

…mputed) #14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resampler - work in progress #14

Resampler - work in progress #14

HpLightcorner commented Mar 17, 2022

matthiasstraka commented Mar 17, 2022

HpLightcorner commented Mar 17, 2022

matthiasstraka commented Mar 18, 2022

HpLightcorner commented Mar 18, 2022 •

edited

Loading

Resampler - work in progress #14

Resampler - work in progress #14

Comments

HpLightcorner commented Mar 17, 2022

matthiasstraka commented Mar 17, 2022

HpLightcorner commented Mar 17, 2022

matthiasstraka commented Mar 18, 2022

HpLightcorner commented Mar 18, 2022 • edited Loading

HpLightcorner commented Mar 18, 2022 •

edited

Loading