This is an old revision of the document!


SampleStream

SampleStream is a small Python library used to extract data from ARDI for time-series analysis. It's particularly useful for…

* AI applications where you need to work on time-series data,
* Applications where you need to search for events across time-windows,
* When you need very reliable 'bucketing' of data over even amounts of time

How It Works

SampleStream lets you iterate over a query.

SampleStream takes an AQL request, a window size, a number of samples and a step size.

Every time you ask SampleStream for the next sample of data, it returns the data for window size seconds of data, broken into number of samples buckets.

For example, if you wanted to look at events happening over a 30 second window at 1 second resolution, your window size would be 30 and your number of samples would also be 30 (30 samples over 30 seconds = 1 sample per second).

The next time you ask for a sample of data, it will be step size seconds away from the previous sample.

This makes it quite easy to scan over a period of time.

Why is it different to a normal query

The window size determines how many of seconds of data

Instead of requesting very large chunks of time, SampleStream only requests the time around the current