t-digest overview

Since 1.0.0 Estimate the value at a given percentile, or the percentile rank of a given value, using the t-digest algorithm. This estimation is more memory- and CPU-efficient than an exact calculation using PostgreSQL’s percentile_cont and percentile_disc functions. tdigest is one of two advanced percentile approximation aggregates provided in TimescaleDB Toolkit. It is a space-efficient aggregation, and it provides more accurate estimates at extreme quantiles than traditional methods. tdigest is somewhat dependent on input order. If tdigest is run on the same data arranged in different order, the results should be nearly equal, but they are unlikely to be exact. The other advanced percentile approximation aggregate is uddsketch, which produces stable estimates within a guaranteed relative error. If you aren’t sure which to use, try the default percentile estimation method, percentile_agg. It uses the uddsketch algorithm with some sensible defaults.

Two-step aggregation

This group of functions uses the two-step aggregation pattern. Rather than calculating the final result in one step, you first create an intermediate aggregate by using the aggregate function. Then, use any of the accessors on the intermediate aggregate to calculate a final result. You can also roll up multiple intermediate aggregates with the rollup functions. The two-step aggregation pattern has several advantages:

More efficient because multiple accessors can reuse the same aggregate
Easier to reason about performance, because aggregation is separate from final computation
Easier to understand when calculations can be rolled up into larger intervals, especially in window functions and continuous aggregates
Perform retrospective analysis even when underlying data is dropped, because the intermediate aggregate stores extra information not available in the final result

To learn more, see the blog post on two-step aggregates.

Samples

Aggregate and roll up percentile data to calculate daily percentiles

Create an hourly continuous aggregate that contains a percentile aggregate:

CREATE MATERIALIZED VIEW foo_hourly
WITH (timescaledb.continuous)
AS SELECT
    time_bucket('1 h'::interval, ts) AS bucket,
    tdigest(100, value) AS tdigest
FROM foo
GROUP BY 1;

Use accessors to query directly from the continuous aggregate for hourly data. You can also roll the hourly data up into daily buckets, then calculate approximate percentiles:

SELECT
    time_bucket('1 day'::interval, bucket) AS bucket,
    approx_percentile(0.95, rollup(tdigest)) AS p95,
    approx_percentile(0.99, rollup(tdigest)) AS p99
FROM foo_hourly
GROUP BY 1;

Available functions

Aggregate

tdigest(): aggregate data in a t-digest for percentile calculation

Accessors

approx_percentile(): estimate the value at a given percentile from a t-digest
approx_percentile_rank(): estimate the percentile rank of a given value from a t-digest
max_val(): get the maximum value from a t-digest
mean(): calculate the exact mean from values in a t-digest
min_val(): get the minimum value from a t-digest
num_vals(): get the number of values in a t-digest

Rollup

rollup(): combine multiple t-digest aggregates

Approximate count distinct

Statistical and regression analysis

Minimum and maximum

Financial analysis

Percentile approximation

Counters and gauges

Time-weighted calculations

Downsampling

Frequency analysis

State tracking

Saturating math

Two-step aggregation

Samples

Aggregate and roll up percentile data to calculate daily percentiles

Available functions

Aggregate

Accessors

Rollup

Approximate count distinct

Statistical and regression analysis

Minimum and maximum

Financial analysis

Percentile approximation

Counters and gauges

Time-weighted calculations

Downsampling

Frequency analysis

State tracking

Saturating math

​Two-step aggregation

​Samples

​Aggregate and roll up percentile data to calculate daily percentiles

​Available functions

​Aggregate

​Accessors

​Rollup

Two-step aggregation

Samples

Aggregate and roll up percentile data to calculate daily percentiles

Available functions

Aggregate

Accessors

Rollup