GorillaStream.Validator (gorilla_stream v3.0.3)

Copy Markdown View Source

Data validation and quality checking utilities for time series data.

Provides functions to validate, clean, and assess the quality of time series data before compression.

Summary

Functions

Assesses the compression-friendliness of data.

Cleans and fixes common issues in time series data.

Validates time series data and reports any issues.

Functions

assess_compression_potential(data)

Assesses the compression-friendliness of data.

clean(data, opts \\ [])

Cleans and fixes common issues in time series data.

Options

  • :remove_duplicates - Remove duplicate timestamps (default: true)
  • :sort - Sort by timestamp (default: true)
  • :fix_values - Attempt to fix invalid values (default: false)
  • :interpolate_gaps - Fill small gaps with interpolated values (default: false)

validate(data, opts \\ [])

Validates time series data and reports any issues.

Examples

iex> data = [{1609459200, 23.5}, {1609459201, 23.6}]
iex> GorillaStream.Validator.validate(data)
{:ok, %{valid_points: 2, issues: []}}

iex> bad_data = [{1609459200, 23.5}, {"invalid", 23.6}]
iex> GorillaStream.Validator.validate(bad_data)
{:error, %{valid_points: 1, issues: [:invalid_timestamp]}}