Real-Time Sensor Data Aggregation & Anomaly Detection

This example demonstrates a high-performance multistage pipeline for processing streaming sensor data using the Roda Engine. It showcases statistical windowing (Aggregation) and stateful delta analysis (Anomaly Detection) in a thread-per-stage architecture.

The implementation is located in main.rs.

Key Features

Multistage Pipeline: Decouples data ingestion, statistical aggregation, and anomaly detection into separate CPU-bound stages.
Stateful Windowing: Maintains running statistics (min, max, average) for sensors using the stateful pipe component.
Low-Latency Alerting: Detects anomalies (e.g., sudden spikes in average value) using the delta component to compare current window state with the previous one.
Performance Metrics:
- Execution Latency: Measures time spent within each stage using the latency pipe component.
- End-to-End Latency: Tracks "Tick-to-Alert" latency from raw reading to signal generation.
- Throughput: Capable of processing millions of sensor readings per second.

Pipeline Architecture

graph LR
    A[Raw Reading] --> B(Stage 1: Aggregation)
    B -->|Summary| C(Stage 2: Anomaly Detection)
    C -->|Alert| D[Alert Journal]

    subgraph "Worker Thread 1"
    B
    end
    subgraph "Worker Thread 2"
    C
    end

Data Models

Reading: Raw sensor data with sensor_id, value, and receive timestamp.
Summary: Statistical window containing min, max, average, and observation count.
Alert: Signal generated when a sensor's average value jumps by more than 50% compared to the previous window.

Usage

# Run the example with optimizations
cargo run --release --example sensor_test

Performance Metrics (tested on MacBook M2 Max)

Based on the latest benchmarks (perf.log):

Throughput: ~56.1 MEPS (Million Events Per Second).
Stage Latency: ~17.8ns per record for aggregation logic (based on measured throughput).
Architectural Efficiency:
- Instructions Per Cycle (IPC): 1.00 (Standard efficiency for branch-heavy logic).
- Branch Prediction: 94.5% accuracy (5.47% branch misses).
- L1 Cache Performance: 3.16% dcache misses, 1.09% icache misses (Highly optimized memory patterns).
End-to-End Latency: < 500ns (median) for alert generation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Real-Time Sensor Data Aggregation & Anomaly Detection

Key Features

Pipeline Architecture

Data Models

Usage

Performance Metrics (tested on MacBook M2 Max)

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Real-Time Sensor Data Aggregation & Anomaly Detection

Key Features

Pipeline Architecture

Data Models

Usage

Performance Metrics (tested on MacBook M2 Max)