Tech Notes: Evaluation of Scalable Solutions for Time Series Database Streaming
Creators
Description
This Tech Note presents an evaluation of scalable solutions for streaming time-series data, critical for real-time analysis in large-scale national research facilities like the NSF Laser Interferometer Gravitational-Wave Observatory (LIGO). The study assesses various time-series databases (ClickHouse, InfluxDB, TimescaleDB) and communication protocols (Kafka, Arrow Flight), focusing on query performance, data ingestion, and scalability. ClickHouse and Kafka emerged as preferred solutions, providing high performance and flexibility for environments with large-scale data requirements. The evaluation is based on use cases from facilities like LIGO, aiming to improve real-time data processing capabilities in NSF Major Facilities.
Notes (English)
Files
Tech Notes_LIGO Data Streaming - Final.pdf
Files
(1.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:393911786d1568b2793e0f0563da37d0
|
1.0 MB | Preview Download |
Additional details
Funding
- U.S. National Science Foundation
- CI CoE: CI Compass: An NSF Cyberinfrastructure (CI) Center of Excellence for Navigating the Major Facilities Data Lifecycle 2127548