Database
Last updated
Last updated
All data imported into Gaio DataOS is stored internally in a high-performance local database. Behind the scenes, Gaio uses the ClickHouse database engine to store, manage, and query data across your flows, dashboards, and APIs.
Blazing-fast performance
ClickHouse is optimized for analytical queries and can process billions of rows per second on commodity hardware.
Columnar storage
Stores data by columns instead of rows, reducing disk I/O and improving query speed dramatically.
Real-time analytics
Supports real-time data ingestion and instant query execution β perfect for dashboards and live monitoring.
Efficient compression
Automatically compresses data, reducing storage footprint while preserving performance.
Scalability
Easily scales horizontally with distributed architecture β useful for enterprise-scale workloads.
SQL-compatible
Uses familiar SQL syntax, making it accessible for analysts, engineers, and data scientists.
Fault-tolerant & reliable
Includes support for replication, backup, and high availability across nodes.
When data is uploaded or ingested into a Data Project, it is stored in your projectβs isolated ClickHouse schema.
All flows, dashboards, and AI features run queries directly on top of ClickHouse, ensuring high performance without additional infrastructure.
Every action, from filtering a table to exploring data distributions (as shown in the image above), is powered by ClickHouseβs real-time query engine.
ClickHouse consistently outperforms traditional and cloud-native databases in OLAP scenarios. Below is a comparison of query performance based on public benchmarks and real-world use cases.
Note: Benchmarks may vary depending on infrastructure, dataset size, and query complexity. The table below reflects averages from public sources like ClickBench, Altinity, and independent performance tests.
ClickHouse
Columnar, OLAP
<1 second
3β8x
β Yes
β Yes
PostgreSQL
Row-based, OLTP
~30β60 seconds
1β2x
β No
β Limited
BigQuery
Serverless OLAP
~3β10 seconds (cold start)
2β4x
β Yes
β No (batch-based)
Redshift
Columnar, OLAP
~5β15 seconds
2β4x
β Yes
β No (cached)
Snowflake
Cloud-based OLAP
~5β20 seconds
3β6x
β Yes
β No (batch)
MySQL
Row-based, OLTP
~50β120 seconds
~1x
β No
β No
Gaio DataOS was designed for speed, flexibility, and scalability. ClickHouse was the natural choice for the following reasons:
It's open-source, robust, and proven in production at scale (used by Yandex, Cloudflare, and others).
It allows Gaio to deliver low-latency queries across huge datasets with minimal resource overhead.
Its native SQL support, built-in compression, and distributed architecture perfectly match Gaio's mission: democratizing enterprise analytics without compromising on performance.
is a column-oriented database management system (DBMS) designed for real-time analytical processing (OLAP). It was built to deliver lightning-fast queries on massive volumes of data, making it ideal for modern data platforms like Gaio.