Boost your Knowledge with Glassflow
Stay informed about new features, explore use cases, and learn how to build real-time data pipelines with GlassFlow.
Load Test GlassFlow for ClickHouse: Real-Time Deduplication at Scale
Benchmarking GlassFlow: Fast, reliable deduplication at 20M events

Part 2: Why are duplicates happening and JOINs slowing ClickHouse?
Learn the root of the duplication and JOINs issues of Kafka to ClickHouse.
Part 3: ClickHouse ReplacingMergeTree and Materialized Views are not enough
Deep dive on limitations of ReplacingMergeTree and Materialized Views.
Part 4: Can Apache Flink be the solution?
Apache Flink isn't the solution for duplications and JOINs on ClickHouse.
Part 5: How GlassFlow will solve Duplications and JOINs for ClickHouse
Learn the details on how GlassFlow will solve Duplications and JOINs.
ClickHouse Glossary
ClickHouse Glossary: Key terms for engines, joins & real-time data
Move Data from Postgres to Snowflake in Real Time using CDC
Stream Postgres data to Snowflake in real time with CDC.
What is CDC?
How Change Data Capture (CDC) enables real-time data replication & integration.
What is a Machine Learning Pipeline? Approaches to Data Integration and Analysis
Learn what a machine learning pipeline is: best practices, use cases, and tools.
What Is Real-Time Analytics? Benefits, Use Cases, and Tools Explained
Explore what real-time analytics is.
Cleaned Kafka Streams for ClickHouse
Clean Data. No maintenance. Less load for ClickHouse.
GitHub Repo