BLOG

Boost your Knowledge with Glassflow

Stay informed about new features, explore use cases, and learn how to build real-time data pipelines with GlassFlow.

ClickHouse

Load Test GlassFlow for ClickHouse: Real-Time Deduplication at Scale

Benchmarking GlassFlow: Fast, reliable deduplication at 20M events

Written by Ashish Bagri06/06/2025, 08.07
Read more
hero about image
Pricing hero background image
hero about image
ClickHouse

Part 2: Why are duplicates happening and JOINs slowing ClickHouse?

Learn the root of the duplication and JOINs issues of Kafka to ClickHouse.

Written by Armend Avdijaj
hero about image
ClickHouse

Part 4: Can Apache Flink be the solution?

Apache Flink isn't the solution for duplications and JOINs on ClickHouse.

Written by Armend Avdijaj
hero about image
ClickHouse

ClickHouse Glossary

ClickHouse Glossary: Key terms for engines, joins & real-time data

Written by Meryem Cebeci
hero about image
Engineering

What is CDC?

How Change Data Capture (CDC) enables real-time data replication & integration.

Written by Armend Avdijaj
hero about image
Engineering

What is a Machine Learning Pipeline? Approaches to Data Integration and Analysis

Learn what a machine learning pipeline is: best practices, use cases, and tools.

Written by Meryem Cebeci

Cleaned Kafka Streams for ClickHouse

Clean Data. No maintenance. Less load for ClickHouse.

GitHub Repo