Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: kafka Clear Filter

Why was Apache Kafka created?

Reading Time: 13 minutes Intro - the Integration Problem We talk all the time about what Kafka is, but not so much about why it is the way it is. What better way than to dive into the original motivation for creating Kafka? Circa 2012, LinkedIn’s original intention with Kafka was to solve a data integration problem. LinkedIn used site activity data (e.g. someone liked this, someone posted this) for many things - tracking fraud/abuse, matching jobs to users, training ML models, basic feature

Load Test GlassFlow for ClickHouse: Real-Time Dedup at Scale

Load Test GlassFlow for ClickHouse: Real-Time Deduplication at Scale By Ashish Bagri, Co-founder & CTO of GlassFlow TL;DR We tested GlassFlow on a real-world deduplication pipeline with Kafka and ClickHouse. It handled 55,00 records/sec published by Kafka and processed 9,000+ records/sec on a MacBook Pro, with sub-0.12ms latency. No crashes, no message loss, no disordering. Even with 20M records and 12 concurrent publishers, it remained robust. Want to try it yourself? The full test setup