/about /blog /build-log /learnNEW

Shaurya's Blog

Data, ML and Software Engineering

Stream Processing with Apache Flink. Part-5
clickhouse
flink
java
real-time-analytics
Real-time processing of F1 event streams
Published On
2026-05-28
Read More →
Racing insights with Airflow 3 and dbt
airflow
dbt
clickhouse
python
Exploring F1 race data with Airflow 3, dbt and Clickhouse
Published On
2026-04-28
Read More →
Accelerating Stream Processing in Production
distributed-systems
flink
fluss
java
kafka
We examine how RocksDB, Apache Fluss, and distributed caching layers address scaling challenges, providing both architectural guidance and practical configuration examples for production deployments.
Published On
2026-02-20
Read More →
Architecting Trust: The Challenge of Data Quality Part 1
architecture
organizational
data-quality
databases
In this series of blog posts, I explore the multi-faceted challenge of building data quality into an organization’s data ecosystem.
Published On
2026-01-26
Read More →
Kafka topics to Iceberg Tables
flink
kafka
iceberg
spark
real-time-analytics
In this post, we'll explore different approaches for ingesting Kafka data into Iceberg tables, examine strategies for managing schema evolution, and discuss when to choose one method over another based on your specific use case.
Published On
2025-11-30
Read More →