About
Hi, I'm a data engineer with experience building large-scale data pipelines, real-time streaming systems, and cloud-native data infrastructure. This blog is where I share what I learn — from hands-on engineering challenges to opinions on the modern data stack.
Topics I write about include Apache Spark, Kafka, dbt, Airflow, data lakehouse architectures, cloud platforms (AWS, GCP, Azure), and emerging trends in data engineering.
Why this blog?
Writing helps me think clearly. I publish here to give back to the community that taught me so much, and to document my journey as a practitioner in this rapidly evolving field.
Get in touch
You can reach me via LinkedIn or find my open-source work on GitHub. I'm always happy to connect with fellow engineers.