A Primer on AWS Networking for Data EngineersawsnetworkingIn this post, I'll cover the basics of networking in AWS, including VPCs, subnets, security groups and route tables that every data engineer should be familiar with.Published On2025-02-08Read More →
Data Processing with PySpark, Delta Lake and AWS EMRawsdelta-lakesparkIn this post, we'll discuss data processing with PySpark using the delta lake format and deploying it on AWS Elastic MapReduce (EMR)Published On2024-06-27Read More →