Data Engineer passionate about building scalable data platforms and end-to-end data pipelines.
π‘ Data Engineer with 2+ years of experience designing cloud-native data pipelines and scalable data platforms.
π¨βπ» I specialize in building high-performance ETL/ELT workflows using PySpark and SQL within the Azure ecosystem.
π§ Strong focus on problem-solving (DSA in Python) and system design for scalable systems.
- β‘ Built scalable ETL pipelines using PySpark & Azure Databricks
- π Designed incremental data pipelines with schema enforcement
- ποΈ Implemented Medallion Architecture (Bronze β Silver β Gold)
- π Worked on large-scale distributed data processing and performance optimization
- π§ Regularly solving DSA problems on LeetCode using Python
PySpark β’ SQL β’ Apache Spark β’ Delta Lake
Azure Databricks β’ Azure Data Factory β’ ADLS / Blob Storage
Medallion Architecture β’ ETL / ELT Pipelines β’ Lakehouse
Git & GitHub β’ Databricks Workflows β’ Jupyter Notebooks
- π End-to-End Azure Data Engineering Projects
- π Real-world ETL pipeline implementations
- β‘ Databricks + PySpark use cases
- ποΈ Lakehouse architecture implementations
- π Data modeling & transformation workflows
- π§ Data Structure Implementations Daily LeetCode solutions in Python covering DSA patterns
- π Building production-grade data pipelines
- β‘ Exploring real-time data processing
- π Improving performance & scalability
- ποΈ Designing modern lakehouse architectures
- π§ Learning System Design (scalability, distributed systems)
- Databricks Certified Data Engineer Associate
- Infosys Certified Databricks Analyst
- Infosys Certified PySpark Professional
- Infosys Certified Cloud Beginner
- Infosys Certified Python Associate
Email: kshagun02@gmail.com
Linkedin: https://www.linkedin.com/in/shagun-khandelwal-96a94620a/
Skype: https://join.skype.com/invite/yfGOtrpFQKyd
π You can also find my blogs on:- https://medium.com/@kshagun02