🌱 I'm a Data and Analytics Engineer with a robust foundation in Computer Science Engineering. With 5 years of professional experience, I specialize in Python, PySpark, SQL, AWS, GCP, Kafka, OOPS, core Machine Learning and DE System Design. I currently serve as Senior Associate - Data Engineer at Definitive Healthcare.
🌱 My professional work includes developing
▶️ Enterprise ETL website using Python, PySpark, OOPS, system design, ETL pipelines using Python, PySpark, SQL, duckdb, airflow
▶️ Developing E2E customer segmentation models using Python and Machine Learning
🌱 I am well versed in
▶️ Python, PySpark, SQL, OOPS, DE System Design
▶️ Data Modelling, ETL, Batch & Streaming data pipelines, Kafka, Data Orchestration - Airflow
▶️ Cloud Services - GCP, GCS, Bigquery, Dataflow, Looker Studio, AWS S3, lambda, SNS, Glue, Redshift, RDS, Athena
▶️ Tools - Git, Bitbucket, VSCode, PYCharm, RStudio
🌱 In the realm of Data Analysis, I have engaged in case studies and exploratory data analysis (EDA) using Python, SQL, and data visualization using Tableau
🌱 I have built robust data pipelines both in batch and stream data processing. Worked with multiple Databases and types of data in GCP platform, Snowflake
🌱 Beyond the world of data and laptops, I have a passion for fitness. During my free time, I enjoy cooking and playing the violin
- Successfully earned the Google Cloud Digital Leader certification.
- Bagged 𝗧𝗲𝗮𝗺 𝗔𝘄𝗮𝗿𝗱 for exceptional performance in 2024.
- Awarded the 𝗦𝗵𝗶𝗻𝗶𝗻𝗴 𝗦𝘁𝗮𝗿 𝗔𝘄𝗮𝗿𝗱 for outstanding performance in 2023 and 2024.
- Honored with the 𝗦𝘁𝗿𝗶𝗱𝗲 𝘁𝗵𝗲 𝗧𝗿𝗶𝗱𝗲 𝗔𝘄𝗮𝗿𝗱 for independently managing and developing a key tool during 2021-2022.