Data-Engineering, Data-Analysis Projects
Welcome to my data portfolio! Here, are projects with Python, SQL, R, PySpark, Tableau
Project Link
Tools
Project Description
⛓ dbt airflow pipeline
Python, Snowflake, Airflow, dbt
Developed a data pipeline to build fact and dimension tables for the snowlake database snowflake-sample-data
🏦 Spar Nord Bank Transaction
Python, AWS, EC2, RedShift, SQOOP
Developed a data pipeline utilizing ETL a batch ETL pipeline to read transactional data from RDS, transform and load it into target dimensions and facts on Redshift Data Mart
🗜️ Retail data Pipeline
Python, Kafka, AWS, EC2, Glue, Athena, S3
Developed a streaming pipeline, that reads data from producer and updates data into s3 followed by athena
Project Link
Project Description
Components Designed
Project Link
Area
Project Description
Libraries
👩🏻💻 Absenteesim Analysis
Programming
Analysing the reason and probabilities various conditions for maximum absenteeism in employees of a company.
pandas, numpy, scikit-learn
📺 NYC Airbnb Analysis
Data Wrangling & EDA
Analysis on multiple files to distinguish Airbnb prices across NYC
pandas
Project Link
Area of Analysis
Project Description
🛍 Serious SQL
Data Analysis Queries
Apprenticeship of SQL
👩💼 Employee Info
Employee Info Analysis
Analyis on Employee database implementing all the concepts of SQL
🎦 IMDB Movie
Data cleaning, transformation, Analysis
Analysis of RSVP Indian film production data of past 3 years Movie data to release a movie for global audience
EDA, Data Visualisation, Data Analysis
Project Link
Project Description
Netflix Dashboard
Netflox Movies and TV Show Analysis Analysis of Netflix data on recent movies and tv shows
IT Technical Issues Dashboard
IT Ticket Info Analysis Detailed analysis of tickets booked, resolved in time