Avinash - D - Resume (1) - 1

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 1

Mohammed Muteeb Basha

LinkedIn Profile | mbasha21@trine.edu | (248)-949-4251

EDUCATION

Trine University, Detroit, MI Dec 2021 – Mar 2023


Master of Science in Business Analytics 3.85 GPA
Related courses: DBMS, Time Series Analysis, Multivariate Analysis, Machine learning, Business Intelligence.
SKILLS

• Big data/Cloud: AWS, Spark, Hadoop, Hive, Kafka, Sqoop, Jenkins, Docker Kubernetes, Airflow, Ab initio, Informatica
• Languages: Python, SQL, R, SAS, Unix Shell scripting, PySpark
• Databases: Oracle 11g (SQL, PL/SQL), PostgreSQL, MySQL, SQLite3, MongoDB.
• Other Skills/Tools: Tortoise SVN, Git, Service Now, WinSCP, SPSS, Tableau, Excel.
• Data science: Probability, Statistics, A/B testing, Chi-square test, Decision trees, Regression, Clustering.
EXPERIENCE

Synchrony Financial May 2020 – April 2021


Data Engineer
• Conceptualized, Designed, Developed & Productized new ETL pipelines using Big Data stack of PySpark, SQL, Kafka, S3 to
handle over 1 TB of data from SFTP.
• Implemented Multiple Data Pipeline DAGs and maintained DAGs in Airflow orchestration.
• Predicted the issue response and solvency time by performing Linear Regression Analysis on 3 years tickets data and
visualized the data using Tableau to the DWH teams and end-users.
• The Client is a consumer financial services company that is the largest provider of private label credit cards in the US.
• Extensively worked on OLTP data extraction, transformation, and loading (ETL) from various sources like Databases, Flat
files, and HDFS files in the Hadoop environment and the Ab initio tool.
• Deployed complex data pipelines including Slowly changing Dimensions type-1,2,3 and altered the existing data flows
and tables increasing the efficiency by 20%.
• Automated the new client acquisition process into the data warehouse and reduced the budget allocations by $25K.
• Worked extensively on Shell scripting for developing SFTP process, file receive and archival process, and other
automation like table load checks and automated the SQL table partitions reducing the effort of Data modelers by 50%.
• Deployed 10 Fact tables and over 20 Dimension tables with respective pipelines to process daily data of the range
200GB on a weekday basis.
• Involved closely with Product Owners to establish the design of the experiment for the effectiveness of product
improvements.
ACADEMIC PROJECTS

Time series analysis of employment in the U.S energy sector and Covid interaction: Python October 2022
• Applied basic regression and correlation techniques, and visualization of US employment data.
• Derived the relation of the oil prices, Rig counts, and covid impacted on employment in the oil sector.
Multivariate Analysis of Cryptocurrencies, Gold and Stock dataset: Python, R November 2022
• Applied regression, K-means clustering, and applied Dimensionality reduction (PCA).
• Analyzed the relation between crypto and the traditional stock market in contrast to Gold.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy