ArimaGupta_DataEngineer_5YoE
ArimaGupta_DataEngineer_5YoE
ArimaGupta_DataEngineer_5YoE
Amazon Web Services: Client: Adani Ports and SEZ Jul 2021 – Sep 2021
S3, Lambda, EMR, CloudWatch, Project Goal: To perform analytics on business units - ports, logistics and dry cargo
SNS, Step function, AWS Glue Roles & Responsibilities
Translate business requirement to create pipeline from different sources such as Oracle
DB, flat files, Mercury DB, API end-points to PostgreSQL in RDS.
EDUCATION Enhanced & optimized existing pipelines to reduce turn around time.
Developed, scheduled & automated near-real time, daily run and monthly run jobs.
Bachelor of Technology – Tech stack: Python, Pandas, Lambda, EC2, SNS, CloudWatch, S3, RDS, PostgreSQL
Computer Science (85.6%)
PSIT College of Engineering, Client: Maruti Suzuki Limited May 2021 - Jul 2021
Uttar Pradesh (IN), 208020 Project Goal: To combine historical observations, current forecasts and statistical weather
2014 - 2018 forecasts to create single dataset via API calls for data scientist team
Roles & Responsibilities
Higher Secondary (90.6%) Worked on visual crossing APIs to fetch csv/json data hourly, daily of 1700+ cities.
St. Don Bosco College, Developed an automated pipeline to transform the data & write to curated layer in S3.
Uttar Pradesh (IN), 262701 Tech stack: Pyspark, Pandas, Step Function, EMR, CloudWatch, Jupyter Notebook
2012 - 2014
SENIOR SYSTEMS ENGINEER (Infosys Ltd., Pune, IN / Jul 2018– May 2021)
Secondary School (93.5%) Client: Semiconductor Manufacturing Company
St. Don Bosco College, Project Goal: To setup a near-real time data lake to analyze the manufactured chips.
Uttar Pradesh (IN), 262701 Roles & Responsibilities
2010 - 2012 Worked as developer and also on documentation of runbook manual, flowchart design,
and code guidelines checklist, technical design document.
Worked on challenges like performance tuning (80GB data on daily basis), rollback
strategies, stored procedure fixes, duplicate records and structured streaming issues
Tech Stack: Pyspark, EMR, Snowflake, S3, Step Function, Structured Streaming, SNS