Computer Science Graduate Student at UNC Charlotte with 3 Years of Experience working as a Data Engineer at Schlumberger. Currently working as a Part time Data Steward at Urban Institute - UNC Charlotte.
Experience
Data Engineer, Schlumberger, Pune - Maharashtra, INDIA
Data Steward, Urban Institute - UNC Charlotte, Charlotte - NC,USA
Key Projects :
Data Engineer, Schlumberger
Project #1: Reporter Dashboard- Technologies, Tools and Frameworks: OBIEE, TFS, Informatica, Oracle, Unix, Agile, Scrum
- Responsibilities:
- Plan design, develop, and test high quality end-to-end business intelligence solutions following agile methodology.
- Gathering business requirements, follow up on existing projects and ensure that deliveries attend to stakeholder needs.
- Act as Subject Matter Expert for finance and HR domain and primary point-of-contact for product development staff.
- Monitored and optimized database performance for 50% reduction in query to report time.
- Technologies, Tools and Frameworks: Hadoop, Spark, Hive, Azure Data Flows, Azure Data Factory, Azure Synapse, SSIS, Python, Shell, Scrum, Agile.
- Responsibilities:
- Developed scalable ETLs to transform using cloud data infrastructure.
- Optimized existing queries and ETLs to run on MPP databases to gain 70% performance improvement.
- Conduct PoC on Spark, Hadoop, and Hive for developing data processing, storage and analytics environment.
- Developed audit and control framework to execute, log and monitor status of data processing jobs.
- Automated restart of data processing pipelines, saving 40% time in manual interventions for restart.
- Contributed to the creation of a standard operational documentation for data migration projects to Cloud.
Data Steward, Urban Institute - UNC Charlotte
Project #1: Data Re-modeling- Technologies & Tools: R, Python, SQL, Tidyverse, Tableau, Shell scripting
- Responsibilities:
- Analyze existing raw data and design data model for optimal storage and data analysis.
- Create pipelines to automate raw data transformation and storage in data warehouse.
- Generate data quality reports to provide a holistic view of the available data.
- Contributing to the creation of an operation manuals for Urban Institute’s data access and usage.
- Technologies, Tools and Frameworks: R, Python, SQL, Tidyverse.
- Responsibilities:
- Perform entity resolution across multiple data sources to match and fetch data as per researcher requests.
- Developed a standard entity resolution framework to build a master index using fuzzy matching algorithms.
Education
BE in Computer Engineering
University of Pune, Pune, Maharashtra, India
Jun 2013 – May 2017
CGPA 3.7/4
Master of Science in Computer Science
University Of North Carolina, Charlotte, NC, USA
Jan 2021 - May 2022
CGPA 4/4
Technical Skills
- Programming Languages/Framework: Java, SQL, Python, R, Scala, HTML, CSS3, JavaScript
- Databases: Sql Server, Oracle, Azure Synapse, MongoDB, Hive
- Data Transformation Tools Informatica, SSIS, Azure Data Flows, Azure Data Factory, Hadoop, Spark
- Data Visualization/ Modelling Tools: OBIEE, PowerBI, SSAS, Tableau
- Tools/Lib/IDE: Eclipse, Azure DevOps, Android Studio, Excel, Jupyter Notebook, Git, TFS, UNIX, Confluence, Docker