Yashashree Patankar

Data Engineer

15+ years designing scalable Data Pipelines, Analytics Platforms, and Cloud-based Data Solutions.

About

Results-driven Data Engineer with 15+ years of experience turning complex, large-scale data challenges into reliable, high-performance solutions. From architecting end-to-end data pipelines and cloud data warehouses to delivering business intelligence that drives real decisions, I bring deep technical expertise and a strong track record across Fortune 500 environments including Oracle, CGI, and PwC.

I specialize in the full data engineering lifecycle — ingestion, transformation, quality, and delivery — with hands-on expertise in AWS big data services, Apache Spark, Airflow, Redshift, and Tableau. AWS certified and consistently recognized for bridging the gap between engineering rigor and business impact.

Experience

Oracle — Pleasanton, CA

Principal Member of Technical Staff

July 2021 — Present
CGI — San Francisco, CA

Senior Consultant

June 2014 — July 2021

Consultant

June 2012 — May 2013
  • Designed, developed and maintained data pipelines, models and analytics for a Business Intelligence product.
  • Engineered solutions for optimal extraction, transformation, and loading of large-scale data from a wide variety of data sources and created automated dataflows into data lakes and data warehouses.
  • Collaborated with cross-functional teams and users to understand data needs and translate them into technical specifications.
  • Optimized performance of critical SQL queries and resolved ETL issues by performing root cause analysis.
  • Deployed data quality checks and audit logging ensuring high quality of data.
  • Developed reports and visualizations using Tableau and Power BI to deliver valuable data-driven insights.
  • Managed scope of requirements and prioritized stories/tasks across distributed teams using agile approach.
PricewaterhouseCoopers — San Francisco, CA

Senior Associate

June 2013 — April 2014
  • Partnered with clients to gather business requirements and prioritize technical details for data analysis.
  • Developed scripts for data integration of disparate data sources, cleansed data and resolved data quality issues.
  • Developed application to capture, validate and format fraud detection data, reducing validation and search time by 50%.
  • Managed a cross-functional data analysis project with CTOs from 25 countries and coordinated changes in scope.
  • Analyzed finder's fee fraud from large data sets and created reports that provided insights.

Education

Master of Information Systems Management

Carnegie Mellon University — Pittsburgh, PA

Bachelor of Engineering in Production Engineering

Mumbai University — Mumbai, India

Skills

Languages

SQL Python Java HTML

Databases

Oracle PostgreSQL SQL Server MySQL Amazon RDS

Big Data

Airflow Spark Hive Hadoop Cassandra Redshift DynamoDB EMR Kinesis

Cloud

AWS EC2 S3 Lambda VPC CloudWatch

Tools

Tableau Power BI Docker Kubernetes Git JIRA Linux SAP BOBJ SSRS

Certifications

Amazon Web Services

AWS Certified Big Data — Specialty

2020

Amazon Web Services

AWS Certified Solutions Architect — Associate

2019

Oracle

Oracle Certified SQL Expert

2013

Sun Microsystems

Sun Certified Java Programmer

2007