Hi, I'm Ankur Wahi.
A
Self-driven, quick starter, passionate programmer with a curious mind who enjoys solving a complex and challenging real-world problems.
About
I have 15 plus years of experience in the software industry across different domains. Expertise in the areas of architecture, engineering, strategic consulting and IT program delivery. I have spent the last 7 years in migrating analytical workloads to the cloud (AWS/GCP). In my current role at Google I help customers migrate their data centers to GCP, build product demos and specialize in building anlaytics on geospatial products like BigQuery, Goolge Earth Engine I am passionate about developing complex applications that solve real-world problems impacting millions of users.
- Languages: Python, Bash
- Databases: MySQL, BigQuery, HBase, Elasticsearch
- Frameworks: Apache Spark, Airflow, Hadoop, Google Earth Engine, Streamlit, Gradio
- Tools & Technologies: Git, Docker, AWS, GCP
Experience
- Built framework to run SQL over raster imagery.
- Developed app to track US crops at county level
- Assist customers in migrating their data center to GCP
- Tools: Python, GCP, Streamlit, Google Earth Engine
- Migrate on-prem Hortonworks cluster to AWS
- AWS cost management
- Re-engineer data pipeline from MR to Spark and use Airflow for workflow orchestration
- Design and build pipelines between GCP and AWS
- Work with teams across Seagate factories and build cloud based architecture for their needs
- Develop data pipelines on serverless architecture using Athena, Lambda and SQS
- Tools: Python, AWS, Spark, Presto, Airflow, GCP BigQuery
- Create big data product architecture roadmap
- Architect, design, deliver, and test complex data implementations
- Work with dev teams to improve data strategy, quality and governance
- Collaborate with business to drive adoption of big data solutions
- Design and implement business analytics platform using Hive and PIG for Tableau Reporting
- Design and build distributed search applications using Elasticsearch and HBase
- Use Kibana to build search visualization reports
- Tools: Hadoop, Hive, HBase, Elasticsearch, Kibana, Spark
- Responsible for managing technical analysis, design, development and maintenance of all Data Warehouse applications
- Development of new functionalities for Java apps, using Web services and Hibernate
- Coach, mentor and lead personnel within a technical team environment
- Setting up of new Disaster recovery environment for all applications
- Tools: Java, Unix, Websphere, Datastage
- Design and implement a solution to replace an ageing Medicare Part D processing
- Build Transition Letter process by collating information across databases based on the rules designed in patient's health plan
- Tools: Datastage, Unix, Oracle
Projects
An app to display crops grown in a US county in a given year
A streamlit based app to show new structures built by analyzing satellite imagery
Skills
Languages and Databases
Python
Elasticsearch
HBase
MySQL
BigQuery
Bash
Frameworks
Apache Spark
Airflow
Streamlit
Gradio
Google Earth Engine
Hadoop
Cloud
AWS
GCP




