Cloud Architect & Data Engineer

Hi, I'm Ankur Wahi.

A

Self-driven, quick starter, and passionate programmer with a curious mind who enjoys solving complex and challenging real-world problems.

About Me

I have 15+ years of experience in the software industry across multiple domains, specializing in system architecture, backend engineering, and IT program delivery. For the last 7 years, I have focused on accelerating cloud transformations (AWS/GCP) and modernizing complex analytical workloads.

In my current role as a Cloud Engineer at Google Cloud, I act as an AI Engineering SME, partnering with enterprise customers to design and deploy advanced machine learning and AI solutions. I help organizations modernize their data ecosystems, architecting highly scalable analytics pipelines using BigQuery and integrating cutting-edge AI to drive intelligent business outcomes. I also leverage my expertise in geospatial analytics using BigQuery and Google Earth Engine to build innovative product demonstrations.

I am highly passionate about developing scalable, AI-enabled systems that solve real-world problems and deliver meaningful business value.

Quick Tech Summary

  • Languages: Python, Bash
  • Databases: MySQL, BigQuery, BigTable, Spanner, Elasticsearch
  • Frameworks: Spark, Airflow, Earth Engine, Streamlit, Gradio, FastAPI
  • Cloud & Tools: AWS, GCP, Git, Docker

Professional Experience

Cloud Engineer

Google
July 2021 - Present
  • Serve as a strategic technical advisor for GCP customers, designing scalable AI and machine learning architectures.
  • Architect high-performance analytics solutions using BigQuery to fuel data-driven decision-making.
  • Drive customer success by guiding enterprises from proof-of-concept to production for complex AI workloads.
BigQuery Spanner Agent platform Cloud Run ADK MCP

Solutions Architect

Seagate
Feb 2018 - Jan 2021
  • Successfully migrated legacy on-premises Hortonworks Hadoop clusters to AWS.
  • Optimized AWS cloud infrastructure costs through structured resource management.
  • Re-engineered core MapReduce data pipelines into modern Apache Spark/Airflow orchestration jobs.
  • Designed multi-cloud data pipelines bridging Seagate facilities across GCP and AWS.
  • Architected serverless pipelines using AWS Athena, Lambda, and SQS.
Python AWS Spark Presto Airflow GCP BigQuery

Associate Director

Cognizant
Sep 2017 - Feb 2018
  • Created comprehensive technical and architectural product roadmaps for enterprise Big Data solutions.
  • Architected, designed, and tested high-throughput distributed data pipelines.
  • Partnered with developers and stakeholders to establish data strategy, quality, and governance practices.
  • Implemented modern business intelligence layers with Hive, Pig, and Tableau.
  • Designed and deployed low-latency search applications using Elasticsearch and HBase.
Hadoop Hive HBase Elasticsearch Kibana Spark

Technical Architect

Capgemini
Mar 2010 - Sep 2014
  • Led technical design, development, and maintenance of Data Warehouse systems.
  • Developed backend APIs and application modules utilizing Java, Hibernate, and SOAP/REST Web Services.
  • Mentored and coached junior engineering talent in architectural best practices.
  • Spearheaded disaster recovery environments and strategies for core enterprise applications.
Java Unix Websphere Datastage

Consultant

Walgreens
Mar 2008 - Mar 2010
  • Designed and built custom data processing pipelines to replace aging Medicare Part D systems.
  • Developed Transition Letter batch process, aggregating member data across Oracle databases based on complex health plan logic.
Datastage Unix Oracle

Click or tap on the map to view in fullscreen

RPG Quest Career Journey Map of Ankur Wahi

Key Projects

e-Commerce Shopping Agent Project Image

e-Commerce Shopping Agent

Real-time, multimodal (voice & video) AI shopping assistant powered by Gemini Live, backed by Cloud Spanner Graph, Vector Search, and SQL.

React FastAPI Gemini Live API
BQ Transcript Analysis Project Image

BQ Transcript Analysis

Analyze customer service center calls and unearth issues in your supply chain.

FastAPI React BigQuery Gemini DataAgent ADK
US County Crops Project Image

US County Crops-App

Interactive spatial map displaying crops grown in any US county across specific years. Highlights crop NDVI index curves upon clicking dynamic points.

Javascript Earth Engine
Wildfire Project Image

Wildfire & Buildings

Multi-page Streamlit application that monitors structure construction by analyzing satellite scans. Correlates buildings affected by active wildfires.

Streamlit Python Earth Engine
SQL over Satellite Project Image

SQL over Satellite

Framework querying geospatial regions dynamically in BigQuery SQL and fetching underlying raster NDVI and temperature data from GEE.

GCP BigQuery Python Earth Engine
SQL over Doc Project Image

SQL over Doc

Framework connecting unstructured documents in GCS buckets to structured analytics queries, using Document AI parsers mapping directly inside BigQuery.

GCP BigQuery Python Doc AI

Technical Skills

Languages

Python
Bash

Databases

MySQL
BigQuery
BigTable
Spanner
Elasticsearch

Frameworks

Apache Spark
Airflow
Earth Engine
Streamlit
Gradio
FastAPI

Cloud & Tools

AWS
GCP
Git
Docker

Get In Touch

I'm always open to discussing data engineering workloads, cloud migrations, geospatial challenges, or interesting software engineering projects. Feel free to reach out!

RPG Career Journey Map Full View