Experience
Director of Data Platform (current) + Manager of Data Platform (prior)
Architected a real time ML driven pricing system to support in game price movement of all stocks in under 500ms
Hired and managed a team of 8 data engineers and machine learning engineers to support companywide data initiatives
Designed and implemented a real time in app analytics tracking framework for tracking user page views and clicks
Led sprint planning and planned quarterly roadmaps for company wide data platform initiatives with C level executives
Supported the data science function by building a ML platform where new real time inference services can be tested and deployed without engineering guidance
Built a latency detection framework that can identify key system bottlenecks and alert when workflows fall behind historical p80, p90, and p95 latency
Coordinated with the C suite and the VP of Finance to manage yearly and quarterly tech infrastructure budgets and expectations while scaling new sports on the platform
Proposed and implemented an on call rotation process for the wider engineering org using OpsGenie and Cloudwatch
Data Engineer at Amazon
Designed a real time data warehouse leveraging various AWS services such as Redshift, Lambda, Firehose, and SQS
Created and indexed a new Elasticsearch cluster to power live reporting to advertisers via the AmazonLive creator app
Redesigned data pipeline workflows that processed 500TB/day of AmazonLive clickstream events for reporting
Led AmazonLive data deletion initiative to conform with GDPR and CCPA advertising regulations overseeing encryption of all pseudonymous customer data past 30 days
Oversaw an integration of AmazonLive clickstream events with internal and external robotic traffic datasets to filter out robotic and fraudulent traffic
Built an automated data quality check mechanism to detect ETL workflow outages using AWS Glue
Introduced Kinesis streams to synchronize clickstream data in real time across multiple data stores – e.g. Redshift, DynamoDB, Elasticsearch, S3
Data Engineer at ShopKeep
Researched and deployed a random forest classifier algorithm that ranks leads by probability of conversion and identifies feature variables that contribute to higher conversion, leading to an increase in conversion rate by 2%
Designed data models to identify the least engaged customers based on days of transaction and days of login to analytics platform, resulting in a change in customer service strategy
Developed custom python ETL pipelines from various platforms – e.g. Salesforce, Zuora, and InContact – into AWS Redshift
Senior Analyst at Hyundai Capital
Generated models on forecasting supply of lease end vehicles using regression models
Led initiative to sell lease end vehicles to Uber, Lyft, and Carvana
Options Trader at Optiver
Market maker for SPY and VIX options
Technologies
AWS
Redshift
RDS
Athena
Elasticsearch
DynamoDB
Lambda
ECS Fargate
MSK (Kafka)
Step Functions
Kinesis
Glue
CDK
Quicksight
Business Intelligence Tools
Tableau
Looker
Other Software
Terraform
Jira
Confluence
GitHub
Languages
Python
SQL
Java
TypeScript
Education
Master of Science @ CMU School of Computer Science
Software Architecture, Applied Machine Learning, Formal Methods, ML in Production, Distributed Systems
Bachelor of Science in Finance and Economics @ New York University Stern School of Business
Hobbies
Competitive amateur golfer
NBA machine learning betting model