Experience

Director of Data Platform (current) + Manager of Data Platform (prior)

  • Architected a real time ML driven pricing system to support in game price movement of all stocks in under 500ms

  • Hired and managed a team of 8 data engineers and machine learning engineers to support companywide data initiatives

  • Designed and implemented a real time in app analytics tracking framework for tracking user page views and clicks

  • Led sprint planning and planned quarterly roadmaps for company wide data platform initiatives with C level executives

  • Supported the data science function by building a ML platform where new real time inference services can be tested and deployed without engineering guidance

  • Built a latency detection framework that can identify key system bottlenecks and alert when workflows fall behind historical p80, p90, and p95 latency

  • Coordinated with the C suite and the VP of Finance to manage yearly and quarterly tech infrastructure budgets and expectations while scaling new sports on the platform

  • Proposed and implemented an on call rotation process for the wider engineering org using OpsGenie and Cloudwatch

Data Engineer at Amazon

  • Designed a real time data warehouse leveraging various AWS services such as Redshift, Lambda, Firehose, and SQS

  • Created and indexed a new Elasticsearch cluster to power live reporting to advertisers via the AmazonLive creator app

  • Redesigned data pipeline workflows that processed 500TB/day of AmazonLive clickstream events for reporting

  • Led AmazonLive data deletion initiative to conform with GDPR and CCPA advertising regulations overseeing encryption of all pseudonymous customer data past 30 days

  • Oversaw an integration of AmazonLive clickstream events with internal and external robotic traffic datasets to filter out robotic and fraudulent traffic

  • Built an automated data quality check mechanism to detect ETL workflow outages using AWS Glue

  • Introduced Kinesis streams to synchronize clickstream data in real time across multiple data stores – e.g. Redshift, DynamoDB, Elasticsearch, S3

Data Engineer at ShopKeep

  • Researched and deployed a random forest classifier algorithm that ranks leads by probability of conversion and identifies feature variables that contribute to higher conversion, leading to an increase in conversion rate by 2%

  • Designed data models to identify the least engaged customers based on days of transaction and days of login to analytics platform, resulting in a change in customer service strategy

  • Developed custom python ETL pipelines from various platforms – e.g. Salesforce, Zuora, and InContact – into AWS Redshift

Senior Analyst at Hyundai Capital

  • Generated models on forecasting supply of lease end vehicles using regression models

  • Led initiative to sell lease end vehicles to Uber, Lyft, and Carvana

Options Trader at Optiver

  • Market maker for SPY and VIX options

Technologies

AWS

  • Redshift

  • RDS

  • Athena

  • Elasticsearch

  • DynamoDB

  • Lambda

  • ECS Fargate

  • MSK (Kafka)

  • Step Functions

  • Kinesis

  • Glue

  • CDK

  • Quicksight

Business Intelligence Tools

  • Tableau

  • Looker

Other Software

  • Terraform

  • Jira

  • Confluence

  • GitHub

Languages

Python

SQL

Java

TypeScript

Education

Master of Science @ CMU School of Computer Science

  • Software Architecture, Applied Machine Learning, Formal Methods, ML in Production, Distributed Systems

Bachelor of Science in Finance and Economics @ New York University Stern School of Business

Hobbies

Competitive amateur golfer

NBA machine learning betting model