EXPERIENCE
Microsoft | Redmond, WA (Current)
- Principal Data Scientist (CoreAI) (Nov 2025 - Present)
- Forecast GitHub Copilot LLM traffic to guide third-party capacity planning
- Run what-if scenarios to confirm capacity plans and size new model launches
- Build agents that turn fast-changing repo metadata into maintained dimension tables
Apple | Cupertino, CA (2017 - 2025)
- Staff Data Scientist (Engineering) (Oct 2023 - Oct 2025)
- Forecasted demand for Private Cloud Compute, Apple’s GPU infrastructure for remote AI workloads
- Defined platform usage metrics that drove engineering and capacity decisions
- Worked with SREs and platform engineers to plan regional GPU deployments
- Built demand models from device growth, feature adoption, and latency data
- Investigated demand spikes and explained what changed and why
- Estimated workloads for new AI features using seed performance data before launch
- Senior Data Science Manager (Business Operations) (Oct 2021 - Sep 2024)
- Managed 4 data scientists and 5 contractors focused on cloud infrastructure costs
- Found expensive workloads and recommended fixes, cutting costs even as usage grew
- Mentored junior scientists, recruited new hires, and led cross-team projects
- Developed R packages, reviewed code, and advised on statistical methods
- Pushed for better documentation, reproducibility, and technical pragmatism
- Data Science Manager (Finance) (Oct 2020 - Sep 2021)
- Led a data science team analyzing infrastructure costs across finance and engineering
- Senior Data Scientist (Finance) (Oct 2018 - Sep 2020)
- Modeled storage growth to inform leadership infrastructure investment decisions
- Deployed time-series forecasts, dashboards, and reports used across finance and engineering
- Used difference-in-differences to estimate causal impact of service migrations on storage
- Authored 10+ R packages for telemetry and infrastructure data (e.g. graphiter, hubbler)
- Led recruiting, grew team from 1 to 7 data scientists
- Data Scientist (Finance) (Nov 2017 - Sep 2018)
- Modeled iCloud storage growth and its long-term costs
Transloc | Durham, NC (2016)
- Data Scientist (Sept 2016 - Dec 2016)
- Worked part-time, refining metrics available to the product team into understandable, actionable information using R
- Data Science Intern (May 2016 - Aug 2016)
- Developed products and packages that simplify working with and understanding transit data. Learned and experimented with implementing agile development philosophy for data science
Duke University | Durham, NC (2015 - 2017)
- Research Assistant, Durham Children’s Data Center (Jan 2017 – May 2017)
- Analyzed Durham County Social Services data for DCDC research projects with Dr. Ken Dodge, Dr. Beth Gifford, and Dr. Anna Gassman-Pines
- Research Assistant, Duke-UNC BECR Center (Jan 2015 – May 2016)
- Investigated food purchasing behavior, food-related health outcomes, and food assistance policy (SNAP/WIC). Uncovered trends to help BECR design behavioral nudges for improving food choices. Helped write papers and proposals, providing analytical results (graphics, tables, etc.)
- Teaching Assistant, Sanford School of Public Policy (Spring 2015)
- Co-taught PubPol 590: Applied Big Data Science Energy Data Analytics and Policy with Dr. Matthew Harding. Students learned introductory causal inference theory and data analysis with large datasets using Python. Designed homework assignments, projects, and exams. Goal was for students to finish with capacity to do basic energy company consulting
EDUCATION
- Ph.D. Public Policy & Economics – Aug 2019
Duke University, Sanford School of Public Policy (Durham, NC)
- M.A. Economics – Feb 2013
Georgetown University (Washington, DC)
- M.S. Applied Statistics – May 2011
California State University at Long Beach (Long Beach, CA)
- B.S. Applied & Computational Mathematics – Aug 2006
University of California at Irvine (Irvine, CA) Orange Coast Community College (Costa Mesa, CA)
PROGRAMMING
- R – Expert. 13+ years of experience. First love. CRAN package author and contributor.
- Python – Advanced. 7+ years. Current language. Pipelines, ETL (PySpark), agents. Taught courses with it. ♥ polars.
- SQL – Expert. 10+ years. Connoisseur of PostgreSQL, Presto/Trino, and HiveQL docs. Never nester.
- bash – Expert user, proficient developer. 10+ years. Mostly for processing raw text files. CLIs > GUIs.
- LLMs – Developer and daily user. 3+ years. Built production agents; Claude preferred for coding.
- Others – dbt, git, docker. Dabbled with Julia, Stan, and Scala. Reluctantly capable analytics engineer.
PACKAGES
- gtfsr – for mapping and validating GTFS data.
DISSERTATION
- “Doubled SNAP Dollars and Nudges: An Analysis of Two Pilot Programs Aimed at Increasing the Purchase of Healthy Foods”
- My dissertation used difference-in-difference-in-differences models and randomized controlled trials to measure the economic impacts of food policy interventions using transaction-level data from grocery stores and convenience stores. I analyzed controlled experiments to test hypotheses about consumer behavior and provided evidence-based recommendations to policymakers based on causal inference methods.
- The first pilot program was a financial incentive called “Double Up Food Bucks” that targeted SNAP participants, encouraging fresh produce purchases by effectively doubling purchasing power. The second pilot program consisted of three behavioral nudges designed to increase banana purchases in convenience store environments.
AWARDS AND HONORS
- Merit Based Fellowship (Economics), Georgetown University (2011 – 2015)
- 2010 STIPDG Outstanding Intern Award, US Department of Transportation (Summer 2010)
- Fletcher Jones Fellowship, University of California at Irvine (Awarded but chose not to pursue Ph.D.) (Fall 2006)
- Dean’s Honor List, University of California at Irvine (Winter 2005 – Spring 2006)
- Early Transfer (Academic Excellence), Orange Coast College to UCI (Winter 2005)
- President’s List for Academic Excellence, Orange Coast College (Fall 2003 – Fall 2004)
- Community College Scholarship Recipient, Hispanic Education Endowment Fund (HEEF) (Fall 2004)
FUN FACTS
- Dual citizen of Chile and the US
- Native Spanish-speaker
- Songwriter and lead vocalist in a SoCal pop punk band
- Goaltimate (aging ultimate frisbee) player
- Former AmeriCorps NCCC Team Leader
- Cooks a mean kimchi & onion frittata