Experience
Mountain View, CA
2023–2024
Staff Data Scientist, Lead, Core ML Data Science
- Assessed data quality at scale, measured rater performance, and automated the cleaning of anomalies from Google's training data for large language models company-wide.
2017–2023
Staff Data Scientist, Google Research
- Led the development of a web app using a PaLM 2 language model to assist in the construction of complex SQL queries over a large data repository.
- Found previously undetected hits in a large-scale antimalarial drug screen via a novel application of machine learning to microscopy images of treated parasites. Source
- Devised and implemented an approach to counteracting batch effects in a cellular assay using optimal transport over Gaussian mixture models; code contributed to Optimal Transport Tools.
2014–2017
Staff Data Scientist, Verily / Google [x]
- Tech lead responsible for the development and implementation of Verily's pipelines for QA and processing of RNA-Seq data.
- Devised and implemented a novel algorithm for estimating levels of DNA contamination in RNA-Seq data.
2012–2014
Staff Data Scientist, Search Infrastructure
- Gave regular presentations to Senior Vice Presidents on the transition of Google's search users from desktop to mobile.
- Created long-term forecasts for global query volume to guide decisions on the placement of future data centers.
2007–2012
Staff Quantitative User Experience Researcher,
Google AdWords
- Analyzed large datasets to provide insights that influenced AdWords and News product team decisions.
- Established a set of AdWords user metrics and developed the necessary infrastructure to gather data from diverse sources.
- Introduced a novel online help format for AdWords, secured resources for development, and showcased its effectiveness. The format was adopted by Gmail, AdSense, and Webmaster Tools.
- Won an internal business competition with an ad format tailored for an important niche market, guided its realization, and filed a related patent.
Collected Insight
San Francisco, CA; Raleigh, NC
2001–2007
Founder
- Core developer for Plone, an open source content management system. Member of the board of directors of the Plone Foundation, 2004–2006.
- Created one of the leading graduate school guides based on data from government sources. Written in Ruby on Rails.
- Principal investigator for the Sigma Xi Postdoctoral Survey project, a national study of young scientists. Raised funds via grants from the Alfred P. Sloan Foundation and the Burroughs Wellcome Fund.
- Continued part-time 2007–2013.
4charity
San Francisco, CA
2000–2001
Senior Software Engineer
Redmond, WA
1999–2000
Researcher,
Signal Processing Group,
Microsoft Research
- Researched methods for the efficient storage and transmission of digital multimedia.
1998–1999
Researcher / Software Engineer, Semantic Platform Group
- Developed algorithms and software for analyzing and manipulating semantically annotated data.
Houston, TX
1997
Texas Instruments Visiting Assistant Professor,
ECE Department
Hanover, NH
1996–1998
Assistant Professor,
Mathematics Department
1994–1996
John Wesley Young Research Instructor,
Mathematics Department
- Researched wavelet-based compression algorithms, multiwavelet constructions, joint source / channel coding algorithms for packet networks, and fast magnetic resonance image acquisition algorithms. Won two best paper awards.
Corporate Communication Group
New York, NY
1992
Senior Programmer, Interactive Technologies
- Designed and implemented an unencumbered, third person virtual reality software platform in C++. Developed software for optical tracking of hand position, a sprite control object library, control software for video and audio hardware, communications software to enable remote virtual interaction. Gold medalist in the New Media Magazine 1993 Multimedia Awards, gold medalist at the 1992 New York Festivals, and a silver medalist at the 1992 Association of Visual Communicators' CINDY awards. Patent