About Us

We are dedicated to being the go-to data layer used to build AGI. The future of intelligence won’t be defined by compute alone—it will be shaped by the quality and richness of the data these systems learn from.

At sumo, we believe data is the most under-invested and under-appreciated driver of AI progress. We give AI teams access to data that can’t be scraped from the open web and that makes the difference between incremental improvement and breakthrough capability.


Role Overview

At Sumo, we believe model quality starts with data quality. As Principal Scientist, you’ll lead the evaluation, curation, and optimization of the large-scale datasets that power real-world AI systems.

This is a senior role at the core of our mission: shaping how we define, measure, and ensure “high-quality data” in practice. You’ll design and apply statistical and ML-driven methods to assess diversity, reduce bias, and improve informativeness in training data. You’ll also provide leadership across research and engineering, establishing best practices for data quality while collaborating with top AI labs and startups alike.

This is an ideal role for someone with a strong research background—PhD or equivalent experience—who is passionate about advancing the frontier of data-centric AI and eager to have outsized impact in a fast-moving environment.


Key Responsibilities


Must Haves