Data Operations

Odyssey is building Hollywood-grade generative worlds, where beautiful scenery, characters, lighting, and motion can be both generated and directed. Our mission is to deliver a better way to create films and games.

Odyssey was founded in late 2023 by Oliver Cameron (ex-Cruise, Voyage, Y Combinator alum) and Jeff Hawke (ex-Wayve, Willow Garage, Oxford AI PhD), two veterans of self-driving cars and AI. They’ve since recruited a world-class team of experienced ML researchers from Cruise, Waymo, Wayve, Tesla, Microsoft, Meta, and NVIDIA; lead computer graphics researchers from EA, Ubisoft, and Valve; and technical artists behind Hollywood blockbusters like Dune, Godzilla, Avengers, and Jurassic World. Odyssey intends to become Hollywood’s AI team.

Odyssey has raised venture capital from GV, EQT Ventures, Air Street Capital, DCVC, Elad Gil, Garry Tan, Soleio, Jeff Dean, Kyle Vogt, Qasar Younis, Guillermo Rauch, Soumith Chintala, and researchers from OpenAI, DeepMind, Meta, Midjourney, and Pixar.

Who You Are

You have 5+ years of experience in ML data operations, logistics, or a related field.
You have a proven track record of constructing large, leading ML datasets.
You understand how to iteratively improve a dataset to best optimize for ML model performance.
You have strong project management skills, and have demonstrated the ability to decisively prioritize and coordinate across multiple teams.
You have excellent communication and interpersonal skills, with experience working closely with researchers.
You are familiar with state-of-the-art data tooling, data management, and machine learning infrastructure.

What You’ll Do

You’ll lead the end-to-end logistics of data acquisition and collection, ensuring that we’re hitting dataset volume targets for training runs.
You’ll own our real-world data collection efforts, building and leading a team of data collectors that travel the world to collect valuable real-world data.
You’ll own our synthetic data collection efforts, building and leading a team of artists who construct large-scale synthetic training data.
You’ll work closely with ML researchers and external vendors to detail data requirements, enabling large-scale data sourcing, labeling, and quality control.
You’ll develop and enforce stringent data quality standards, implementing QA processes to ensure that data meets ML-readiness benchmarks.

Data

London, United Kingdom

Santa Clara, CA

Share on: