Odyssey is pioneering the next frontier of artificial intelligence: generative worlds. By learning from the real-world, they’re training a new kind of visual generative model capable of generating cinematic worlds that you can direct—from scenery and characters to lighting and motion. Odyssey’s mission is to reinvent film, gaming, and more.
Odyssey was founded in late 2023 by Oliver Cameron (Cruise, Voyage) and Jeff Hawke (Wayve, Oxford AI PhD), two veterans of self-driving cars and AI. They’ve since recruited a world-class team of ML researchers from Cruise, Waymo, Wayve, Tesla, Microsoft, Meta, and NVIDIA; lead computer graphics researchers from EA, Ubisoft, and Valve; and technical artists behind Hollywood blockbusters like Dune, Godzilla, Avengers, and Jurassic World.
Odyssey has raised venture capital from GV, EQT Ventures, Air Street Capital, DCVC, Elad Gil, Garry Tan, Soleio, Jeff Dean, Kyle Vogt, Qasar Younis, Guillermo Rauch, Soumith Chintala, and researchers from OpenAI, DeepMind, Meta, Midjourney, and Pixar.
The Role
We're seeking an ML Platform engineer to design and implement our ML/data platform strategy. This role has different titles at many companies and will be a mix of building the infrastructure, tooling, and data pipelines that enable our researchers to efficiently work with 3D, visual, and structural data, conduct experiments, and seamlessly move models to production. You'll have significant autonomy in technical decisions and the opportunity to grow into a technical leadership role as we scale.
A Typical Week
- Design and implement scalable data pipelines for processing large-scale 3D assets
- Collaborate with ML researchers to optimize data preprocessing and training workflows
- Make key architectural decisions about our data platform infrastructure
- Improve our Kubernetes-based data processing, training, and serving infrastructure
Core Responsibilities
- Design and implement our Kubernetes-based ML data platform from the ground up
- Build scalable data pipelines that support both research experimentation and production deployment
- Create systems for dataset versioning, experiment tracking, and model lifecycle management
- Develop tools and interfaces that make it easy for researchers to find, enrich, and version complex 3D and visual data
- Establish best practices for reproducibility and production readiness
- Collaborate closely with ML researchers to understand and optimize their workflows
Technical Scope
- Work with large-scale 3D datasets, including 3D scenes, real-world scans, gaussian splats, video, and images. (PB-scale)
- Design and manage multi-node, multi-cloud Kubernetes clusters for distributed training
- Implement monitoring and observability for ML workflows
- Help design integration points with industry-standard creative software, e.g., Maya, Houdini, etc.
- Support real-time inference requirements for creative tools
Required Skills & Experience
- 5+ years of software engineering experience, with significant work in data platforms
- Strong Python development and system design expertise
- Deep experience with data pipeline development and ETL processes
- Production Kubernetes experience and container orchestration expertise
- Hands-on experience with data-oriented ML infrastructure tools (experiment tracking, feature stores, model registries)
- Proficiency with cloud platforms (AWS/GCP/Azure)
- Experience with data versioning and experiment tracking systems
- Understanding of ML workflows and researcher needs
Ideal Qualities
- Self-directed and comfortable with ambiguity
- Strong bias for action and pragmatic problem-solving
- Track record of extreme ownership in technical projects
- Excellent communication skills, especially with technical stakeholders
- Experience building systems from scratch in fast-paced environments
- Passion for enabling ML research and production excellence
Nice to Have
- Experience with video, image, or 3D data pipelines for ML/AI
- Experience with distributed computing frameworks (we use Ray) or workflow orchestration (we use Flyte)
- Familiarity with vector databases (e.g., LanceDB) and similarity search
- Background in computer graphics, VFX, or gaming industries
- Experience at AI/ML companies or research labs
- Contributions to open-source data/ML tools
- Experience building researcher-facing tools
Growth Opportunities
- Help build our data platform engineering team as we scale
- Define our technical strategy for data platform and infrastructure
- Establish key partnerships with data platform framework open source projects and vendors
- Shape our technical hiring strategy
- Deep engagement with the broader data and ML infrastructure community