What you'll do
- Design and implement scalable infrastructure for custom AI evaluation systems
- Build tools and platforms for agentic workflows and alignment techniques
- Develop data processing pipelines for evaluation datasets and feedback loops
- Create APIs and interfaces for real-world agent deployment and monitoring
- Optimize performance and cost of cloud-based ML workloads
- Contribute to open-source projects and research implementations
Qualifications
- Strong experience building large-scale LLM-powered or agentic systems in production environments
- Strong research taste, especially in evaluation design, post-training methods, and model improvement workflows
- Desire to own problems end-to-end and deliver measurable outcomes
- Ability to work directly with customers
- Obsession with detail, code quality, and clean, modular system design
- Strong ownership mentality and ability to thrive in a fast-paced startup environment
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
Our Core Values
- Customer Obsession - We start with the customer and work backwards. We aim to earn trust through consistent delivery, thoughtful listening, and by obsessing over customers.
- Intellectual Honesty - We operate with high trust and low ego. Ideas matter more than titles, and we communicate openly and directly while assuming good intent, even in strong disagreement.
- Bias for Action - We set high standards and move quickly to meet them. We prefer building and learning with customers over debating in the abstract, and we iterate based on real feedback.
- Extreme Ownership - We take responsibility for outcomes, not just tasks. Ownership means seeing problems through to completion and ensuring solutions truly work in practice.
Benefits and perks
- Competitive salary plus meaningful equity package
- Comprehensive medical benefits and generous PTO
- Flexible work arrangements
- Direct impact on company direction and technical decisions
- High ownership and the opportunity to make a career-defining impact
As a founding member, you’ll help define the technical foundation of NeoSigma. Your scope will grow with the company, from owning core systems end-to-end to shaping architecture, hiring, and engineering culture. This role has a natural path toward technical leadership or engineering management as the team scales.
About Us
NeoSigma is a product-driven research lab building the intelligence layer that helps close the feedback loop between your customers, products, and AI systems.
We are a small, intensely technical team of researchers and engineers who have trained frontier-scale models and widely used AI products and agents at MIT, Parallel Web, Essential AI, Apple, and Amazon.