About Albert Invent
Albert Invent is a cutting-edge AI-driven software company headquartered in Oakland, California, on a mission to empower scientists and innovators in chemistry and materials science to invent the future faster. Every day, scientists in 30+ countries use Albert to accelerate R&D with AI trained like a chemist, bringing better products to market, faster
Job Description
To design, build, and optimize scalable data infrastructure and pipelines that enable efficient data collection, transformation, and analysis across the organization. The Data Engineer will play a key role in driving data architecture decisions, ensuring data quality and availability, and empowering analytics, product, and engineering teams with reliable, well-structured data to support business growth and strategic decision-making
Responsibilities:
- Design, build, and maintain scalable lakehouse and data warehouse solutions using platforms such as Databricks and Snowflake.
- Develop and optimize data ingestion and transformation pipelines (batch and/or streaming) to enable reliable, high-quality analytics and downstream consumption.
- Implement data modeling standards (e.g., dimensional, data vault, or medallion patterns) to support reporting, BI, and advanced analytics use cases.
- Develop, and maintain SQL and NoSQL databases, ensuring high performance, scalability, and reliability.
- Collaborate with the API team and Data Science team to build robust data pipelines and automations.
- Optimize database queries and performance tuning to enhance overall system efficiency.
- Implement and maintain data security measures, including access controls and encryption.
- Monitor database systems and troubleshoot issues proactively to ensure uninterrupted service.
- Develop and enforce data quality standards and processes to maintain data integrity.
- Create and maintain documentation for database architecture, processes, and procedures.
- Stay updated with the latest database technologies and best practices to drive continuous improvement.
- Experience with monitoring and visualization tools such as Grafana to monitor database performance and health
Requirements:
- Bachelor's degree in computer science, Engineering, or equivalent experience
- 4+ years of experience in data engineering, with a focus on large-scale data systems.
- Demonstrated hands-on experience with lakehouse platforms and modern data warehousing solutions, such as Databricks and Snowflake (or equivalent technologies).
- Strong programming skills in Python or JavaScript, with the ability to write efficient, maintainable code.
- Proven experience designing data models and access patterns across SQL and NoSQL ecosystems.
- Hands-on experience with technologies like SQL, DynamoDB, S3, and Lambda services.
- Proficient in SQL stored procedures with extensive expertise in MySQL schema design, query optimization, and resolvers, along with hands-on experience in building and maintaining data warehouses.
- Familiarity with observability stacks (Prometheus, Grafana, Open Telemetry) and debugging production bottlenecks.
- Understanding cloud infrastructure (preferably AWS), including networking, IAM, and cost optimization.
- Excellent communication and collaboration skills to influence cross-functional technical decisions
Why Join Albert Invent
- Joining Albert Invent means becoming part of a mission-driven, fast-growing global team at the intersection of AI, data, and advanced materials science.
- You will collaborate with world-class scientists and technologists to redefine how new materials are discovered, developed, and brought to market.
- The culture is built on curiosity, collaboration, and ownership, with a strong focus on learning and impact.
- You will enjoy the opportunity to work on cutting-edge AI tools that accelerate real-world R&D and solve global challenges from sustainability to advanced manufacturing while growing your careers in a high-energy environment.
For more details, please feel free to refer to www.albertinvent.com