DevOps Lead Engineer

About Formant


Founded in 2017, Formant is the leading cloud-based platform for managing fleets of robots. Formant’s platform enables companies to operate, observe, and analyze robots from anywhere, meeting the growing need of robotics companies looking to deploy into new environments and scale their fleets.  

We are inspired by the unique opportunities and challenges of connecting robots to each other, people and the cloud. We believe that more effective infrastructure can help unlock the shared potential of humans and robots and accelerate the successful deployment of autonomous systems throughout society. 

As a team, we value humility, honesty, and creativity. We have a friendly, remote culture full of people from diverse backgrounds. We love solving problems and take pride in our craft.

About the role

As the DevOps Lead Engineer, you will play a pivotal role in driving the evolution of our software delivery processes by implementing and optimizing CI/CD pipelines. Your responsibilities will include orchestrating effective collaboration between development and operations teams, automating key processes, and leveraging cutting-edge technologies to enhance efficiency and reduce operational costs. The ideal candidate will have a deep understanding of cloud infrastructure, containerization, and a proven track record of designing and implementing scalable and resilient architectures. As a leader, you will be responsible for fostering a culture of continuous improvement, innovation, and cross-functional collaboration. If you are passionate about transforming software delivery practices and thrive in a fast-paced, collaborative environment, we encourage you to apply for this exciting opportunity.


If you are a seasoned DevOps professional looking to lead and shape our platform, we invite you to apply for the position of DevOps Lead Engineer.

What you'll do

  • Leadership

    • Lead for the DevOps team

    • Design, build, and maintain our cloud infrastructure using tools such as Kubernetes, Helm, Terraform, and Ansible

    • Work with different teams to design and implement infrastructure solutions that meet their needs

    • Promote and Implement best practices

  • Maintain

    • Monitor and maintain the health of our infrastructure and applications using tools such as Datadog, and Sentry

    • Troubleshoot and resolve complex issues, ensuring the reliability and performance of our systems

    • Participate in on-call rotation to support production systems and services

  • Optimize

    • Continuously evaluate and improve the infrastructure and deployment processes to ensure optimal performance, security, and scalability

    • Optimize infrastructure costs through analysis and implementation of cost-saving measures

    • Stay abreast of industry trends and integrate cutting-edge technologies to continuously improve our DevOps practices.

  • Automate

    • Build and maintain CI/CD pipelines 

    • Implement and manage deployment strategies such as Canary and Blue-Green to ensure safe and smooth releases of new features and updates

    • Automate manual processes and tasks to improve efficiency and reduce errors

    • Drive automation efforts to enhance efficiency across development and operations workflows

Qualifications

  • Bachelor’s degree in Computer Science or a related field

  • 8+ years of experience as a DevOps Engineer, Site Reliability Engineer, or equivalent

  • 5+ years of experience with Amazon Web Services (AWS) or Microsoft Azure

  • 3+ years of experience with production Kubernetes clusters

  • Advanced proficiency working with Linux systems

  • Advanced proficiency working with scripting languages such as Bash or Python

  • Advanced proficiency working with CI/CD pipelines such as Github Actions, GitLab CI/CD, Jenkins

  • Experience working with build systems such as Bazel or Buck

  • Experience with distributed systems and microservices architectures

  • Experience with on-premises infrastructure

  • Experience with monitoring tools such as Prometheus, Grafana, Datadog, and Sentry

  • Experience with networking concepts and protocols, such as TCP/IP and DNS

  • Familiarity with incident management tools such as PagerDuty

  • Familiarity with security and compliance best practices, such as SOC2 or ISO27001

  • Ability to work in a fast-paced, startup environment

  • Strong problem-solving skills and ability to work independently

  • Excellent communication and collaboration skills

Engineering-301

Remote (United States)

Share on:

Terms of servicePrivacyCookiesPowered by Rippling