About Luupli
Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a force for good, and we are committed to creating a platform that maximizes the value that creators and businesses can gain from it, while making a positive impact on society and the planet. Luupli started internal testing since June 2024 and getting ready for a commercial BETA testing from December 2024, with the hope of launching fully summer of 2025
Job Title: Site Reliability Platform Engineer
About Luupli:
Luupli is a social media app that has equity, diversity, and equality at its heart. We believe that social media can be a force for good, and we are committed to creating a platform that maximizes the value that creators and businesses can gain from it, while making a positive impact on society and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success.
Role Description:
We are seeking a talented and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure and services, primarily hosted on AWS. If you have a passion for problem-solving, a deep understanding of AWS services, hands-on experience with Terraform, and proficiency in scripting with Python or Bash, we invite you to apply for this exciting opportunity.
Role and Responsibilities:
1. Infrastructure Design and Automation:
- Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform.
- Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components.
2. Monitoring and Incident Management:
- Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues.
- Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents.
3. Reliability and Performance Optimization:
- Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning.
- Identify opportunities to automate manual processes and improve system resilience.
4. Scripting and Automation:
- Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments.
- Implement and improve continuous integration and continuous deployment (CI/CD) pipelines.
5. Security and Compliance:
- Collaborate with security teams to implement best practices for securing cloud infrastructure and services.
- Ensure compliance with relevant industry standards and regulations.
6. Deployment and Release Management:
- Support CI/CD pipelines for application deployments and updates.
- Contribute to the design and implementation of deployment strategies that promote zero-downtime releases.
7. Documentation and Knowledge Sharing:
- Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures.
- Participate in knowledge sharing with team members to enhance overall expertise and skill sets.
Requirements:
1. Education and Experience:
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
- Proven experience as a Site Reliability Engineer or similar role.
2. Technical Skills:
- Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.).
- Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform.
- Proficient in scripting with Python or Bash for automation and operational tasks.
- Solid understanding of networking principles and protocols.
- Knowledge of CI/CD pipelines and related tools.
3. Problem-Solving and Analytical Abilities:
- Ability to diagnose and resolve complex technical issues in a fast-paced environment.
- Analytical mindset to proactively identify potential system weaknesses and performance bottlenecks.
4. Collaboration and Communication:
- Strong teamwork and collaboration skills to work effectively with cross-functional teams.
- Excellent verbal and written communication skills.
Compensation
This is an equity-only position, offering a unique opportunity to gain a stake in a rapidly growing company and contribute directly to its success.
Engineering
Remote (United Kingdom)
Share on: