Site Reliability Engineer Intern (f/m)
Ledger
Your mission
- As an SRE Intern, you will contribute to the continuous improvement of our systems and developer platform by taking on the following responsibilities:
- Assist in developing and maintaining our CI/CD pipelines on Kubernetes and AWS, ensuring reliability and efficiency.
- Automate infrastructure provisioning and configuration management using tools such as Terraform and ansible.
- Develop and implement monitoring and logging solutions to enhance observability and maintain the health and performance of our cloud infrastructure.
- Collaborate with the development team to deploy and manage web applications efficiently and securely.
- Contribute to the integration and customization of Backstage, enhancing the developer experience by building plugins, automating workflows, and streamlining tools.
- Participate in continuously improving DevOps practices, tools, and processes to support Ledger’s mission of delivering reliable and scalable systems.
You will learn
- This internship offers a unique opportunity to gain hands-on experience in the following areas:
- Work in a real-world DevOps and SRE environment alongside experienced professionals.
- Explore and use a wide range of AWS services, including EC2, S3, RDS, ECS, Lambda, and EKS.
- Build expertise in Infrastructure as Code (IaC) using Terraform to provision and manage cloud resources.
- Develop and implement effective monitoring and logging systems to maintain application reliability and performance.
- Gain deep knowledge of Backstage, an open-source developer portal, and learn how to customize and extend its capabilities.
You are:
- We are looking for a motivated and driven individual with the following qualifications:
- A Master’s student – this is non-negotiable, and you must be able to provide a 6-month internship agreement issued by your school.
- Fluent in English and French.
- Customer-focused with the ability to identify and understand both internal and external customer needs.
- Proficient in Unix/Linux environments and experience with tools such as Git, Python, and Terraform. Familiarity with Kubernetes, AWS cloud solutions, CI/CD tools (e.g., ArgoCD), and configuration management tools like Ansible is a plus.
- Knowledgeable about observability practices, with experience implementing and managing logging, monitoring, and alerting frameworks using tools like Datadog or Prometheus/Grafana/Loki.
- Strong problem-solving skills and a creative mindset for developing and implementing solutions to complex challenges.
- Collaborative, with experience working across teams and building strong relationships within an organization.
- Excellent in presentation and written communication, with a proactive approach to learning and improvement.
What's in it for you?
- Flexibility: Partial remote work possible
- Social: Frequent social events, snacks and drinks
- Transport: Ledger reimburses part of your preferred means of transportation
- Lunch vouchers with Swile
- Vacation: 1 day off for every full month of work, in addition to national holidays