Site Reliability Engineer
Offchain Labs
This job is no longer accepting applications
See open jobs at Offchain Labs.See open jobs similar to "Site Reliability Engineer" Blockchain Association.Who You Are
- Eager to dive into blockchain technology, even if it’s new territory
- Enjoy solving infrastructure problems in unconventional ways and thinking beyond standard patterns
- Use tools like k9s or ArgoCD for speed and abstraction, but comfortable dropping into YAML, logs, or low-level debugging when things go sideways
- Experienced with GitOps-style systems and treating both infrastructure and application delivery as code
- Have scaled deployment automation using patterns like ArgoCD ApplicationSets or similar tooling
- Curious about how things work under the hood and not satisfied with surface-level fixes
- Comfortable in Linux, fluent in shell scripting, and productive in languages like Python or Go
- Comfortable operating within a cloud platform (e.g., AWS, GCP, Azure), with a strong understanding of the underlying components making it easy to adapt to or migrate across providers
- Participated in an on-call rotation, responding to incidents, troubleshooting under pressure, and driving postmortems to improve system reliability over time
- Design systems with security in mind, applying principles like least privilege and threat modeling
- Bring a strong technical foundation, excellent problem-solving skills, and a genuine commitment to high-quality work
- Take ownership, collaborate openly, and contribute to a culture of clarity, curiosity, and continuous improvement
What You've Done
- Operated production Kubernetes clusters and built scalable, declarative infrastructure using Terraform or similar tools
- Deployed and maintained Kubernetes environments, managed system components, and troubleshot applications running on the platform
- Designed CI/CD workflows with ArgoCD, GitHub Actions, CodeBuild, or similar tools, covering both infra and app deployments
- Designed and operated observability systems using time-series metrics, logs, and dashboards with tools like Prometheus, Loki, Mimir, Grafana, and CloudWatch
- Diagnosed tough networking and storage issues across complex, distributed systems
- Implemented secure-by-default infrastructure and contributed to architecture reviews and threat models
- Automated operational workflows using scripting or programming in Python, Go, or Bash
- SREs come from a wide range of backgrounds. If you bring strong problem-solving skills, curiosity, and a drive to build reliable systems, we’d love to hear from you, even if your experience doesn’t perfectly match every bullet point
Perks:
- Remote-first global workforce + NY office
- Annual company offsite + team onsites
- Professional reimbursement program (facilitates industry conference attendance, certifications, and more)
- Medical, dental & vision coverage (US + some other countries)
- 401k retirement plan + company match (US only)
- Wellness stipend
- Home office set up / ergonomic equipment program
This job is no longer accepting applications
See open jobs at Offchain Labs.See open jobs similar to "Site Reliability Engineer" Blockchain Association.