Senior Site Reliability Engineer
Orgvue is a leading organizational design and planning software platform that captures the power of data visualization and modelling to build more adaptable, and better performing organizations. HR, finance and business leaders use Orgvue for actionable insight and analysis that helps them make faster workforce decisions in a constantly changing world.
Orgvue is used by the world’s largest and best-known enterprises and management consulting firms to visualize and confidently build the businesses they want tomorrow, today. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney.
We are seeking a Principal Site Reliability Engineer who will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure.
Role
In this role you will work across product, platform, and operations teams to ensure our systems are reliable, observable, and resilient, even at scale.
This role combines hands-on technical capability with strategic vision, helping us build a world-class reliability culture and a robust engineering foundation for growth. We're looking for someone who has technical expertise, is a great communicator and enjoys collaborating across multiple teams.
Responsibilities
- Define and enforce SLOs, SLIs, and error budgets across critical services
- Crafting and implementing a cloud infrastructure and tooling strategy
- Work across our Org to level up SRE practices
- Help implement robust observability metrics, logs & traces using our observability tool
- Guide the team in building automated, self-healing systems
- Own and evolve our incident response processes, including on-call practices and post-mortem culture
- Mentor engineers across the org on best practices in reliability, operational readiness, and scalable infrastructure
- Drive Infrastructure as Code (IaC) using Terraform, Kubernetes, CloudFormation and GitOps practices
- Collaborate closely with security, DevOps, and software teams to ensure compliance, scalability, and operational excellence
- Evaluate and introduce tools, patterns, and practices that improve the performance and reliability of our SaaS platform
Requirements
- Demonstrable experience leading SRE transformations
- Deep hands-on expertise with Kubernetes (EKS preferred) in production environments
- Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.)
- Expert in Infrastructure as Code using tools such as Terraform , with knowledge of GitOps workflows
- Strong background in observability: metrics, visualization, logging, and tracing
- Understanding of automation, SDLC, CI/CD pipelines, deployment automation, and blue/green or canary releases
- Proven experience with incident management, disaster recovery planning, root cause analysis, and post-incident reviews
Benefits
- Hybrid working - 1+ days a week in the London office
- Wellbeing: Sanctus Coaching, Virtual fitness sessions, Wellbeing webinars, Annual Wellbeing day
- Subsidised Gym Membership
- Private Medical Insurance (including Dental and Vision) and Life Assurance
- 25 days holiday (increasing to 30 days at a rate of 1 extra day per year)
- Employer pension contribution of 5% of your gross salary, if you contribute a minimum of 3%
- Season ticket Loan
- Cycle to Work Scheme
- Annual Discretionary Bonus
'Here at Orgvue we promote individualism and a diverse workforce to build on our future success'
Recommended Jobs
CloudMargin Talent Pool (Hiring Immediately)
At CloudMargin, we’re always on the lookout for exceptional talent to join our team. If you’re interested in becoming part of our journey, please submit your CV and let us know the type of role you’r…
Sales Assistant - 24h - King’s Road
Ysé’s beautiful journey continues to grow every year, and we still have so many exciting things ahead of us. We are currently looking for a Sales Assistant (part-time permanent contract – 24 hours/…
Physics Teacher | Barnet | Outstanding School
We are working with a school who are recruiting for a talented Physics Teacher for a prestigious, Outstanding secondary school in Barnet. Starting in September 2026, this full-time, permanent positio…
Reinsurance Claims Manager
A Lloyd's Managing Agency have a new opening for a Reinsurance Claims Manager. You will manage an interesting portfolio of Casualty, Property, Marine and Aviation claims, on a Proportional Treaty basi…
Creative Project Manager
About Moonbug Entertainment Thank you for considering the Creative Project Manager role with Moonbug Entertainment, an award-winning global entertainment company inspiring kids everywhere to laugh…
Senior Client Engagement Manager
LotusFlare is a provider of cloud-native SaaS products based in the heart of Silicon Valley. Founded by the team that helped Facebook reach over one billion users, LotusFlare was founded to make affo…
Project Manager
Project Manager) - Position Overview Our client is a well-established construction company specialising in residential refurbishments, commercial fit-outs, property restorations, and general constr…
Science (Chemistry) ECT - Independent School, Barnet
A highly academic Independent School in Barnet is seeking a permanent, full-time Early Career Teacher (ECT) of Science (specialising in Chemistry), starting January 2026. Begin your career in a rigor…
Designer - Midweight (London)
Vovi is a small London studio. We brand and build websites for VC-backed startups, deep-tech companies and universities - mostly US, and consistently aim for the kind of work that gets sent round in …
Senior Organic Copywriter
About Uncovered Uncovered is a London based social first creative agency founded in 2017. Social sits at the heart of everything that we do. From strategy to production, our team is fluent i…