Principal Site Reliability Engineer
Orgvue is a leading organizational design and planning software platform that captures the power of data visualization and modelling to build more adaptable, and better performing organizations. HR, finance and business leaders use Orgvue for actionable insight and analysis that helps them make faster workforce decisions in a constantly changing world.
Orgvue is used by the world’s largest and best-known enterprises and management consulting firms to visualize and confidently build the businesses they want tomorrow, today. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney.
We are seeking a Principal Site Reliability Engineer who will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure.
Role
In this role you will work across product, platform, and operations teams to ensure our systems are reliable, observable, and resilient, even at scale.
This role combines hands-on technical capability with strategic vision, helping us build a world-class reliability culture and a robust engineering foundation for growth. We're looking for someone who has technical expertise, is a great communicator and enjoys collaborating across multiple teams.
Responsibilities
- Define and enforce SLOs, SLIs, and error budgets across critical services
- Crafting and implementing a cloud infrastructure and tooling strategy
- Work across our Org to level up SRE practices
- Help implement robust observability metrics, logs & traces using our observability tool
- Guide the team in building automated, self-healing systems
- Own and evolve our incident response processes, including on-call practices and post-mortem culture
- Mentor engineers across the org on best practices in reliability, operational readiness, and scalable infrastructure
- Drive Infrastructure as Code (IaC) using Terraform, Kubernetes, CloudFormation and GitOps practices
- Collaborate closely with security, DevOps, and software teams to ensure compliance, scalability, and operational excellence
- Evaluate and introduce tools, patterns, and practices that improve the performance and reliability of our SaaS platform
Requirements
- Demonstrable experience leading SRE transformations
- Deep hands-on expertise with Kubernetes (EKS preferred) in production environments
- Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.)
- Expert in Infrastructure as Code using tools such as Terraform , with knowledge of GitOps workflows
- Strong background in observability: metrics, visualization, logging, and tracing
- Understanding of automation, SDLC, CI/CD pipelines, deployment automation, and blue/green or canary releases
- Proven experience with incident management, disaster recovery planning, root cause analysis, and post-incident reviews
Benefits
- Hybrid working - 1+ days a week in the London office
- Wellbeing: Sanctus Coaching, Virtual fitness sessions, Wellbeing webinars, Annual Wellbeing day
- Subsidised Gym Membership
- Private Medical Insurance (including Dental and Vision) and Life Assurance
- 25 days holiday (increasing to 30 days at a rate of 1 extra day per year)
- Summer Fridays (half-day Fridays for the months of July and August)
- Employer pension contribution of 5% of your gross salary, if you contribute a minimum of 3%
- Season ticket Loan
- Cycle to Work Scheme
- Annual Discretionary Bonus
'Here at Orgvue we promote individualism and a diverse workforce to build on our future success'
Recommended Jobs
Senior Machine Learning Engineer
Senior Machine Learning Engineer Care Join us in our mission to transform the way people shop and eat where impact innovation and growth drives everything we do. Our engineering and product teams …
Financial Services Business Resilience Analyst
Financial Services Business Resilience Analyst Location: London (City) / Hybrid The Opportunity In an era of increasing digital complexity, the ability of the financial sector to withstand an…
Senior Manager, R&D Tax (Software Technology) - UK Wide
Our client, a leading professional services firm, is seeking a talented R&D Tax Senior Manager with a strong background in software technology to join their dynamic team. This is a fantastic opportun…
Interim IT Procurement Manager
Develop and implement procurement strategies for IT and digital projects. Manage supplier relationships to ensure the delivery of value and quality. Coordinate tendering processes for IT and di…
Capital Markets Accounting Advisory & Structuring(CMAS) Senior Manager
Line of Service Advisory Industry/Sector Not Applicable Specialism Deals Management Level Senior Manager Job Description & Summary About the role: Join our Capital M…
Business Data Analyst
Description Summary : As a technical business analyst focused on the use of data you will be involved in the definition and development of products and services working alongside customers des…
Demand Planning Assistant - Fixed Term Contract
At Sephora, beauty is about feeling seen, valued, and empowered, individually and collectively. It is connecting deeply with others, celebrating diversity and inclusivity, unlocking your potential an…
Caretaker - Croydon
A well-established and inclusive secondary school in Croydon is seeking a dependable and proactive Caretaker to join its premises team. This is a full-time, permanent position suited to an indivi…
Year 2 Teacher | Outstanding School | Richmond | January...
Are you an experienced or newly qualified Year 2 Teacher seeking an inspiring new role from January 2026? Do you want to work in an Outstanding school in Richmond known for its creative curricul…
Senior Business Support Officer
Job Category: Admin / Clerical Location: Laurence House – First Floor, Lewisham Council Hours Per Week: 35.00 Start Date: Immediate Start Start Time: 09:00 End Time: 17:00 Salary…