Site Reliability Engineer (IT)

CGI
London

We are seeking an experienced and proactive Site Reliability Engineer (SRE) to join a team supporting multiple data product and platform groups. This role is focused on improving the reliability, scalability, observability, and operational performance of critical data-driven platforms and services across complex production environments. The successful candidate will work closely with engineering, platform, and support teams to strengthen monitoring and alerting capabilities, improve logging and traceability, troubleshoot production incidents, support deployments, and automate operational processes wherever possible. The environment includes Kubernetes, Helm, the ELK stack, and a strong focus on modern Site Reliability Engineering practices across cloud and platform services. This is a hands-on technical role suited to someone who thrives in fast-paced operational environments and is passionate about reliability engineering, automation, and continuous improvement. The role requires strong collaboration with both client stakeholders and engineering teams to ensure platform stability, operational excellence, and high service availability Candidate profile:

  • Support, maintain, and improve highly available production platforms and services across cloud and containerised environments.
  • Manage and support Kubernetes clusters and Helm-based deployments across multiple environments.
  • Implement and enhance monitoring, alerting, logging, and observability solutions to improve platform reliability and operational visibility.
  • Investigate incidents, analyse logs, identify root causes, and drive timely resolution of production issues.
  • Participate in incident response, post-incident reviews, and continuous operational improvement initiatives.
  • Automate operational tasks and repetitive support activities to reduce manual effort and improve platform efficiency.
  • Work closely with engineering and data platform teams to improve system resilience, scalability, deployment reliability, and operational maturity.
  • Develop and maintain operational documentation, support procedures, runbooks, and troubleshooting guides.
  • Contribute to reliability engineering practices including proactive monitoring, service health management, and operational readiness.
  • Support deployment activities, release processes, and production change management activities.
Required qualifications to be successful in this role
  • Strong commercial experience in Site Reliability Engineering, Platform Engineering, DevOps, or Production Support environments.
  • Strong hands-on experience with Kubernetes and Helm in enterprise or production environments.
  • Proven experience supporting mission-critical production platforms and operational support functions.
  • Strong hands-on experience with the ELK stack (Elasticsearch, Logstash, Kibana) for logging, monitoring, troubleshooting, and operational analysis.
  • Demonstrated capability in log analysis, incident investigation, troubleshooting, and root cause analysis.
  • Strong understanding and practical experience with core SRE practices including:
Monitoring and alerting Incident management and response Root cause analysis and post-incident reviews Automation and operational improvement Production support and reliability engineering
  • Experience working with data platforms, analytics platforms, or data product teams would be highly advantageous.
  • Experience with scripting and automation tools such as Bash, Python, or similar technologies is desirable.
  • Exposure to CI/CD pipelines, Infrastructure as Code, and cloud-native environments would be beneficial.
  • Strong communication, stakeholder engagement, and collaboration skills.
  • Ability to work effectively in fast-paced support environments and manage competing priorities under pressure.
Security Clearance
  • Resource must be willing and able to work onsite at the client location five days per week.
  • Candidate must already hold current HLC clearance (mandatory requirement).
  • Previous experience working within secure, government, defence, or highly regulated environments will be highly regarded.
  • Due to client security requirements, only candidates meeting the required clearance criteria will be considered.

#LI-CGISDI

Posted 2026-05-13

Recommended Jobs

Female Head of PE - Permanent Role | Barnet

Marchant Recruitment
Barnet, Greater London

A secondary school in Barnet is seeking a Female Head of PE to lead its Physical Education department from September. This is an opportunity for an experienced PE teacher or current middle leader …

View Details
Posted 2026-05-30

Senior Operations & Logistics Manager

RIXO
London

ABOUT RIXO: A rail of vintage dresses. A London flat. Two best friends who wanted to build the brand they couldn't find. That was 2015. Today, RIXO is six stores across London, New York, and Irela…

View Details
Posted 2026-05-01

EYFS Teacher - Rainham

Marchant Recruitment
Havering, Greater London

We are seeking a nurturing, enthusiastic, and dedicated EYFS Teacher to join a welcoming primary school in Rainham, starting as soon as possible. This is an excellent opportunity for a teacher who is…

View Details
Posted 2026-01-28

Private Family Solicitor

London

Top-of-market salary with no BD, no advocacy and exceptional work–life balance Join a boutique Legal 500 family law firm expanding into Birmingham Fast-track progression to Team Leader or Partn…

View Details
Posted 2025-12-15

Data Scientist (Junior/Senior)

Central London

We are looking for a talented and motivated Data Scientist (Junior or Senior) to join our client’s dynamic team. The ideal candidate will be responsible for data analysis and Python programming, with…

View Details
Posted 2026-04-06

Site Manager - Elite Secondary School - Kensington

Marchant Recruitment
London

A well-regarded Good secondary school in Kensington is seeking a reliable and experienced Site Manager to oversee the maintenance, safety, and day-to-day operations of the school site from April…

View Details
Posted 2026-03-27

Manager - Warehouse Automation & Solution Design

BearingPoint United Kingdom
London

BearingPoint is one of Europe’s leading independent, partner led technology services and business consulting organisation and a highly regarded strategic partner for our clients. We are looking fo…

View Details
Posted 2026-05-15

Digital Construction Manager - Tier 1 Contractor

Johnson BIM
London

A great opportunity to join a successful, BIM centric, major contractor.  Step up, take on responsibility for good BIM across multiple projects and progress your career with this expanding digital …

View Details
Posted 2026-05-28

Mandarin speaking Job - Solution Manager / Assistant Solution Manager - rj

People First Recruitment
Central London

Please follow us on WeChat to see all our Cantonese and Mandarin jobs, interview tips and London news: Your New Job Title: Mandarin speaking Solution Manager / Assistant Solution Manager …

View Details
Posted 2026-03-06

Receptionist - Outstanding Secondary School - Havering

Marchant Recruitment
Havering, Greater London

Receptionist – Outstanding Secondary School – Havering Start Date: As soon as possible Contract: Full-time, Permanent Salary: Paid to scale We are seeking a professional and welcoming …

View Details
Posted 2026-04-16