Site Reliability Engineer - Google CloudGoogle Cloud Platform (GCP) Managed Service specialistsUp to £100k DOE FRG are working with a specialist GCP Partner who are experts in Google Cloud Migration, Optimisation, Cost Optimisation and Managed Service. They are seeking a proactive SRE experienced in Google Cloud to take ownership of this function and build out within the business.
Role & Responsibilities- Ensure high availability and reliability of software/infrastructure, improve SLA's and improving reliability of systems
- Design, implement, maintain monitoring and alerting (mainly in Datadog) and GCP Cloud Monitoring
- Debug and troubleshooting
- On call rotation to respond o incidents
- Develop and maintain runbooks/playbooks for incident response
- Testing
- Develop and maintain Infrastructure as Code (Terraform) and Config Management (Ansible, Helm..)
Skills & Qualifications- GCP experience is mandatory for this vacancy, notable services include Cloud Run, BigQuery, GKE etc)
- 3+ years experience as an SRE
- CI/CD
- Experience setting up and managing monitoring solutions
This vacancy is for UK based candidates only, no sponsorship provided, please do not apply if you are not a resident of the UK or do not have right to work in the UK without sponsorship. Office travel is once or twice a month.
Please apply below for more information or reach out directly to s.mckay@frgconsulting.com