We are partnering with a leading organisation in the data and analytics space to recruit an experienced Senior Site Reliability Engineer. This is an opportunity to join a highly collaborative, technically strong SRE function working on large‑scale, cloud‑native platforms that support high‑volume, high‑speed data services.
The team is expanding due to increased workload, and this role will become the eighth member of an established, supportive engineering group. You’ll play a key part in driving cloud automation, improving system reliability, and supporting critical production environments.
Key Responsibilities
- Build, maintain, and improve AWS cloud infrastructure
- Develop automation using Terraform, Ansible, and Python
- Support incident response and troubleshoot performance issues
- Deliver routine maintenance, including patching and upgrades
- Enhance CI/CD pipelines (GitLab CI, GitHub CI)
- Contribute to Agile ceremonies and take ownership of user stories
- Implement new technologies and solutions to improve system reliability
What You Will Bring
- Strong commercial experience with AWS (essential)
- Solid understanding of Linux systems (RHEL, CentOS or similar)
- Scripting skills, ideally Python
- Hands‑on experience with Terraform and/or Ansible
- Proficiency with Docker
- Exposure to CI/CD tooling and Agile ways of working
- Background in software engineering, systems engineering, or previous SRE roles
- Minimum 4 years’ experience in a relevant technical discipline
Please note, this role is not suitable for candidates with Windows‑only experience or Engineers without hands‑on AWS or Linux exposure.
Remote working is supported, with an on-site presence in Nottingham, ideally once per week preferred.