Senior SRE / DevOps Engineer
Location: Remote
Duration: 3 months initially (strong possibility of extension)
Start: ASAP
Project Language: English
Rate: £550 (outside of IR35)
Tech Stack: Kubernetes, AWS, Terraform, Helm.
I am working with an international client that is looking for an Kubernetes Expert for a 3 month project.
Site Reliability Engineers (SREs) are a fundamental component of their mission to empower their developers with a solid platform that embraces the 5 S's:
Speed, Stability, Security, Scalability, Sustainability.
The ideal candidate would have a strong system admin background but over the past few years they have been working heavily with Kubernetes.
About the role:
- Set up secure access model
- Increase development speed by giving better insight to developers into cluster state and application status
- Reduce time to recover by ensuring the right people have access they need to debug problems
- Increase productivity for Tech leads by reducing load on people with production access
- Migration from plain Kubernetes manifests to helm
- Reduce differences between clusters to improve testability and stability
- Increase development speed by granting "blueprints" developers can use (reduce time to cluster)
- Reduce review time and remove requirement for SRE to review each PR
- Setup SLI/SLO measurements and alerting
- Increase visibility on stability of cluster to ensure data driven development
- Reduce technical debt
- Reduce differences between clusters to improve testability and stability
- Improve time to onboarded for new engineers
About you:
- Experience with Terraform.
- At least 4 years working AWS
- At least 2 years hands on Kubernetes experience (Ideally Kubernetes Certified).
- Experience with Metrics, Monitoring, Logging & Alerting
- A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
