Our client is in the search for a Site Reliability Engineer to help scale the largest data analytics software in recent years. Our client is a data analytics company with VC-Backing from Google Venture amongst other big Sand Hill Road investors! They are in the search for an experience engineer who truly understands the SRE paradigm and is able to demonstrate a proven track record of working in a high available, high data traffic environment!
As the Site Reliability Engineer, you will be part of a small SRE team and you will be responsible for managing, scaling, and automating, production Kubernetes clusters. As an SRE will also be writing automation code for managing, expanding, measuring, pod operations, network configuration, node replacement. You will not be an operator, rather you will be an experience software engineer focused on operations. You will be conducting many activities such as deep diving troubleshooting around Kubernetes and Docker.
The Site Reliability Engineer will have extensive experience in the field automation and containerisation, amongst other highly sought-after skills/experience:
- You must have hands experience on Docker & Kubernetes production clusters.
- Making huge contributions in the Docker or Kubernetes open source community is a huge advantage!
- You must have strong development/automation skills and therefore, must be comfortable writing Python, Golang or Java code
- You must have in-depth understanding of the internals of Kubernetes cluster management, orchestration, networking, ingress, and Docker packaging!
If you are a highly experienced Site Reliability Engineer based in San Francisco and is looking for an onsite SRE role with an incredibly innovative start-up then apply today and a member of the Harrison Clarke team will be in touch!