Stratostaff Kenya
Site Reliability Engineer at Stratostaff
Job Description
Site Reliability Engineer at Stratostaff
Key Duties and Responsibilities
- Ensure the reliability, scalability, and high availability of quantum computing systems.
- Collaborate with development teams to design, deploy, and maintain quantum systems.
- Implement and maintain CI/CD pipelines using tools like Concourse, Tekton, and GitLab CI/CD.
- Monitor system performance with tools like Grafana, Sysdig, LogDNA, Datadog, and troubleshoot and resolve issues.
- Develop and execute monitoring, load, and stress testing to ensure system resilience.
- Implement security measures to safeguard system integrity, leveraging tools such as Vault.
- Respond to system alerts using PagerDuty and similar tools for swift issue resolution.
- Create and maintain comprehensive system documentation using GitHub for version control and collaboration.
MINIMUM REQUIREMENTS:
- Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent work experience.
- Proven experience as a Site Reliability Engineer or similar role in a software development setting.
- Proficiency in at least one of the following languages: Python, Go (Golang), JavaScript, TypeScript, C++, or Rust.
- Strong Linux skills, including command-line tools, shell scripting, and system diagnostics.
- Familiarity with fundamental DevOps tools like SSH, Git, and Makefiles.
- Experience with quantum computing systems and Qiskit.
- Knowledge of distributed systems and backend systems architecture.
- Experience with Red Hat, OpenShift, RHEL, and container technologies like Docker and Podman.
- Proficiency with Kubernetes and familiarity with service mesh technologies like Istio.
- Experience with GitOps and infrastructure as code tools such as ArgoCD, Ansible, and Terraform.
- Familiarity with “Pipelines as Code” principles and practices.
- Experience with monitoring, logging, and tracing tools such as Grafana, Sysdig, LogDNA, Datadog, OpenTelemetry, and Prometheus.
- Experience with templating languages like Jinja2.
- Familiarity with cloud platforms like IBM Cloud, AWS, GCP, or Azure.
Preferred Skills:
- Master’s degree in Computer Science, Engineering, or a related field.
- Experience in a quantum computing environment.
- Advanced knowledge of Quantum Information Science principles and technologies.
- Experience with Helm for managing Kubernetes applications.
- Familiarity with DevOps and Agile methodologies.
- Experience automating manual processes and customizing and optimizing CI/CD pipelines.
- Knowledge of database technologies such as PostgreSQL, MySQL, MongoDB, and InfluxDB.
- Certifications related to Kubernetes, Red Hat, or other relevant technologies.
Site Reliability Engineer at Stratostaff