Stratostaff Kenya

Site Reliability Engineer at Stratostaff

00100, Nairobi Kenya
May 22, 2024
Apply Now
Deadline date:

Job Description

Site Reliability Engineer at Stratostaff

Key Duties and Responsibilities

  • Ensure the reliability, scalability, and high availability of quantum computing systems.
  • Collaborate with development teams to design, deploy, and maintain quantum systems.
  • Implement and maintain CI/CD pipelines using tools like Concourse, Tekton, and GitLab CI/CD.
  • Monitor system performance with tools like Grafana, Sysdig, LogDNA, Datadog, and troubleshoot and resolve issues.
  • Develop and execute monitoring, load, and stress testing to ensure system resilience.
  • Implement security measures to safeguard system integrity, leveraging tools such as Vault.
  • Respond to system alerts using PagerDuty and similar tools for swift issue resolution.
  • Create and maintain comprehensive system documentation using GitHub for version control and collaboration.

MINIMUM REQUIREMENTS:

  • Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent work experience.
  • Proven experience as a Site Reliability Engineer or similar role in a software development setting.
  • Proficiency in at least one of the following languages: Python, Go (Golang), JavaScript, TypeScript, C++, or Rust.
  • Strong Linux skills, including command-line tools, shell scripting, and system diagnostics.
  • Familiarity with fundamental DevOps tools like SSH, Git, and Makefiles.
  • Experience with quantum computing systems and Qiskit.
  • Knowledge of distributed systems and backend systems architecture.
  • Experience with Red Hat, OpenShift, RHEL, and container technologies like Docker and Podman.
  • Proficiency with Kubernetes and familiarity with service mesh technologies like Istio.
  • Experience with GitOps and infrastructure as code tools such as ArgoCD, Ansible, and Terraform.
  • Familiarity with “Pipelines as Code” principles and practices.
  • Experience with monitoring, logging, and tracing tools such as Grafana, Sysdig, LogDNA, Datadog, OpenTelemetry, and Prometheus.
  • Experience with templating languages like Jinja2.
  • Familiarity with cloud platforms like IBM Cloud, AWS, GCP, or Azure.

Preferred Skills:

  • Master’s degree in Computer Science, Engineering, or a related field.
  • Experience in a quantum computing environment.
  • Advanced knowledge of Quantum Information Science principles and technologies.
  • Experience with Helm for managing Kubernetes applications.
  • Familiarity with DevOps and Agile methodologies.
  • Experience automating manual processes and customizing and optimizing CI/CD pipelines.
  • Knowledge of database technologies such as PostgreSQL, MySQL, MongoDB, and InfluxDB.
  • Certifications related to Kubernetes, Red Hat, or other relevant technologies.

Site Reliability Engineer at Stratostaff