← Back to all jobs
Deimos

Senior Site Reliability Engineer

Deimos

4h ago

0DevopsNigeriahimalayas
Professional-ServicesSenior-Site-Reliability-EngineerSenior-Site-Reliability-Engineering-ArchitectPrincipal-Site-Reliability-EngineerSenior-Reliability-EngineerSite-Reliability-Engineering-LeadStaff-Site-Reliability-Engineer-(SRE)Mid-level

Job Description

Deimos is a Cloud-native Developer and Security Operations technology services company. We help companies of all sizes adopt the Cloud for improved service delivery to their clients. We’re a fully remote African-based team of engineers who are passionate about implementing engineering best practices. We leverage the latest technologies while building globally competitive solutions for our clients. With Deimos being one of the two moons of Mars, we refer to ourselves as “Martians” who are on a mission to Mars, together. Our teams value the ability to learn and adapt to technology changes while appreciating solid foundational design and the craft of software engineering. As such our engineers enjoy working with various clients who have different problems to solve. If this sounds like you then you would be an ideal fit for our environment. However, you must be based in one of the countries we currently hire in which are as follows: Kenya, Ghana, Nigeria, South Africa, and Senegal.Role OverviewWe are looking for an experienced Senior Site Reliability Engineer to join our Professional Services team and deliver Software and DevSecOps projects. You will report to a Site Reliability Engineering Manager.SRE / DevOps is one of our core competencies. You will be part of a highly-skilled team that continuously innovates and delivers high value solutions to clients across various industries on all public clouds (AWS, Azure, GCP, etc). Technologies we work with daily include Kuberenetes, Helm, Terraform, GitOps, just to name a few.What you will be doingExperience with Azure DevOps and deployment pipelines.In-depth experience with Powershell.In-depth experience with YAML.Microsoft Azure Infrastructure experience essential.Experience with Source Code Management using Github.Experience with Version Control.Design and build advanced cloud-native infrastructureGuide technical discussions with clients and build technical roadmaps Collaborate with the Engineering Director(s) to (re)design architectureAssist the Site Reliability Manager with resource planningMentor other engineers and share knowledgeDocument processes and monitor performance metricsCollaborate with cross-functional teams to define, design, and ship new features.Constantly improve the stability, scalability, security, cost-effectiveness, and operational excellence of our clients' systems.Continuously discover, evaluate, and implement new technologies to maximize development efficiency and security.Conduct infrastructure planning, testing, and development.What you must haveMinimum NQF 7 – BSC/BCom/BTech in Information Technology, Information Systems Engineering or Computer Science or relevant equivalent. 3+ years’ experience in Azure Cloud & DevOps Practices.At least five or more years experience working in a DevOps/SRE team Extensive experience in DevOps/SRE, team management and collaborationAdvanced knowledge of best practices related to data encryption and cybersecurityAdvanced knowledge of the general DevOps/SRE landscape, architectures, and emerging technologiesCloud experience, preferably Azure, AWS and GCP.Experience in Observability Practices and Incident ManagementExtensive experience with Prometheus, Grafana, the Elastic Stack and all versions of Beats, especially within KubernetesExperience with Infrastructure as Code, preferably TerraformExperience with general automation and config management, preferably AnsibleExtensive experience building and maintaining Kubernetes clusters and workloadsStrong foundation of basic network and security conceptsAbility to build robust CICD pipelinesFamiliarity with relational and non-relational databasesSolid understanding of Linux operating systemsQualities & BehavioursExceptional interpersonal and communication skillsA zest for automation.Comfortable working as a remote team member.Ability to keep up to date with DevOps/SRE best practices, trends and innovation.Passionate about mentoring and growing technical skills within the team.Expected Output for the roleAutomate Azure infrastructure provisioning and configuration using PowerShell, YAML and Bicep. Monitor and troubleshoot issues in the Azure environment, including network, storage, and compute resources.Deploy and manage Azure Databricks infrastructure for data processing and analytics. Attend to support tickets, which may arise due to product components not functioning as expected. Develop and maintain technical support documentation of the product. Promote innovations to support business requirements through activities that test, pilot and implement innovative concepts. Responsible for support and troubleshooting DevOps tools and processes for stakeholdersAbout youFor us to achieve our ambitious vision together as a team, It is important for our Martians to lead at all levels, be self starters who take initiative and put their hands up for challenging tasks. A growth mindset is important to us and we encourage all our Martians to openly share knowledge, su