Senior Staff Engineer, SRE/DevOps, Product Software/ Ingénieur principal ou ingé
Equinix
3d ago
0$131k - $181kDevopsCanadahimalayas
Site-Reliability-EngineeringDevOps-EngineeringNetwork-Platform-EngineeringBackend-EngineeringInfrastructure-EngineeringSr-Staff-DevOps-EngineerSenior-Staff-Software-EngineerStaff-Site-Reliability-Engineer-(SRE)Senior-DevOps-EngineerSenior-Engineer---Cloud-&-DevOps-(P)Senior
Job Description
Who are we?Equinix is the world’s digital infrastructure company®, shortening the path to connectivity to enable the innovations that enrich our work, life and planet.A place where tech thinkers and future builders turn bold ideas into breakthrough experiences, we welcome your unique perspective.Help us challenge assumptions, uncover bias, and remove barriers—because progress starts with fresh ideas. You’ll find belonging, purpose, and a team that welcomes you—because when you feel valued, you’re empowered to do your best work.Job SummaryEquinix is the world’s digital infrastructure company, operating 280+ data centers across the globe and providing interconnections to all the key clouds and networks. Much of the internet that you know flows through our rapid interconnected network and data centers. As Equinix’s network continues to evolve and grow, it’s essential for us to move towards autonomous operations – the automation and orchestration of network management tasks without human intervention. To achieve this, we’ll need a robust unified network model that accurately represents the network via real time telemetry for observability, which can also be used for intent-based configurations of the network. The unified modeling team in Network Platform Engineering is building this unified network model which feeds Equinix Brain, a strategic company-wide initiative to up-level the way we manage and use our asset data to improve margins and customer experience.We are looking for an experienced Senior Staff Engineer to help us build and operate a highly scalable, available, and information-rich unified network model.ResponsibilitiesActively participate in the design, development, test, and operation of highly reliable services and software to model network entities and relationships that compose Equinix’s global networkProvide technical guidance and collaborate with stakeholders to identify network data and use cases that can enrich the unified network model to solve critical business problems that increase network reliability, visibility, awareness, and the ability to planDevelop solutions that leverage data from the unified network model to provide insights that enhance product capabilities for customers, aid operations teams to troubleshoot customer escalations and planning maintenances, and inform capacity planning teamsFacilitate cross-stakeholder discussions to ensure alignment on software requirements and design trade-offs, while considering performance, scalability, and reliability factorsFollow a proactive and collaborative approach to working with cross-functional teams, ensuring seamless integration of unified network model into Equinix Brain with other domains and Network-As-A-Service (NaaS) initiativesLead by example through direct contribution, and provide direction in establishing development and operational best practices and standard methodologies Participate in an on-call rotationQualifications7+ years of experience developing distributed, scalable, highly available software services using GolangBackground of working for SaaS, PaaS, IaaS, or cloud-based companies with prior experience of designing microservices and systems at scale with a focus on production readinessExperience with building software as a service, running services with 24x7 on-call rotationsProficient in data management systems and technologies: GCP Spanner, MongoDB, Redis, Neo4JExperience with containerization and orchestration technologies: Docker, Kubernetes, or other open-source alternativesExperience in working with network management protocols: gRPC, NetconfSolid understanding of networking concepts, protocols (e.g. ISIS, BGP, BMP, LLDP), and their applicationsStrong experience in building & operating highly reliable distributed systemsProficient in using continuous integration and continuous deployment technologies: Github Actions, ArgoCDExperience with public cloud (AWS, GCP, Azure) services and technologiesHands-on experience with observability stack (metrics, logs, traces) such as Grafana, Prometheus, ThanosExperience with agile software development practices including JIRA, peer reviews, Git, CI/CDExcellent problem-solving and analytical skills to troubleshoot and resolve distributed system issuesStrong written and verbal communication skills to effectively convey findings, recommendations, and technical details to various stakeholdersBachelor's degree in computer science or related technical fieldPreferredMaster's degree or PhD in Computer Science or a related technical fieldExcellent coding skills in GolangPrior experience building a network model or digital twinPrior experience using AI to make real-time decisions (on the network)French TranslationRésumé du posteEquinix, la société d’infrastructure numérique dans le monde, exploite plus de 280 centres de données à l’échelle planétaire et fournit des interconnexions à tous les nuages et réseaux importants. Une bonne partie de l’Internet comme vous le connaissez p
