← Back to all jobs
Equinix

Principal Engineer, Product Software

Equinix

5d ago

0$177k - $265kDevUnited Stateshimalayas
Principal-EngineerSoftware-EngineerPlatform-EngineerSite-Reliability-EngineerObservability-EngineerSenior

Job Description

Who are we?Equinix is the world’s digital infrastructure company®, shortening the path to connectivity to enable the innovations that enrich our work, life and planet.A place where tech thinkers and future builders turn bold ideas into breakthrough experiences, we welcome your unique perspective.Help us challenge assumptions, uncover bias, and remove barriers—because progress starts with fresh ideas. You’ll find belonging, purpose, and a team that welcomes you—because when you feel valued, you’re empowered to do your best work.Job SummaryThe Platform Tools & Delivery (PTD) organization is the unified platform engineering team within Core Product Services (CPS). We are responsible for the secure, scalable, and consistent delivery of Equinix's digital products. This role leads the technical vision and consolidation of observability signals and reliability standards across Equinix's global hybrid footprint for the engineering teams that build and run Equinix’s infrastructure, products, and services.ResponsibilitiesRequirements AnalysisInteracts with internal product management and engineering teams to understand product requirements and define the platform roadmapWorks with the Equinix Engineering Excellence (E3) team in the Equinix IT organization to find common points of acceleration and bidirectional consumption of servicesActs as a lead representative for Infrastructure P&S requirements in forums for enterprise-wide developer initiatives, plans, and architecturesSoftware ArchitectureDefines the platform reliability standards through the development of a comprehensive SLO/SLI frameworkDrives architectural consistency for observability across a hybrid footprint including 31 metros and multiple AWS regionsConsolidates all application observability signals onto a single platform (Grafana Cloud) to provide a single source of truthSoftware DesignProvides technical leadership for the design of the "Paved Path" regarding application assurance and reliability signalsEvaluates and recommends the consolidation of disparate, non-unified observability tools and parallel support systems in favor of unified, strategic solutionsDesigns integration strategies for identity and access management to ensure secure developer access to platform toolsDevelopment/CodingParticipates in the development of automated reliability signals and self-service observability toolsDrives project work and creates automation for the observability stack and application lifecycle toolsParticipates in peer reviews and technical integration efforts to ensure cross-functional alignment within the PTD and CPS organizationsTestingSets standards for application assurance, including vulnerability management and identity integration programsRecommends frameworks for measuring platform performance, such as Kubernetes API server uptime and provisioning delivery timeDevOpsArticulates the vision for a unified runtime that leverages both global on-premises footprints and cloud capabilitiesLeads the Observability Stack Unification charter as part of the broader CI/CD and platform consolidation effortUtilizes FinOps and financial observability reporting to provide cost attribution by product, team, and organizationSoftware Reliability & Support EngineeringDefines and publishes critical reliability metrics, including Mean Time to Detect (MTTD) and Mean Time to Repair (MTTR)Provides L4 technical escalation capacity to stabilize critical, high-toil servicesParticipates in on-call rotations for respective observability and operations areas to ensure 24/7 platform stabilityCustomer/Stakeholder EngagementServes as a technical liaison for internal product teams (the platform's customers) to understand concerns and prioritiesActs as a primary point of contact for technical perspectives and alignment with stakeholders in the Equinix product organization and the Equinix IT organizationTechnical Project ManagementWorks with Engineering Managers to define platform KPIs and project schedules for unification effortsProvides status reporting on the Observability Standard and other strategic consolidation projectsR&D/InnovationInvestigates and evaluates new observability technologies to reduce infrastructure toil for product teamsInfluences the organization’s technical objectives by identifying fruitful opportunities in areas like telemetry and proactive alertingQualificationsExperience: 10+ years in Platform Engineering, Site Reliability Engineering (SRE), or Observability-focused rolesEducation: Bachelor’s in Computer Science, Computer Engineering, or a related technical fieldTechnical Depth: Expert-level knowledge of Platform Engineering, Grafana Cloud, Observability concepts (Logs, Metrics, Traces, RUM, Synthetics, etc), and Operational Readiness. Competence with Kubernetes, ArgoCD, on-premises and cloud infrastructure (AWS), software engineering practices including CI/CD. Familiarity with Go development, cluster-api and the CNCF ecosystem is preferredThe targeted pay range