← Back to all jobs
Red Hat

Principal Software Engineer - AI Experiment Tracking (Ireland)

Red Hat

4h ago

0DevIrelandhimalayas
Principal-Software-EngineerMLOps-EngineerAI-ML-EngineerPlatform-EngineerAI-Experiment-TrackingDirector

Job Description

Are you ready to join a game-changing open-source AI platform that harnesses the power of hybrid cloud to drive innovation?The Red Hat OpenShift AI (RHOAI) team is looking for a Principal Software Engineer with Kubernetes and MLOps (Machine Learning Operations) experience to join our rapidly growing engineering team. Our focus is to create a platform, partner ecosystem, and community by which enterprise customers can solve problems to accelerate business success using AI. This is a very exciting opportunity to build and impact the next generation of hybrid cloud MLOps platforms, contribute to the development of the RHOAI product, participate in open source communities, and be at the forefront of the exciting evolution of AI. You’ll join an ecosystem that fosters continuous learning, career growth, and professional development. You will be contributing as a core developer for AI experiment tracking on OpenShift AI. You will work as part of an evolving development team to rapidly design, secure, build, test, and release new capabilities. This role is for an individual contributor who collaborates closely with other developers and cross-functional teams. You will have the opportunity to actively participate in both our downstream efforts as well as the upstream MLFlow project. You should have a passion for working in open-source communities and for developing solutions that integrate Red Hat, open-source, and partner technologies into a cohesive platform.At Red Hat, our commitment to open source innovation extends beyond our products - it’s embedded in how we work and grow. Red Hatters embrace change – especially in our fast-moving technological landscape – and have a strong growth mindset. That's why we encourage our teams to proactively, thoughtfully, and ethically use AI to simplify their workflows, cut complexity, and boost efficiency. This empowers our associates to focus on higher-impact work, creating smart, more innovative solutions that solve our customers' most pressing challenges.What you will doArchitect and lead implementation of new features and solutions for MLFlow on Red Hat OpenShift AIProvide technical vision and leadership on critical and high-impact projects, ensuring quality, scalability, and reliability across systemsInnovate in the MLOps domain by participating in upstream communities, particularly Kubeflow and MLFlowEstablish and champion quality engineering standards across teams, ensuring robust testing practices, CI/CD pipelines, and quality-first culture at scaleEnsure non-functional requirements including security, resiliency, performance, and maintainability are consistently metWrite and review complex test strategies, frameworks, and automation approaches that raise the bar for quality across the organizationContribute to a culture of continuous improvement by sharing recommendations and technical knowledge with team membersCollaborate with product management, other engineering, and cross-functional teams to analyze and clarify business requirementsCommunicate effectively with stakeholders and leadership to provide visibility and influence decision-makingGive thoughtful and prompt code reviews, modeling high standards of quality, maintainability, and designRepresent RHOAI in external engagements including industry events, customer meetings, and open source communitiesMentor, influence, and coach a distributed team of engineers, developing future technical leaders and instilling strong engineering disciplineExplore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and toolingWhat you will bringAdvanced experience developing applications in Go or Python, or another programming languageAdvanced experience with AI experiment tracking tools such as MLFlow, Weights and Biases, or ClearMLAdvanced experience in Kubernetes, OpenShift, or other cloud-native technologiesExpertise in defining, scaling, and enforcing testing strategies, automation frameworks, and CI/CD pipelines across large, distributed systemsAbility to quickly learn and guide others on using new tools and technologies, including AI-assisted development toolsExperience with source code management tools such as GitProven ability to innovate and a passion for staying at the forefront of technology, including quality engineering best practicesExcellent system understanding and troubleshooting capabilities, with a focus on scalability, reliability, and performanceTechnical leadership acumen in a global team environment, including mentoring and coaching engineers at multiple levelsExcellent written and verbal communication skillsThe following will be considered a plus:Understanding of how Open Source and Free Software communities workExperience with development for public cloud services (AWS, GCE, Azure)Experience working with or deploying MLOps platformsAbout Red HatRed Hat is the world’s leading provide