Senior Staff Machine Learning Engineer, GenAI Platform
1d ago
0$293k - $410kDataUnited Stateshimalayas
Machine-Learning-EngineeringMLOpsLLMOpsGenAIAI-PlatformsSenior
Job Description
Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.Who We Are: The Machine Learning Platform team at Reddit is a high-impact team that owns the infrastructure that powers recommendations, content discovery, user and content quantification, while directly impacting other teams such as Growth, Ads, Feeds, and Core Machine Learning teams.What You’ll Do:As a Senior StaffSoftware Engineer, you will help define and lead the vision for Reddit’s large-scale GenAI Platform, shaping the strategy, architecture, and operating model that enable teams across the company to build, deploy, and scale generative AI products with confidence.Contribute to the design, implementation, and maintenance of the LLM Gateway, focusing on features like unified API endpoints for internal/externally hosted LLM, rate/token limit management, and intelligent failover mechanisms to boost uptime and reliability.Lead and execute the vision, strategy, and roadmap for Reddit’s large-scale GenAI Platform.Define the platform architecture and operating model that enable teams to build, deploy, and scale GenAI products reliably.Drive the strategy for a unified LAG Gateway supporting internally and externally hosted LLMs through consistent APIs and abstractions.Set the direction for core platform capabilities such as rate and token limit management, intelligent failover, and production resilience.Shape Reddit’s approach to an enterprise-grade RAG systemEstablish the strategic direction for agentic AI workflows and tool-use patterns across the platform.Own the end-to-end platform strategy from concept through production adoption and long-term evolution.Drive MLOps and LLMOps standards across CI/CD, testing, versioning, evaluation, and lifecycle management.Define best practices for observability, monitoring, governance, and operational excellence across GenAI systems.Partner across engineering, product, and leadership to align platform investments with company priorities and user needs.Champion platform thinking with a strong focus on scalability, reliability, performance, and developer experience.Influence technical direction across teams by turning emerging AI capabilities into a scalable platform strategy.Who You Might Be:10+ years of experience in ML Engineering, AI Platform Engineering, or Cloud AI Deployment roles.Have a track record of leading technical strategy and delivering AI platforms in cloud-based production environments at scale.Demonstrate strong execution by turning strategy into action, driving complex initiatives end to end, and consistently delivering high-quality platform outcomes.Bring deep experience operating Kubernetes and other orchestration systems in large-scale production environments.Deep experience with cloud-based technologies for supporting an ML platform, including tools like AWS, Google Cloud Storage, infrastructure-as-code (Terraform), and moreProficiency with the common programming languages and frameworks of ML, such as Go, Python, etc.Excellent communication skills with the ability to articulate technical AI concepts to non-technical stakeholdersStrong focus on scalability, reliability, performance, and developer experience. You are an undying advocate for platform users and have a deep intuition for the genAI product development lifecycle.Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems is a plusBenefits:Comprehensive Healthcare Benefits and Income Replacement Programs401k with Employer MatchGlobal Benefit programs that fit your lifestyle, from workspace to professional development to caregiving supportFamily Planning SupportGender-Affirming CareMental Health & Coaching BenefitsFlexible Vacation & Paid Volunteer Time OffGenerous Paid Parental Leave Pay Transparency:This job posting may span more than one career level.In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts a
