Model Serving Engineer
Bright Vision Technologies
3h ago
0DevUnited Stateshimalayas
Model-Serving-EngineeringModel-ServingModeling-EngineerAI-ML-Services-EngineerModel-Platform-EngineeringSoftware-EngineerMid-level
Job Description
Bright Vision Technologies is a software development company looking for a skilled Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production.RequirementsBachelor’s or Master’s degree in Computer Science or a related field.Six or more years of experience in distributed systems, infrastructure, or ML platform engineering.Strong proficiency in Python and a systems language such as Go, Rust, or C++.Deep experience operating high-throughput, low-latency services in production.Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM.Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization.Familiarity with Kubernetes, autoscaling, and modern cloud platforms.Experience with observability stacks including metrics, tracing, and structured logging.Solid grounding in performance engineering and capacity planning.Strong communication and incident response skills.BenefitsCompetitive base salary commensurate with experience, plus benefits.Originally posted on Himalayas
