← Back to all jobs
Bright Vision Technologies

Model Serving Engineer

Bright Vision Technologies

3h ago

0DevUnited Stateshimalayas
Model-Serving-EngineeringModel-ServingModeling-EngineerAI-ML-Services-EngineerModel-Platform-EngineeringSoftware-EngineerMid-level

Job Description

Bright Vision Technologies is a software development company looking for a skilled Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production.RequirementsBachelor’s or Master’s degree in Computer Science or a related field.Six or more years of experience in distributed systems, infrastructure, or ML platform engineering.Strong proficiency in Python and a systems language such as Go, Rust, or C++.Deep experience operating high-throughput, low-latency services in production.Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM.Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization.Familiarity with Kubernetes, autoscaling, and modern cloud platforms.Experience with observability stacks including metrics, tracing, and structured logging.Solid grounding in performance engineering and capacity planning.Strong communication and incident response skills.BenefitsCompetitive base salary commensurate with experience, plus benefits.Originally posted on Himalayas