Senior Software Engineer, Inference Platform
About the Role We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences in MongoDB Atlas. You’ll join the broader Search and AI Platform organization and collaborate with ML researchers and engi
What this role actually needs.
About the Role We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences in MongoDB Atlas. You’ll join the broader Search and AI Platform organization and collaborate with ML researchers and engi Responsibilities: - Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas, supporting semantic search and hybrid retrieval - Collaborate with AI engineers and researchers to productionize inference for embedding models and rerankers — enabling both batch and real-time use cases - Contribute to platform capabilities such as latency-aware routing, model versioning, health monitoring, and observability - Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment - Work across product, infrastructure, and ML teams to ensure the inference platform meets the scale, reliability, and latency demands of Atlas users - Gain hands-on experience with tools like vLLM and container orchestration with Kubernetes Requirements: - 5+ years of experience building backend or infrastructure systems at scale - Strong software engineering skills in languages such as Go, Rust, Python, or C++, with an emphasis on performance and reliability - Experienced in cloud-native architectures, distributed systems, and multi-tenant service design - Familiar with concepts in ML model serving and inference runtimes, even if not directly deploying models - Knowledge of vector search systems (e.g., Faiss, HNSW, ScaNN) is a plus - Comfortable working across functional teams, including ML researchers, backend engineers, and platform teams Company context: MongoDB is the public document database company powering modern applications across cloud, on-prem, and edge.
Day-to-day expectations
Mongodb lists these responsibilities for the Senior Software Engineer, Inference Platform role.
- Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas, supporting semantic search and hybrid retrieval
- Collaborate with AI engineers and researchers to productionize inference for embedding models and rerankers — enabling both batch and real-time use cases
- Contribute to platform capabilities such as latency-aware routing, model versioning, health monitoring, and observability
- Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment
- Work across product, infrastructure, and ML teams to ensure the inference platform meets the scale, reliability, and latency demands of Atlas users
- Gain hands-on experience with tools like vLLM and container orchestration with Kubernetes
What a strong candidate brings
These requirements are extracted from the source listing and normalized for UpJobz readers.
- 5+ years of experience building backend or infrastructure systems at scale
- Strong software engineering skills in languages such as Go, Rust, Python, or C++, with an emphasis on performance and reliability
- Experienced in cloud-native architectures, distributed systems, and multi-tenant service design
- Familiar with concepts in ML model serving and inference runtimes, even if not directly deploying models
- Knowledge of vector search systems (e.g., Faiss, HNSW, ScaNN) is a plus
- Comfortable working across functional teams, including ML researchers, backend engineers, and platform teams
Why this listing is more than a copied job post.
Senior Software Engineer, Inference Platform is framed against UpJobz source checks, country scope, compensation visibility, and work-authorization signals so candidates can make a faster go/no-go decision.
United States tech market
United States roles on UpJobz are filtered for high-tech relevance, source freshness, and actionable employer detail before they are allowed into SEO surfaces.
Compensation read
The employer source does not expose a reliable salary range, so candidates should ask for compensation early instead of waiting until late-stage interviews.
Work authorization read
Current extracted signal: Open to TN, H-1B, and OPT candidates already in the United States. UpJobz treats this as a search signal, not legal advice, and links visa-sensitive roles back to the relevant visa hub where possible.
Location read
On-site roles in Seattle should be compared against commute, local salary bands, and nearby employer demand.
Browse similar jobs
Turn this listing into an application plan.
This is the first pass at the premium UpJobz layer: a fast brief that helps serious applicants move with more clarity.
Next moves
- Tailor your resume around ai and llm instead of sending a generic application.
- Use the first two bullets of your application to connect your background directly to senior software engineer, inference platform is a high-signal on-site role in seattle, and it is most realistic for open to tn, h-1b, and opt candidates already in the united states.
- Open the role quickly if it fits and bookmark three similar jobs before you leave the page.
Interview themes
Watchouts
- Compensation is hidden, so get range clarity in the first recruiter conversation.
- Use open to tn, h-1b, and opt candidates already in the united states as part of your positioning so the recruiter does not have to infer it.
- Show concrete examples of succeeding in on-site environments.
Keywords to match against your background
Use these terms to decide whether your resume, portfolio, and recent projects line up with the role.
Apply through the employer source
Open the source listing from mongodb.com, confirm the role is still active, then apply on the employer or ATS page.
Source: mongodb.com · Source ID: 7467701 · Confidence: 90/100 · Last checked: May 7, 2026
How UpJobz verifies job sourcesContinue browsing tech jobs