Related skills
kubernetes ai opentelemetry vector databases mlflowπ Description
- Design and maintain GenAI infra: gateways, prompts, vector DBs, LLM tools.
- Implement secure access controls and authentication integrated by default.
- Develop observability, monitoring, and logging for GenAI workloads.
- Collaborate with teams to integrate GenAI infra with agent frameworks.
- Optimize infra for scalability, high availability, and cost efficiency.
π― Requirements
- Extensive experience building AI platform infra, Kubernetes, and container security.
- Expertise in observability and monitoring for real-time performance (OpenTelemetry, MLFlow).
- Experience with AI infrastructure components: vector databases, prompts/versioning stores, AI IDEs.
- Familiarity with vLLM, SGLang to host LLM workloads.
- CI/CD pipelines and automation for AI model deployment and platform operations.
- Strong knowledge of authentication and authorization frameworks integrated into AI platforms.
π Benefits
- US base salary 179000-199000 plus bonus and benefits.
- Total rewards package includes bonus and benefits.
- Remote work with comprehensive rewards package.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!