Related skills
docker kubernetes tensorflow pytorch llms๐ Description
- Serve as a technical partner to success and sales teams.
- Conduct architecture reviews, proofs of concepts, and demos.
- Design tailored architectures for specific business use cases.
- Guide customers to optimal AI/ML cloud solutions.
- Optimize GPU workloads with CUDA or TensorRT.
- Build scalable AI apps using Kubernetes and NFS.
๐ฏ Requirements
- Proven professional experience with cloud infrastructure, AI/ML platforms.
- Expertise in AI/ML frameworks (TensorFlow, PyTorch) and Hugging Face.
- Experience deploying and fine-tuning LLMs (e.g., DeepSeek, Llama, Claude, GPT-4) and GenAI models.
- Linux, distributed systems, Kubernetes, NFS, Object Storage; GPU optimization (CUDA, TensorRT).
- Hands-on experience with vllm and quantization methods (INT4, INT8, FP8) for efficient model deployment.
- Programming/development experience building AI-powered applications; post-sales/technical consultant experience.
๐ Benefits
- We innovate with purpose and simplify cloud/AI.
- Career development with conference stipends and LinkedIn Learning.
- Well-being benefits and flexible time off.
- Equal opportunity employer; inclusive and respectful culture.
- Remote-friendly environment with global, collaborative teams.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest โ finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!