Added
less than a minute ago
Location
Type
Full time
Salary
Upgrade to Premium to se...
Related skills
python tensorflow pytorch reinforcement learning jaxπ Description
- Architect and train teacher-student models for multimodal data.
- Build RL-based data discovery and scale RL training (PPO, DQN).
- Optimize real-time inference: deploy distilled/RL models with low latency.
- Research agentic systems: autonomous reasoning and chain-of-thought prompts.
- Drive production reliability of Omnitag data platform.
- Mentor ML scientists, data engineers, and junior engineers.
π― Requirements
- BS in Computer Science, ML, or related field, or equivalent experience.
- 6+ years hands-on ML engineering with post-training, optimization, deployment.
- Distillation/teacher-student training experience; loss functions and evaluation.
- RL experience in production or research: policy optimization, rewards, simulation.
- Expert in Python and ML frameworks (PyTorch, TensorFlow, or JAX).
- Software engineering fundamentals: testing, CI/CD, containerization, system design.
- Cloud deployment of ML models (AWS, GCP, Azure) and inference optimization.
- Bonus points: MS/PhD; agentic systems; robotics; multimodal learning.
π Benefits
- Hybrid schedule with in-office time in Boston, Pittsburgh, or Las Vegas, or fully remote.
- Medical, dental, vision; 401k with match; HSA; life and pet insurance.
- Global, diverse, inclusive culture with a people-first approach.
Meet JobCopilot: Your Personal AI Job Hunter
Automatically Apply to Engineering Jobs. Just set your
preferences and Job Copilot will do the rest β finding, filtering, and applying while you focus on what matters.
Help us maintain the quality of jobs posted on Empllo!
Is this position not a remote job?
Let us know!