This job is no longer available

The job listing you are looking has expired.
Please browse our latest remote jobs.

See open jobs →
← Back to all jobs

Site Reliability Infra Engineer (AI, LLM)

Added
26 days ago
Location
Type
Full time
Salary
Not Specified

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Save job

Binance - Site Reliability Infra Engineer (AI, LLM)

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

We are looking for a seasoned SRE/ AI Engineer to design and improve our central Big Data infrastructure/services to the next stage, to ensure the data, services, and infrastructures are reliable, fault-tolerant, efficiently scalable, and cost-effective.

Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Responsibilities

  • Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation, and refinement.
  • Develop and maintain tools, re-designing capacity planning infrastructure for greater scalability.
  • Troubleshooting, diagnosing, fixing software issues, and ensuring data security.
  • Build production LLM systems to power business functions, from data to production, emphasizing automation and reproducibility.
  • Optimize and support LLM workloads in on-prem environments

Requirements

  • Have source code understanding of open-source data groups, such as HDFS, HBase, YARN, Spark, Flink, Airflow, Kyuubi, ZK, Kafka, etc.
  • In-depth understanding of Linux and computer networks.
  • Experience in at least one language (Python/Golang/Java, etc.).
  • Experience in profiling, benchmarking and optimizing ML applications
  • Self directed, self motivated and detail oriented with ability to come up with good design proposals and thorough analysis of production issues
  • Ability to thrive in a multi-functional team on high profile, critical projects
  • Minimum of 5 years of hands-on experience on backend or big data ecosystem.
  • Comfortable working in a high-velocity startup environment with evolving goals and systems
  • Middleware:
  • · Java code development experience, preferably Apache open source project developers
  • · Big data architecture experience, having built a multi-middleware-based technical architecture
  • · Experience in high availability, high performance, and resource optimization.
  • LLMOps:
  • · Experience in LLM-related project architecture design, code development, and performance tuning.
  • · Experience in LLM training and development of optimization-related platform tools.

About Binance

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success. By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.

Notes

Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Use AI to Automatically Apply!

Let your AI Job Copilot auto-fill application questions
Auto-apply to relevant jobs from 300,000 companies

Auto-apply with JobCopilot Apply manually instead
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Remote Engineering Jobs. Just set your preferences and Job Copilot will do the rest—finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs →