Head of Platform/AI Cluster Management - System Integrator (San Francisco) Job at Hamilton Barnes Associates Limited, San Francisco, CA

L2JiRWtJVW9WVHo0azY4L1RYNWdVWnBP
  • Hamilton Barnes Associates Limited
  • San Francisco, CA

Job Description

Ready to lead innovation at the intersection of platforms and artificial intelligence?

Join a pioneering technology company driving advancements in cloud, AI, and data-driven solutions across global markets. The organization is recognized for fostering innovation, scalability, and collaboration through cutting-edge platforms that empower enterprises to evolve intelligently.

The team is hiring a Head of Platform/AI Cluster Management to oversee the strategic development, integration, and optimization of AI and platform initiatives. The role will focus on leading cross-functional teams, enhancing performance and scalability, and aligning technology strategy with long-term business goals.

Shape the future of intelligent platforms and transformative innovation. Apply now!

Responsibilities

  • Own the scheduler/runtime layer (Slurm, Kubernetes, Ray), including multi-tenancy, quotas, and GPU/host fleet management.
  • Lead cluster operations across images, CI/CD, repair/health, performance/telemetry, and incident response.
  • Deliver platform services that ensure workload SLOs and reliable runtime execution.
  • Define and implement namespace/tenancy design, node health automation, golden images, admission controls, on-call runbooks, and go-live gates.
  • Collaborate closely with infra, SRE, and network teams to optimize workload placement and cluster efficiency.
  • Provide hands-on expertise in NCCL behaviours, placement strategies, and congestion signal management.

Requirements

  • Deep expertise in cluster management, scheduling, and runtime environments for large-scale compute.
  • Hands-on background with Slurm, Kubernetes, Ray, or similar orchestration platforms.
  • Strong understanding of NCCL performance tuning, workload isolation, and congestion management.
  • Experience scaling multi-tenant, GPU-heavy clusters with strict SLOs.
  • Ability to thrive in a startup environment with full ownership over platform and cluster strategy.

Salary

  • $500,000 gross per year (Negotiable)
#J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

Costco Wholesale Corporation

Stocker Job at Costco Wholesale Corporation

 ...Warehouse Position California applicants: Please review the Costco Applicant Privacy Notice. The jobs listed are examples of the typical kinds of positions that Costco may hire for when openings exist. The listing does not mean that any positions are currently open... 

CARE E ME Transportation

Non-Emergency Service Driver Job at CARE E ME Transportation

 ...:00 PM for the following days schedule. Qualifications: Driving Experience: Minimum of 10 years of driving experience with a valid...  ...off Paid training Schedule: Monday to Friday Weekends as needed Supplemental Pay: Performance bonus... 

Integrity Windows

Call Center Representative/Appointment Setter Job at Integrity Windows

 ...Call Center Representative/Appointment Setter Join to apply for the Call Center Representative/Appointment Setter role at Integrity . Job Overview Gott Professional Insurance Services (GPIS), an Integrity partner based in Sacramento, CA, is seeking an experienced... 

GovernmentJobs.com

Police Records Technician I Job at GovernmentJobs.com

 ...2026 3% increase in base effective 6/20/2027 Performs a variety of customer service functions applicable to Police Department operations including records release and maintenance, receiving, responding to, and entering requests for police reports, and screening and... 

BAYADA Home Health Care

HHA - Home Health Aide Job at BAYADA Home Health Care

Work in your neighborhood and give back to your community. BAYADA Home Health Care has an immediate need for HHAs - Home Health Aides to care for our clients. As a member of our home care team, you will be valued, respected, and heard.We have current job openings for HHAs...