Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • eTeam, Inc.

    AI Systems Engineer

    San Jose, CA, United States

    • Ending Soon

    Job Overview: We are seeking an AI Systems Engineer to join our IT compute platforms engineering team. The AI Systems Engineer is responsible for the design, development, and administration of High-Performance Computing (HPC) infrastructure, GPU clusters, and AI workload schedulers. ABOUT YOU: You have a passion for learning. You are passionate abo

    Job Source: eTeam, Inc.
  • NVIDIA

    Senior GPU Cluster Software Engineer_

    Santa Clara

    As a member of the System Software team, you'll be responsible for building profiling solutions for large-scale real world applications running on GPU compute clusters to make them work efficiently and improve the user experience for customer as well as engineers supporting the cluster. Much of our software development focuses on profiling varied s

    Job Source: NVIDIA
  • Zealogics

    HPC engineer

    San Jose, CA, United States

    Job Responsibilities Candidates should have good domain knowledge in High-Performance Computing, script language(Shell, Python), Linux administrator, operating systems (Linux, Windows), computer network Distributed file systems (Lustre/NFS), virtualization and containerization related experience is a plus Configuration and maintenance of the HPC co

    Job Source: Zealogics
  • NVIDIA Corporation

    Senior AI-HPC Storage Engineer

    Santa Clara, CA, United States

    Senior AI-HPC Storage Engineer page is loaded Senior AI-HPC Storage Engineer Apply locations US, CA, Santa Clara US, MA, Westford US, TX, Austin time type Full time posted on Posted 22 Days Ago job requisition id JR1977545 NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in

    Job Source: NVIDIA Corporation
  • Support Revolution

    Manager, Solution Engineering

    San Jose, CA, United States

    • Ending Soon

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Create Alert Select how often (in days) to receive an alert: Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterpr

    Job Source: Support Revolution
  • Advanced Micro Devices , Inc.

    AI Systems Engineer - HPC

    San Jose, CA, United States

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

    Job Source: Advanced Micro Devices , Inc.
  • Cadence Design Systems

    IT InfiniBand/GPU -Sr Staff Systems Engineer

    San Jose, CA, United States

    At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. Cadence is looking for a Sr Staff Systems Engineer who accelerates strategic customer deployments and ensures on-time bring-up and deployment of HPC infrastructure and troubleshooting and supports technical roles supporting HPC, InfiniBa

    Job Source: Cadence Design Systems
  • Support Revolution

    Principal Product Engineer - Server Cluster

    San Jose, CA, United States

    • Ending Soon

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Create Alert Select how often (in days) to receive an alert: Principal Product Engineer - Server Cluster Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solut

    Job Source: Support Revolution

HPC Cluster Engineer

Santa Clara, CA, United States

Are you ready to make your mark in the forefront of technological innovation? As an HPC Cluster Engineer , you'll play a pivotal role in shaping the future of AI, deep learning, and machine learning initiatives. Join us and leverage Nvidia's cutting-edge GPU technology to drive groundbreaking discoveries and revolutionize industries.

Sustainable Talent is thrilled to partner with Nvidia , a global powerhouse with over 25 years of trailblazing advancements in computer graphics, gaming, and accelerated computing.

This is a W-2 full-time contract based in Santa Clara, CA - Hybrid work option We offer competitive pay based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture!

Additional locations: MA, Westford; US, NC, Durham; US, TX, Austin.

What you'll be doing:

You'll lead the charge in optimizing our Infiniband network and managing Lustre and GPFS storage solutions, ensuring seamless performance for our cutting-edge initiatives.

Your expertise in the SLURM job scheduler will be instrumental in orchestrating the smooth operation of our clusters, from scheduling tasks to managing resources efficiently.

As a Linux sysadmin guru, you'll be responsible for maintaining the stability and security of our systems, leveraging your deep understanding of Linux environments.

Harnessing the power of Ansible, you'll automate routine tasks and streamline operations, freeing up time for innovation and optimization.

Advanced python and bash scripting will drive automation efforts and enable dynamic solutions to complex challenges.

What We Need to See: Demonstrated experience with SLURM, coupled with a solid understanding of Infiniband networks and Lustre/GPFS storage systems, is essential.

A proven track record in Linux system administration, ensuring robustness and security in our computing environment.

Proficiency in Ansible is a must-have, enabling you to automate tasks and workflows efficiently.

Strong scripting abilities in Python and bash are critical for developing custom solutions and optimizing cluster performance.

Ways to Stand Out From the Crowd: Showcase your knowledge of best practices in HPC cluster operations, automation, and upgrades, setting you apart as a seasoned professional in the field.

Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.

Apply

Create Email Alert

Create Email Alert

Email Alert for HPC Cluster Engineer jobs in Santa Clara, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.