Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

eTeam, Inc.

AI Systems Engineer

San Jose, CA, United States
- Ending Soon
Job Overview: We are seeking an AI Systems Engineer to join our IT compute platforms engineering team. The AI Systems Engineer is responsible for the design, development, and administration of High-Performance Computing (HPC) infrastructure, GPU clusters, and AI workload schedulers. ABOUT YOU: You have a passion for learning. You are passionate abo
Job Source: eTeam, Inc.
NVIDIA

Senior GPU Cluster Software Engineer_

Santa Clara
As a member of the System Software team, you'll be responsible for building profiling solutions for large-scale real world applications running on GPU compute clusters to make them work efficiently and improve the user experience for customer as well as engineers supporting the cluster. Much of our software development focuses on profiling varied s
Job Source: NVIDIA
Zealogics

HPC engineer

San Jose, CA, United States
Job Responsibilities Candidates should have good domain knowledge in High-Performance Computing, script language(Shell, Python), Linux administrator, operating systems (Linux, Windows), computer network Distributed file systems (Lustre/NFS), virtualization and containerization related experience is a plus Configuration and maintenance of the HPC co
Job Source: Zealogics
NVIDIA Corporation

Senior AI-HPC Storage Engineer

Santa Clara, CA, United States
Senior AI-HPC Storage Engineer page is loaded Senior AI-HPC Storage Engineer Apply locations US, CA, Santa Clara US, MA, Westford US, TX, Austin time type Full time posted on Posted 22 Days Ago job requisition id JR1977545 NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in
Job Source: NVIDIA Corporation
Support Revolution

Manager, Solution Engineering

San Jose, CA, United States
- Ending Soon
Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Create Alert Select how often (in days) to receive an alert: Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterpr
Job Source: Support Revolution
Advanced Micro Devices , Inc.

AI Systems Engineer - HPC

San Jose, CA, United States
Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.
Job Source: Advanced Micro Devices , Inc.
Cadence Design Systems

IT InfiniBand/GPU -Sr Staff Systems Engineer

San Jose, CA, United States
At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology. Cadence is looking for a Sr Staff Systems Engineer who accelerates strategic customer deployments and ensures on-time bring-up and deployment of HPC infrastructure and troubleshooting and supports technical roles supporting HPC, InfiniBa
Job Source: Cadence Design Systems
Support Revolution

Principal Product Engineer - Server Cluster

San Jose, CA, United States
- Ending Soon
Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Create Alert Select how often (in days) to receive an alert: Principal Product Engineer - Server Cluster Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solut
Job Source: Support Revolution

HPC Cluster Engineer

Santa Clara, CA, United States

Are you ready to make your mark in the forefront of technological innovation? As an HPC Cluster Engineer , you'll play a pivotal role in shaping the future of AI, deep learning, and machine learning initiatives. Join us and leverage Nvidia's cutting-edge GPU technology to drive groundbreaking discoveries and revolutionize industries.

Sustainable Talent is thrilled to partner with Nvidia , a global powerhouse with over 25 years of trailblazing advancements in computer graphics, gaming, and accelerated computing.

This is a W-2 full-time contract based in Santa Clara, CA - Hybrid work option We offer competitive pay based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture!

Additional locations: MA, Westford; US, NC, Durham; US, TX, Austin.

What you'll be doing:

You'll lead the charge in optimizing our Infiniband network and managing Lustre and GPFS storage solutions, ensuring seamless performance for our cutting-edge initiatives.

Your expertise in the SLURM job scheduler will be instrumental in orchestrating the smooth operation of our clusters, from scheduling tasks to managing resources efficiently.

As a Linux sysadmin guru, you'll be responsible for maintaining the stability and security of our systems, leveraging your deep understanding of Linux environments.

Harnessing the power of Ansible, you'll automate routine tasks and streamline operations, freeing up time for innovation and optimization.

Advanced python and bash scripting will drive automation efforts and enable dynamic solutions to complex challenges.

What We Need to See: Demonstrated experience with SLURM, coupled with a solid understanding of Infiniband networks and Lustre/GPFS storage systems, is essential.

A proven track record in Linux system administration, ensuring robustness and security in our computing environment.

Proficiency in Ansible is a must-have, enabling you to automate tasks and workflows efficiently.

Strong scripting abilities in Python and bash are critical for developing custom solutions and optimizing cluster performance.

Ways to Stand Out From the Crowd: Showcase your knowledge of best practices in HPC cluster operations, automation, and upgrades, setting you apart as a seasoned professional in the field.

Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.

Name	Expiration	Description
ATTBCookie*	2 years	These cookies are used to remember a user’s choice about cookies on thebigjobsite.com. Where users have previously indicated a preference, that user’s preference will be stored in these cookies.
last-search search redirect-stage original-keyword	1 day Session 1 hour 1 hour	These cookies are used by thebigjobsite.com to pass search data between our own pages.
datadome	1 year	DataDome is a cybersecurity solution to detect bot activity
jjap	1 days	Used to track if you have seen the Job Alerts prompt. Job Alerts is a service you can subscribe to to receive information about new jobs.

What job

...and where?

Similar Jobs

AI Systems Engineer

Senior GPU Cluster Software Engineer_

HPC engineer

Senior AI-HPC Storage Engineer

Manager, Solution Engineering

AI Systems Engineer - HPC

IT InfiniBand/GPU -Sr Staff Systems Engineer

Principal Product Engineer - Server Cluster

HPC Cluster Engineer

Share this job

Create Email Alert