Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • NVIDIA

    Senior DevOps and Automation Engineer

    Santa Clara, CA, United States

    • Ending Soon

    NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers

    Job Source: NVIDIA
  • ASRC Federal Holding Company

    Senior HPC Engineer

    Mountain View, CA, United States

    • Ending Soon

    Job Description ASRC Federal is searching for a Senior HPC Engineer to support Inuteq LLC which this role is fully telework ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace i

    Job Source: ASRC Federal Holding Company
  • NVIDIA

    Senior Software Engineer, DevOps and Infrastructure Automation

    Santa Clara, CA, United States

    • Ending Soon

    NVIDIA is searching for a Senior Software Engineer, DevOps Infrastructure Engineering and Automation engineer for the bringing up, development and prototyping a class of products and services for our Metropolis platforms on multi cloud environments and on-Prem. Data is the lifeblood of the modern city. Today, it is captured by over 500 million came

    Job Source: NVIDIA
  • NVIDIA Corporation

    Senior HPC Performance Engineer

    Santa Clara, CA, United States

    • Ending Soon

    Senior HPC Performance Engineer page is loaded Senior HPC Performance Engineer Apply locations US, CA, Santa Clara time type Full time posted on Posted 2 Days Ago job requisition id JR1977468 NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visua

    Job Source: NVIDIA Corporation
  • Ampstek

    HPC Engineer

    Mountain View, CA, United States

    Title: HPC hardware development consultant Location: Mountain View CA- Onsite Domain: Automotive High-performance computing (HPC) Extensive experience (7+ years) in HPC hardware development and embedded systems, particularly in automotive applications. Design of computer PCBs for automotive systems. Work with component suppliers to select and suppo

    Job Source: Ampstek
  • Zealogics

    HPC engineer

    San Jose, CA, United States

    Job Responsibilities Candidates should have good domain knowledge in High-Performance Computing, script language(Shell, Python), Linux administrator, operating systems (Linux, Windows), computer network Distributed file systems (Lustre/NFS), virtualization and containerization related experience is a plus Configuration and maintenance of the HPC co

    Job Source: Zealogics
  • NVIDIA

    Senior AI-HPC Storage Engineer

    Santa Clara, CA, United States

    • Ending Soon

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by

    Job Source: NVIDIA
  • contextual ai

    DevOps Engineer

    Mountain View, CA, United States

    • Ending Soon

    Job Overview As a DevOps Engineer, you will play a crucial role ensuring the reliability, scalability, and performance of our RAG 2.0-based product and our HPC cluster. You will collaborate closely with our product and research engineering teams to design and implement processes and systems that ensure the stability and availability of our service

    Job Source: contextual ai

Senior DevOps and Automation Engineer - HPC

Santa Clara, CA, United States

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver libraries like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated DevOps and Automation Engineer to help us increase our execution efficiency. Most DL and HPC applications run on large clusters with high-speed networking (Infiniband, RoCE). This is an outstanding opportunity going beyond the traditional DevOps roles and responsibilities. Are you ready for to contribute to the development of innovative technologies and help realize NVIDIA's vision?

What You Will Be Doing

As a Senior Software Engineer in the GPU Communications Group, you will utilize your knowledge and expertise in high availability network software to create, enhance, and maintain our GPU communication solutions. You will:

Maintain and improve CI/CD systems (Gitlab, Github, Perforce)

Develop tools and automation to deploy testing on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.)

Maintain internal cluster servers and Infiniband/RoCE networks

Collect a lot of performance data; build tools and infrastructure to visualize and analyze the information

Collaborate with a very dynamic team across multiple time zones

What We Need To See

B.S. or M.S. in Computer Science, or related field and 5+ years of relevant experience

Excellent C/C++ programming and debugging skills

Expert in a scripting language, preferably Python

Proficient with Linux fundamentals

Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible)

Adaptability and passion to learn new areas and tools

Flexibility to work and communicate effectively across different teams and timezones

Ways To Stand Out From The Crowd

Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters

Good understanding of Infiniband/RoCE networks and experience debugging network configuration issues

Familiarity with CUDA programming and/or GPUs. Experience with Deep Learning Frameworks such PyTorch, TensorFlow. Deep understanding of technology and passionate about what you do

The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Email Alert for Senior DevOps and Automation Engineer - HPC jobs in Santa Clara, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.