Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • KLA

    HPC Engineering Manager

    Milpitas, CA, United States

    • Ending Soon

    Base Pay Range: $180,700.00 - $307,200.00 AnnuallyPrimary Location: USA-CA-Milpitas-KLAKLA's total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits identified below. Interns are eligible for some of the benefits identified below. Our pay ranges are determined by r

    Job Source: KLA
  • Dexterity

    Hardware Engineering Manager

    Redwood City, CA, United States

    Common Hardware Engineering Manager, Hardware Engineering Location: Redwood City, California Job Classification: Full Time About Dexterity At Dexterity, we believe robots can positively transform the world. Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robots to hand

    Job Source: Dexterity
  • Kairos Aerospace

    Hardware Engineering Manager

    Sunnyvale, CA, United States

    • Ending Soon

    About Kairos Aerospace At Kairos Aerospace, we combine innovative aerospace systems with advanced data science. Unlike traditional aerospace companies, we sell information, not hardware - we operate our sensors cost-effectively at continental scale to produce novel data streams about pressing global problems. First on our list: spotting hard-to-m

    Job Source: Kairos Aerospace
  • Universal Audio

    Engineering - Hardware

    Scotts Valley, CA, United States

    Scotts Valley, California, United States If you are passionate about Hardware Engineering and Audio Product design and development, but you do not currently see an ideal job posting match, we still encourage you to submit your resume. Universal Audio's success has resulted from us hiring passionate experts and we would love to review your resume. T

    Job Source: Universal Audio
  • Support Revolution

    Sr. Manager, Solution Engineering

    San Jose, CA, United States

    Select how often (in days) to receive an alert: Create Alert Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the

    Job Source: Support Revolution
  • Meta

    Production Systems Engineer, Sustaining_

    Menlo Park

    • Ending Soon

    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver our innovative services. The RTP team is responsible for the Hardware Lifecycle of all Meta servers includ

    Job Source: Meta
  • Unison

    Hardware Engineering Program Manager

    Mountain View, CA, United States

    • Ending Soon

    [Full Time] Hardware Engineering Program Manager at UNISON (United States) | BEAMSTART Jobs Hardware Engineering Program Manager UNISON United States Date Posted 30 Apr, 2023 Work Location Mountain View, United States Salary Offered Not Specified Job Type Full Time Experience Required No experience required Remote Work No Stock Options No Vacanci

    Job Source: Unison
  • Support Revolution

    Manager, Solution Engineering

    San Jose, CA, United States

    • Ending Soon

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Create Alert Select how often (in days) to receive an alert: Location: San Jose, California, United States About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterpr

    Job Source: Support Revolution

HPC Operations Manager – Hardware Engineering

Santa Clara, CA, United States

HPC Operations Manager – Hardware Engineering page is loaded HPC Operations Manager – Hardware Engineering Apply locations US, CA, Santa Clara time type Full time posted on Posted 7 Days Ago job requisition id JR1975474 Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables outstanding creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to craft global and dynamic HPC clusters used by Nvidia’s hardware design teams. We are looking for leaders to help us grow and evolve a reliable computing environment to enable our hardware designers to build the next generation of GPUs and SOCs.

What You'll be Doing:

A huge part of the day-to-day job is collaborating with partners to develop programs driving around storage, networking, and compute in our growing fleet of data centers.

Lead, cultivate, and mentor a multi-national team of sysadmins and devops engineers, in support of the chip design teams

Ensure the highest reliability of HPC clusters. Develop critical metrics, program schedules to measure program health, predictability, and achievements

Identify failures, lead retrospective analysis, and help to develop improvement action plans. Build standard methodologies that cut through complexity and can be used across Nvidia and influence other partners for continuous improvement

Evaluate the latest technologies (hardware and cloud computing) and recommend future evolution of the infrastructure. Plan deployments and refresh of hardware (compute, storage, network equipment), and associated software stack (e.g. OS)

Work multi-functionally with hardware engineering leaders to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering infrastructure teams on the different subsystems that comprise the computing environment.

Lead all aspects of the HPC scheduler (LSF), set/adjust policy, ensure delivery of forecasted compute demand to each hardware division, and drive high utilization.

Track software licensing servers and drive efficient license utilization

Develop and manage program schedules, milestones and deliverables. Adjust in the face of a highly fluid customer product roadmap.

Regularly communicate program status and key issues to senior management at NVIDIA’s headquarters. Accurately represent the importance of issues and call out issues appropriately. Be the evangelist of data driven project management

What We Need to See:

B.S. or M.S. in Computer Science, Computer Engineering, Information Science (or equivalent experience)

15+ years overall

5+ years managing IT infrastructure teams of 10+ people

10+ years experience running Linux servers, NFS storage, and Ethernet networks

Knowledge of HPC schedulers (IBM LSF preferred)

Knowledge of hardware design workflows (EDA tools and methodology)

Experience using project management and capacity planning software

Datacenter operations (rack and stack, maintenance)

Ways to stand out from the crowd:

HPC storage (e.g. Netapp, Pure Storage, Lustre, ZFS, Isilon)

Infiniband (operations, debugging, performance tuning)

Software development, especially in a devops context

Knowledge of relational databases, data lakes, metrics/visualization/analytics platforms

Deploying and maintaining FlexLM-based software license servers

Established relationships with enterprise-level equipment suppliers

The base salary range is 272,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Similar Jobs (5) Senior Silicon Hardware Development Engineer locations US, CA, Santa Clara time type Full time posted on Posted 6 Days Ago Senior Engineering Manager, GPU SW Security locations 4 Locations time type Full time posted on Posted 7 Days Ago Senior Software Engineer, AI Storage Infrastructure locations 2 Locations time type Full time posted on Posted 2 Days Ago

NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Email Alert for HPC Operations Manager – Hardware Engineering jobs in Santa Clara, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.