Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • XPeng Motors

    Staff AI Infrastructure Engineer

    Santa Clara, CA, United States

    XPeng Motors is one of China's leading smart electric vehicle (EV) companies. We design, develop, and manufacture smart EVs that are seamlessly integrated with advanced Internet, AI and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. We strive

    Job Source: XPeng Motors
  • Plume Design Inc

    Staff Infrastructure Engineer

    Palo Alto, CA, United States

    • Ending Soon

    Life at Plume At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learni

    Job Source: Plume Design Inc
  • Coupang

    Staff Engineer, Security Infrastructure

    Mountain View, CA, United States

    • Ending Soon

    Job Overview: As a Staff Engineer on the Security Infrastructure team, you will build the platform that enables Coupang to win and grow our customers’ confidence while rapidly expanding and scaling our services. The Security Infrastructure team builds core security services and libraries used by all Coupang services to secure themselves and

    Job Source: Coupang
  • Plume Design Inc

    Staff Cloud Infrastructure Engineer

    Palo Alto, CA, United States

    Opportunity We’re looking for infrastructure engineers who have an affinity with Networking, Cloud Governance, and Security, in addition to Organizational skills. You will be participating in next-generation networking and security policy automation, as well as maintaining and continuously improving our existing Production systems. If you have been

    Job Source: Plume Design Inc
  • Dexterity Inc

    Staff Infrastructure Software Engineer

    Redwood City, CA, United States

    • Ending Soon

    Staff Infrastructure Software Engineer About Dexterity At Dexterity, we believe robots can positively transform the world. Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robots to handle repetitive and physically difficult work. We’re starting with warehouse automation, w

    Job Source: Dexterity Inc
  • Walmart

    Staff, Systems and Infrastructure Engineer

    Sunnyvale, CA, United States

    • Ending Soon

    About Team: GTP - Collaboration Tools Building the right technology foundation for Infrastructure & platforms is vital to success at the scale of Walmart. Our team builds and maintains the foundational technologies that support the tech organization. Included in this are data platforms, enterprise architecture, DevOps, cloud computing, and infrastr

    Job Source: Walmart
  • Plume Design Inc

    Staff Cloud Infrastructure Engineer

    Palo Alto, CA, United States

    • Ending Soon

    Life at Plume At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learni

    Job Source: Plume Design Inc
  • Guardant Health

    Staff HPC Infrastructure Engineer

    Palo Alto, CA, United States

    • Ending Soon

    Company Description Guardant Health is a leading precision oncology company focused on helping conquer cancer globally through use of its proprietary tests, vast data sets and advanced analytics. The Guardant Health oncology platform leverages capabilities to drive commercial adoption, improve patient clinical outcomes and lower healthcare costs a

    Job Source: Guardant Health

Staff AI Infrastructure Engineer

Santa Clara, CA, United States

XPeng Motors is one of China's leading smart electric vehicle (EV) companies. We design, develop, and manufacture smart EVs that are seamlessly integrated with advanced Internet, AI and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. We strive to transform smart electric vehicles with technology and data, shaping the mobility experience of the future.

We are looking for a talented AI/ML Infrastructure Engineer to join our team. In this role, you will have the opportunity to improve productivity for our researchers by enhancing the entire stack. Your primary duty will be to identify and resolve infrastructure gaps to provide reliable, efficient, and scalable solutions.

Job Responsibilities:

Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions

Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams

Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters

Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization

Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks

Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem

Minimum Skill Requirements:

Bachelor's degree in Computer Science, Engineering, or related technical field

5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain

Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc.

Strong communication and collaboration abilities, effective in working with diverse teams and individuals

Preferred Skill Requirements: In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines

Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana)

Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions

A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space

What do we provide:

A fun, supportive and engaging environment

Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving

Opportunity to work on cutting edge technologies with the top talent in the field

Competitive compensation package

Snacks, lunches and fun activities

The base salary range for this full-time position is $180,000-$300,000, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.

Apply

Create Email Alert

Create Email Alert

Email Alert for Staff AI Infrastructure Engineer jobs in Santa Clara, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.