Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • Etched

    ML Compiler Backend Engineer

    Cupertino, CA, United States

    • Ending Soon

    5+ years of experience writing production-grade software Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals Able to write production-grade code in C++ and in Python Experience with modern compiler IRs, including at least one of (LLVM, MLIR, Relay) Experience with PyTorch Responsibilities Des

    Job Source: Etched
  • Waymo

    Senior ML Compiler Engineer, Compute

    Mountain View, CA, United States

    • Ending Soon

    Senior ML Compiler Engineer, Compute Mountain View, California, United States New York, New York, United States Waymo is an autonomous driving technology company with a mission to make it safe and easy for people and things to get where they're going. Since our start as the Google Self-Driving Car Project in 2009, Waymo has been focuse

    Job Source: Waymo
  • Samsung Electronics GmbH

    Principal Engineer, AI/ML Software Compiler

    San Jose, CA, United States

    • Ending Soon

    Principal Engineer, AI/ML Software Compiler Job Title Principal Engineer, AI/ML Software Compiler Job Location SSI San Jose Main Office Category Engineering - Software Job Type Full-time Job # 41938 Advancing the World’s Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicle

    Job Source: Samsung Electronics GmbH
  • Conductor

    Principal Engineer, AI/ML Software Compiler

    San Jose, CA, United States

    • Ending Soon

    What You’ll Do The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads whi

    Job Source: Conductor
  • AMD

    Sr. Staff ML Compiler Engineer

    San Jose, CA, United States

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinn

    Job Source: AMD
  • Conductor

    Staff Engineer, AI/ML Software Compiler

    San Jose, CA, United States

    What You’ll Do The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads whil

    Job Source: Conductor
  • Advanced Micro Devices , Inc.

    Sr. Staff ML Compiler Engineer

    San Jose, CA, United States

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpin

    Job Source: Advanced Micro Devices , Inc.
  • Conductor

    Senior Engineer, AI/ML Software Compiler

    San Jose, CA, United States

    What You’ll Do The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads whil

    Job Source: Conductor

ML Compiler Frontend Engineer

Cupertino, CA, United States

ML Compiler Frontend Engineer

Etched is building the hardware for superintelligence.

GPUs and TPUs are flexible AI chips that can run many kinds of models: CNNs, RNNs, LSTMs, and more. But today, almost all AI workloads, from ChatGPT to self-driving cars, are done on one model architecture: transformers. Using flexible AI chips for transformers is very inefficient: Etched is building a single-purpose chip exclusively for transformer inference. We only support transformers, but in exchange our chips have an order of magnitude more throughput and lower latency than an H100. With Etched, you can build products that would be impossible with GPUs, like tree-of-thought agents and ultra-low-latency audio chat bots.

Etched is looking for exceptional ML compiler frontend engineers to join our team and build production-grade integrations with today’s transformer libraries. The ideal candidate has experience working closely with LLMs in products and also understands how efficient inference works under the hood.

Responsibilities:

Design and develop our integrations with current transformer-specific inference libraries, like TensorRT-LLM, TransformerEngine, Hugging Face TGI, and vLLM.

Provide feedback to the firmware, compiler, and hardware teams based on compiler development work

Ensure the software we expose to customers is reliable and production-grade as soon as our servers begin to ship

Requirements:

5+ years of experience writing production-grade software.

Able to write production-grade code in Python

Experience with LLMs to build products

Experience with at least one of TensorRT, TensorRT-LLM, Transformer Engine, or vLLM

Great understanding of how companies working with LLMs build their inference stacks

1+ year of work experience at a cloud provider

Deeply creative and able to think from first principles

Desired qualifications:

Experience with hardware design and development

Proficiency with GPU programming

Experience working in hardware simulation/emulation environments.

Benefits:

Competitive salary and equity package

Full medical, dental, and vision packages, with 100% of premium covered

Work with world-class people and state-of-the-art AIs everyday

Etched is committed to fair and equitable compensation practices. Compensation is determined based on your qualifications and experience. Compensation packages also include generous equity in Etched.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Email Alert for ML Compiler Frontend Engineer jobs in Cupertino, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.