Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

  • ByteDance

    Software Engineer, Inference Frameworks

    San Jose, CA, United States

    • Ending Soon

    Responsibilities Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and cr

    Job Source: ByteDance
  • TikTok

    Software Engineer, Inference

    San Jose, CA, United States

    • Ending Soon

    Responsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us Creation is the core of TikTok's purpose. Our platform is built to help imagin

    Job Source: TikTok
  • Advanced Micro Devices , Inc.

    Staff Software Development Engineer - AI. C++, Inference

    San Jose, CA, United States

    • Ending Soon

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpin

    Job Source: Advanced Micro Devices , Inc.
  • Black Sesame Technologies

    Sr. Engineer, AI Framework Software

    San Jose, CA, United States

    Design the software systems, including layout and flow charts, according to common NN frameworks towards BST AI SOCs (computer chips); Define scalable neural network development tools/applications to enable effective mapping and optimization of network models; Develop automated model conversion flow features that are user friendly and generic, suit

    Job Source: Black Sesame Technologies
  • Apple, Inc.

    HTTP Frameworks Software Engineer

    Cupertino, CA, United States

    • Ending Soon

    Summary Posted: May 2, 2024 Role Number: 200550085 Seeking a strong C, C++, Swift, or Obj-C developer for Apple's HTTP protocol implementation (HTTP/1.1, HTTP/2, HTTP/3). The group is responsible for the client-side HTTP code that powers Safari and WebKit, iCloud, App Store, Music, and the vast majority of other Apple and 3rd party apps and serv

    Job Source: Apple, Inc.
  • Nuro

    Software Engineer, Application Framework

    Mountain View, CA, United States

    • Ending Soon

    Who We Are Nuro exists to better everyday life through robotics. The company’s custom electric autonomous vehicles are designed to bring the things you need—from produce to prescriptions—right to your home. Nuro’s autonomous, goods-focused solution can give you valuable time back and more freedom to do what you love. This convenient, eco-friendly

    Job Source: Nuro
  • Kuraray America, Inc.

    Senior Software Engineer, TensorRT Inference

    Santa Clara, CA, United States

    Do you want to be at the forefront of deep learning technology development that powers everything from Stable Diffusion to Chat-GPT? We are looking for incredible Windows Software Engineers on the TensorRT team to help us build industry-leading deep learning software for NVIDIA GPUs. As a part of our TensorRT team, you will work with the most excit

    Job Source: Kuraray America, Inc.
  • Waymo

    Staff Software Engineer, Inference, ML Platform

    Mountain View, CA, United States

    Staff Software Engineer, Inference, ML Platform Mountain View, California, United States Waymo is an autonomous driving technology company with a mission to make it safe and easy for people and things to get where they're going. Since our start as the Google Self-Driving Car Project in 2009, Waymo has been focused on building the Waymo Driv

    Job Source: Waymo

Software Engineer, Inference Frameworks

San Jose, CA, United States

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.

Together, we inspire creativity and enrich life - a mission we aim towards achieving every day.

To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve.

Join us.

Our team was established to help realize our company vision, building a global platform for creation and communication. We are doing world-class work in machine learning, computer vision, natural language processing, speech and audio, and knowledge, and transferring our work into products, which hundreds of millions of users worldwide use. As a vital AI infrastructure for the company, our machine learning system integrates our most up-to-date R&D results in AI algorithms and systems. Come and join us, you will get the chance of building large-scale machine learning systems and working with the best AI system and algorithm researchers and engineers.

What You'll Be Doing

1. Responsible for developing and optimizing LLM inference framework.

2. Responsible for GPU and CUDA Performance optimization to create an industry-leading high-performance LLM inference engine.

1. Bachelor's degree or above, major in computer/electronics/automation/software, etc., with experience in ML engineering optimization preferred

2. Proficient in C/C++, proficient in algorithms and data structures, familiar with Python

3. Proficient in GPU high-performance computing optimization technology on CUDA, in-depth understanding of computer architecture, familiar with parallel computing optimization, memory access optimization, low-bit computing, etc.

4. Understand the basic principles of deep learning algorithms, be familiar with the basic architecture of neural networks and understand deep learning training frameworks such as Pytorch and TensorFlow

5. Familiar with TensorRT-LLM, ORCA, VLLM, etc.

6. Knowledge of LLM models, experience in accelerating LLM model optimization is preferred

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at [email protected].

#J-18808-Ljbffr

Apply

Create Email Alert

Create Email Alert

Email Alert for Software Engineer, Inference Frameworks jobs in San Jose, CA, United States

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Would you like to [visit your alert settings] now?

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.