Create Email Alert

Email Alert for

ⓘ There was an unexpected error processing your request.

Please refresh the page and try again.

If the problem persists, please contact us with your issue.

Email address is already registered

You can always manage your preferences and update your interests to ensure you receive the most relevant opportunities.

Success! You're now signed up for Job Alerts

Get ready to discover your next great opportunity.

Similar Jobs

HeyGen

Research Engineer, Tech Lead

San Francisco, CA, United States
- Ending Soon
The mission at HeyGen is to make visual storytelling accessible to all. HeyGen is the fastest-growing generative AI platform for anyone who wants to create videos for their business. Come join our team in revolutionizing the way videos are created. Learn more at www.heygen.com Research Engineer, Tech Lead at HeyGen Elevating AI Frontiers in a Hig
Job Source: HeyGen
HeyGen

Research Engineer, Tech Lead

San Francisco, CA, United States
Research Engineer, Tech Lead at HeyGen Elevating AI Frontiers in a High-Growth Startup Environment HeyGen is a dynamic and rapidly expanding startup at the intersection of artificial intelligence and practical technology solutions. We are on the hunt for an adaptive and visionary Research Scientist with a Tech Lead focus to catalyze our growth tr
Job Source: HeyGen
HeyGen, Inc.

Research Scientist, Tech Lead

San Francisco, CA, United States
Research Scientist, Tech Lead at HeyGen Elevating AI Frontiers in a High-Growth Startup Environment HeyGen is a dynamic and rapidly expanding startup at the intersection of artificial intelligence and practical technology solutions. We are on the hunt for an adaptive and visionary Research Scientist with a Tech Lead focus to catalyze our growth
Job Source: HeyGen, Inc.
Perplexity AI

AI Inference Engineer

San Francisco, CA, United States
- Ending Soon
We are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference. Responsibilities Develop APIs for AI inference that will be used by both internal and external customers Bench
Job Source: Perplexity AI
Perplexity AI

AI Inference Engineer

San Francisco, CA, United States
- Ending Soon
We are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference. Responsibilities Develop APIs for AI inference that will be used by both internal and external customers Benchma
Job Source: Perplexity AI
Waymo

Tech Lead Manager, Planner Research

San Francisco, CA, United States
- Ending Soon
Tech Lead Manager, Planner Research Mountain View, California, United States San Francisco, California, United States Waymo is an autonomous driving technology company with the mission to be the most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's
Job Source: Waymo
Anthropic Limited

Software Engineer, Inference

San Francisco, CA, United States
About the role: Our Inference team builds the service that generates outputs from our models in production. This service is the key driver of our efficiency, latency and reliability. As an engineer on this team, you’ll work on improving those metrics by solving complex distributed-systems problems across all layers of our stack. You may be a goo
Job Source: Anthropic Limited
Anthropic

Software Engineer, Inference

San Francisco, CA, United States
About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the
Job Source: Anthropic

Research Inference, Tech Lead

San Francisco, CA, United States

The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference.

Our priorities are to maximize training throughput (how quickly we can train a new model) and researcher throughput (how quickly we can develop new models) with the goal of accelerating progress towards AGI. We frequently collaborate with other teams to speed up the development of new capabilities.

About the Role

We are looking for an experienced Technical Lead to lead critical work on our shared internal inference stack and grow the team. Our inference stack is primarily built by the Applied AI engineering team and we will improve and extend it for research use cases.

In this role, you will:

Get SOTA throughput for our most important research models.

Reduce the time it takes to get efficient inference for new model architectures.

Collaborate closely with Applied AI engineering to maximize the benefits of our shared internal inference stack.

Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think.

You might thrive in this role if you:

Have experience with ML systems, particularly high scale distributed training or inference for modern LLMs.

Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.

Have lead large scale engineering projects end-to-end.

Are an expert in core HPC technologies: InfiniBand, MPI, CUDA, OpenAI Triton.

Deep understanding of GPU/hardware accelerators, networking performance and how to maximize multi-device inference throughput (including overlap of compute and communication).

Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to help the team succeed.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via thislink .

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

#J-18808-Ljbffr

Name	Expiration	Description
ATTBCookie*	2 years	These cookies are used to remember a user’s choice about cookies on thebigjobsite.com. Where users have previously indicated a preference, that user’s preference will be stored in these cookies.
last-search search redirect-stage original-keyword	1 day Session 1 hour 1 hour	These cookies are used by thebigjobsite.com to pass search data between our own pages.
datadome	1 year	DataDome is a cybersecurity solution to detect bot activity
jjap	1 days	Used to track if you have seen the Job Alerts prompt. Job Alerts is a service you can subscribe to to receive information about new jobs.

What job

...and where?

Similar Jobs

Research Engineer, Tech Lead

Research Engineer, Tech Lead

Research Scientist, Tech Lead

AI Inference Engineer

AI Inference Engineer

Tech Lead Manager, Planner Research

Software Engineer, Inference

Software Engineer, Inference

Research Inference, Tech Lead

Share this job

Create Email Alert