Senior Software Engineer — Backend (Distributed Systems)
Redwood City, CA, United States
As a Senior AI/ML Engineer, you'll build systems to power large-scale machine learning and foundation model (e.g. large language model) workloads. You’ll work closely with other engineers, product managers, and field team members to ensure that Snorkel Flow users working with different data modalities (e.g. text, PDF, image) and different use cases can build high quality training datasets, integrate with the latest foundation model technology to build and adapt models, and take advantage of state-of-the-art error analysis and development automation.
Main Responsibilities
Own the architecture, design, development, and operations of large-scale systems designed for AI/ML tasks including distributed compute systems, data management systems, data engineering workflow systems, and end user experiences
Recognize and act on opportunities to integrate the latest foundation model and related technologies to power user workflows
Prototype, optimize, and maintain scalable back-end services that will power new ML and foundation model development workflows
Design extensible and testable interfaces between internal services including the underlying storage and data models
Be an engaged team player in a customer-focused cross-functional environment where you will feel excited to take on whatever is most impactful for the company and product
Work a hybrid schedule with one or two days per week in our Redwood City HQ and work remotely with "No Meeting" Tuesdays and Thursdays
Required Qualifications
4+ years experience in delivering distributed and ML systems and services in a production setting for cloud-native applications
Experience with distributed compute frameworks and deep learning frameworks
Ability to design and build efficient scalable data storage, compute, and retrieval systems for AI/ML tasks
Strong communication and coding skills with emphasis on designing for scale and robustness
Experience owning the delivery of large multi-person projects
Preferred Qualifications
8+ years of professional software engineering experience
Experience with architecting and developing production web-scale systems (monitoring, telemetry, performance, reliability, triage and debug)
Strong development and debugging skills in Python
Experience working with foundation models (e.g. large language models)
Experience developing enterprise software products for machine learning and/or data science applications
The salary range for our Tier 1 locations of San Francisco, Seattle, Los Angeles & New York is $191,000.00 - $225,000.00.
#J-18808-Ljbffr