Infrastructure Engineer, Observability San Francisco, CA
San Francisco, CA, United States
Nearly every company in the world runs on custom software: Gartner estimates that up to 50% of all code is written for internal use. This is the operational software for refunding orders, underwriting loans, onboarding employees, analyzing transactions, and providing customer support. But most companies don’t have adequate resources to properly invest in these tools, leading to a lot of old and clunky internal software or, even worse, users still stuck in manual and spreadsheet flows.
At Retool, we’re on a mission to bring good software to everyone. We’re building a new type of development platform that combines the benefits of traditional software development with a drag-and-drop UI editor and AI, making it dramatically faster to build internal tools. We believe that the future of software development lies in abstracting away the tedious and repetitive tasks developers waste time on, while creating reusable components that act as a force multiplier for future developers and projects. The result is not just productivity, but good software by default. And that’s a mission worth striving for.
Today, our customers span from small startups building their first operational tools to Fortune 500 companies building mission-critical apps for thousands of users across their business. Interested in joining us? Let us know!
WHY WE'RE LOOKING FOR YOU:
Retool started as a way to address obstacles with internal tools and has grown into a company that solves internal tooling for thousands of companies, from one-person startups to S&P 500 enterprises. We’ve done a lot with a little–we have a rapidly growing engineering team and a laundry list of features and foundational infrastructure pieces we want to tackle.
Retool is in an exciting hyper-growth phase and we need infrastructure engineers to tackle our rapid scaling challenges. These scaling challenges are unique both in scope and in technical complexity as we scale both the company and the product.
WHAT YOU'LL DO:
In this role, you will be a founding member of our Observability team! You will build, integrate, and evangelize observability platforms and solutions for our products and internal systems. You will drive adoption of these solutions and ensure they drive value for the company. In delivery of these solutions, you will leverage automation and orchestration tooling, along with infrastructure-as-code patterns.
Your core responsibility in this role is to build and deploy observability solutions that make our products highly available, scalable, reliable, observable and delight our customers.
IN THIS ROLE, YOU'LL:
Help build a great product that improves productivity of engineers across the globe by several orders of magnitude
Design and build observability solutions via collection frameworks, delivery, analysis, and visualization of metrics, logs, and traces
Work with engineers, designers, product managers and customer support to instrument and implement observability into our products and internal apps
Building orchestration and automation tooling around off-the-shelf solutions (e.g. Datadog), as well as building custom solutions that meet our unique needs
Be involved in the development of scalable, distributed software systems that support globally distributed customer base
Coach and mentor other SRE/SWE; Provide leadership in iteratively defining & refining development processes as the team grows
THE SKILLSET YOU'll BRING:
7+ years of related professional experience, with 2+ years in a lead role for a mission critical platform with high-availability requirements
Experience with containerization (e.g. Docker, Kubernetes), infrastructure as code (e.g. Terraform) and observability (e.g. Datadog, Stackdriver, Wavefront, Grafana) stacks
A strong understanding of system availability, resiliency, and recoverability
Comfortable being a hands-on individual contributor, while at the same time hiring and scaling the team
Strong organizational skills with high attention-to-detail and able to work independently with minimal supervision
Ability to thrive in a high-energy, high-growth, fast-paced, entrepreneurial environment. Willing to learn new skills and implement new technologies
BONUS POINTS:
Familiarity with GitHub, CI/CD, DevOps
Familiarity with React, React Native frontend web and mobile application development
Experience with observability platforms and tools like Datadog, New Relic, Dynatrace etc..
For candidates based in San Francisco, the annual base salary range is listed below. This salary range may be inclusive of several career levels at Retool and will be narrowed during the interview process based on a number of factors such as (but not limited to), scope and responsibilities, the candidate’s experience and qualifications, and location.
Additional compensation in the form(s) of equity, and/or commission/bonuses are dependent on the position offered. Retool provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.
$127,500 — $234,100 USD
Retool offers generous benefits to all employees and hybrid work location. For more information, please visit the benefits and perks section of our careers page!
Retool is currently set up to employ all roles in the US and specific roles in the UK. To find roles that can be employed in the UK, please refer to our careers page and review the indicated locations.
#J-18808-Ljbffr