Software Engineer (L4) - Resilience Engineering
Netflix
At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next.
Who we are
Resilience Engineering’s purpose is to help other teams at Netflix understand the outcome of a change (e.g. a code change), before they are deployed to production.
Our tools help teams build confidence that their services will perform as expected when changes are introduced, while also providing early insights into any unintended consequences, such as increased error rates or latency, that could affect the Netflix experience for our users.
A majority of critical services leverage our platform and tooling to confidently and safely deliver changes to production on a day to day basis.
You can learn more about what we do from the Evolution of Chaos presentation.
Where we work
We are a distributed team, with members working both remotely and near Netflix offices. We collaborate asynchronously and meet in person quarterly.
What you could work on
We have two major focuses that you could work on in 2026:
Expand our canary approach to new use cases, including libraries and functions.
Increase confidence that Netflix’s most critical applications are resilient under different expected and unexpected conditions.
Some of the larger initiatives we will be focusing on are:
Extending canaries to effectively support use cases where a single change needs to be validated across a large number of targets (e.g. library changes).
Expanding canary support to new resource types, like functions.
Scaling out chaos testing to recurrently validate fallback behaviour on behalf of application owners.
About you
You are curious and enjoy solving ambiguous, open-ended problems.
You collaborate effectively across teams, using communication skills to influence product direction.
You are product-focused, driving the ideation, design, development, and implementation to turn user’s pain-points into simple and elegant solutions.
You have experience to distributed systems including how to debug them and how they can fail.
Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.
Netflix is a unique culture and environment. Learn more here.
Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.
We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
Job is open for no less than 7 days and will be removed when the position is filled.