As engineers we expect our systems and applications to be reliable. And we often test to ensure that at a small scale or in development. But when you scale up, the assumption that conditions will remain stable is wrong. Reliability at scale does not mean eliminating failure, failure is inevitable. It matters when it impacts our users and it matters how we handle it.
Ana will talk about the practice of Chaos Engineering and how we can proactively embrace failure as we scale our systems.
Ana Margarita is currently working as a Senior Chaos Engineer at Gremlin, helping companies avoid outages by running proactive chaos engineering experiments. Before Gremlin, she has worked at various-sized companies including Google, Uber, SFEFCU, and Miami-based startup. Ana is an internationally recognized speaker and has spoken at: AWS re:Invent, KubeCon, DockerCon, DevOpDays, AllDayDevOps, Write/Speak/Code, and many others. Catch her tweeting at @Ana_M_Medina about traveling, diversity in tech, and mental health.
Have questions for Ana?
Please submit them in this thread. Ana would love to answer them!
Haven’t signed up for the free conference yet?
Grab your free tickets here