Betsy Beyer,Chris Jones,Jennifer Petoff,Niall Richard Murphy

Site Reliability Engineering

Notify me when the book’s added
To read this book, upload an EPUB or FB2 file to Bookmate. How do I upload a book?
Building and operating distributed systems is fundamental to large-scale production infrastructure, but doing so in a scalable, reliable, and efficient way requires a lot of good design, and trial and error. In this collection of essays and articles, key members of the Site Reliability Team at Google explain how the company has successfully navigated these deep waters over the past decade. You'll learn how Google continuously monitors and deploys some of the largest software systems in the world, how its Site Reliability Engineering team learns and improves after outages, and how they balance risk-taking vs reliability with error budgets.
This book is currently unavailable
834 printed pages
Have you already read it? How did you like it?
👍👎

Impressions

  • Bobby Marleyshared an impression7 years ago
    👍Worth reading
    🎯Worthwhile

    Must read for developers, operations and architects. Read and use as a reference.

  • Dauren Chapaevshared an impression4 years ago
    👍Worth reading
    💡Learnt A Lot
    🎯Worthwhile
    💤Borrrriiinnng!

Quotes

  • Bobby Marleyhas quoted7 years ago
    Finally, use the project management style that suits the project in its current state.
  • Bobby Marleyhas quoted7 years ago
    A matrix of all possible combinations of disasters with plans to address each of these disasters permits you to sleep soundly for at least one night; keeping your recovery plans current and exercised permits you to sleep the other 364 nights of the year.
  • Bobby Marleyhas quoted7 years ago
    The cost of failure is education.
    Devin Carraway

On the bookshelves

fb2epub
Drag & drop your files (not more than 5 at once)