System design issue

I asked the following question during an interview with a cloud computing company, and did not answer well. Any suggestions for analyzing this issue would be greatly appreciated.

Our company has hundreds of millions of users, and we expect zero downtime in production, explaining programming methods and methods that help improve redundancy and fault tolerance capabilities for front-end, intermediate, and internal services, including database services.

+3
source share
2 answers

This is a fairly broad question. If they expect zero downtime, ask them to forget about it or turn all their profits into building redundancy. Now, if they just want "five 9, or 99.999% uptime," then we can talk. :)

Usually, you can answer these questions with the usual canned unit about creating a stable, automatic, prefabricated environment that includes extensive unit testing. Using design patterns such as MVC or the like can help in testing. Perform regular security checks. This is much more than just a development issue, it is a question of network and server architecture, maintenance of secondary and tertiary data centers, etc. These questions really give you a chance to make the interviewer very important.

0

" " Joel. .

:

  • -

, . , , .

, , . 100% , - ,

  • .
  • O.S.
  • Dumb System Admin DBA
  • Dumb Janitor

( SQL Server Windows)

  • ( )
  • NICS
  • ( )

, , "--".

+2

Source: https://habr.com/ru/post/1752139/


All Articles