System Design

note: over engineering always sucks, either it will kill your app, your job or you. in short it's a bad thing and try to avoid it. maybe you asked how, well it's try to explain but sometime i stuck with it too. in simple word if I can say don't make things over complicated in other words don't use a gun to kill a mouse

Question & Answer

Q1- what is Design Pattern ?
A1- Design Pattern prefer to how you organize the code. it will focus on how organize the code

Q2- what is MVC ?
A2- MVC (Module View Controller) - As per name MVC this system will have a module (to communicate with the database), a controller (to handle user request), and a view to render (to render a response)

here are few key point in a MVC
- it will be single repo
- deploy on a single server
MVC is a design pattern.

Q3- what is Software Architecture ?
A3- Software Architecture is the structure of the code. software architecture follow how different system connect/communicate

Q4- what is 3-tier Architecture ?
A4- in 3-tier Architecture we will break all there component from MVC and separate them into different server. so your model (database) will be host in different server, as well as your view (frontend) and your controller (backend).
example: let's say you are using mongoDB as database so your model is on mongodb's server (aws/azure/ or what cluster you select), and you host your frontend which is on reactJS is host on vercel and for database you maybe use any cloud providing service - here you can see our system is divided on code level as well on hosting level as well.

3-tier is an Architecture.
the beauty of 3-tier is if you want to boots your frontend or backend you can host more server to the specific end, and yes same for database

Q5- what is System Design ?
A5- System Design is the process of solving a specific problem.

Q6- what is a Monolith ?
A6- if a single serve is doing everything and project is wrap in one source control then it's a monolith application
example: you write a app in js and deploy it in a server and maybe it have a database as well and the database is also in the same server and this single server is managing your app completely then it's a monolith application

Q7- what is Micro Service ?
A7- Micro Service is when you break your system into smaller chunks (at architecture level).
example: having different application (different codebase/server/repo) for user module, payment module and other stuff

the beautify of micro service is as we see in the 3-tier we can add frontend server or backend but the problem is - let's say your application is a e-commerce store and a lot of people spend more time on searching as compare to payment page so maybe you can host 2 server for searching and 1 for payment service because searching service get more traffic while in 3-tier you have to upgrade the server for complete application

Q8- what is distributed computing ?
A8- in short, if any system run in different computer, for example a server running in different ec2 instances of AWS.

Q9- What is Availability ?
A9- let's say you have a system and it's run one a single machine, and it this system break (either code error or something wrong with system) your system will no longer available for the people on internet. so this system have 0 availability, let's say you host your system into 2 server so even if one server is down you can use second server for the people who want to access your system. and this is what we called availability

Q10- What is High Availability ?
A10- As we discuss in last question that having a backup is availability but we all know that maybe backup system will go down as well, so we measure the availability and decide if the server is high available or not
we measure availability in 9, so 1-9 would be 90% which mean in one year this service could be down in 36.5 days, second we have 2-9 which is 99% which mean service will be down 3.65 days in one year (all this are worst case scenario) and so on as 3-9 to 4-9
the bench mark is 4-9 (99.99%) which mean less then one hour per year

Q11- What is Reliability ?
A11- in simple words, does the app do what it's suppose to do.

Q12- Availability vs Reliability ?
A12- Availability is if the service is available to access and Reliability is if service functional as it should be functional.

Q13- What is Consistency ?
A13- Consistency is something that everyone see same thing, (this only something if you work with distributive system).

in monolith Architecture you have one db and everyone seeing data from that database so it will have 100% Consistency
in micro service all your services have their own database so their is a chance data might be inconsistency

Q14- What is Strong Consistency ?
A14- If you change some data in one server and all server is showing same data as we update it's strong consistency.

How to apply it
- it's simple in your first server where you update the data, data will stay in pending state until it get conformation that all the server have same data then it will update the data in the first server.
- prefer for banking, medical stuff, military
+ point: everything is consistence
- point: take a little time

Q15- What is Eventual Consistency ?
A15- If you change some data in one server and other server get this data in some time it's eventual consistency.

How to apply it
- it's simple first you update the first server which get the data request and after that or maybe after x amount of time or every hour you update data in other servers as well.
- prefer for social media, basic crud app
+ point: current server update quickly
- point: other server take time to update

Q16- What is Scalability ?
A16- Scalability is if your app can handle workload and consistency from a spiky traffic. in simple word if you have more resource then your spiky traffic then yes your app is scalable. but there is some other way to manage scalability if your server go beyond the available resource. there are two ways to manage it

Vertical Scaling:
- In vertical scaling you will increase the resources of the existing server. but still stay 1 server (also know as scaling up)
Horizontal Scaling
- In horizontal Scaling you will add more number of server. you go from one to many (also know as scaling down)

Q17- What is Elasticity ?
A17- In Elasticity we will only increase the workload when their is a increase if work. so if you get more user there will be more server and if number of user decline server will be delete as well. (available in cloud)

Q18- What is Single Point Of Failure - (SPOF) ?
A18- Single Point Of Failure - (SPOF) is if your service failed because you're server down and you don't have any backup.

Q19- What is Redundancy ?
A19- Redundancy is having a extra layer of backup if something go down. One important thing to remember redundancy is more cost because now you need to manage multiple servers

Q20- What is Fault Tolerance ?
A20- We have Fault Tolerance by removing Single Point Of Failure - and how by adding redundancy - so basically adding redundancy and removing single point of failure is Fault Tolerance

Q21- What is Database Replication ?
A21- Database Replication is adding redundancy in the database - in other words creating a replica of existing as backup

Q22- What is Database Cluster ?
A22- Database Cluster is having multiple database replica -

Q23- What is Synchronous and Asynchronous Replication ?
A23- Synchronous Replication is Strong Consistency and Asynchronous Replication is Eventual Consistency.

Q24- What is Latency ?
A24- Time while getting the response. it can be on database consistency, either loading a landing page or something other.

Q25- What is ACID - (Atomicity, Consistency, isolation, durability) ?
A25- Well it can be a little tricky to define:

Atomicity: either thing you do in your system will be success or failed
Consistency: means you can't create change which will break existing rules - common example deleting something which have foreign key reference
Isolation: transition should be isolated
Durability: permanent - once the data save it stay even if server down

Q26- What is Database Partitioning?
A26- Database Partitioning is dividing a table into 2, there are 2 ways to do this:

Vertical Partitioning:
- divide a table's by columns
Horizontal Partitioning:
- dividing a table's data by rows

Q27- What is Auto Scaling ?
A27- Auto Scaling is work with scaling down (horizontal scaling) - in auto scaling we config the system so it will add or delete new server automatically (as per traffic) - also refer as elasticity.

Q28- What is Props and Cons of using Single Server ?
A28- few props and cons listed below:

Props:
- it simple to build
- it simple to scale (just add more hardware - or Scale Up)
- only recommended for small app - practice or learning purpose
Cons:
- you can not scale separately
- no separation of concerns (focus on code)
  - hard to manage - specially in large team
- it's a Single Point Of Failure (SPOF)
- easy target for DDOS

Q29- What is Props and Cons of using 3-tier Architecture ?
A29- few props and cons listed below:

Props:
- super common to use
- biggest win now you can scale up or down all three services independently
- fronted:
  - you can use serverless Services
Cons:
- more code (separate frontend and backend and yes database)
- add complexity
- handle networking
- still limited to vertical scaling

Q30- What is Serverless ?
A30- Serverless is when cloud provider manage all the under the layer things of the server and you only pay for the usage and don't worry about scalability.

Q31- What is Props and Cons of using micro service Architecture ?
A31- few props and cons listed below:

Props:
- remove SPOF
- easy to horizontal vertical
- achieved high availability
- support auto scaling - elasticity
Cons:
- add load balancing
- no caching
- expensive to manage
- add more complexity

Q32- what is load balancer ?
A32- load balancer help me manage vertical scaling - and manage it as per traffic

Q33- What is cache ?
A33- let's understand it as you visit the website and to see the website frontend make a request to backend and backend make request to database and then database return to backend and then backend return to frontend and then you see the data - so cache come in place to minimized all this request and save value into one of 3 places - you can save data on frontend to eliminate backend request or save data on backend to eliminate database request, so what will happened now:

frontend caching:
- you visit the website first time and it take required time to load and then save everything into client browser cache and next time you visit you will see website loading instance (because of saving in cache)
- use for save harmless values to save on frontend
  - example:
    - user theme
    - images
    - user info
backend caching:
- now say you visit same website and all the images is same on frontend but your it fetch horoscope from backend which again come from database. but we now that horoscope value will change after one day so if you visit same website in same day backend will not fetch horoscope information from database and just return save value from backend server
  - example:
    - LOV = (list of values)
    - anything which will take some time to update

Q34- What is CDN - (Content delivery Network) ?
A34- CDN is same as Cache for frontend. let's say you have server in london and people from india visit. indian people request will travel from india to all the way london to fetch the website. and it will take time. so what we can do with this. let's say you can have access to multiple server in the world so you deploy your website in all the server and now user will get response from nearest server. a cdn will do the exactly this and deploy your website to multiple server and you get a faster website

CDN is a service so you probably use it rather then creating your own
Edge Worker - look for distributed backend

Q35- What is Props and Cons of using a CDN ?
A35- few props and cons listed below:

Props:
- make website fast (increase speed and reduce latency)
- easy to implement
- easy to make app global
- mostly CDN have security to
- good for SEO
- good for images, videos and downloads
Cons:
- add complexity
- only work with client-side application (so if you have a website with server side pages this is not for you)
- costly
Use Case:
- use for script (html, css, js)
- use for photos
- use for videos

Q36- What is Props and Cons of Cache ?
A36- few props and cons listed below:

Props:
- less db hits
- faster speed
- it's cheaper to manage
Cons:
- increase complexity
- increase development effort

Q37- How can I put a server on web ?
A37- In general there are 2 ways

In House:
- If you want to host your server in house just go buy a server (machine), config the ip and you good to go.
- Props:
  - it's cheaper to manage
- Cons:
  - space (you have a limit memory)
  - hardware limitation (it's hard to update hardware of an in-house server)
Using Cloud:
- If you want to host your application on cloud just simple go to any cloud provider and buy the server
- Props:
  - forget about hardware layer
- Cons:
  - costly in start (for one to two servers)

Q38- If you have to scale a web application what should you choice Scale up or Scale Out ?
A38- Well for this particular question we have no perfect answer: the answer will depend on which server you are using:

frontend:
- you have few great option:
  - Serverless
  - CDN
backend:
- there are key point which required when you want to scale your backend (api) server
  - it should be stateless
  - next we will move to scaling
    - Vertical Scaling
      - there are 4 things you can scale on vertical scaling
        
        CPU
        
        RAM
        
        Disk/Storage
        
        Network
      - so we have few step to follow
        
        adjust the instance
        
        update the thing which make issue, like CPU, RAM, Storage, or Network
      - Props:
        
        it simple
        
        easy to start
      - Cons:
        
        it have single point of failure
    - Horizontal Scaling
      - start with scaling into 2 nodes if you already scale vertical enough
      - Props:
        
        it have High Availability
      - Cons:
        
        add complexity
        
        need load balancer
    - Hybrid Scaling
      - start with increase the instance in vertical to get a decent instance then add 2 node to remove SPOF, now at this stage you can decide either you need horizontal or vertical scaling
database:
- we can scale our data both Vertical and horizontal
  - Vertical Scaling
    - just increase the node and you good to go
    - Props:
      - easy to do
    - Cons:
      - have single point of failure
  - Horizontal Scaling
    - create a stand by database (only work if anything happen with primary)
    - Props:
      - remove the single point of failure
    - Cons:
      - add complexity
  - Hybrid Scaling
    - have a Write node and create at least 2 read node to manage traffic

Q39- How we can scale horizontal ?
A39- Well three are few ways to do that:

Same Data Center:
- let's say AWS have server center in mumbai and you chose both your node (in horizontal scaling) in mumbai and in same data center so it mean your server is hosted in same place, it will remove SPOF but it will take a little long time for someone from US (because server is still far)
- Props:
  - increase High Availability
  - add scalability
  - remove Single Point Of failure
- Cons:
  - still slow for someone how access your service from far (no impact on latency)
Same Region But Different Data Center:
- now you again on AWS and chose different data center in same region, well if you are only working in specific region then this will work just fine otherwise same problem for above, other region will face latency
Different Region:
- now you go in different region, it will help with latency and you can scale few server a little more up then other as per traffic
  - make sure you assign traffic to closet region
    - for this you have to use geolocation routing (route 53 manage geolocation in AWS)
  - and each server again have multiple nodes in same/different data center to increase availability and remove SPOF
  - it will look something like this
```
|- client Request
    |- Geo Location (DNS)
        |- Region abc-1
            |- availability zone (data center) az-1 (optional)
                |- load balancer
                    |- node1 (your server)
                    |- node2 (your server)
            |- availability zone (data center) az-2 (optional)
                |- node1 (your server)
        |- Region abc-2
            |- availability zone (data center) az-1 (optional)
                |- load balancer
                    |- node1 (your server)
                    |- node2 (your server)
```
    - so as you see in each region we can have multiple nodes inside a load balancer ( which is optional ) and you can have as many as you want and user request will first go to geo location (DNS) then the nearest region and inside the region in the available node
    - it will increase complexity a lot more
    - it any region go down you can route to other region (just beware of laws of the region)
- Props:
  - improve latency
  - improve high availability
  - remove SPOF
- Cons:
  - add complexity
  - higher cost
  - still in global
Different continent:
- going global may bring some concern
  - localization requirement
    - there are multiple laws in how to save and what to save in the database to beware of that
- a part of that everything go same as in different region

Q40- What is CAP Theorem ?
A40- CAP Theorem is consist of 3 things:

Consistency:
- in simple all your database node should be consistence
- in other words, having most recent data in all db nodes
Availability:
- in simple if you get some data from your database it should give a response
- in other words, getting a response with data
Partition:
- in simple words, communication in two nodes get disrupted but still working fine
- two node didn't stay sync for some know/unknown reasons but still provide response in CAT Theorem you have to chose between two from three of this, and Partition is import because we need response so we will choice between Consistency or Availability
AP - (Availability & Partition):
- in this situation you might get updated data from node one and node two might send stale data
CP - (Consistency & Partition):
- in this situation you might didn't get response from one to two node but every response is up to date
CA - (Consistency & Availability) - (only work in theory):
- in this situation you might get data some time which is up to date and some time you don't because we don't have Partition here and that's why in only work in theory

Q41- How you can scale a Database ?
A41- Let's break this into 2:

Relational Database:
- Vertical Scaling:
  - first make sure you have strong/powerful node before going for horizontal scaling
- Caching:
  - second thing you need to use caching to minimize database hits
- Data Warehouse:
  - we have data warehouse were you have one primary database only for transaction (OLTP) and another for analytics purpose (OLAP)
  - data warehouse
- Horizontal Scaling:
  - start with a standby node for high availability
    - Replication:
      - we need to replicate data from one node to another continuously
      - we use write ahead log (wal) to achieve replica
    - Separate Replication:
      - you can have one backup for writing as mention above and other for reading to minimize the load on master node, this may very case to case.
      - and yes if you want you can have standup nodes for read node which is optional
    - Sharding:
      - here let's say we have multiple nodes so we don't replicate everything to every node but only one, with this way you can increase writing capacity of the database but it will increase the complexity, keeping things consistence and we ofter use services for this with relational databases.
Non-Relational Database:
- in noSQL you might use
  - all this 3 didn't need scaling because they use for temporary storing data
    - redis
    - elasticSearch
    - clientSide (indexdb)
- Sharding:
  - sharding is easy in noSQL but using noSQL with bring it's own challenges as well

Key words

Stale = use for old(incorrect) data
node = a compute/server in multi computer system
Database Partitioning at node level is know as sharding
DDOS - Distributed Denial-of-Service = is a cyber crime that flood a website or server with traffic to make it inaccessible
Horizontal Scaling = Scaling Out
Vertical Scaling = Scaling up
Diminishing Returns = in simple you will not double return if you invest double resource
region = is a specific area
SPOF = Single Point Of failure
OLTP = Online transaction Process
OLAP = Online Analytics Process
wal = Write Ahead Log

Tips

For Database:
- start with vertical scale as much as possible before going to horizontally
- or have multiple server, multi read and single write
- you can use haproxy for horizontal scaling
For Backend:
- start with split service into smaller services and then move to micro services

Resources

I start my journey using this cool stuff so shout to them:

System Design by Caleb Curry

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

System Design

Question & Answer

Key words

Tips

Recommended

Book

Cloud Provider

Resources

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

System Design

Question & Answer

Key words

Tips

Recommended

Book

Cloud Provider

Resources

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!