System Design Interview Questions

Introduction

The process of establishing system aspects such as modules, architecture, components and their interfaces, and data for a system based on specified requirements is known as systems design. It is the process of identifying, creating, and designing systems that meet a company's or organization's specific objectives and expectations. Systems design is more about system’s analysis, architectural patterns, APIs, design patterns, and glueing it all together than it is about coding. Because your application will be able to handle the architectural load, designing your system adequately for the requirements of your application will eliminate unnecessary costs and maintenance efforts, as well as provide a better experience for your end-users.

It's impossible to overlook system design when it comes to tech interviews! In the interview, almost every IT giant, whether it's Facebook, Amazon, Google, or another, asks a series of questions based on System Design concepts like scalability, load balancing, caching, and so on. So without any further adieu, let us go through the most frequently asked interview questions on System Design.

Interview Questions

System Design Concepts Interview Questions

1. What is CAP theorem?

CAP(Consistency-Availability-Partition Tolerance) theorem says that a distributed system cannot guarantee C, A and P simultaneously. It can at max provide any 2 of the 3 guarantees. Let us understand this with the help of a distributed database system.

Consistency: This states that the data has to remain consistent after the execution of an operation in the database. For example, post database updation, all queries should retrieve the same result.
Availability: The databases cannot have downtime and should be available and responsive always.
Partition Tolerance: The database system should be functioning despite the communication becoming unstable.

The following image represents what databases guarantee what aspects of the CAP Theorem simultaneously. We see that RDBMS databases guarantee consistency and Availability simultaneously. Redis, MongoDB, Hbase databases guarantee Consistency and Partition Tolerance. Cassandra, CouchDB guarantees Availability and Partition Tolerance.

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

2. How is horizontal scaling different from vertical scaling?

Vertical scaling refers to the concept of upgrading the resource capacity such as increasing RAM, adding efficient processors etc of a single machine or switching to new machine with more capacity. The capability of the server can be enhanced without need for code manipulation.

Horizontal scaling refers to addition of more computing machines to the network that shares the processing and memory workload across distributed network of devices. In simple words, more instances of server are added to the existing pool and the traffic load is distributed across these devices in an efficient manner.

This has been demonstrated in the image below:

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Horizontal scaling is different from vertical scaling in following ways:

Category	Horizontal Scaling	Vertical Scaling
Load Balancing	Requires load balancing for distributing request traffic across multiple machines	Since there is just one single machine, load balancer is not required.
Failure Resilience	This is more resistant to application failure because if one server fails, traffic is routed to other server.	This is more prone to failure as there is only one machine and failure of this results in failure of entire application.
Machine Communication	Since there are multiple machines being involved, it is very much necessary to have network communication.	Vertical scaling makes use of inter-process communication within the machine which makes it quite fast.
Data Consistency	There exists possibilites of data inconsistencies here because there are different machines for handling different requests which might result in data being out of sync.	As there is only one machine, there is no issue of data inconsistency.
Limitations	Since this scaling requires multiple servers, there might be concerns on budget and space but the scaling of the application can be done as much as needed based on the business needs.	Vertical scaling has a limit on the capacity of the resources that are achievable. If the resources are scaled up above this limit, then the application might crash and result in downtime.

3. What do you understand by load balancing? Why is it important in system design?

Load balancing refers to the concept of distributing incoming traffic efficiently across group of various backend servers. These servers are called as server pool. The modern day websites are designed to serve millions of requests from clients and return the responses in a fast and reliable manner. In order to serve these requests, addition of more servers is required. In such scenario, it is essential to distribute request traffic efficiently across each server so that they do not face undue loads. Load balancer acts as a traffic police cop facing the requests and routes them across the available servers in a way that not a single server is overwhelmed which could possibly degrade the application performance.

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

When a server goes down, the load balancer redirects traffic to remaining available servers. When a new server gets added to the configuration, the requests are automatically redirected to it. Following are the benefits of load balancers:

They help to prevent requests from going to unhealthy or unavailable servers.
Helps to prevent resources overloading
Helps to eliminate single point of failure since the requests are routed to available servers whenever a server goes down
Requests sent to the servers are encrypted and the responses are decrypted. It aids in SSL termination and removes need to install X.509 certificates on every server.
Load balancing impacts system security and allows continuous software updates for accomodating changes in the system.

4. What do you understand by Latency, throughput, and availability of a system?

Performance is an important factor in system design as it helps in making our services fast and reliable. Following are the three key metrics for measuring the performance:

Latency: This is the time taken in milliseconds for delivering a single message.
Throughput: This is the amount of data successfully transmitted through a system in given amount of time. It is measured in bits per second.
Availability: This determines the amount of time a system is available to respond to requests. It is calculated: System Uptime / (System Uptime+Downtime)

5. What do you understand by Sharding?

Sharding is a process of splitting large logical dataset into multiple databases. It also refers to horizontal partitioning of data as it will be stored on multiple machines. By doing so, a sharded database becomes capable of handling more requests than a single large machine. Consider an example - in the following image, assume that we have around 1TB of data present in the database, when we perform sharding, we divide the large 1TB data into smaller chunks of 256GB into partitions called shards.

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Sharding helps to scale databases by helping to handle increased load by providing increased throughput, storage capacity and ensures high availability.

6. How is NoSQL database different from SQL databases?

Category	SQL	NoSQL
Model	Follows relational model.	Follows non-relational model.
Data	Deals with structured data.	Deals with semi-structured data
Flexibility	SQL follows strict schema.	NoSQL deals with dynamic schema and is very flexible.
Transactions	Follows ACID (Atomicity, Consistency, Isolation, Durability) properties.	Follows BASE (Basic Availability, Soft-state, Eventual consistency) properties.

7. How is sharding different from partitioning?

Database Sharding - Sharding is a technique for dividing a single dataset among many databases, allowing it to be stored across multiple workstations. Larger datasets can be divided into smaller parts and stored in numerous data nodes, boosting the system's total storage capacity. A sharded database, similarly, can accommodate more requests than a single system by dividing the data over numerous machines. Sharding, also known as horizontal scaling or scale-out, is a type of scaling in which more nodes are added to distribute the load. Horizontal scaling provides near-limitless scalability for handling large amounts of data and high-volume tasks.

Database Partitioning - Partitioning is the process of separating stored database objects (tables, indexes, and views) into distinct portions. Large database items are partitioned to improve controllability, performance, and availability. Partitioning can enhance performance when accessing partitioned tables in specific instances. Partitioning can act as a leading column in indexes, reducing index size and increasing the likelihood of finding the most desired indexes in memory. When a large portion of one area is used in the resultset, scanning that region is much faster than accessing data scattered throughout the entire table by index. Adding and deleting sections allows for large-scale data uploading and deletion, which improves performance. Data that is rarely used can be uploaded to more affordable data storage devices.

The following table lists the differences between sharding and partitioning:

Partitioning	Sharding
A partition is a logical database's split into separate, independent portions. Database partitioning is commonly used for load balancing, manageability, performance, and availability.	Sharding is a type of partitioning and is also referred to as horizontal partitioning. Sharding can also be defined as replicating the schema and then dividing the data based on a shard key.
The advantages of partitioning include all that of sharding since sharding is a type of partitioning. Besides this, partitioning includes the benefits of vertical partitioning as well which involves dividing the schema of the database.	The advantages of sharding include the following: 1. Increased Read/Write Throughput – Distributing the dataset across several shards increases both read and write operation capacity, as long as read and write operations are limited to a single shard. 2. Increased Storage Capacity – Boosting the number of shards allows for near-infinite scalability by increasing overall total storage capacity. 3. High Availability - Every piece of data is copied since each shard is a replica set. Moreover, because the data is dispersed, even if an entire shard goes down, the database as a whole remains partially functional, with separate shards hosting different parts of the schema.

Partitioning

Sharding

A partition is a logical database's split into separate, independent portions. Database partitioning is commonly used for load balancing, manageability, performance, and availability.

Sharding is a type of partitioning and is also referred to as horizontal partitioning. Sharding can also be defined as replicating the schema and then dividing the data based on a shard key.

The advantages of partitioning include all that of sharding since sharding is a type of partitioning. Besides this, partitioning includes the benefits of vertical partitioning as well which involves dividing the schema of the database.

The advantages of sharding include the following: 1. Increased Read/Write Throughput – Distributing the dataset across several shards increases both read and write operation capacity, as long as read and write operations are limited to a single shard. 2. Increased Storage Capacity – Boosting the number of shards allows for near-infinite scalability by increasing overall total storage capacity. 3. High Availability - Every piece of data is copied since each shard is a replica set. Moreover, because the data is dispersed, even if an entire shard goes down, the database as a whole remains partially functional, with separate shards hosting different parts of the schema.

A system is said to be scalable if there is increased performance in proportional to the resources added. Generally performance increase in terms of scalability refers to serving more work units. But this can also mean being able to handle larger work units when datasets grow. If there is a performance problem in the application, then the system will be slow only for a single user. But if there is a scalability problem, then the system may be fast for a single user but it can get slow under heavy user load on the application.

9. What are the various Consistency patterns available in system design?

Consistency from the CAP theorem states that every read request should get the most recently written data. When there are multiple data copies available, there arises a problem of synchronizing them so that the clients get fresh data consistently. Following are the consistency patterns available:

Weak consistency: After a data write, the read request may or may not be able to get the new data. This type of consistency works well in real time use cases like VoIP, video chat, realtime multiplayer games etc. Example, when we are on a phone call, if we lose network for a few seconds, then we lose information about what was spoken during that time.
Eventual consistency: Post data write, the reads will eventually see the latest data within milliseconds. Here, the data is replicated asynchronously. These are seen in DNS and email systems. This works well in highly available systems.
Strong consistency: After a data write, the subsequent reads will see the latest data. Here, the data is replicated synchronously. This is seen in RDBMS and file systems and are suitable in systems requiring transactions of data.

10. What is Caching? What are the various cache update strategies available in caching?

Caching refers to the process of storing file copies ina. temporary storage location called cache which helps in accessing data more quickly thereby reducing site latency. Cache can only store limited amount of data. Due to this, it is important to determine cache update strategies that is best suited for the business requirements. Following are the various caching strategies available:

Cache-aside: In this strategy, our application is responsible to write and read data from the storage. Cache interaction with the storage is not direct. Here, the application looks for an entry in cache, if the result is not found, then the entry is fetched from the database and is added to the cache for further use. Memcached is an example for using this update strategy.
Image Not Showing Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
Cache-aside strategy is also known as lazy loading because only the requested entry will be cached thereby avoiding unnecessary caching of the data. Some of the disadvantages of this strategy are:
- In cases of cache miss, there would be a noticeable delay as it results in fetching data from database and then caching it.
- Chances of data being stale is more if it is updated in database. This can be reduced by defining time-to-live parameter which forces update of cache entry.
- When a cache node fails, it will be replaced by new, empty node which results in increased latency.
Write-through: In this strategy, cache will be considered as the main data store by the system and the system reads and writes data into it. The cache then updates the database accordingly as shown in the database.
- The systems adds or updates the entry in the cache.
- The cache synchronously writes entry to the database.
  This strategy is overall a slow operation because of the synchronous write operation. However, the subsequent reads of the recently written data will be very fast. This strategy also ensures that the cache is not stale. But, there are chances that the data written in the cache might never be read. This issue can be reduced by providing appropriate TTL.
Write-behind (write-back): In this strategy, the application does the following steps:
- Add or update entry in the cache
- Write the entry into the data store asynchronously for improving the write performance.
  This is demonstrated in the image below:
  
  The main disadvantage of this method is that there are chances of data loss if cache goes down before the contents of the cache are written into the database.
Refresh-ahead: Using this strategy, we can configure the cache to refresh the cache entry automatically before its expiration.

This cache strategy results in reduced latency if it can predict accurately what items are needed in future.

11. What do you understand by Content delivery network?

Content delivery network or in short CDN is globally distributed proxy server network that serves content from locations closeby to the end users. Usually in websites, static files like HTML, CSS, JS files, images and videos are served from CDN.
Using CDN in delivering content helps to improve performance:

Since users receive data from centres close to them as shown in the image below, they dont have to wait for long.
Load on the servers is reduced significantly as some of the responsibility is shared by CDNs.

There are two types of CDNs, they are:

Push CDNs: Here, the content is received by the CDNs whenever changes occur on the server. The responsibility lies on us for uploading the content to CDNs. Content gets updated to the CDN only when it is modified or added which inturn maximises storage by minimising the traffic. Generally, sites with lesser traffic or content works well using push CDNs.
Pull CDNs: Herem new content is grabbed from the server when first user requests the content from the site. This leads to slower request for the first time till the content gets stored/cached on the CDN. These CDNs minimizes space utilized on CDN but can lead to redundant traffic when expired files are pulled before they are changed. Websites having heavy traffic work well when used with pull CDNs.

12. What do you understand by Leader Election?

In a distributed environment where there are multiple servers contributing to the availability of the application, there can be situations where only one server has to take lead for updating third party APIs as different servers could cause problems while using the third party APIs. This server is called as primary server and the process of choosing this server is called as leader election. The servers in the distributed environment has to detect when the leader server has failed and appoint other one to become a leader. This process is mostly suitable in high availability and strong consistency based applications by using a consensus algorithm.

13. How do you answer system design interview questions?

Ask questions to the interviewer for clarification: Since the questions are purposefully vague, it is advised to ask relevant questions to the interviewer to ensure that both you and the interviewer are on the same page. Asking questions also shows that you care about the customer requirements.
Gather the requirements: List all the features that are required, what are the common problems and system performance parameters that are expected by the system to handle. This step helps the interviewer to see how well you plan, expect problems and come up with solutions to each of them. Every choice matters while designing a system. For every choice, atleast one pros and cons of the system needs to be listed.
Come up with design: Come up with high level design and low level design solution for each of the requirements decided. Discuss on the pros and cons of the design. Also discuss how they are beneficial to the business.

The primary objective of system design interviews is to evaluate how well a developer can plan, prioritize, evaluate various options to choose the best possible solution for a given problem.

14. What are some of the design issues in distributed systems?

Following are some of the issues found in distributed systems:

Heterogeneity: Internet allows the applications to run over a heterogeneous collection of computers and networks. There would be different types of network and the differences are masked by the usage of standard Internet protocols for communicating with each other. This becomes an issue while designing distributed applications
Openness: Openness represents the measure by which a system can be extended and re-implemented in different ways. In distributed systems, it specifies the degree to which new sharing services can be added and made available for client usage.
Security: The information maintained in distributed systems need to be secure as they are valuable to the users. The confidentiality, availability and integrity of the distributed systems has to be maintained and this sometimes becomes a challenge.
Scalability: A system is scalable if it remains effective when there is significant increase in the request traffic and resources. Designing a distributed system involves planning well in advance how well the system can be made scalable under varying user loads.
Failure Handling: In distributed environment, the failures are partial, meaning if some components fail, others would still function. It becomes challenging to handle these failures as it involves identifying right components where the failures occur.

System Design Interview Questions for Experienced

1. Design a global chat service like Whatsapp or a facebook messenger.

What are some of the required features?

Allow users to chat over the internet.
Provide support for one-on-one and group chats.
Messages needs to be stored for better viewing.
Messages needs to be encrypted for security purposes.

What are some of the common problems that can be encountered?

What would happen to a message if it is sent without internet connection?
Will encrypting and decrypting increase the latency?
How are the messages sent and notified to the device?

Possible tips for consideration:

Split database schema into multiple tables such as user table, chat table, message table etc.
Make use of web sockets for bi-directional communication between the device and the server.
Make use of push notifications for notifying the members even if they are online.

2. How do you design a URL shortening service like TinyURL or bit.ly?

TinyURL or bit.ly takes long URL and generates new unique short URL. These systems are also capable of taking the shortened URL and returning original full URL.

What are some of the Required Features?

Generate short URL having length shorter than the original URL.
Store the original URL and map it to the shortened one.
Allow redirects in the shortened URLs.
Support custom names for short URLs.
Handle multiple requests at same time.

What are some of the Common Problems encountered?

What if two users input the same custom URL?
What happens if there are more user load than expected?
How do you regulate the database storage space?

Possible tips for consideration:

Concept of hashing can be used for linking original and new URLs.
REST API can be used for balancing high traffic and handling front-end communication
Multithreading concept for handling multiple request at same time.
NoSQL databases for storing original URLs

3. Design a forum-like systems like Quora, Reddit or HackerNews.

These sites are meant for posting questions and answering them, showing newsfeed highlighting popular questions based on tags and related topics.

What are some of the Required Features?

Users should be able to create public posts and apply tags to them.
Posts should be sortable based on tags.
Post comments in real-time by users.
Display posts on newsfeed based on followed tags.

What are some of the Common Problems encountered?

Should it be just a web application?
Where to store the uploaded images and links?
How can you determine the related tags?
How can you distribute posts across a server network?

Possible tips for consideration:

Check on using SQL database for mapping relational data between users, posts, comments, likes, tags, posts etc.
Incorporate multithreading and load balancer for supporting high traffic.
Make use of sharding for distributing the data across different systems.
Incorporate machine learning algorithms for finding correlation between the tags.

4. Design Facebook's newsfeed system.

Facebook's newsfeed allows user to see what is happening in their friends circle, liked pages and groups followed.
What are some of the Required Features?

Generate newsfeed using posts from other system entities that the user follows.
Newsfeed posts can be of text, image, audio or video format.
Append new posts to the user's newsfeed in close to real-time.

What are some of the Common Problems encountered?

What happens if the new post sees lot of latency to get appended to the news feed?
Can the algorithm handle sudden user load?
What posts should take priority for displaying in the news feed?

Possible tips for consideration:

Evaluate the process of fanout for publishing posts to the followers
Check how sharding can be achieved efficiently for handling heavy user load. The feed data of a user shouldnt be put into multiple servers. Instead, sharding can be done on user ids.

5. Design a parking lot system?

What are some of the Required Features?

Parking lot can have multiple levels where each level has multiple rows for parking spots.
Parking lot can support parking for cars, buses, motorcycles hence spots can be of multiple sizes.
Consider the parking lot capacity at the time of designing the system.
Design appropriate pricing for each parking spot.

What are some of the Common Problems encountered?

What should happen to the parking lot system if every spot is occupied?
Assigning parking lot spot of smaller size to vehicles of bigger size.

Possible tips for consideration:

Think of an algorithm for assigning appropriate parking spot to a vehicle
Think of different entities required for designing the system

6. How do you design a recommendation system?

Recommendation systems are used for helping users identify what they want efficiently by assisting them by offering various choices and alternatives based on their history or interests.

What are some of the Required Features?

Discuss what kind of recommendation system is required - whether it is for movies, e-commerce websites, songs etc.

What are some of the common problems encountered?

Figure out how to recommend fresh and relevant content in real-time.

Possible tips for consideration:

Discuss how to use Eval component for understanding the working of the system
Discuss how to train collaborative filtering approach

7. Design an API Rate Limiter system for GitHub or Firebase sites.

API Rate Limiters limit the API calls that a service recieves in a given time period for avoiding request overload. This question can start with coding algorithm on a single machine to distributed network.

What are some of the Required Features?

What is the required request count per hour or second? Let us assume that the requirement can be 10 requests per second.
Should the limiter notify the user if the requests are blocked?
The limiter should handle traffic suitable according to the scale.

What are some of the common problems encountered?

How to measure the requests per given time?
How to design the rate limiter for distributed system when compared to local system?

Possible tips for consideration:

Evaluate usage of sliding time windows for avoiding hourly resets.
Try using counter integer instead of request for saving space.

What are some of the Required Features?

Users should be able to upload, delete, share and download files over the web.
File updates should be synced across multiple devices.

What are some of the common problems encountered?

Where to store the files?
How can you handle updates? Should the files be re-uploaded or just the modified version has to be updated?
How to handle updation of two documents at same time?

Possible tips for consideration:

Consider using chunking for splitting files into multiple sections for supporting re-uploads of a particular section rather than the whole file.
Make use of cloud storages for storing the files.

9. Design a type-ahead search engine service.

This service partially completes the search queries by displaying n number of suggestions for completing the query that the user intended to search.

What are some of the Required Features?

Service has to match partial queries with popularly searched queries.
The system has to display n number of suggestions (say 5, for example) based on the written query.
The suggestions has to be updated based on the query updation.

What are some of the common problems encountered?

How to update the suggestions without much latency?
How to determine the most likely suggestion?
Are the suggestions adapting to the user's search results?
When does the suggestions appear? Is it updated on the fly or once user stops writing?

Possible tips for consideration:

Evaluate usage of natural language processing for anticipating next characters.
Markov chain rule for ranking the probabilities of top queries.

10. Design Netflix.

Netflix is a video streaming service.

What are some of the Required Features?

Uninterrupted video streaming to be made available for the users.
Likes and reviews of videos
Recommend new videos
Support high traffic of users

What are some of the common problems encountered?

Is it acceptable to have lags while uploading videos?
What happens if many users are accessing same video concurrently?

Possible tips for consideration:

Make use of cloud technology to store and transmit video data
There are three components of Netflix: OC (Content Delivery Network), Backend database, Client device for accessing the application.

11. Design Tic-Tac-Toe game.

Tic-tac-toe game involves two players where one player chooses 0 and other player chooses X for marking the cells. The player who fills a row/column/diagonal with their selected character wins.

What are some of the Required Features?

Support 2 player game where one player can be a computer.
Design algorithm to calculate the win and loss results.

What are some of the common problems encountered?

What happens if both players play optimally?
How to decide the winning strategy?

Possible tips for consideration:

If one player is a computer, then make use of rand() method for ensuring moves are completely random.

12. Design a traffic control system.

Generally, in a traffic control system, we see that the lights transition from RED To GREEN, GREEN to ORANGE and then to RED.

What are some of the Required Features?

Transition traffic lights based on the conventions.

What are some of the common problems encountered?

Determine the time interval for which the state of the traffic lights has to change.
What happens in worst case scenarios where the state is wrongly shown?

Possible tips for consideration:

Make use of state design pattern and scheduling algorithms for transition of state from one color to other

13. Design Web Crawler.

The Web crawler is a search engine-related service like Google, DuckDuckGo and are used for indexing website contents over the Internet for making them available for every result.

What are some of the Required Features?

Design and develop Scalable service for collecting information from entire web and fetching millions of web documents.
Fresh data has to be fetched for every search query.

What are some of the common problems encountered?

How to handle the updates when users are typing very fast?
How to prioritize dynamically changing web pages?

Possible tips for consideration:

Look into URL Frontier Architecture for implementing this system.
Know how crawling is different from scraping

14. Design ATM system.

ATMs are used for depositing and withdrawing money from the customers. It is also useful for checking the account balance.

What are some of the required features?

Each user should have atleast one bank account that is linked to the card for performing transactions.
ATM to authenticate user based on 4 digit PIN associated with the card.
User to perform only one transaction at a given time.

What are some of the common problems encountered?

What happens during transaction timeout?
What happens if the money is deducted from the bank account but the user hasnt received it from the machine?

Possible tips for consideration:

Divide the problem into different entities like Card, Card Reader etc and establish relationship between each of the entities.

15. Design Uber, Ola or Lyft type of systems.

These platforms help user request rides and the driver picks them up from the location and and drop at the destination selected by the user.

What are some of the required features?

Real-time service for booking rides
Should have capability of assigning rides that lets user reach the destination fast.
Show ETA (Estimated Time of Arrival) of the driver after booking the ride and once the ride has been started, show the ETA of the vehicle arriving the destination.

What are some of the common problems encountered?

How to store geographical locations for drivers always on move?
How to assign drivers to the customers efficiently?
How do you calculate the ETA of the driver arrival or the destination arrival?

Possible tips for consideration:

Make use of microservices concept with fast databases for booking rides faster.
Evaluate Dispatch System for assigning drivers to the users.

Conclusion

In this article, we have covered the most frequently asked interview questions on System Design. The key element to clear a System Design interview is that you should have a clear understanding of the approach that you are taking while designing a particular system. For instance, in a system, if you choose to store the data in a No SQL database, you should be clear with the reason that made you choose a No SQL database over a SQL database. You should be clear with the differences between SQL and No SQL databases. In other words, every proposition of yours must be backed by some logical reasoning. This will give you an edge in your interviews.

MCQs For Practice:

Question 1: In a System Design interview question, which of the following options would be the correct sequence to follow?

Statements:
Statement I: Specifying the key features to be included
Statement II: Discussing each feature one by one
Statement III: Clarifying any doubts with regards to the question asked
Statement IV: Clarifying if any other feature needs to be incorporated

Options:
Option A: I, II, III, IV
Option B: III, I, IV, II
Option C: IV, II, I, III
Option D: III, IV, II, I

Correct Answer:
Option B: III, I, IV, II

Question 2: Which strategy can help you configure the cache to refresh the cache entry automatically before its expiration?

Options:
Option A. Refresh-ahead
Option B. Cache aside
Option C. Refresh-through
Option C. Refresh-back

Correct Answer:
Option A. Refresh-ahead

Question 3: Which of the following options can be a design issue in a distributed system?

Options:
Option A: Scalability
Option B: Fault-tolerance
Option C: Clustering
Option D: All of the Above

Correct Answer:
Option D: All of the above

Question 4: Which of the following is not a cache update strategy available in caching?

Options:
Option A: Cache-aside
Option B: Write-through
Option C: Write-behind
Option D: Refresh-ahead
Option E: None of the above

Correct Answer:
Option E: None of the above

Question 5: Which of the following options is correct about horizontal scaling and vertical scaling?

Options:
Option A: Horizontal scaling requires load balancing whereas vertical scaling does not require a load balancer.
Option B: Horizontal scaling is more resistant to application failure as compared to vertical scaling.
Option C: Horizontal scaling may lead to data inconsistencies whereas this is not the case with vertical scaling.
Option D: All of the above.

Correct Answer:
Option D: All of the above.

Question 6: Which of the following options is not a consistency pattern available in system design?

Options:
Option A: Weak Consistency
Option B: Eventual Consistency
Option C: Strong Consistency
Option D: Permanent Consistency

Correct Answer:
Option D: Permanent Consistency

Question 7: Which of the following options is an important factor in determining the performance of a system?

Options:
Option A: Latency
Option B: Throughput
Option C: Availability
Option D: All of the above

Correct Answer:
Option D: All of the above

Question 8: Which of the following options is correct about load balancing?

Options:
Option A: Load balancing is responsible for preventing requests to go to unhealthy servers.
Option B: Load balancing helps to prevent resource overloading.
Option C: Load balancing aids in SSL termination and the need to install X.509 certificates on every server.
Option D: All of the above.

Correct Answer:
Option D: All of the above.

Question 9. Which of the following is not true about strong consistency pattern?

Options:
Option A. After a data write, the subsequent reads will see the latest data.
Option B. The data is replicated asynchronously.
Option C. It is suitable in systems requiring transactions of data.
Option D. All of the above

Correct answer:
Option B. The data is replicated asynchronously.

Question 10: Which of the following statements is/are true about sharding?

Options:
Option A. Sharding refers to horizontal partitioning of data as it will be stored on multiple machines.
Option B. A sharded database becomes capable of handling more requests than a single large machine.
Option C. Sharding helps to scale databases by helping to handle the increased load by providing increased throughput, storage capacity and ensuring high availability.
Option D. All of the above

Correct Answer:
Option D. All of the above.

Guest McCoy2022/02/02 07:28:26

Can we add a question : What is the difference between Sharding and Partitioning (Edited)

Guest Rodgers2022/02/02 07:29:03

Concepts Interview Questions

Can we start with : What is CAP theorem ? (Edited)

Guest Bowers2022/02/02 07:29:35

What do you understand by

Prior to this question, we need to understand vertical and horizontal scale, then we talk about sharing as it a horizontal scale (Edited)

Guest Alvarez2022/02/02 07:30:06

#### 11. What do you understand by Content delivery network?

before this question pls introduce : what is caching? (Edited)

Guest Myers2022/02/02 07:30:52

What do you understand by Sharding

Post this question : (Edited)

SukanyaPai

2022/02/03 03:49:29

done (Edited)

2022/02/03 03:50:44

System Design Interview Questions

Introduction

Interview Questions

System Design Concepts Interview Questions

1. What is CAP theorem?

2. How is horizontal scaling different from vertical scaling?

3. What do you understand by load balancing? Why is it important in system design?

4. What do you understand by Latency, throughput, and availability of a system?

5. What do you understand by Sharding?

6. How is NoSQL database different from SQL databases?

7. How is sharding different from partitioning?

8. How is performance and scalability related to each other?

9. What are the various Consistency patterns available in system design?

10. What is Caching? What are the various cache update strategies available in caching?

11. What do you understand by Content delivery network?

12. What do you understand by Leader Election?

13. How do you answer system design interview questions?

14. What are some of the design issues in distributed systems?

System Design Interview Questions for Experienced

1. Design a global chat service like Whatsapp or a facebook messenger.

2. How do you design a URL shortening service like TinyURL or bit.ly?

3. Design a forum-like systems like Quora, Reddit or HackerNews.

4. Design Facebook's newsfeed system.

5. Design a parking lot system?

6. How do you design a recommendation system?

7. Design an API Rate Limiter system for GitHub or Firebase sites.

8. How do you design global file storage and file sharing services like Google Drive, Dropbox etc?

9. Design a type-ahead search engine service.

10. Design Netflix.

11. Design Tic-Tac-Toe game.

12. Design a traffic control system.

13. Design Web Crawler.

14. Design ATM system.

15. Design Uber, Ola or Lyft type of systems.

Conclusion

MCQs For Practice:

Read more

Exception Handling Interview Questions

LoadRunner Interview Questions

Numpy Interview Questions

Data Modelling Interview Questions