Library Check-in / Check-out System Design

Goal

Design a library check-in/check-out system that allows a user to:

Check-out a book.
Return a book that they have checked-out.
Add a book to the system.

Design should list:

Any microservices used.
Physical and logical design.
AWS services that would be used if the system needs to be a SaaS solution
Assumption or design constraints.
Architectural principles that you would consider if the system is mission-critical.
Any sequence diagrams (as needed).

1. Domain research
2. Assumptions and design constraints
3. High-level logical design
4. Models
5. Sequence diagrams
6. Physical design
7. Potential optimizations
8. Performance considerations
9. Monitoring and observability
10. Architectural principles if the system is mission-critical
11. Design scope
12. Final note

1. Domain research

Before thinking about system design or infrastructure, it is important to understand how libraries usually operate in practice.

1.1 How libraries usually operate

Libraries manage both book titles and physical copies.
Availability of a book title depends on whether a copy is currently available, not only on whether the title exists in the catalog.
A single title may have multiple copies, and each copy may have a different status.
Many libraries operate across multiple branches, so copies may belong to a specific location and may be returned to a different branch if it is allowed.
The main day-to-day activities are adding books, checking books out, and accepting returns.
Libraries typically have different user roles, such as members, librarians, and administrators.
When a book is borrowed, the library needs to track the borrower, checkout date, due date, and return status.
Users usually search by title, author, subject, or International Standard Book Number (ISBN), but they borrow a specific copy.
Libraries care a lot about search and discoverability. A member or staff may search by: title, author, subject, ISBN, category

1.2 Sizing assumptions

Scope	Indicative stats
Worldwide library landscape	Libraries exist in the tens of thousands globally, with circulation volumes in the hundreds of millions to billions of items per year across all library systems combined.
Typical tenant in this design	1 to 50 branches
Typical tenant catalog size	10,000 to 1,000,000 titles
Typical tenant physical inventory	20,000 to 2,000,000 copies
Typical tenant circulation volume	A few hundred to a few thousand checkouts/returns per day
Traffic pattern	Noticeable spikes during school terms, weekends, and opening hours

2. Assumptions and design constraints

Functional assumptions

The system tracks physical copies of books, not just titles.
A title can have multiple copies.
A user checks out a single available copy.
A returned book must correspond to an active checkout.
Only librarians or staff can add books/copies.
Library members can search, check availability, and check out books.
A library tenant may operate one or more branches.

Non-functional assumptions

Correctness matters more than extreme scale.
The most important rule is: the same copy must not be checked out twice at the same time.
The system may be used as a SaaS product by multiple library organizations.

Design constraints

Checkout and return should be transactional.
Return should be idempotent where possible.
The system should keep an audit trail of inventory-changing operations.
In SaaS mode, tenants must be isolated from each other.
The design should reflect normal day-to-day library operations without unnecessary complexity.

3. High-level logical design

This design is centered on the main functional requirements of the exercise, while also covering supporting capabilities such as search, audit, and platform concerns at a practical level without making them the main focus of the solution.

The initial design uses one modular service plus one background worker for audit processing, rather than separate business microservices.

3.1 Components diagram

The following diagram shows the dependency view of the main service, its internal modules, and the surrounding SaaS platform components.

flowchart LR
    Client[Client applications]

    subgraph Service["Book Service"]
        API["API handlers"]
        Catalog["Catalog module"]
        Circulation["Circulation module"]
        Auth["Auth module"]
        Audit["Audit module"]
    end

    subgraph SaaS["SaaS platform"]
        DB[(Relational DB)]
        Identity[Identity Provider]
        AuditStore[Long-term audit storage - for example S3]
    end

    Client --> API
    API --> Catalog
    API --> Circulation
    API --> Auth
    API --> Audit

    Catalog --> DB
    Circulation --> DB
    Auth --> DB
    Audit --> DB

    Auth --> Identity
    Audit --> AuditStore

    Catalog --> Circulation
    Circulation --> Audit

3.2 Functional scope

Main requirements:

Add a book title
Add a book copy
Checkout a book copy
Return a book copy

Additional nice to have capabilities:

Search books by title, author, ISBN, or category

SaaS and platform requirements:

Tenant onboarding
User registration
Authentication and authorization
Audit logging

3.3 Flows

#	Flows	Design details
	Main flows
1	Add a book title	Staff adds a book title with metadata such as title, author, ISBN, publisher, and category. The title becomes searchable in the catalog and can later be used to create one or more physical copies.
2	Add a book copy	Staff creates a physical copy for an existing book title. Each copy is associated with a branch and a unique barcode. The copy starts with an `available` status and can then participate in circulation flows.
3	Checkout a book copy	A copy can be checked out only if its current status is `available`. The system validates the user, confirms availability, updates the copy status, creates the loan record, writes an audit event, and commits the transaction. The copy row is locked during the operation to prevent two users from checking out the same copy at the same time.
4	Return a book copy	The system finds the active loan for the copy and validates that it belongs to the requesting user. Librarians can return any active loan. It then marks the loan as returned, updates the copy status back to `available`, writes an audit event, and commits the transaction. The return flow should be idempotent so that retries do not create duplicate return operations.
	Search, SaaS and platform flows
5	Search books	Users can search books by title, author, ISBN, or category. Search is an important supporting capability for real usage, but it is not the main focus of this exercise compared to add, checkout, and return flows.
6	Tenant onboarding	A new library organization is registered as a tenant. The system creates the tenant record, a default branch, and the initial admin account. The tenant admin then manages branches and users inside that tenant.
7	User registration	Tenant admins create or invite librarians and members. Each user is attached to a specific tenant. Users may also be associated with a branch if needed. The application keeps its own user profile and role information.
8	Authentication and authorization	Use an out-of-the-box identity provider such as Amazon Cognito for sign-in, password flows, token issuance, and MFA if needed. The application still keeps its own `users` table for tenant membership, branch association, library role, and local account status. Authorization stays in the application layer. Role model: `member` can search the catalog, check availability, check out books, return books, and access their own account; `librarian` can add books/copies and support circulation operations; `admin` can manage users, branches, and tenant-level settings.
9	Audit logging	The system records key operational events such as book creation, checkout, return, and tenant/user administration changes. Operational events are first written into an outbox table inside the main transactional database. A separate background process reads events from the outbox table and publishes them to long-term audit storage. This uses the outbox pattern to keep the core business transaction and audit event creation consistent.

The flows are described in more detail in section 5, Sequence diagrams.

3.4 Main service (Book Service)

This design uses one backend service split into clear logical parts, rather than multiple deployable microservices.

3.4.1 Why one service is used here

A single service is a better fit for this problem because the domain is still compact and the most important workflows are highly transactional.

Checkout and return are the critical operations in the system. Keeping the related logic in one service makes the behavior easier to implement, test, and reason about.

This approach also reduces operational complexity:

one deployment unit
one codebase
one main transactional boundary
easier debugging and monitoring

A multi-service design would introduce extra complexity early:

service-to-service communication
more deployment and monitoring overhead
more complicated failure handling
harder cross-service consistency

For this system, that complexity is not necessary at the beginning. A modular service with clear internal boundaries gives a cleaner balance between simplicity and correctness.

If the system grows later, the natural evolution path would be to separate:

tenant/user management
catalog
circulation

3.4.2 Logical parts of the service

3.4.2.1 API handlers

Handles:

receiving client requests
validating request shape and input
routing requests to the correct internal module
returning responses to clients

3.4.2.2 Catalog management

Handles:

adding book titles
storing metadata such as title, author, and ISBN
searching and listing books

3.4.2.3 Circulation management

Handles:

physical copies
availability
checkout
return
loan lifecycle

3.4.2.4 Audit

Handles:

writing audit events into the outbox table
background processing of outbox events
delivery of audit records into audit storage

This follows the outbox pattern: the business flow writes its audit event into the same relational database transaction, and a separate background process later publishes that event to audit storage.

3.5 SaaS and platform integration

3.5.1 Tenant and user management

Handles:

tenant onboarding
branch management
user registration
roles and permissions
identity provider integration

3.5.2 Audit storage

Handles:

long-term storage for audit events after they are published from the outbox

4. Models

4.1 Core models

Model	Fields
BookTitle	`book_id` — primary key `tenant_id` `isbn` `title` `author` `publisher` `category`
BookCopy	`copy_id` — primary key `tenant_id` `book_id` `barcode` — unique within tenant `branch_id` `status` (`available`, `checked_out`, `lost`, `maintenance`)
Tenant	`tenant_id` — primary key `name` — unique key `status` `plan` `created_at`
Branch	`branch_id` — primary key `tenant_id` `name` — unique key within tenant `address` `status` `created_at`
User	`user_id` — primary key `tenant_id` `branch_id` `name` `email` — unique key within tenant `role` (`member`, `librarian`, `admin`) `status` `created_at`
Loan	`loan_id` — primary key `tenant_id` `copy_id` `user_id` `checkout_at` `due_at` `returned_at` `status` (`active`, `returned`, `overdue`)
InventoryEvent	`event_id` — primary key `tenant_id` `copy_id` `event_type` `actor_id` `timestamp` `metadata` `delivery_status`

4.2 Key model relationships

A Tenant owns branches, users, books, copies, loans, and events.
A Branch belongs to a tenant.
A User belongs to a tenant and may optionally be associated with a branch.
A BookTitle belongs to a tenant and may have many physical copies.
A BookCopy belongs to one title and one branch.
A Loan connects a user to a specific copy.
An InventoryEvent acts as the outbox table for important operational events.

4.3 Model notes

Title vs copy

A title and a copy are different things:

BookTitle = "The Hobbit"
BookCopy = physical copy with barcode BUL-001-8842

That separation keeps checkout logic correct.

Data integrity rules

a book copy can have at most one active loan at a time
barcode should be unique within a tenant
checkout and return update both BookCopy and Loan in the same transaction

Tenant and user model

each library organization is a separate tenant
each tenant owns its own branches, users, books, copies, and loans
all business data is scoped by tenant_id
tenant admins manage users and branches only inside their own tenant
user roles are member, librarian, and admin

Branch model

a tenant can have one or more branches
a copy belongs to a branch
a user may optionally be attached to a home branch
branch-level restrictions can be added if the operating model requires them

5. Sequence diagrams

This section focuses on the main business flows of the system. SaaS administration flows such as tenant onboarding and user setup are acknowledged in the design, but they are intentionally kept out of scope here so the sequence diagrams stay focused on the core library operations.

5.1 Common authentication and authorization flow

sequenceDiagram
    actor User
    box rgba(33, 150, 243, 0.15) Book Service
        participant API as API handlers
        participant Auth as Auth module
    end
    participant Identity as Identity Provider
    participant DB as Relational DB

    User->>API: request with identity token
    API->>Auth: validate token + role + tenant
    Auth->>Identity: validate identity token
    alt invalid or expired token
        Identity-->>Auth: token invalid
        Auth-->>API: unauthorized
        API-->>User: 401 Unauthorized
    else valid token
        Identity-->>Auth: identity valid
        Auth->>DB: resolve user, tenant, role
        alt user or tenant not found / inactive
            DB-->>Auth: invalid authorization context
            Auth-->>API: forbidden
            API-->>User: 403 Forbidden
        else authorization context found
            DB-->>Auth: authorization context
            Auth-->>API: ok
        end
    end

5.2 Add a book title

sequenceDiagram
    actor Librarian
    box rgba(33, 150, 243, 0.15) Book Service
        participant API as API handlers
        participant Auth as Auth module
        participant Catalog as Catalog module
        participant Audit as Audit module
    end
    participant DB as Relational DB

    Librarian->>API: POST /books
    API->>Auth: common auth flow (5.1)
    Auth-->>API: ok
    API->>Catalog: create book title
    alt invalid request payload
        Catalog-->>API: validation error
        API-->>Librarian: 400 Bad Request
    else valid request
        Catalog->>DB: insert BookTitle
        Catalog->>Audit: add title_created event
        Audit->>DB: insert InventoryEvent(title_created)
        Catalog-->>API: book title created
        API-->>Librarian: 201 Created
    end

5.3 Add a book copy

sequenceDiagram
    actor Librarian
    box rgba(33, 150, 243, 0.15) Book Service
        participant API as API handlers
        participant Auth as Auth module
        participant Circulation as Circulation module
        participant Audit as Audit module
    end
    participant DB as Relational DB

    Librarian->>API: POST /books/{bookId}/copies
    API->>Auth: common auth flow (5.1)
    Auth-->>API: ok
    API->>Circulation: add copy
    alt book title not found
        Circulation-->>API: title not found
        API-->>Librarian: 404 Not Found
    else valid title
        Circulation->>DB: insert BookCopy(status=available)
        Circulation->>Audit: add copy_added event
        Audit->>DB: insert InventoryEvent(copy_added)
        Circulation-->>API: copy created
        API-->>Librarian: 201 Created
    end

5.4 Checkout a book copy

sequenceDiagram
    actor Member
    box rgba(33, 150, 243, 0.15) Book Service
        participant API as API handlers
        participant Auth as Auth module
        participant Circulation as Circulation module
        participant Audit as Audit module
    end
    participant DB as Relational DB

    Member->>API: POST /loans/checkout
    API->>Auth: common auth flow (5.1)
    Auth-->>API: ok
    API->>Circulation: checkout(userId, copyId)
    Circulation->>DB: begin transaction
    Circulation->>DB: lock BookCopy row
    Circulation->>DB: verify status = available
    alt copy not available
        Circulation->>DB: rollback
        Circulation-->>API: copy unavailable
        API-->>Member: 409 Conflict
    else copy available
        Circulation->>DB: update BookCopy.status = checked_out
        Circulation->>DB: insert Loan(active)
        Circulation->>Audit: add checked_out event
        Audit->>DB: insert InventoryEvent(checked_out)
        Circulation->>DB: commit
        Circulation-->>API: success
        API-->>Member: 200 OK
    end

5.5 Return a book copy

sequenceDiagram
    actor Member
    box rgba(33, 150, 243, 0.15) Book Service
        participant API as API handlers
        participant Auth as Auth module
        participant Circulation as Circulation module
        participant Audit as Audit module
    end
    participant DB as Relational DB

    Member->>API: POST /loans/return
    API->>Auth: common auth flow (5.1)
    Auth-->>API: ok
    API->>Circulation: return(copyId)
    Circulation->>DB: begin transaction
    Circulation->>DB: find active Loan for copy
    alt no active loan found
        Circulation->>DB: rollback
        Circulation-->>API: active loan not found
        API-->>Member: 404 Not Found
    else loan belongs to different member
        Circulation->>DB: rollback
        Circulation-->>API: forbidden
        API-->>Member: 403 Forbidden
    else active loan found for requester
        Circulation->>DB: lock BookCopy row
        Circulation->>DB: update Loan.status = returned
        Circulation->>DB: set returned_at
        Circulation->>DB: update BookCopy.status = available
        Circulation->>Audit: add returned event
        Audit->>DB: insert InventoryEvent(returned)
        Circulation->>DB: commit
        Circulation-->>API: success
        API-->>Member: 200 OK
    end

5.6 Publish audit events from outbox

sequenceDiagram
    participant Audit as Audit module
    participant DB as Relational DB
    participant AuditStore as Long-term audit storage

    Audit->>DB: poll InventoryEvent outbox rows
    DB-->>Audit: pending events
    Audit->>AuditStore: publish audit records
    Audit->>DB: mark events as delivered

6. Physical design

The system is deployed as a small AWS-based SaaS application with one main stateless service, one background worker, and one relational database.

6.1 Deployment view

Client applications access the system through Route 53, CloudFront, and an Application Load Balancer.
The main Book Service runs as multiple ECS Fargate tasks distributed across multiple Availability Zones.
A separate background worker also runs on ECS Fargate and processes outbox/audit events asynchronously.
The transactional database is Amazon Aurora PostgreSQL deployed in Multi-AZ mode.
Amazon Cognito is used for authentication and token issuance.
Amazon S3 can be used for long-term audit export, reports, and backups.
AWS Secrets Manager stores database credentials and application secrets.
Amazon CloudWatch collects logs, metrics, and alarms.

6.2 Network layout

The Application Load Balancer is deployed in public subnets.
The Book Service, background worker, and Aurora PostgreSQL run in private subnets.
Only the load balancer is publicly exposed.
Internal components communicate over private network paths.

6.3 Scaling and availability

The Book Service scales horizontally because it is stateless.
The background worker can scale independently from the main API service.
Aurora PostgreSQL provides high availability through Multi-AZ deployment.
Read replicas can be added later if reporting or catalog search becomes heavier.

6.4 Why this physical design fits the problem

This deployment keeps the system simple while still supporting the most important requirement: correct and reliable checkout/return transactions. It also matches the modular-monolith approach of the logical design, avoids unnecessary platform complexity, and gives a clear path for future scaling.

6.5 Physical deployment diagram

flowchart LR
    User[Client Applications] --> DNS[Route 53]
    DNS --> CF[CloudFront]
    CF --> ALB[Application Load Balancer]

    subgraph VPC[AWS VPC]
        subgraph PublicSubnets[Public Subnets]
            ALB
        end

        subgraph PrivateSubnets[Private Subnets]
            App1[Book Service - ECS Fargate]
            App2[Book Service - ECS Fargate]
            Worker[Audit Worker - ECS Fargate]
            DB[(Aurora PostgreSQL)]
        end
    end

    Cognito[Amazon Cognito]
    CW[CloudWatch]
    S3[Amazon S3]

    ALB --> App1
    ALB --> App2

    App1 --> DB
    App2 --> DB

    App1 --> Cognito
    App2 --> Cognito

    App1 --> CW
    App2 --> CW
    Worker --> CW

    Worker --> DB
    Worker --> S3

7. Potential optimizations

Possible improvements that could be introduced later include:

keeping book titles in a shared global catalog instead of storing title metadata separately for each library tenant
storing only tenant-specific copy, branch, loan, and circulation data inside each library tenant scope
reusing common title metadata across libraries when multiple libraries carry the same books
adding caching for frequently accessed catalog and availability queries
introducing read replicas for reporting or heavier search workloads

A practical optimization here is to separate book title metadata from tenant-specific inventory. In that model, book titles could be stored once in a common catalog shared across all libraries, while each library would still keep its own copies, branches, loans, and users.

Search optimization

If search becomes more important, the following fields are good candidates for indexing or dedicated search optimization:

BookTitle.title
BookTitle.author
BookTitle.isbn
BookTitle.category
BookCopy.barcode
BookCopy.status
BookCopy.branch_id
User.email
Loan.user_id
Loan.copy_id
Loan.status

8. Performance considerations

The most performance-sensitive parts of this system are the circulation flows.

Main considerations

checkout and return should remain fast and transactional
book copy lookup by barcode or copy id should be indexed
book title search should support common lookups such as title, author, and ISBN
tenant_id should be part of important filtering and indexing strategy
audit logging should not slow down the core circulation path significantly

Read vs write profile

catalog browsing and search are read-heavy
checkout and return are write-sensitive but lower in volume than search
tenant and user management are relatively low-volume administrative operations

Practical performance measures

index copy, loan, and tenant-scoped lookup fields
cache frequently accessed catalog queries if needed
use read replicas for heavier read traffic or reporting
keep the checkout and return path as short as possible
monitor database lock contention around circulation operations

9. Monitoring and observability

A production version of this system should make the most important operational signals visible.

Core metrics

checkout success and failure rate
return success and failure rate
add-title and add-copy success rate
request latency by endpoint
database latency and connection health
lock contention during circulation operations

Logs and audit visibility

application logs for request and error tracking
structured logs for important business actions
audit event visibility for checkout, return, book creation, and tenant/user administration changes

Alerts

spike in failed checkout or return requests
database latency above threshold
audit write failures
authentication failures above normal baseline

10. Architectural principles if the system is mission-critical

If this becomes mission-critical, I would focus on the following:

1. Inventory correctness first

A book copy must never appear available and checked out at the same time.

2. Strong transactional boundaries

Checkout and return should have strong transactional boundaries.

3. Idempotency

Retries should not corrupt inventory state.

4. Auditability

Every inventory-changing event should be traceable.

5. High availability

multi-AZ database
multiple stateless service instances
outbox-backed async processing

6. Backup and recovery

regular database backups and point-in-time recovery should be enabled
audit data stored in long-term storage should be retained independently from the transactional database
recovery procedures should be documented and tested

7. Graceful degradation

If non-critical background audit processing is delayed, checkout/return should still succeed.

8. Security and tenant isolation

strict authn/authz
tenant filters everywhere
encryption at rest and in transit
secrets in managed stores

9. Observability

Track:

checkout failures
lock contention
return failures
DB latency
tenant-specific error rates

11. Design scope

This solution is intentionally centered on the core requirements of the exercise:

add a book title
add a book copy
checkout a book copy
return a book copy

It uses one modular backend service, one relational database, and a background worker for audit processing. The design can evolve later if scale or product scope grows.

12. Final note

A modular design around Catalog, Circulation, and tenant-aware access control, backed by PostgreSQL, provides a good balance of simplicity and correctness for this problem. The most important technical requirement is preventing invalid inventory state while keeping the system easy to operate and extend.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Library Check-in / Check-out System Design

Goal

Contents

1. Domain research

1.1 How libraries usually operate

1.2 Sizing assumptions

2. Assumptions and design constraints

Functional assumptions

Non-functional assumptions

Design constraints

3. High-level logical design

3.1 Components diagram

3.2 Functional scope

3.3 Flows

3.4 Main service (Book Service)

3.4.1 Why one service is used here

3.4.2 Logical parts of the service

3.4.2.1 API handlers

3.4.2.2 Catalog management

3.4.2.3 Circulation management

3.4.2.4 Audit

3.5 SaaS and platform integration

3.5.1 Tenant and user management

3.5.2 Audit storage

4. Models

4.1 Core models

4.2 Key model relationships

4.3 Model notes

Title vs copy

Data integrity rules

Tenant and user model

Branch model

5. Sequence diagrams

5.1 Common authentication and authorization flow

5.2 Add a book title

5.3 Add a book copy

5.4 Checkout a book copy

5.5 Return a book copy

5.6 Publish audit events from outbox

6. Physical design

6.1 Deployment view

6.2 Network layout

6.3 Scaling and availability

6.4 Why this physical design fits the problem

6.5 Physical deployment diagram

7. Potential optimizations

Search optimization

8. Performance considerations

Main considerations

Read vs write profile

Practical performance measures

9. Monitoring and observability

Core metrics

Logs and audit visibility

Alerts

10. Architectural principles if the system is mission-critical

1. Inventory correctness first

2. Strong transactional boundaries

3. Idempotency

4. Auditability

5. High availability

6. Backup and recovery

7. Graceful degradation

8. Security and tenant isolation

9. Observability

11. Design scope

12. Final note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages