Assignment 1 — Negacyclic NTT in JAX

In this assignment, you will implement a negacyclic Number Theoretic Transform (NTT) in JAX and measure its performance. The goal is to keep the code correct and push performance as far as you can.

You are free (and encouraged) to use any AI tools, both for learning this algorithm and optimizing your code. AI will certainly help, but you will likely need to understand and improve the output to reach top performance.

Useful links

Algorithm specifics

The NTT is the FFT, but for exact modular arithmetic instead of complex numbers. That makes it a core tool for fast polynomial multiplication, especially in cryptography.

The forward negacyclic NTT for polynomials modulo x^N + 1:

$y[k] = \sum_{n=0}^{N-1} x[n] * \psi^{(2k+1)n}$ (mod q)

Where:

N is the transform size
q is a prime modulus where (q - 1) is divisible by 2N
psi is a primitive 2N-th root of unity modulo q, so psi^N ≡ -1 (mod q)

Your function must handle inputs shaped (B, N) (batch dimension B). psi_powers and twiddles are inputs to your implementation. The tests pass the tables from provided.precompute_tables, but you can transform them once in prepare_tables or use your own layout internally. No auto-conversion is required or expected.

You may use any correct NTT algorithm.

Optional hook: prepare_tables(q=..., psi_powers=..., twiddles=...) can precompute or convert tables once. The benchmark calls this before timing, so its cost is excluded.

What to do

Your implementation goes in student.py — that's the only file you edit.

Implement ntt in student.py.
Implement mod_add, mod_sub, and mod_mul in student.py.
Keep the public API unchanged so tests and benchmarks still run.
Focus on speed. Correctness is required, but the goal is fast code.

Your implementation will not need to be modified to switch between CPU and GPU backends. We suggest testing correctness locally. Once you have a working version, then migrate over to a GPU to test and optimize performance.

Performance tips

Keep everything in JAX and JIT the hot path.
Precompute or convert tables once in prepare_tables.
Modular arithmetic using % may not be the fastest approach.
Use uint32/uint64 carefully to avoid overflow and extra conversions.
You can even use Pallas or other JAX lowering tools for additional performance.

Setup

Install uv if you don't have it:

# macOS / Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows (PowerShell)
powershell -ExecutionPolicy ByPass -c `
  "irm https://astral.sh/uv/install.ps1 | iex"

Then from this directory:

bash scripts/setup.sh

scripts/setup.sh installs CPU JAX by default. If an NVIDIA driver is present, it installs the matching CUDA wheels. It does not modify your shell.

For GPU requirements, see: https://docs.jax.dev/en/latest/installation.html

Running tests

uv run pytest
uv run pytest --logn 10 --batch 4

Running benchmarks

uv run python -m tests.benchmark
uv run python -m tests.benchmark --tests --logn 10 --batch 4
uv run python -m tests.benchmark --bench --logn 12 --batch 4

With no flags, it runs both tests and the benchmark.

Options:

uv run python -m tests.benchmark --tests            # only run tests
uv run python -m tests.benchmark --bench            # only run benchmark

Submission

bash scripts/make_submission.sh

This runs tests and produces code.zip. Upload to Brightspace.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
scripts		scripts
tests		tests
.gitignore		.gitignore
NTT-GPU-1.pdf		NTT-GPU-1.pdf
NTT-GPU-2.pdf		NTT-GPU-2.pdf
NTT.pdf		NTT.pdf
README.md		README.md
Report.pdf		Report.pdf
provided.py		provided.py
pyproject.toml		pyproject.toml
student.py		student.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assignment 1 — Negacyclic NTT in JAX

Useful links

Algorithm specifics

What to do

Performance tips

Setup

Running tests

Running benchmarks

Submission

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Assignment 1 — Negacyclic NTT in JAX

Useful links

Algorithm specifics

What to do

Performance tips

Setup

Running tests

Running benchmarks

Submission

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages