Torch2Gemmini

This project provides an end-to-end MLIR-based compilation and acceleration flow that integrates Torch-MLIR, Buddy-MLIR, and Chipyard for running PyTorch workloads on Gemmini accelerator implemented on an FPGA. The pipeline lowers high-level tensor operations from PyTorch through MLIR dialects in Buddy into Gemmini-compatible accelerator code. The repository also includes setup scripts and integration flows required to connect the software stack with the underlying FPGA deployment environment.

git clone https://github.com/NMJ0/Torch2Gemmini.git

click on the steps to see dropdown

Steps to set up forked CHIPYARD(https://github.com/NMJ0/chipyard.git)

Clone the forked chipyard repository

git clone https://github.com/NMJ0/chipyard.git

Build chipyard ecosystem

cd chipyard
./build-setup.sh riscv-tools

If any error related to firtool comes -- Steps to install firtools
(this is what worked for me, you can try to set up firtools in your own way)

# Step 1: Remove the broken firtool
rm /home/noel/chipyard/.conda-env/bin/firtool   ## change to your system's path

# Step 2: Download full CIRCT distribution (includes libraries)
cd /tmp
rm -rf circt-dist
mkdir circt-dist && cd circt-dist
wget -q -O - https://github.com/llvm/circt/releases/download/firtool-1.56.1/circt-full-shared-linux-x64.tar.gz | tar -zx

# Step 3: Copy entire directory to conda environment
cp -r firtool-1.56.1/* $CONDA_PREFIX/

# Step 4: Verify - check both binary and libraries exist
ls -la $CONDA_PREFIX/bin/firtool
ls -la $CONDA_PREFIX/lib/libCIRCT*.so

# Step 5: Test
firtool --version

Steps to set up forked buddy-mlir (https://github.com/NMJ0/buddy-mlir.git)

Install dependencies

sudo apt install flatbuffers-compiler libflatbuffers-dev libnuma-dev

Clone and initialize the forked buddy-mlir repo

git clone https://github.com/NMJ0/buddy-mlir.git
cd buddy-mlir
git submodule update --init llvm

pip install requirements

pip install -r requirements.txt

Build and test LLVM

cd buddy-mlir
mkdir llvm/build
cd llvm/build

cmake -G Ninja ../llvm \
    -DLLVM_ENABLE_PROJECTS="mlir;clang;openmp" \
    -DLLVM_TARGETS_TO_BUILD="host;RISCV" \
    -DLLVM_ENABLE_ASSERTIONS=ON \
    -DOPENMP_ENABLE_LIBOMPTARGET=OFF \
    -DCMAKE_BUILD_TYPE=RELEASE \
    -DMLIR_ENABLE_BINDINGS_PYTHON=ON \
    -DPython3_EXECUTABLE=$(which python3)

ninja check-clang check-mlir omp

Build BUDDY

cd buddy-mlir
mkdir build
cd build

cmake -G Ninja .. \
    -DMLIR_DIR=$PWD/../llvm/build/lib/cmake/mlir \
    -DLLVM_DIR=$PWD/../llvm/build/lib/cmake/llvm \
    -DLLVM_ENABLE_ASSERTIONS=ON \
    -DCMAKE_BUILD_TYPE=RELEASE \
    -DBUDDY_MLIR_ENABLE_PYTHON_PACKAGES=ON \
    -DPython3_EXECUTABLE=$(which python3)

ninja
ninja check-buddy

Steps to set up forked torch-mlir (https://github.com/NMJ0/torch-mlir.git)

chmod +x torch_mlir_setup.sh
./torch_mlir_setup.sh

FPGA Programming Guide

For tools such as Vivado, use your own systems path.Paths in my system are used in these instructions.This is only an example; depending on the fpga used, you might need to change some variables.

1. Enable the Chipyard Environment

Navigate to the Chipyard directory and enable the environment.

(base) noel@sjd:~$ cd chipyard/
(base) noel@sjd:~/chipyard$ source env.sh 
(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard$

This activates the Chipyard Conda environment required for building and running tools.

2. Enable Vivado 2018 NFS Server

Mount the hardware tools and enable Vivado 2018.

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard/fpga$ hwmountnfs
[sudo] password for noel: 
(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard/fpga$ v18
(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard/fpga$ which vivado 
/home/noel/tools/hw_tools/Vivado/Vivado/2018.3/bin/vivado

This ensures the correct Vivado version is active.

3. Generate Bitstream for Rocket + Gemmini

Generate the FPGA bitstream using the configuration defined in Configs.scala.

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard/fpga$ make SUB_PROJECT=nexysvideo CONFIG=RocketGemminiNexysVideo10MHzConfig bitstream

The generated bitstream will be placed in the generated-src directory.

4. Build ELF Binaries for C Programs

Navigate to the Gemmini software tests and build the binaries.

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard/generators/gemmini/software/gemmini-rocc-tests$ ./build.sh

All C programs are compiled into ELF binaries and stored in the build directory.

5. Connect and Program the FPGA

Steps to program the FPGA:

Open Vivado Hardware Manager
Connect both:

UART cable
FPGA programming cable

Click Auto Connect
Click Program Device
Select the bitstream file from:

/home/noel/chipyard/fpga/generated-src

Program the device.

Important: Reset the CPU before running each program.(There is a CPU Reset button on the FPGA )

6. Identify the Connected USB Port

Check which UART port corresponds to the FPGA.

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard$ ls /dev/ttyUSB*
/dev/ttyUSB0

7. Running Programs on the FPGA

Running Hello World on the Rocket Core

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard$ ~/chipyard/.conda-env/riscv-tools/bin/uart_tsi   +tty=/dev/ttyUSB0 +baudrate=115200   +selfcheck software/baremetal-ide/build/examples/chipyard-tests/hello.elf

Running the Gemmini Template Test

This program multiplies a matrix with an identity matrix and verifies that the output matches the input.

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard$ ~/chipyard/.conda-env/riscv-tools/bin/uart_tsi   +tty=/dev/ttyUSB0 +baudrate=115200   +selfcheck generators/gemmini/software/gemmini-rocc-tests/build/bareMetalC/template-baremetal

Running Matrix Addition on Rocket + Gemmini

(/home/noel/chipyard/.conda-env) noel@sjd:~/chipyard$ ~/chipyard/.conda-env/riscv-tools/bin/uart_tsi   +tty=/dev/ttyUSB0 +baudrate=115200   +selfcheck generators/gemmini/software/gemmini-rocc-tests/build/bareMetalC/matrix_add-baremetal

Build and compilation flow

move to the target folder

cd matmul  ## or mlp mnist/Lenet

To generate MLIR file from torch

make gen_mlir

lowering the mlir file

make lower-std #uses only cpu
make lower-gemmini #uses gemmini

linking the object file with the c wrapper

make link-std  # cpu only
make link-gemmini  #gemmini

running on spike simulator

make run-std #cpu only
make run-gemmini #gemmini

running on FPGA (Do not forget to click cpu reset button on the FPGA after each run)

make run-uart-std #cpu only
make run-uart-gemmini #gemmini

Documentation

Acknowledgement

This work was completed as part of the Undergraduate Research Credit (UGRC) program at IIT Madras under the guidance of Dr.Karthik Sankaranarayanan.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
Lenet		Lenet
MLP_mnist		MLP_mnist
matmul		matmul
README.md		README.md
report.pdf		report.pdf
slides.pdf		slides.pdf
torch_mlir_setup.sh		torch_mlir_setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Torch2Gemmini

1. Enable the Chipyard Environment

2. Enable Vivado 2018 NFS Server

3. Generate Bitstream for Rocket + Gemmini

4. Build ELF Binaries for C Programs

5. Connect and Program the FPGA

6. Identify the Connected USB Port

7. Running Programs on the FPGA

Build and compilation flow

Documentation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Torch2Gemmini

1. Enable the Chipyard Environment

2. Enable Vivado 2018 NFS Server

3. Generate Bitstream for Rocket + Gemmini

4. Build ELF Binaries for C Programs

5. Connect and Program the FPGA

6. Identify the Connected USB Port

7. Running Programs on the FPGA

Build and compilation flow

Documentation

Acknowledgement

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages