A course project on optimization of the DCT algorithm in C and ARM Assembly.
-
Updated
Aug 19, 2020 - C
A course project on optimization of the DCT algorithm in C and ARM Assembly.
VectorFFT: A permutation-free mixed-radix FFT library that beats Intel MKL and FFTW on every tested size. Hand-tuned AVX2/AVX-512 codelets for 18 radixes, zero-allocation in-place execution, and a wisdom-based planner. 1.1x–4.9x faster than Intel MKL, 1.3x–12.8x faster than FFTW. Pure C, no dependencies.
For more info please visit https://en.wikipedia.org/wiki/Fast_Walsh%E2%80%93Hadamard_transform.
Reference: Introduction to Algorithms (Third Edition)
Add a description, image, and links to the butterfly-operation topic page so that developers can more easily learn about it.
To associate your repository with the butterfly-operation topic, visit your repo's landing page and select "manage topics."