Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4970631 | Integration, the VLSI Journal | 2017 | 9 Pages |
Abstract
Discrete Fourier Transform (DFT) is widely used in almost all fields of science and engineering. Meanwhile, modern applications processing big data, such as images and sound, require increasingly complex features, such as the long and non-power-of-two hardware DFT and floating-point operations with wide ranges and high effective resolutions. In this paper, we propose a method to extend the matrix-factorization-based DFT algorithm for performing non-power-of-two DFTs of length N equal to the product of coprime numbers. Based on this algorithm, we also present a new DFT architecture synthesizer with high portability, called AutoNFT, to generate hardware DFT in a fully parallel structure. The architecture also contains a high-performance floating-point core to work at 1Â GHz. DFTs generated by AutoNFT can run at 500 Mhz using a 40Â nm industry library. This technology can handle 115 billion fixed-point samples per second on 256-point DFT and 13.5 billion floating-point samples per second on 30-point DFT.
Related Topics
Physical Sciences and Engineering
Computer Science
Hardware and Architecture
Authors
Gan Feng, Lan Yao, Song Chen,