heal.abstract |
Many high-end applications dictate the efficient implementation of Digital Signal Processing (DSP) algorithms. The development of innovative hardware architectures with the main purpose of accelerating the execution of such algorithms has been proven to be area and power efficient.
In the scope of this diploma thesis, a new architecture is proposed, specifically designed for the accelerated mapping of DSP algorithms and other computation intensive applications. As the core processing unit of this architecture, two distinctive modules are introduced, the Unified Double Multiplier-Adder (UDMA) and a modified variation of the UDMA that implements a two-stage pipeline. These two units are flexible arithmetic circuits optimally designed to reduce their critical delay, which can process data internally in Carry Save representation while using 16-bit binary I/O.
These certain operations, which mainly consist of multiplications and additions or subtractions, are common in DSP algorithms. Thus, by properly chaining the execution of these operations the accelerated mapping of DSP kernels can be achieved.
The UDMA-based architecture is different to the ones that have already been proposed, in that it targets the optimization of both the architectural and arithmetic levels of design, and that it seeks to take advantage of the efficient grouping of two multiplications together, which is the focal point of the UDMA‟s design. Its performance is evaluated by comparing the latency, area, and power consumption figures with the highly efficient FCU-based architecture. |
el |