In this paper, a novel 2D IDCT architecture, based on the energy compaction property of the 2D DCT, is proposed. This architecture performs 2D IDCT directly on the 2D DCT data set, avoiding the need for the transposition memory. We derive a recursion equation from the definition of the 2D IDCT algorithm and use it to implement a wavefront array processor. The wavefront array processor consists of highly regular, parallel and pipelined processing elements which are suitable for VLSI implementation. This implementation also utilizes the sparseness property of the 2D DCT coefficients to reduce the computational complexity. It is shown that the proposed architecture achieves a high throughput rate, (15+m) clock cycles per 2D DCT data set, where m is the number of the non-zero DCT coefficients. Another important aspect of this architecture is that it provides an efficient way to control the trade-off between visual quality of the reconstructed image and computational complexity.