The 080 CPU allows 64 bit mem accesses while the blitter does only 16 bit. Since the blitter and CPU memory accesses are serialised in the V4 due to long mem bursts, the blitter always loses when compared with a CPU-only routine regardless of the memory bandwidth.
|