RTG should make a huge difference. The chunky to planar calculation is quite CPU intensive and is no longer required. Data no longer needs to be copied into slow chip ram before being displayed.
With RTG + AHI (if you have a sound card) everything can be in 32-bit fast ram close to the accelerator.
If you dont notice a difference... take a look at your RAM priorities.