First, small correction. I missed fillscratch, so that's 5sec for fill+copy and 7sec for code within FillPolygon2D...
How about if you test these scenarios:
- completely skip blitter code in the interrupt handler (only handle intena bits),
- only handle clear screen and similar (skip poly related blits),
- instead of actual poly blits, use size=(1<<6)+1 D=0 blits (minimal size blits, so you keep triggering interrupts).
- anything similar that comes to your mind.
So you can see how much overhead you have from pure interrupts, interrupts with minimal blits, etc. and then compare.
|