Interesting topic!
I am still learning Assembler and also thought about doing my own specialized CopyMem.
Quote:
Originally Posted by PeterK
WinUAE
|
yeah, I also think, that WinUAE is not (always) a good way for doing benchmarks?!
Quote:
Originally Posted by PeterK
using MOVE.W instead of MOVE.L
|
That's a point I am wondering about. On 020+ MOVE.L is the fastest. But how about on 000 and 010? How do 2* MOVE.W compare to 1* MOVE.L as 000 and 010 have to split up 32 bit accesses to two 16 bit accesses?!
And on the 010, due to its special loop mode, is it a good idea to unroll mini loops i.e. (pseudo-code)
Code:
LOOP
MOVE
DBcc D0, LOOP
unroll to
Code:
LOOP
MOVE
MOVE
MOVE
MOVE
DBcc D0/4, LOOP