I saw that in Supercars 2
Code:
move.l a0,(a4)+ ;mask
move.l a1,(a4)+ ;data0
move.l a2,(a4)+ ;scn0
wouldn't it be faster to use
Code:
movem.l a0-a2,(a4)+
(and would it be equivalent) ? Maybe there's a threshold where movem is faster. Is it 2, 3, or more registers?