16 July 2008, 14:25 | #1 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
Copper timing
Hello,
I'm trying to adapt Minimig's copper to behave like Amiga's one. The current solution written by Dennis is based on AHRM description and it's not perfect. I've done some tests on my A4k and WinUAE 1.5.0 to figure out the actual behaviour. But by chance I've found difference in UAE's and AGA's copper operation. The attached pictures show the results of the same program run on A4k and under WinUAE 1.5.0. The difference is noticeable when using 6 bitplanes in low resolution or 3 bitplanes in high resolution (I believe the difference is also present when using 5 bitplanes but my tests don't cover this case, they should use different alligment between wait and bitplane dma fetch cycles). It seems that WAIT and SKIP require 4 FREE even cycles. I only tested WAIT's which had been already true. Can somebody confirm this? What happens on the bus during these two additional cycles? Also there is a question for which I don't know the answer yet: in which cycle does the actual comparision take place? The description of copper MOVE's in WinUAE's sources are not understable by me . It's been written that these MOVE's require 3 or 4 cycles (depending if they move data to Agnus or other chip) but from my tests I can see that changing background colour (moves to Denise) takes 8 lores pixels (if all even cycles are free) and it's only 2 even color clock cycles. Where is the catch? I hope Toni will enlighten me . I'm looking forward to see any explanation. |
16 July 2008, 15:04 | #2 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
That copper taking more time -comment most likely isn't true in real hardware (old test case works now without it but i fear it breaks something else without real fix..)
Could you attach your test program (including source) and I can use my logic analyzer to solve this problem. I have been planning to do this but I haven't got good test cases. Until now I guess Note: program needs to be ECS compatible, it is too difficult to connect LA probes to A1200 surface mount chips |
16 July 2008, 15:25 | #3 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
The program is very dirty and bangs hardware directly. I quickly modified one of my test programs which I wrote 15 years ago. I run it from Asm-One without any system software loaded. It runs on OCS but under AGA it switches fetchmode to standard one. Thanks for help. I am really curious what happens on the bus
|
16 July 2008, 16:03 | #4 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Line 64 (+part of line 65) in LogicPort project file attached. I think LogicPort software (http://www.pctestinstruments.com/) works without the device and can load project files (that include captured data)
Software only has csv export and it isn't very visual I'll attach other lines after you have confirmed you can load attached lpf-file. Trigger condition was vsync + 64*hsync. Note that hsync isn't same as hpos=0, real hpos happens when there is 0x3c on the bus. (few cycles earlier) (I haven't examined the result yet, later..) EDIT: only RGA captured, data bus would mean 16 more probes, there is enough channels left but.. |
16 July 2008, 17:14 | #5 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
Big thanks for very quick answer.
I confirm I can view the captured data with downloaded software. As I expected the RGA bus is idle during additional cycles of WAIT. Now I'm doing further copper timing research. Still don't know when actual comparision takes place. My Minimig's copper code is late in WAIT's about 8 lowres pixels. I would be interested in more captured data files if you have any and be so kind to share them (I mean not only copper but blitter would be interesting although I haven't touched it yet). |
16 July 2008, 17:50 | #6 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
@Toni
I would like to see a capture of a line where all slots are taken by bitplane dma and wait is over during that line but further copper list processing is postponed until display dma is inactive (the last case in my program). It will answer some of my questions. Thanks once again. |
16 July 2008, 17:56 | #7 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Can you list the lines that are needed?
btw, I was going to setup data bus capture but I noticed I run out of probes. (found 10, need 6 more..) It seems I have lost some of them.. Blitter is as documented in HRM (channel cycle diagrams). Nothing special, fortunately or unfortunately It can use any available DMA slot. |
16 July 2008, 18:31 | #8 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
Line $D2 would be the best.
Does it mean that blitter can use odd slots if they are not used by bitplanes, sprites, audio, disk nor refresh? |
16 July 2008, 18:53 | #9 | |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Ok. I can do multiple lines as long as I know which ones. I am not going to do 20+ lines
Quote:
|
|
17 July 2008, 11:18 | #10 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
More lines attached. (note that previous file was incorrect, it was line 66..)
Line $d2 and $40 from your test program. Other files include following tests: "moves" = move to 180, move to 182, move to 180... "waits" = wait line 1, wait line 1, wait line 1 etc.. (skip,skip,skip was 100% identical) "jumps" = coplc2 points to move to copjmp2. (copjmp2, copjmp2, copjmp2..) Still need to know which "free" cycles are really free or used but null word in rga.. (Is there signal that shows when "Agnus" bus is active?) |
17 July 2008, 16:43 | #11 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
I noticed that the previously captured data was not from line $40 and assumed that it was from line $42 or $44 (those lines are probably indistinguishable).
In the meantime I discovered another undocumented feature: after write to COPJMP2 there are two additional cycles before first instruction from new location is fetched. I have also made tests with A4000 and WinUAE and found another difference considering this issue (I know that WinUAE takes these cycles into consideration). See atached pictures and source code. The code jumps to new location after performing 4 MOVE's. It is done in 5 bitplane display mode so there are 3 free even cycles and one taken by plane 5 dma. In four lines the same sequence repeats with different start position. Could you capture line $42, $44, $46 and $48? I'm interested in the first cycle after COPJMP2 is written. It seems that the first cycle after write to COPJMPx is just one even cycle (free or not, in your captured file there is a read to COPINS) and the second one must be a free cycle (with RGA NULL). As far as I know there is no way to determine if a given cycle is a real free dma cycle or not, such a logic is deeply burried inside Agnus. |
17 July 2008, 17:12 | #12 |
Global Moderator
Join Date: Nov 2001
Location: Derby, UK
Age: 48
Posts: 9,355
|
/me is confused, or should I say..
/me doesn't understand |
17 July 2008, 17:32 | #13 | ||
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Quote:
Latch chips between CPU and Agnus data bus is probably the best place.. or some signal that prevents CPU's memory accesses. btw, I can't do more tests until saturday or sunday. Quote:
|
||
17 July 2008, 18:18 | #14 |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
Thanks for your help. There is a signal from Agnus to Gary called /DBR, when it's high chip bus is available for CPU.
|
20 July 2008, 16:11 | #15 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Same tests plus "empty line" done + "/BLIT" signal included (which appears to be "agnus wants the bus"-signal. Used Gary pin 15)
I can see /DBS in Agnus pinout but I don't see it marked on any schematics.. I also tried /REGEN (which should be "RGA in use" but it was totally quiet..) This test confirms that writing to COPJMP does need extra memory cycle which is not used for anything.. (copper_jumps.lpf) COPJMP cycle usage seems to be: 1:fetch copper instruction (normal move cycle 1) 2:write to copjmpx (normal move cycle 2) 3:fetch next instruction(write to copins. I really need to see data in data bus..) 4:use another cycle for nothing (copy internal pointer?) 5:continue normally More tests soon.. |
20 July 2008, 17:01 | #16 | |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
"/BLIT" is another name for "/DBR". In all functional descriptions in Technical Reference and Service Manuals for A500/A2000 the name "/DBR" is used but in schematics it's named "/BLIT". It prevents CPU from getting bus access in current cycle.
"/REGEN" is an output from CPU address decoder. It's only active when CPU wants to access custom registers. Quote:
As for 3: it apears that this cycle doesn't have to be a write to copins. It can also be a bitplane dma fetch. As for 4: it apears that this cycle must be a free bus cycle. If it's taken by bitplane dma another cycle is required. |
|
20 July 2008, 17:12 | #17 | ||
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Quote:
Quote:
|
||
21 July 2008, 17:17 | #18 | |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Quote:
Bitplane dma can use "cycle 3". BUT blitter CAN'T use it.. (it becomes "8C" cycle again) Test case 1: No bitplane dma. Continuous COPJMP2's. D=A blit (and nasty bit set). -> 08C 000 08A 074 08C 000 1FE 074 (and repeats) Test case 2: 3 planes + hires. Continuous COPJMP2's. D=A blit (and nasty bit set) -> 112 114 110 xxx (and repeats) where xxx = 8C 8A 1FE 8C 8A 1FE (blitter didn't get any cycles) It seems Agnus DMA scheduler is weird |
|
21 July 2008, 17:40 | #19 | |
Registered User
Join Date: Mar 2008
Location: Poland
Posts: 159
|
Quote:
The "blitter-nasty" mode keeps only the CPU off the bus, not the copper. |
|
21 July 2008, 18:24 | #20 |
WinUAE developer
Join Date: Aug 2001
Location: Hämeenlinna/Finland
Age: 49
Posts: 26,505
|
Yes but the point is that copper does not need that cycle for anything but it still "steals" it if it is available. But you are right, it most likely depends on dma priority levels.
|
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
Thread Tools | |
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Combining copper scrolling with copper background | phx | Coders. Asm / Hardware | 16 | 13 February 2021 12:41 |
Blitter fill timing | leonard | Coders. Asm / Hardware | 42 | 01 September 2014 11:00 |
ACA1230 timing fix | bx20 | support.Hardware | 17 | 24 August 2011 09:58 |
Even more sound timing issues... | andreas | support.WinUAE | 11 | 30 November 2005 11:23 |
A1200 timing fixes? | icewizard2k5 | support.Hardware | 2 | 28 February 2005 09:37 |
|
|