English Amiga Board


Go Back   English Amiga Board > Coders > Coders. General

 
 
Thread Tools
Old 09 July 2024, 21:17   #161
abu_the_monkey
Registered User
 
Join Date: Oct 2020
Location: Bicester
Posts: 2,066
I can run it on the stock A4000 040 tomorrow evening if that helps.
abu_the_monkey is offline  
Old 09 July 2024, 22:06   #162
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by abu_the_monkey View Post
I can run it on the stock A4000 040 tomorrow evening if that helps.
It does. It would be very interesting if the shift normalisation is dramatically faster than the delta lookup since my expectation was that the lookup should be hitting the datacache at least every 15 out of 16 lookups. If that's not the case, I have some headscratching to do.
Karlos is offline  
Old 09 July 2024, 22:12   #163
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
The more I see of the 68060, the more I wish I'd gotten one back in the day.
Karlos is offline  
Old 10 July 2024, 11:37   #164
grond
Registered User
 
Join Date: Jun 2015
Location: Germany
Posts: 1,938
Quote:
Originally Posted by Karlos View Post
It does. It would be very interesting if the shift normalisation is dramatically faster than the delta lookup since my expectation was that the lookup should be hitting the datacache at least every 15 out of 16 lookups. If that's not the case, I have some headscratching to do.
I think the effect may be less because all the graphics data will have pushed out anything from the audio mixing by the next time you get into audio mixing. Then there will be a lot of reads from memory when you do the next bit of mixing before the cache can actually kick in.
grond is offline  
Old 10 July 2024, 11:43   #165
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by grond View Post
I think the effect may be less because all the graphics data will have pushed out anything from the audio mixing by the next time you get into audio mixing. Then there will be a lot of reads from memory when you do the next bit of mixing before the cache can actually kick in.
Well, that might be true in the game, but this is a standalone test program. There's nothing else to get into the datacache.

The way I'm doing stuff, I have all cache aligned data structures. We use move16 to transfer from sample data into a fetch buffer that we then operate on. The fetch and accumulation buffers are adjacent and cache aligned and so should be hot as soon as you've processed the first few samples. The only quasi random lookups are for converting the 8 bit samples to some volume scaled 16-bot one. This is what the delta encoding aims to improve. Almost all of the delta values are small enough to fit into a pair of cache lines. Only the first sample of 16 has to do a proper lookup.

The move16 operations are intended to avoid trashing the cache. The source data should not enter the cache and the destination location will be cached. We can even pre hint that if needed.

I am curious to see how these all perform on a faster 68040 implementation with local fast memory. The 3640 is useful for a worst case analysis since it has a relatively slow interface into memory.

Last edited by Karlos; 10 July 2024 at 11:54.
Karlos is offline  
Old 10 July 2024, 17:50   #166
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
At the risk of seeming obsessed,I think I am going to add a "generic" version to this code that does not use any 040/060 specific operations (i.e. doesn't use the fetch buffer) and just reads the frame data directly from the sample source. We might get away with using the same mixer on 030, albeit with fewer input channels.

For that to be affordable, I'd probably want to have shift-only normalisation too.
Karlos is offline  
Old 10 July 2024, 21:20   #167
abu_the_monkey
Registered User
 
Join Date: Oct 2020
Location: Bicester
Posts: 2,066
as requested.
Attached Thumbnails
Click image for larger version

Name:	tkg_sound_040_10-07-2024.jpg
Views:	23
Size:	400.9 KB
ID:	82678  
abu_the_monkey is offline  
Old 10 July 2024, 21:22   #168
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by abu_the_monkey View Post
as requested.
Ta pet. I'll have a look after the footie
Karlos is offline  
Old 10 July 2024, 21:28   #169
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
A quick glance suggests it's only slightly faster, for what could be a pretty significant reduction in quality.
Karlos is offline  
Old 10 July 2024, 21:49   #170
abu_the_monkey
Registered User
 
Join Date: Oct 2020
Location: Bicester
Posts: 2,066
Quote:
Originally Posted by Karlos View Post
A quick glance suggests it's only slightly faster, for what could be a pretty significant reduction in quality.
would it be better on an actual accelerator as opposed to a plain commodore processor card?
abu_the_monkey is offline  
Old 10 July 2024, 22:11   #171
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by abu_the_monkey View Post
would it be better on an actual accelerator as opposed to a plain commodore processor card?
I expect it to perform better, but I don't know how the trends will be affected. If everything is working as expected, all the code and important lookup data will be running from the CPU cache and the only memory access will be for the source sound data and for writing back to chip. The thought does occur, what if your test is dominated entirely by just fetching the source data? That should scale totally linearly with channel count.
Karlos is offline  
Old 11 July 2024, 09:08   #172
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
The 3640 results are a bit perplexing to me. I should do a null version that doesn't do any mixing or normalisation, but does fetch the source data and does push silence to chip. That'll give a baseline for all the data movement. If it's not very different then we can say it's memory interface issues
Karlos is offline  
Old 11 July 2024, 10:40   #173
daxb
Registered User
 
Join Date: Oct 2009
Location: Germany
Posts: 3,315
Maybe you should start a new thread for the 68040 testing issue. I guess most 040 user don't know or follow this thread. If my A1200 with Apollo 1240/40 would have network access I had already run the test.
daxb is offline  
Old 11 July 2024, 10:57   #174
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
I didn't want to spam the forums with TKG specific threads
Karlos is offline  
Old 11 July 2024, 12:01   #175
patrik
Registered User
 
patrik's Avatar
 
Join Date: Jan 2005
Location: Umeå
Age: 44
Posts: 954
Quote:
Originally Posted by Karlos View Post
I didn't want to spam the forums with TKG specific threads
Interesting stuff and while originating in TKG, it is does feel applicable to general coding/optimization/68k cpu difference discussion etc.
patrik is offline  
Old 11 July 2024, 12:37   #176
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by patrik View Post
Interesting stuff and while originating in TKG, it is does feel applicable to general coding/optimization/68k cpu difference discussion etc.
Do I hear a volunteer, old friend?
Karlos is offline  
Old 11 July 2024, 16:10   #177
patrik
Registered User
 
patrik's Avatar
 
Join Date: Jan 2005
Location: Umeå
Age: 44
Posts: 954
Quote:
Originally Posted by Karlos View Post
Do I hear a volunteer, old friend?
If you are asking for a 68040 machine with local fastram, I am afraid I have no such machine available . I have some 68000, 68030 and 68060 machines up and running.
patrik is offline  
Old 11 July 2024, 17:19   #178
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
You guys are really going to make me power up my 1200 BlizzPPC040 tower, aren't you?

If the blue smoke comes out, I'm holding you all responsibile...

Who am I kidding, I didn't even put the HD back in when I imaged it a couple of years back
Karlos is offline  
Old 11 July 2024, 17:50   #179
patrik
Registered User
 
patrik's Avatar
 
Join Date: Jan 2005
Location: Umeå
Age: 44
Posts: 954
Quote:
Originally Posted by Karlos View Post
You guys are really going to make me power up my 1200 BlizzPPC040 tower, aren't you?
Looking forward to some BlizzPPC040 action
patrik is offline  
Old 11 July 2024, 17:54   #180
Karlos
Alien Bleed
 
Karlos's Avatar
 
Join Date: Aug 2022
Location: UK
Posts: 4,712
Quote:
Originally Posted by patrik View Post
Looking forward to some BlizzPPC040 action
I really don't think that'd be wise, tbh
Karlos is offline  
 


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Similar Threads
Thread Thread Starter Forum Replies Last Post
Slow A4000 after overhaul Screechstar support.Hardware 57 11 July 2023 23:02
Amiga Font Editor overhaul buggs Coders. Releases 19 09 March 2021 17:39
Escom A1200 overhaul Ox. Amiga scene 8 26 August 2014 08:54
Will Bridge Practice series needs an overhaul mk1 HOL data problems 1 02 April 2009 21:55

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +2. The time now is 02:20.

Top

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.
Page generated in 0.14956 seconds with 16 queries