English Amiga Board


Go Back   English Amiga Board > Other Projects > project.Amiga File Server

 
 
Thread Tools
Old 14 January 2023, 19:44   #61
jbl007
Registered User
 
Join Date: Mar 2013
Location: Leipzig/Germany
Posts: 466
Quote:
Originally Posted by TCD View Post
'Collection/Compilation/Fred_TheGang_2014_Amiga_Collection/Amiga - Not TOSEC/Demos - DMS' contained 291 DMS files. After converting them to ADF 4 files remained that couldn't be converted to ADF. That makes 287 ADF files. Out of those only 1 (one) wasn't in TOSEC.
Nice.

Did you delete those?
Should files from collections be deleted?


I spotted some 17-byte-sized .dms/.lha files. Should probably be removed.
Attached Files
File Type: txt 17ByteJunk.txt (5.8 KB, 75 views)
jbl007 is offline  
Old 15 January 2023, 07:34   #62
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Quote:
Originally Posted by jbl007 View Post
Did you delete those?
Should files from collections be deleted?
The problem with these collections is that people got the files in those collections from the same sources. A cracked game or a demo got spread and ended up in multiple collections. In the end I think that it's more important to keep the unique files and not x copies of the same file in various collections.

Quote:
Originally Posted by jbl007 View Post
I spotted some 17-byte-sized .dms/.lha files. Should probably be removed.
Thank you I'll have a look at those.
TCD is offline  
Old 15 January 2023, 10:49   #63
chip
Registered User
 
Join Date: Oct 2012
Location: Italy
Age: 49
Posts: 2,942
Are you saying you want to keep an unique copy of each file ?

Collections are many, it's hard job to remove all the dupes i guess
chip is offline  
Old 15 January 2023, 13:00   #64
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Quote:
Originally Posted by chip View Post
Are you saying you want to keep an unique copy of each file ?

Collections are many, it's hard job to remove all the dupes i guess
Well, yes and yes Realistically we'll never get to 'only one version of each unique file', but if we can find duplicates we should remove them.
TCD is offline  
Old 15 January 2023, 16:34   #65
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Quote:
Originally Posted by jbl007 View Post
I spotted some 17-byte-sized .dms/.lha files. Should probably be removed.
The contents of those files are "Nuked by Byteandi" btw. I assume that is a virus?

Last edited by TCD; 15 January 2023 at 16:50.
TCD is offline  
Old 15 January 2023, 18:26   #66
Dan
Registered User
 
Dan's Avatar
 
Join Date: Nov 2004
Location: Germany
Posts: 629
Hmm, here is a suggestion:
if you delete files from a collection, maybe add a text file with deleted filenames and crc ?
In that way, if people want to assemble the collection, they would have the information needed, because the duplicates are found somewhere else.
Dan is offline  
Old 15 January 2023, 18:59   #67
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Maybe once Turran has his script for MD5 sorted this can be done, but I won't do it manually. Just to make sure that people understand that it was an idea in the main thread (http://eab.abime.net/showpost.php?p=...postcount=3166) and I checked if it is possible and makes sense (as in are there a lot of duplicate files). So far I've deleted a few files that can be restored if it's really important that the collections stay intact.
TCD is offline  
Old 16 January 2023, 10:38   #68
chip
Registered User
 
Join Date: Oct 2012
Location: Italy
Age: 49
Posts: 2,942
Collections should remain intact IMHO

I tell this as a collector

For me there's no problem in your final decision, i have already two backups of the various collections

I just want to say my personal point of view
chip is offline  
Old 28 January 2023, 20:49   #69
rygar
Registered User
 
Join Date: Nov 2007
Location: Poland
Posts: 1,329
There are a lot of duplicates in both IPF catalogs:
Non TOSEC IPFs - Official
Non TOSEC IPFs - Unofficial
rygar is offline  
Old 28 January 2023, 21:38   #70
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Yep, already removed over 2000 of those (there are actually three folders besides the official TOSEC ones that need to be checked) I'm cleaning the IPF folders up and will make a thread about the missing official ones afterwards.
TCD is offline  
Old 29 January 2023, 10:43   #71
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
Speaking of removing files: I've uploaded a list of duplicates to the root of the server called 'Duplicate_Files.txt'. If a file you are looking for seems to be missing on the server please have a look at the file and check if it wasn't a duplicate of another file.

As an example: You are looking for the file '17bit-1574b.dms'. The file isn't available but in the list of duplicates you'll find this entry when searching for the file:
Code:
== File with md5 d2e91da94549e9554b1287ef5b5b161d, 902776 bytes, found in 2 locations ==
Collection/Compilation/Fred_TheGang_2014_Amiga_Collection/Amiga - Not TOSEC/Demos - DMS/Odyssey (1991-12)(Alcatraz)(Disk 2 of 5)[TP#1].dms
Collection/Compilation/BTTR/17bit/dms/15xx/17bit-1574b.dms
So the file 'Odyssey (1991-12)(Alcatraz)(Disk 2 of 5)[TP#1].dms' is the exact same file as '17bit-1574b.dms'.
TCD is offline  
Old 03 February 2023, 09:30   #72
TCD
HOL/FTP busy bee
 
TCD's Avatar
 
Join Date: Sep 2006
Location: Germany
Age: 46
Posts: 31,612
After a bit over two months I'll call it done for now. I'll still remove some duplicates in the next weeks, but the main cleaning up is over. Feel free to report any bits that you spot that could use some TLC and I'll get to it.
TCD is offline  
Old 06 February 2023, 13:24   #73
Turran
Moderator
 
Turran's Avatar
 
Join Date: May 2012
Location: Stockholm / Sweden
Age: 49
Posts: 1,575
Once again, BIG thanks. Looks like little over 500GiB has been cleared up.
Turran is offline  
Old 12 February 2023, 13:05   #74
peo
Registered User
 
Join Date: Dec 2008
Location: Ursviken
Posts: 137
Quote:
Originally Posted by chip View Post
Collections should remain intact IMHO

I tell this as a collector

For me there's no problem in your final decision, i have already two backups of the various collections

I just want to say my personal point of view

Maybe the collections could be made into 'virtual collections': keep unique files (index, texts, images etc) belonging to the collection itself, and then link in the actual content from other places on the FTP server (if downloading links are supported)..


Another way to do it would be to create a wget/curl compatible file list for batch downloading a collection as it was put together.
peo is offline  
 


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Similar Threads
Thread Thread Starter Forum Replies Last Post
Guess the Screenshot - 2023 malko EAB's competition 1210 14 February 2024 10:53
Amiga Ireland 2023 Daedalus Amiga scene 19 30 March 2023 16:04
Super League 2023: Round 1 - Xenon 2 lifeschool EAB's competition 11 29 January 2023 01:23
Super League 2023: Round 1 - Cast Your Votes lifeschool EAB's competition 27 08 January 2023 01:15
Passione Amiga #11 (January 2023) is published! passioneamiga News 0 16 December 2022 11:25

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +2. The time now is 20:09.

Top

Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.
Page generated in 0.52916 seconds with 16 queries