Request for Alternative Download Methods for the ISIA Food-500 dataset #1

jyp-studio · 2024-08-21T14:12:15Z

I am currently in the process of downloading the necessary external open-source datasets required for your database. However, I've encountered a significant issue with the download speed from the dataset provider's server when downloading the ISIA Food-500 dataset. Specifically, the download speed is extremely slow, and it's estimated to take around 4 days to download a single file. Since there are 10 files in total, the entire process would take approximately 40 days, which is excessively long.

Given that you've already downloaded this dataset during the creation of your database and that they are all open-source, I would greatly appreciate it if you could provide an alternative method for accessing these files. For example, uploading the compressed datasets to Google Drive or another faster and more reliable hosting service would be highly beneficial.

Thank you for considering this request. Your assistance in this matter would be invaluable and would greatly expedite my work with your database.

michaeledeprospo · 2024-08-26T03:10:13Z

An alternative download for ISIA Food-500 would be greatly appreciated if possible!

jyp-studio · 2024-08-26T04:53:20Z

Thank you for your response. I have already downloaded the dataset, but I need to run database_generation.py to verify that it was downloaded correctly, so I haven't closed the issue yet. You might still want to prepare the dataset in case there are any missing files in what I downloaded.

Currently, I am encountering three main issues:

I placed all the datasets in the src folder, at the same level as database_generation.py, and unzipped all of them. However, when running database_generation.py, an error occurs indicating that the datasets cannot be found. This might be due to the code checking for the datasets using len(os.listdir(path)). To address this, I manually changed the initial paths of the datasets from None to the corresponding paths. I'm not sure if this is the correct approach.

When executing the function download_file(path) with the URL http://atvs.ii.uam.es/atvs/AI4Food-NutritionDB/AI4Food-NutritionDB.txt, I encountered the following error:

requests.exceptions.ConnectionError: HTTPConnectionPool(host='atvs.ii.uam.es', port=80): 
Max retries exceeded with url: /atvs/AI4Food-NutritionDB/AI4Food-NutritionDB.txt 
(Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f721edeabe0>: 
Failed to establish a new connection: [Errno 110] Connection timed out'))

It seems I cannot connect to your server, as I also tested by directly Googling atvs.ii.uam.es/atvs/ and couldn't connect either. Could you please check this issue on your end?

Additionally, in database_generation.py, there is an issue on line 195 where correspondence_file is not defined beforehand, resulting in a "reference before assignment" error.

Thank you for your attention to these issues.

zs1314 · 2024-09-17T15:24:17Z

@jyp-studio Hello! The official link seems to be dead, I wonder where you got it from? Or can you help me with the dataset! Paid! Thank!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for Alternative Download Methods for the ISIA Food-500 dataset #1

Request for Alternative Download Methods for the ISIA Food-500 dataset #1

jyp-studio commented Aug 21, 2024

michaeledeprospo commented Aug 26, 2024

jyp-studio commented Aug 26, 2024

zs1314 commented Sep 17, 2024

Request for Alternative Download Methods for the ISIA Food-500 dataset #1

Request for Alternative Download Methods for the ISIA Food-500 dataset #1

Comments

jyp-studio commented Aug 21, 2024

michaeledeprospo commented Aug 26, 2024

jyp-studio commented Aug 26, 2024

zs1314 commented Sep 17, 2024