-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download of basebackup always stalls #359
Comments
Hi, 2020-10-16 11:00:09,131%BasebackupFetcher MainThread ERROR Download stalled for 120.13373475382105 seconds, aborting downloader Thanks. |
There shouldn't be any generic issue with this as we've done very large amount of restorations across all major cloud providers and haven't been seeing this. If this is reproducible then you should check out what's happening on network level. |
Hi, On the line https://github.com/aiven/pghoard/blob/master/pghoard/rohmu/object_storage/google.py#L60
Debugging, including on a machine/network in the CGP itself, I realized that the problem occurs when a machine has> 2 GB of RAM, because enters the condition "if get_total_memory () <2048 else 1024 * 1024 * 50" DOWNLOAD_CHUNK_SIZE = 1024 * 1024 * 5 if get_total_memory () <2048 else 1024 * 1024 * 50 That is, the problem occurs when DOWNLOAD_CHUNK_SIZE = 50MB First I tested with DOWNLOAD_CHUNK_SIZE = 1024 * 1024 * 5 and the download was successful! The maximum value that the download works is DOWNLOAD_CHUNK_SIZE = 1024 * 1024 * 25, that is, 25 MB Is there an automated test that runs on a machine with> 2GB of RAM? Cheers |
Yes. It would probably make sense to add an optional configuration parameter that can be used to set the chunk size. 50 MiB performs better than 5 MiB so it is preferable when download performance is important and as mentioned we haven't seen any issues with this but in general 50 MiB is fairly large chunk size and setting smaller one via config would be reasonable, especially if the machine is otherwise somehow memory constrained. |
Hello,
I'm facing the issue where I'm not able to download a basebackup using
pghoard_restore
command since download always stalls.Restore command:
The appropriate backup was selected, but nothing happens.
ps auxf
shows thatpghoard_restore
creates 9 additional processes but the download progress is constantly 0%, which after 3 x 2 minutes turns to fail.Command output:
pghoard.conf
:The text was updated successfully, but these errors were encountered: