-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
i4i instance type cluster fails to restart #376
Comments
What is |
Hi @nchammas , yes it's an official Amazon Linux 2 image |
If you have an already know working combination of instance type and ami, I can try with them to check if it's a problem related to ami or instance type. |
Can you show me where exactly you are seeing that? I am not able to find mention of this AMI in the official listing from Amazon. I just tried to launch, stop, and then start a cluster using |
I can only use amis in eu-central-1. And I can't find the one you are mentioning in eu-central-1 region. |
I'm not sure where aws ec2 describe-images \
--region eu-central-1 \
--owners amazon \
--filters \
"Name=name,Values=amzn2-ami-hvm-*-gp2" \
"Name=root-device-type,Values=ebs" \
"Name=virtualization-type,Values=hvm" \
"Name=architecture,Values=x86_64" \
--query \
'reverse(sort_by(Images, &CreationDate))[:100].{CreationDate:CreationDate,ImageId:ImageId,Name:Name,Description:Description}' Please also try a different instance type, like |
Hi, thanks for the suggestion. Indeed it's a problem of finding the instance type.
It would be nice to understand what's the difference in storage config of the i4i. However not a big issue for me. I can use other instance types. Andrea |
I will leave the issue open and re-title it to focus on this storage-related problem. Flintrock should handle it more gracefully, even if we don't support it. |
Hi,
I'm having issues when stopping and restarting the cluster.
Stop is working fine (i.e. flintrock stop my-cluster).
However when trying to start again (flintrock start my-cluster) the instances fails 1 of the 2 sanity checks, they cannot be reached event with console ssh login, and the cluster won't start.
I'm guessing is something related to the ephemeral storage because (as you can see from the system log below) the instance is going in a "recovery mode" due to some errors related to ext4 partition non found
Do you have any guess?
Thanks for your kind help.
Andrea
Here a more complete log file. After you can find also my flintrock config.
The text was updated successfully, but these errors were encountered: