-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VPC in subnet without internet gateway #375
Comments
Could you elaborate on this, please? What exactly is |
no_proxy is an environment variable that defines which domains should not go through the proxy. Thanks |
Can you share the error that Flintrock spits out during the health check? Can you also share an example config, including for |
Previously, I encountered a connection timeout error, but I've since resolved it (refer to the details below). If it's necessary for your troubleshooting, I can revert to the previous code version and execute it again to capture the error output. Please inform me if that is required. Here’s how I addressed the issue: I modified the health_check function within the services.py file to include the master host in the no_proxy environment variable. You can view the change at this GitHub link: I appended the master host to the no_proxy environment variable within the def health_check(self, master_host: str): function by adding the following line:
The rationale for this modification is as follows: In enterprise settings with a private VPC and subnets, internet access from EC2 instances is typically routed through a proxy server. However, internal VPC traffic, such as pinging another instance or accessing AWS internal DNS, should bypass the proxy to avoid being blocked. The no_proxy environment variable is used to specify destinations that should not be routed through the proxy. For the health check, the master host's IP address could be any within the VPC. My solution was to add the entire VPC CIDR block to the no_proxy variable. Python's urllib, however, does not support CIDR blocks in no_proxy. In my opinion there could be different options, in case you will consider to handle these cases:
Let me know if I can help or if you need more info. |
Hi,
I'm trying to launch a cluster on a private vpc without internet access (internet accessible only by a proxy).
I setup the proxy and the no_proxy variables with the user data script.
However the hadhoop health check is failing during installation.
This is because of the proxy. The ec2 private ip address (which is called by the health check) should go inside the no_proxy.
I have added the full vpc cidr block in the no_proxy env variable however urllib do not support cidr block in no_proxy.
I have 2 ideas but I'm here to get your suggestions:
Can this be handled in other ways?
Thanks
Andrea
The text was updated successfully, but these errors were encountered: