Skip to content

Conversation

@dsouzajude
Copy link
Contributor

Background:
The nat instances have dual functionality. It serves as a gateway for entercard auths and hence has to be configured to associate with itself elastic ips (eip). It also functions as a nat for the instances in the private zone.

Because of this, it handles failovers for nat and eip when any of the nats goes down. If the eip failover breaks, it will not be assigned a public ip and hence the instance won't have access to the internet. Because of this, the nat functionality would also break. When this happens, the impact is that we won't receive entercard auths and also the instances connected to the failed zones will be down and eventually services in these zones will also go down.

Solution:
Separate eip and nat "failover" functionality. Execute re-route related functionality only on nat clusters and eip related functionality only on eip clusters.

References https://github.com/wrapp/whaleshark/pull/55

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants