Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recover from transient gossip failures #1446

Merged
merged 1 commit into from
Sep 20, 2016
Merged

Commits on Sep 19, 2016

  1. Recover from transient gossip failures

    Currently if there is any transient gossip failure in any node the
    recoevry process depends on other nodes propogating the information
    indirectly. In cases if these transient failures affects all the nodes
    that this node has in its memberlist then this node will be permenantly
    cutoff from the the gossip channel. Added node state management code in
    networkdb to address these problems by trying to rejoin the cluster via
    the failed nodes when there is a failure. This also necessitates the
    need to add new messages called node event messages to differentiate
    between node leave and node failure.
    
    Signed-off-by: Jana Radhakrishnan <[email protected]>
    mrjana committed Sep 19, 2016
    Configuration menu
    Copy the full SHA
    716810d View commit details
    Browse the repository at this point in the history