Andreas Vöst
1 month
Marc Dierig
3 months
Claus-Theodor Riegg
7 years
Claus-Theodor Riegg
6 years
Claus-Theodor Riegg
4 years
Andreas Vöst
8 months

Repair broken etcd node

Posted Over 6 years ago. Visible to the public. Repeats.

If one etcd node is no longer a member of the remaining etcd cluster or fails to connect you need to remove it from the cluster and then add it again:

  1. Stop etcd on the broken node : sudo service etcd stop
  2. delete the data on the broken node sudo rm -r /var/lib/etcd/data/*
  3. delete the wal data on the broken node: sudo rm -r /var/lib/etcd/wal/*
  4. Follow the instructions for etcd runtime-configuration Show archive.org snapshot , remove the broken node from the cluster, then re-add it again and update the etcd config on the broken node with the parameters printed by the add command.
  5. start etcd again

Even if etcd logging is configured to /var/log/etcd/etcd.log it can happen on new hosts (focal) that StandardOutput is only in journal (systemctl status etcd).

Claus-Theodor Riegg
Last edit
4 months ago
Deleted user #4941
License
Source code in this card is licensed under the MIT License.