FrontEnd HA Fencing Error Opennebula 5.2


(surya) #1

Hello,
I am using Opennebula5.2, OS-CentOS. Trying for a Frontend HA setup.

Trying this command,
pcs stonith create fence_server1 fence_ilo_ssh pcmk_host_list=controller1 ipaddr=192.XXX.XX.XX login="…" passwd="…" pcmk_reboot_action=“reboot” secure=yes delay=30 op monitor interval=20s

While checking pcs status it shows,

Cluster name: opennebula
Stack: corosync
Current DC: controller1 (version 1.1.19-8.el7_6.2-c3c624ea3d) - partition with quorum
Last updated: Mon Dec 24 11:33:59 2018
Last change: Mon Dec 24 11:33:46 2018 by root via cibadmin on controller1

2 nodes configured
6 resources configured

Online: [ controller1 controller2 ]

Full list of resources:

Cluster_VIP (ocf::heartbeat:IPaddr2): Started controller1
opennebula (systemd:opennebula): Started controller1
opennebula-sunstone (systemd:opennebula-sunstone): Started controller1
opennebula-gate (systemd:opennebula-gate): Started controller1
opennebula-flow (systemd:opennebula-flow): Started controller1
fence_server1 (stonith:fence_ilo_ssh): Starting controller1

Failed Actions:

  • fence_server1_start_0 on controller2 ‘unknown error’ (1): call=51, status=Error, exitreason=’’,
    last-rc-change=‘Mon Dec 24 11:33:46 2018’, queued=0ms, exec=12256ms

Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled

How to solve this problem?


(Kristian Feldsam) #2

why u use corosync? OpenNebula supports HA using RAFT. Read docs.https://docs.opennebula.org/5.6/advanced_components/ha/index.html


(Kristian Feldsam) #3

if you still want use corosync, then you should check logs, where you find why start action failed.