All nodes for resource dummy1 are unavailable, unclean or...

SLES11 HAE Problem: All nodes for resource dummy1 are unavailable,
unclean or shutting down.

I have setup a 2 node SLES11 SP1 HAE cluster (config see below).
Whenever I ifdown nic0 on node1 followed by ifup on node1, then ifdown
nic0 on node2 followed by ifup on node2, certain resources (e.g. dummy1
which is the built-in class:ocf/provider:heartbeat/type:dummy resource)
will not be able to start, leaving the following message in
/var/log/messages: “All nodes for resource dummy1 are unavailable,
unclean or shutting down.” followed by “Node2 pengine: [27355]: info:
native_color: Resource dummy1 cannot run anywhere”.

Any help is greatly appreciated, thanks

Bruno

Versions

rpm -qa ‘(pacemaker|corosync|resource-agents)’

resource-agents-1.0.3-0.3.2
corosync-1.2.1-0.5.1
pacemaker-1.1.2-0.2.1

/var/log/messages

sfd90211:~ # o /var/log/messages |grep -i ‘All nodes for resource’
Oct 18 14:50:13 Node2 pengine: [27355]: debug: native_assign_node: All
nodes for resource dummy1 are unavailable, unclean or shutting down
(Node2: 0, -1000000)
Oct 18 14:50:13 Node2 pengine: [27355]: info: native_color: Resource
dummy1 cannot run anywher

CIB whole Config



<crm_config>
<cluster_property_set id=“cib-bootstrap-options”>







</cluster_property_set>
</crm_config>


<instance_attributes id=“nodes-node1”>

</instance_attributes>


<instance_attributes id=“nodes-node2”>

</instance_attributes>




<meta_attributes id=“group_fd-meta_attributes”>


</meta_attributes>




<instance_attributes id=“san_disk_fd-instance_attributes”>



</instance_attributes>
<meta_attributes id=“san_disk_fd-meta_attributes”>


</meta_attributes>


<meta_attributes id=“clusterip_fd-meta_attributes”>


</meta_attributes>



<instance_attributes id=“clusterip_fd-instance_attributes”>

</instance_attributes>





<meta_attributes id=“apache_fd-meta_attributes”>


</meta_attributes>



<meta_attributes id=“clusterip_fd2-meta_attributes”>


</meta_attributes>



<instance_attributes id=“clusterip_fd2-instance_attributes”>

</instance_attributes>


<meta_attributes id=“my_sandisk_cluster-meta_attributes”>

</meta_attributes>

<instance_attributes id=“failover-ip-instance_attributes”>

</instance_attributes>



<meta_attributes id=“failover-ip-meta_attributes”>

</meta_attributes>





<instance_attributes id=“san_disk_fd2-instance_attributes”>



</instance_attributes>
<meta_attributes id=“san_disk_fd2-meta_attributes”>

</meta_attributes>



<meta_attributes id=“pingdclone-meta_attributes”>

</meta_attributes>

<instance_attributes id=“pingd-instance_attributes”>


</instance_attributes>






<meta_attributes id=“dummy1-meta_attributes”>

</meta_attributes>



<rsc_location id=“my_sandisk_cluster_on_connected_node”
rsc=“my_sandisk_cluster”>




</rsc_location>
<rsc_location id=“dummy1_on_connected_node” rsc=“dummy1”>




</rsc_location>

<op_defaults>
<meta_attributes id=“op_defaults-options”>

</meta_attributes>
</op_defaults>
<rsc_defaults>
<meta_attributes id=“rsc-options”>


</meta_attributes>
</rsc_defaults>


<node_state crm-debug-origin=“do_update_resource” crmd=“online”
expected=“member” ha=“active” id=“node1” in_ccm=“true” join=“member”
shutdown=“0” uname=“node1”>

<lrm_resources>
<lrm_resource class=“ocf” id=“san_disk_fd”
provider=“heartbeat” type=“Filesystem”>
<lrm_rsc_op call-id=“2”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“60” id=“san_disk_fd_monitor_0” interval=“0”
last-rc-change=“1318941311” last-run=“1318941311”
op-digest=“bfadd3a6afb308eb046b879e2070bcb9” op-status=“0”
operation=“monitor” queue-time=“0” rc-code=“7”
transition-key=“4:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;4:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“clusterip_fd2”
provider=“heartbeat” type=“IPaddr2”>
<lrm_rsc_op call-id=“5”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“70” id=“clusterip_fd2_monitor_0” interval=“0”
last-rc-change=“1318941311” last-run=“1318941311”
op-digest=“199253622b2b5dda4457f146efe66ad1” op-status=“0”
operation=“monitor” queue-time=“0” rc-code=“7”
transition-key=“7:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;7:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“clusterip_fd”
provider=“heartbeat” type=“IPaddr2”>
<lrm_rsc_op call-id=“3”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“80” id=“clusterip_fd_monitor_0” interval=“0”
last-rc-change=“1318941311” last-run=“1318941311”
op-digest=“199253622b2b5dda4457f146efe66ad1” op-status=“0”
operation=“monitor” queue-time=“0” rc-code=“7”
transition-key=“5:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;5:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“apache_fd” provider=“heartbeat”
type=“apache”>
<lrm_rsc_op call-id=“4”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“100” id=“apache_fd_monitor_0” interval=“0”
last-rc-change=“1318941311” last-run=“1318941311”
op-digest=“f2317cad3d54cec5d7d7aa7d0bf35cf8” op-status=“0”
operation=“monitor” queue-time=“0” rc-code=“7”
transition-key=“6:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;6:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“pingd:0” provider=“pacemaker”
type=“pingd”>
<lrm_rsc_op call-id=“8”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“30” id=“pingd:0_monitor_0” interval=“0”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“f9e5022a59cac98812bcfea6a34348b4” op-status=“0”
operation=“monitor” queue-time=“1000” rc-code=“7”
transition-key=“10:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;10:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
<lrm_rsc_op call-id=“10”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“20” id=“pingd:0_start_0” interval=“0”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“f9e5022a59cac98812bcfea6a34348b4” op-status=“0”
operation=“start” queue-time=“0” rc-code=“0”
transition-key=“28:0:0:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:0;28:0:0:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
<lrm_rsc_op call-id=“11”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“10” id=“pingd:0_monitor_15000” interval=“15000”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“efd4237723a9695275fd88201a420089” op-status=“0”
operation=“monitor” queue-time=“0” rc-code=“0”
transition-key=“30:1:0:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:0;30:1:0:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“dummy1” provider=“heartbeat”
type=“Dummy”>
<lrm_rsc_op call-id=“9”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“40” id=“dummy1_monitor_0” interval=“0”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“f2317cad3d54cec5d7d7aa7d0bf35cf8” op-status=“0”
operation=“monitor” queue-time=“1000” rc-code=“7”
transition-key=“11:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;11:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“failover-ip”
provider=“heartbeat” type=“IPaddr”>
<lrm_rsc_op call-id=“6”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“40” id=“failover-ip_monitor_0” interval=“0”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“199253622b2b5dda4457f146efe66ad1” op-status=“0”
operation=“monitor” queue-time=“1000” rc-code=“7”
transition-key=“8:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;8:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
<lrm_resource class=“ocf” id=“san_disk_fd2”
provider=“heartbeat” type=“Filesystem”>
<lrm_rsc_op call-id=“7”
crm-debug-origin=“do_update_resource” crm_feature_set=“3.0.2”
exec-time=“70” id=“san_disk_fd2_monitor_0” interval=“0”
last-rc-change=“1318941312” last-run=“1318941312”
op-digest=“bfadd3a6afb308eb046b879e2070bcb9” op-status=“0”
operation=“monitor” queue-time=“1000” rc-code=“7”
transition-key=“9:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”
transition-magic=“0:7;9:0:7:5fc58646-b53f-47df-9796-e1764e3f7fd7”/>
</lrm_resource>
</lrm_resources>

<transient_attributes id=“node1”>
<instance_attributes id=“status-node1”>

</instance_attributes>
</transient_attributes>
</node_state>


b400bhb

b400bhb’s Profile: http://forums.novell.com/member.php?userid=111651
View this thread: http://forums.novell.com/showthread.php?t=446975

b400bhb,

It appears that in the past few days you have not received a response to your
posting. That concerns us, and has triggered this automated reply.

Has your problem been resolved? If not, you might try one of the following options:

  • Visit http://support.novell.com and search the knowledgebase and/or check all
    the other self support options and support programs available.
  • You could also try posting your message again. Make sure it is posted in the
    correct newsgroup. (http://forums.novell.com)

Be sure to read the forum FAQ about what to expect in the way of responses:
http://forums.novell.com/faq.php

If this is a reply to a duplicate posting, please ignore and accept our apologies
and rest assured we will issue a stern reprimand to our posting bot.

Good luck!

Your Novell Product Support Forums Team
http://forums.novell.com/

Hi

Can you post the output of “crm configure show” too? It’s better
readable than the xml-file…


amo_vzug

amo_vzug’s Profile: http://forums.novell.com/member.php?userid=25342
View this thread: http://forums.novell.com/showthread.php?t=446975

Thank you for your reply, amo_vzug

I have rebuilt the whole cluster config and the problem has not
reoccured since then. This post can be regarded as closed (don’t know
how to close a post otherwise)

Regards, bb


b400bhb

b400bhb’s Profile: http://forums.novell.com/member.php?userid=111651
View this thread: http://forums.novell.com/showthread.php?t=446975