configured resource disappear

System1 · October 18, 2012, 1:24pm

I have 2 node cluster SLES 11 sp2
My problem is that many times I added resources, which disappear after
restart servers. Also I can see resources already deleted by meself.
I use pacemaker GUI to manage resources.
What am I doingo wrong?

Jens-U · October 18, 2012, 2:12pm

Hi Adam,

seems like some sort of issue with cluster db sync:

can you confirm that prior to the reboot, both nodes have the same content?
does this happen when a single node reboots, or do both nodes need to reboot for this to occur?
after the according reboot, is there anything in the logs that maybe hints at some roll-back of the database?

Regards,
Jens

system · October 18, 2012, 3:02pm

UÂ¿ytkownik “jmozdzen” jmozdzen@no-mx.forums.suse.com napisaÂ³ w wiadomoÂ¶ci
news:jmozdzen.5kmo9b@no-mx.forums.suse.com…[color=blue]

Hi Adam,

seems like some sort of issue with cluster db sync:

can you confirm that prior to the reboot, both nodes have the same
content?

does this happen when a single node reboots, or do both nodes need to
reboot for this to occur?

after the according reboot, is there anything in the logs that maybe
hints at some roll-back of the database?

Regards,
Jens

–
jmozdzen

jmozdzen’s Profile: http://forums.suse.com/member.php?userid=51
View this thread: http://forums.suse.com/showthread.php?t=1921
[/color]
Both node have the same content.
When I go out with node1 from the cluster changes are in place, the same
when I only go out from the cluster with node2. I can do that many times,
and nothing disappear
When I stop corosync on both, and start, I can observe that my last last
changes disappear.

Jens-U · October 18, 2012, 4:53pm

Hi Adam,

in addition to checking the logs during cluster start, you may want to check in what state the cluster DB is while both nodes are down. My guess is that for a currently unknown reason, the cluster disregards the lastest DB version and resorts to the previous one. Might be disk space, md5 problem or something else.

Regards,
Jens

system · October 22, 2012, 10:08pm

It is happening on my 2 diffrent clusters instalations. One is in my lab,
another is at the client side. I am using vmware virtual machines as nodes.
Client is using Dell servers with EMS SAN Storage. Both have the same
version of sles, and ha.

UÂ¿ytkownik “jmozdzen” jmozdzen@no-mx.forums.suse.com napisaÂ³ w wiadomoÂ¶ci
news:jmozdzen.5kmvo0@no-mx.forums.suse.com…[color=blue]

Hi Adam,

in addition to checking the logs during cluster start, you may want to
check in what state the cluster DB is while both nodes are down. My
guess is that for a currently unknown reason, the cluster disregards the
lastest DB version and resorts to the previous one. Might be disk space,
md5 problem or something else.

Regards,
Jens

–
jmozdzen

jmozdzen’s Profile: http://forums.suse.com/member.php?userid=51
View this thread: http://forums.suse.com/showthread.php?t=1921
[/color]

Jens-U · October 23, 2012, 6:05pm

Hi Adam,

how do you take out the cluster nodes - maybe the (latest) change hasn’t made it to persistence yet? Does this happen if you simply stop & start cluster services on both nodes?

Regards,
Jens

system · October 24, 2012, 11:16am

When I stop both nodes using rcopenais stop, they are stoping without
errors.
Then I start them, and latest changes to cluster disapper.

UÂ¿ytkownik “jmozdzen” jmozdzen@no-mx.forums.suse.com napisaÂ³ w wiadomoÂ¶ci
news:jmozdzen.5kw8pb@no-mx.forums.suse.com…[color=blue]

Hi Adam,

how do you take out the cluster nodes - maybe the (latest) change
hasn’t made it to persistence yet? Does this happen if you simply stop &
start cluster services on both nodes?

Regards,
Jens

–
jmozdzen

jmozdzen’s Profile: http://forums.suse.com/member.php?userid=51
View this thread: http://forums.suse.com/showthread.php?t=1921
[/color]

Jens-U · October 24, 2012, 12:44pm

Hi Adam,

When I stop both nodes using rcopenais stop, they are stoping without errors.
Then I start them, and latest changes to cluster disapper.

Then “lost disk writes” or alike don’t seem to be the problem.

When AIS restarts and loads the cluster db, do you see any indication in the log that it tries and fails to load the lastest (in terms of prior to stopping AIS) db, thus resorting to an older copy?

If you have a support contract, this might be a good time to open an incident with Novell/SuSE.

Regards,
Jens

Topic		Replies	Views
On 2-node Cluster: Resource restart when other node reboot SLES High Availability Extension	4	261	July 5, 2012
2-Node Cluster: Resources restart when other node reboots SLES High Availability Extension	3	319	December 17, 2013
Both nodes in OCFS2 cluster keep rebooting SLES High Availability Extension	2	424	June 15, 2015
second node rebooted when first node comes back online SLES High Availability Extension	8	989	December 15, 2015
Automatic failback of cluster resources after reboot SLES High Availability Extension	1	769	September 26, 2020

configured resource disappear

–
jmozdzen

–
jmozdzen

–
jmozdzen

configured resource disappear

– jmozdzen

– jmozdzen

– jmozdzen

Related topics

–
jmozdzen

–
jmozdzen

–
jmozdzen