Fwd: Re: [HECnet] More clustering fun

Mark Wickens mark at wickensonline.co.uk
Fri Sep 16 12:37:23 PDT 2011



On Fri, 16 Sep 2011, Peter Coghlan wrote:


2) The cluster id and cluster password are different on both nodes.


Whats the cluster id?

The cluster id is 4.

I've confirmed that the authorize file is the same on both the alpha and vax nodes:

$$ diff dsa0:[sys0.syscommon.sysexe]cluster_authorize.dat -
_$$ $4$dkb400:[sys0.syscommon.sysexe]cluster_authorize.dat
Number of difference sections found: 0
Number of difference records found: 0

DIFFERENCES /IGNORE=()/MERGED=1-
      DSA0:[SYS0.SYSCOMMON.SYSEXE]CLUSTER_AUTHORIZE.DAT;1-
      $4$DKB400:[SYS0.SYSCOMMON.SYSEXE]CLUSTER_AUTHORIZE.DAT;1


The cluster group number and the cluster password must be the same
on all nodes in the cluster. The easiest way to achieve this is
to copy sys$common:[sysexe]cluster_authorize.dat from one to the
other.

I set the right values when I ran CLUSTER_CONFIG_LAN.COM on the VAX.


If the cluster group numbers are different, you will end up
trying to form two different clusters.

If the passwords are different, the results will probably be
something like you are seeing, nodes failing to completely
join the cluster. There may also be some messages in the
operator.log


It's weird that by changing the VOTES to 0 on the satellite and running an AUTOGEN that this behaviour is experienced. Before that the satellite boots off the served disk no problem.

Indeed, the 'master' disk which is a local drive in the VAXstation boots into the cluster no problem, but again, with VOTES = 1.

Regards, Mark



More information about the Hecnet-list mailing list