Cluster Fail Over

  • Thread starter Thread starter Guest
  • Start date Start date
G

Guest

Hi,
Here is my situation (Maybe someone can help.)

I have 2 Win2k Adv servers SP4 in a cluster with 2 groups: Cluster group and
DB group[Oracle group].
I have been testing cluster performance and fail over as follows:
Test1:
1) Both groups resides on 1 node [N1].
2) From within cluster admin UI I have opened Cluster group, selected Quorum
drive, right click, and selected the Initiate Failure. At the 4th time
Cluster group failed over BUT the DB group failed over as well. Cluster
service was recycled.
Test2:
3) Each group resides on different node.
4) Initiating failure will fail only the group which resource was
initialized as failure.

Question\Problem: In case where all groups reside on one node [Test 1] why
does DB group Fail Over when we initiate a failure on the Cluster group
quorum drive Only?

Thanks. IS.
 
Hi there,

When you initiate the failure on the quorum disk, you would affect the functionality of the cluster itself.
Quorum being the central location where your cluster resource arbitrations being determined, has to be online all the time. If you initiate a failure of that quorum disk, that node would "think" that it has
problem communicating with that important resouce. Thus to avoid any possible problem, it too initiate the failure of any resources that it has on that node. This caused the failover of your DB
group.

Here is a good explaination of Quorum:
The quorum resource plays a crucial role in the operation of the cluster. In every cluster, a single resource is designated as the quorum resource. A quorum resource can be any resource with the
following functionality:


- It offers a means of persistent arbitration. Persistent arbitration
means that the quorum resource must allow a single node to gain
physical control of the node and defend its control. For example, Small
Computer System Interface (SCSI) disks can use "Reserve and Release"
commands for persistent arbitration.

- It provides physical storage that can be accessed by any node in the
cluster. The quorum resource stores data that is critical to recovery
after there is a communication failure between cluster nodes.


On the other hand, if you initiate a failure of other disk than the quorum, this should not cause the quorum disk to failover.

As a test I would suggest that you do a move group instead.

Some good articles for this would be:
Failover/Failback Policies on Microsoft Cluster Server
http://support.microsoft.com/default.aspx?scid=kb;en-us;197047

Quorum Drive Configuration Information
http://support.microsoft.com/default.aspx?scid=kb;en-us;280345


I hope this clarify the confusion.

Thank you,

Alex Chandra [MSFT]
Microsoft Corp.

--

This posting is provided "AS IS" with no warranties, and confers no rights.

Please do not send meil directly to this alias. This alias is for newsgroup purposes only.



--------------------
Thread-Topic: Cluster Fail Over
thread-index: AcS6Bd35zGkgMtBCQAy6ISnwsoCJtw==
X-WBNR-Posting-Host: 192.198.152.98
From: "=?Utf-8?B?SVM=?=" <[email protected]>
Subject: Cluster Fail Over
Date: Sun, 24 Oct 2004 13:13:03 -0700
Lines: 24
Message-ID: <[email protected]>
MIME-Version: 1.0
Content-Type: text/plain;
charset="Utf-8"
Content-Transfer-Encoding: 7bit
X-Newsreader: Microsoft CDO for Windows 2000
Content-Class: urn:content-classes:message
Importance: normal
Priority: normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.3790.0
Newsgroups: microsoft.public.win2000.advanced_server
NNTP-Posting-Host: TK2MSFTNGXA03.phx.gbl 10.40.1.29
Path: cpmsftngxa10.phx.gbl!TK2MSFTNGXA03.phx.gbl
Xref: cpmsftngxa10.phx.gbl microsoft.public.win2000.advanced_server:27188
X-Tomcat-NG: microsoft.public.win2000.advanced_server

Hi,
Here is my situation (Maybe someone can help.)

I have 2 Win2k Adv servers SP4 in a cluster with 2 groups: Cluster group and
DB group[Oracle group].
I have been testing cluster performance and fail over as follows:
Test1:
1) Both groups resides on 1 node [N1].
2) From within cluster admin UI I have opened Cluster group, selected Quorum
drive, right click, and selected the Initiate Failure. At the 4th time
Cluster group failed over BUT the DB group failed over as well. Cluster
service was recycled.
Test2:
3) Each group resides on different node.
4) Initiating failure will fail only the group which resource was
initialized as failure.

Question\Problem: In case where all groups reside on one node [Test 1] why
does DB group Fail Over when we initiate a failure on the Cluster group
quorum drive Only?

Thanks. IS.
 
Back
Top