[Gluster-devel] HA translator total failure in 2.0.0rc1

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Gluster-devel] HA translator total failure in 2.0.0rc1

From:	Daniel Maher
Subject:	[Gluster-devel] HA translator total failure in 2.0.0rc1
Date:	Thu, 15 Jan 2009 11:18:32 +0100
User-agent:	Thunderbird 2.0.0.19 (X11/20090105)

Hello all,

As noted in this email to the gluster-users list :
http://zresearch.com/pipermail/gluster-users/20090114/001389.html

I've got a simple and reproducible scenario to crash a Gluster clientusing the HA translator to access two AFR'd servers. The scenario isidentical to that described by Krishna Srinivas on the gluster-devellist on 08-01-2008 :

http://lists.gnu.org/archive/html/gluster-devel/2009-01/msg00059.html

       Client
         |
         HA
        /  \
       /    \
    AFR1    AFR2
     |        |
 Server1    Server2

Basically, if i stop glusterfsd on Server1, HA on Client switches toAFR2 as expected ; however, when i re-enable glusterfsd on Server1, thenstop glusterfsd on Server2, one of two things occurs :1. Client stops communicating entirely with the cluster (transportendpoint not connected), or

2. Client recovers and continues communicating with AFR1.
It appears to be random as to which one actually occurs.

If the client recovers and continues to communicate, and i re-enableglusterfsd on Server2, Client stops communicating immediately with thecluster - every time, guarunteed.


There are therefore two key questions :

1. In the first component, why doesn't the client switch gracefullybetween available subvolumes ?2. In the second component, why does re-enabling apreviously-unavailable subvolume crash the client ?

All relevant details are in the mail to the gluster-users list, linkedabove.


Any ideas what's going on here ?


--
Daniel Maher <dma+gluster AT witbe DOT net>

[Prev in Thread]

Current Thread

[Next in Thread]

[Gluster-devel] HA translator total failure in 2.0.0rc1, Daniel Maher <=

Prev by Date: Re: [Gluster-devel] problem with DHT
Next by Date: Re: [Gluster-devel] xfs, fstab, glusterfs
Previous by thread: [Gluster-devel] 1.3.12 segfault
Next by thread: [Gluster-devel] Open Shared Root GlusterFS Patches and HowTo
Index(es):
- Date
- Thread