Re: [Gluster-devel] Re; Load balancing ...

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] Re; Load balancing ...

From:	gordan
Subject:	Re: [Gluster-devel] Re; Load balancing ...
Date:	Wed, 30 Apr 2008 12:52:55 +0100 (BST)
User-agent:	Alpine 1.10 (LRH 962 2008-03-14)

On Wed, 30 Apr 2008, Gareth Bult wrote:

Sorry, I'm trying to follow this but I'm coming a little unstuck ..
Am I right in thinking the rolling hash / rsync solution would involvesyncing the file "on open" as per the current system .. and in order todo this, the server would have to read through the entire file in orderto create the hashes?
(indeed it would need to do this on two servers to create hashes for 
comparison?)


Yes.

So .. as a rough benchmark .. assume 50Mb/sec for a standard / modernSATA drive, opening a crashed 20G file is going to take 400 seconds orsix minutes ... ? (which would also flatten two servers for theduration)

It would certainly ber beneficial in the cases when the network speed isslow (e.g. WAN replication).

Whereas a journal replay of 10M is going to take < 1s and be effectively 
transparent.
(I'm guessing this could also be done at open time ??)

Journal per se wouldn't work, because that implies fixed size andwrite-ahead logging. What would be required here is more like thesnapshot style undo logging.


The problem with this is that you have to:

1) Categorically establish whether each server is connected and up to datefor the file being checked, and only log if the server has disconnected.This involves overhead.

2) For each server that is down at the time, each other server would haveto start writing the snapshot style undo logs (which would have to beper server) for all the files being changed. This effectively multipliesthe disk write-traffic by the number of offline servers on all the workingup to date servers.

The problem that arises then is that the fast(er) resyncs on small changescome at the cost of massive slowdown in operation when you have multipledowned servers. As the number of servers grows, this rapidly stops being aworkable solution.


Gordan

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] Re; Load balancing ..., (continued)
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/28
  - Re: [Gluster-devel] Re; Load balancing ..., Krishna Srinivas, 2008/04/29
    - Re: [Gluster-devel] Re; Load balancing ..., Martin Fick, 2008/04/29
    - Re: [Gluster-devel] Re; Load balancing ..., Gordan Bobic, 2008/04/30
    - [Gluster-devel] mmap support, Dionisas, 2008/04/30
    - Re: [Gluster-devel] mmap support, Mickey Mazarick, 2008/04/30
    - Re: [Gluster-devel] mmap support, Mickey Mazarick, 2008/04/30
    - Re: [Gluster-devel] mmap support, Anand Avati, 2008/04/30
    - Re: [Gluster-devel] Re; Load balancing ..., Krishna Srinivas, 2008/04/30
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., gordan <=
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., gordan, 2008/04/30
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30
  - Re: [Gluster-devel] Re; Load balancing ..., Mickey Mazarick, 2008/04/30
    - Re: [Gluster-devel] Re; Load balancing ..., gordan, 2008/04/30
- Re: [Gluster-devel] Re; Load balancing ..., Gareth Bult, 2008/04/30

Prev by Date: Re: [Gluster-devel] Re; Load balancing ...
Next by Date: [Gluster-devel] Re: F1... F1... F1
Previous by thread: Re: [Gluster-devel] Re; Load balancing ...
Next by thread: Re: [Gluster-devel] Re; Load balancing ...
Index(es):
- Date
- Thread