Re: [Gluster-devel] regarding flush taking full inode locks

gluster-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gluster-devel] regarding flush taking full inode locks

From:	Anand Avati
Subject:	Re: [Gluster-devel] regarding flush taking full inode locks
Date:	Fri, 02 Nov 2012 16:59:04 -0700
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:10.0.4) Gecko/20120422 Thunderbird/10.0.4

On 11/02/2012 06:44 AM, Pranith Kumar Karampuri wrote:

hi Avati,
     There was some reason Flush is made into a transaction with full file 
inode-locks for some optimization. I remember you saying that the optimization 
is not valid anymore. Could you please let us know the history of this. 
Basically VMs are hanging in some cases because of the flush fop while 
self-heal in progress. We need to evaluate if this full inode-lock can be 
removed or not.

Pranith.


The reason flush() was made a transaction was twofold:

1. before piggyback-changelog optimization was introduced, there existedanother similar optimization in a much cruder form where changelog wouldbe written on first write, and cleared on close() [i.e, flush()].

2. write-behind (back then) was not intelligent enough to hold offissuing a flush downwards till previous writes were fulfilled.

So flush was doing a full file inodelk in order to confirm that allprevious writes were complete, and that unsetting the changelog would besafe. It also helped the other side of the problem, that the nextarriving write would wait till flush unlocked, and was guaranteed to seeitself as the first write (and would set the changelog again) - (yes, wewere really using inodelk() to synchronize on inode_ctx flag instead ofa mutex)


That had a bunch of issues -

1. flush() is not reliable. Sometimes you don't get flushes. Sometimesyou get them twice, or more. They loosely correlate with close(), notstrict enough to base changelog clearing behavior on.

2. It was too simplistic and the code was such that if any write failurehappened, handling it was very messy and almost always the onlychangelog we would be left with was a FOOL-FOOL state (unless there wereno failure at all).

The current piggyback-changelog fixes all those issues. We don't needflush to be a transaction anymore.

We still need to synchronize flush() against lk() for keeping up POSIXconformance, but I feel that is a separate (open) problem.


Avati

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gluster-devel] regarding flush taking full inode locks, Anand Avati <=

Prev by Date: [Gluster-devel] failure to get the build log failure
Next by Date: [Gluster-devel] Questions on SSL in 3.4.0qa2
Previous by thread: [Gluster-devel] failure to get the build log failure
Next by thread: [Gluster-devel] Questions on SSL in 3.4.0qa2
Index(es):
- Date
- Thread