Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm()

From:	Hanna Reitz
Subject:	Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm()
Date:	Mon, 14 Nov 2022 21:22:32 +0100
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.11.0

On 08.11.22 13:37, Kevin Wolf wrote:

In order to make sure that bdrv_replace_child_noperm() doesn't have to
poll any more, get rid of the bdrv_parent_drained_begin_single() call.

This is possible now because we can require that the child is already
drained when the function is called (it better be, having in-flight
requests while modifying the graph isn't going to end well!) and we
don't call the parent drain callbacks more than once.

The additional drain calls needed in callers cause the test case to run
its code in the drain handler too early (bdrv_attach_child() drains
now), so modify it to only enable the code after the test setup has
completed.

Signed-off-by: Kevin Wolf <kwolf@redhat.com>
---
  include/block/block-io.h     |  8 ++++
  block.c                      | 72 +++++++++++++++++++++++++-----------
  block/io.c                   |  2 +-
  tests/unit/test-bdrv-drain.c | 10 +++++
  4 files changed, 70 insertions(+), 22 deletions(-)

I find this change complicated. I understand it’s the point of theseries, but I find it difficult to grasp. But I guess there can be nodrain series without such a patch.

As usual, I was very skeptical of the code at first, and over timeslowly realized that I’m mostly confused by the comments, and the codeseems fine. Ah, well.


[...]

diff --git a/block.c b/block.c
index 5f5f79cd16..12039e9b8a 100644
--- a/block.c
+++ b/block.c


[...]

@@ -2414,12 +2428,20 @@ static TransactionActionDrv bdrv_replace_child_drv = {
   *
   * Note: real unref of old_bs is done only on commit.
   *
+ * Both child and new_bs (if non-NULL) must be drained. new_bs must be kept
+ * drained until the transaction is completed (this automatically implies that
+ * child remains drained, too).

I find “child” extremely ambiguous. The problem is that there generallyis no way to drain a BdrvChild object, is there? You can only drain theBDS in it, which then drains the parent through the BdrvChild object. Historically, I don’t think there was ever a place where we cared aboutthe BdrvChild object between the two to be drained, was there? I mean,now there apparently is, in bdrv_child_attach_common(), but that’s adifferent story.

So the problem is that “draining a BdrvChild object” generally appearsin the context of bdrv_parent_drained_*() functions, i.e. actuallyfunctions draining the parent. Which makes it a bit confusing to referto a BdrvChild object just as “child”.

I know that “child” here refers to the variable (or does it not?), butthat is why I really prefer marking variables that are just plainEnglish words, e.g. as @child or `child`, so it’s clear they are a nameand not a noun.

In any case, because the concept is generally to drain the `child->bs`instead of the BdrvChild object directly, I understand the comment tomean: “Both the old child (`child->bs`) and `new_bs` (if non-NULL) mustbe drained. `new_bs` must be kept drained until the transaction iscompleted. This implies that the parent too will be kept drained untilthe transaction is completed by the BdrvChild object `child`.”

Or am I misunderstanding something, and the distinction between `child`and `child->bs` and the parent node is important here? (Would be good toknow. :))

+ *
   * The function doesn't update permissions, caller is responsible for this.
   */
  static void bdrv_replace_child_tran(BdrvChild *child, BlockDriverState 
*new_bs,
                                      Transaction *tran)
  {
      BdrvReplaceChildState *s = g_new(BdrvReplaceChildState, 1);
+
+    assert(child->parent_quiesce_counter);
+    assert(!new_bs || new_bs->quiesce_counter);
+
      *s = (BdrvReplaceChildState) {
          .child = child,
          .old_bs = child->bs,
@@ -2818,6 +2840,12 @@ static void bdrv_replace_child_noperm(BdrvChild *child,

This function now has its callers fulfill kind of a complicatedcontract. I would prefer that to be written out in a doc comment,especially because it sounds like the assertions can’t cover everything(i.e. callers that remove a child are required to have stopped issuingrequests to that child, but they are free to do that in any way theywant, so no assertion will check for it here).

      int new_bs_quiesce_counter;

assert(!child->frozen);

+    /*
+     * When removing the child, it's the callers responsibility to make sure
+     * that no requests are in flight any more. Usually the parent is drained,
+     * but not through child->parent_quiesce_counter.
+     */

When I see a comment above an assertion, I immediately assume it isgoing to describe what the assertion checks. Unless I’mmisunderstanding something (again?), this comment actually describeswhat the assertion *does not* check. I find that confusing, especiallybecause the comment leads with “it’s the caller’s responsibility”, whichto me implies “and that’s why we check it here in this assertion”,because assertions are there to verify that contracts are met.

The assertion verifies that the parent must be drained (through @child),unless the child is removed, which case isn’t covered by the assertion. That “isn’t covered” is then described by the comment, right?

I’d prefer the comment to lead with describing what the assertion doescheck, and then transitioning to “But in case the child is removed, weignore that, and just note that it’s the caller’s responsibility to...”.

Also, the comment doesn’t explicitly say why we don’t check it in theassertion. It says “usually” and “child->parent_quiesce_counter”, whichimplies “can’t get any information from child->parent_quiesce_counter,and regardless, callers can do what they want do achieve quiescing inregards to this child, so there’s nothing we can check”. It feels likewe can just say outright that there’s an informal contract that we can’tformally verify here, but callers naturally still must adhere to it. Itwould be interesting to know (and note) why that is, though, i.e. why wecan’t have parents be drained through the BdrvChild object for the childthat is being removed.

I understand the intention behind the assertion to be: “We require theparent not to have in-flight requests to the BdrvChild objectmanipulated here. In most cases, we verify that by requiring the parentbe drained through this BdrvChild object. However, when a child isbeing removed, we skip formal verification, because we leave callersfree in deciding how to ensure that no requests are in flight. Usually,they will still have the parent be drained (even if not through thisBdrvChild object), but we don’t require that.”

I may well be wrong, but then it would be good for a comment to correctme. :)

(Interestingly, because bdrv_replace_child_noperm() no longer pollsitself, it can’t know for sure that `child->parent_quiesce_counter > 0`means that there are no requests in flight.)

+    assert(!new_bs || child->parent_quiesce_counter);
      assert(old_bs != new_bs);
      GLOBAL_STATE_CODE();


[...]

@@ -2865,9 +2875,6 @@ static void bdrv_replace_child_noperm(BdrvChild *child,
      /*
       * If the old child node was drained but the new one is not, allow

This now also covers the case where there was no old child node, but theparent was simply drained via an empty BdrvChild by the caller.

       * requests to come in only after the new node has been attached.
-     *
-     * Update new_bs_quiesce_counter because bdrv_parent_drained_begin_single()
-     * polls, which could have changed the value.
       */
      new_bs_quiesce_counter = (new_bs ? new_bs->quiesce_counter : 0);
      if (!new_bs_quiesce_counter && child->parent_quiesce_counter) {
@@ -3004,6 +3011,12 @@ static BdrvChild 
*bdrv_attach_child_common(BlockDriverState *child_bs,
      }

bdrv_ref(child_bs);

+    /*
+     * Let every new BdrvChild start drained, inserting it in the graph with
+     * bdrv_replace_child_noperm() will undrain it if the child node is not
+     * drained. The child was only just created, so polling is not necessary.

I feel like this is hiding some complexity. Unless I missed something,draining a BdrvChild always meant draining the parent. But here, itabsolutely does not mean that, and maybe that deserves a big warning sign?

Beginning a drain without poll means quiescing. You assert that therecan be no requests to the new child, which I agree on[1]. Thecombination of no new requests coming in, and no requests being there atthis point is what being drained means. So @new_child is indeed “drained”.

But the parent isn’t drained, because it isn’t polled. There may stillbe requests in flight to its other children. That’s really interesting,and I found it extremely confusing until I wrote ten paragraphs in replyhere and scrapped most of them again. Whenever I find this to be myreaction to something, I really wish for a detailed comment thatexplains the situation.


I would like the comment to:

- Expand on what “only just created” means. As it’s written, that couldmean relying on a race condition. At which point would the parent beable to send requests? (I assume either the .attach() inbdrv_replace_child_noperm(), or when this function returns, whichevercomes first. (The former always comes first.))- Say in more detail that calling bdrv_parent_drained_begin_single()without polling will quiesce the parent, preventing new requests fromappearing.- Note that because there are no requests in flight, and because no newrequests can then appear, the BdrvChild is drained.- Note that the parent is only quiesced, not drained, and may still haverequests in flight to other children, but naturally we don’t care aboutthem.

I feel like the comment tries to hide all that complexity simply byavoiding the word “parent”.

[1] As far as I can piece together, no requests to the new child canhave started yet, because this function creates the BdrvChild object, sobefore it is returned to the caller (or BdrvChildClass.attach() iscalled in bdrv_replace_child_noperm()), the block driver won’t generaterequests to it.


Hanna

+     */
+    bdrv_parent_drained_begin_single(new_child, false);
      bdrv_replace_child_noperm(new_child, child_bs);
BdrvAttachChildCommonState *s = g_new(BdrvAttachChildCommonState, 1);

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH 08/13] stream: Replace subtree drain with a single node drain, (continued)
- [PATCH 09/13] block: Remove subtree drains, Kevin Wolf, 2022/11/08
  - Re: [PATCH 09/13] block: Remove subtree drains, Vladimir Sementsov-Ogievskiy, 2022/11/09
  - Re: [PATCH 09/13] block: Remove subtree drains, Hanna Reitz, 2022/11/14
- [PATCH 11/13] block: Remove ignore_bds_parents parameter from drain functions, Kevin Wolf, 2022/11/08
  - Re: [PATCH 11/13] block: Remove ignore_bds_parents parameter from drain functions, Vladimir Sementsov-Ogievskiy, 2022/11/09
  - Re: [PATCH 11/13] block: Remove ignore_bds_parents parameter from drain functions, Hanna Reitz, 2022/11/14
- [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm(), Kevin Wolf, 2022/11/08
  - Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm(), Emanuele Giuseppe Esposito, 2022/11/11
  - Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm(), Hanna Reitz <=
    - Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm(), Kevin Wolf, 2022/11/17
- [PATCH 13/13] block: Remove poll parameter from bdrv_parent_drained_begin_single(), Kevin Wolf, 2022/11/08
  - Re: [PATCH 13/13] block: Remove poll parameter from bdrv_parent_drained_begin_single(), Hanna Reitz, 2022/11/14
- Re: [PATCH 00/13] block: Simplify drain, Stefan Hajnoczi, 2022/11/10
- Re: [PATCH 00/13] block: Simplify drain, Emanuele Giuseppe Esposito, 2022/11/11

Prev by Date: Re: [PATCH 13/13] block: Remove poll parameter from bdrv_parent_drained_begin_single()
Next by Date: Re: [PATCH 02/13] test-bdrv-drain: Don't yield in .bdrv_co_drained_begin/end()
Previous by thread: Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm()
Next by thread: Re: [PATCH 12/13] block: Don't poll in bdrv_replace_child_noperm()
Index(es):
- Date
- Thread