Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-comp

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-comp

From:	Max Reitz
Subject:	Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete
Date:	Fri, 9 Apr 2021 11:31:35 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.0

On 08.04.21 18:55, John Snow wrote:

On 4/8/21 12:20 PM, Max Reitz wrote:

block-job-complete can only be applied when the job is READY, not when
it is on STANDBY (ready, but paused).  Draining a job technically pauses
it (which makes a READY job enter STANDBY), and ending the drained
section does not synchronously resume it, but only schedules the job,
which will then be resumed.  So attempting to complete a job immediately
after a drained section may sometimes fail.

That is bad at least because users cannot really work nicely around
this: A job may be paused and resumed at any time, so waiting for the
job to be in the READY state and then issuing a block-job-complete poses
a TOCTTOU problem.  The only way around it would be to issue
block-job-complete until it no longer fails due to the job being in the
STANDBY state, but that would not be nice.

We can solve the problem by allowing block-job-complete to be invoked on
jobs that are on STANDBY, if that status is the result of a drained
section (not because the user has paused the job), and that section has
ended.  That is, if the job is on STANDBY, but scheduled to be resumed.

Perhaps we could actually just directly allow this, seeing that mirror
is the only user of ready/complete, and that mirror_complete() could
probably work under the given circumstances, but there may be many side
effects to consider.

It is simpler to add a function job_wait_unpaused() that waits for the
job to be resumed (under said circumstances), and to make
qmp_block_job_complete() use it to delay job_complete() until then.

Buglink: https://bugzilla.redhat.com/show_bug.cgi?id=1945635
Signed-off-by: Max Reitz <mreitz@redhat.com>
---
  include/qemu/job.h | 15 +++++++++++++++
  blockdev.c         |  3 +++
  job.c              | 42 ++++++++++++++++++++++++++++++++++++++++++
  3 files changed, 60 insertions(+)

diff --git a/include/qemu/job.h b/include/qemu/job.h
index efc6fa7544..cf3082b6d7 100644
--- a/include/qemu/job.h
+++ b/include/qemu/job.h
@@ -563,4 +563,19 @@ void job_dismiss(Job **job, Error **errp);
   */

int job_finish_sync(Job *job, void (*finish)(Job *, Error **errp),Error **errp);

+/**
+ * If the job has been paused because of a drained section, and that
+ * section has ended, wait until the job is resumed.
+ *
+ * Return 0 if the job is not paused, or if it has been successfully
+ * resumed.
+ * Return an error if the job has been paused in such a way that
+ * waiting will not resume it, i.e. if it has been paused by the user,
+ * or if it is still drained.
+ *
+ * Callers must be in the home AioContext and hold the AioContext lock
+ * of job->aio_context.
+ */
+int job_wait_unpaused(Job *job, Error **errp);
+
  #endif
diff --git a/blockdev.c b/blockdev.c
index a57590aae4..c0cc2fa364 100644
--- a/blockdev.c
+++ b/blockdev.c

@@ -3414,6 +3414,9 @@ void qmp_block_job_complete(const char *device,Error **errp)

          return;
      }
+    if (job_wait_unpaused(&job->job, errp) < 0) {
+        return;
+    }

After which point, we assume we've transitioned back to either RUNNINGor READY, and

      trace_qmp_block_job_complete(job);
      job_complete(&job->job, errp);

This function checks the usual state table for permission todeliver/perform the verb.

      aio_context_release(aio_context);
diff --git a/job.c b/job.c
index 289edee143..1ea30fd294 100644
--- a/job.c
+++ b/job.c

@@ -1023,3 +1023,45 @@ int job_finish_sync(Job *job, void(*finish)(Job *, Error **errp), Error **errp)

      job_unref(job);
      return ret;
  }
+
+int job_wait_unpaused(Job *job, Error **errp)
+{
+    /*
+     * Only run this function from the main context, because this is
+     * what we need, and this way we do not have to think about what
+     * happens if the user concurrently pauses the job from the main
+     * monitor.
+     */
+    assert(qemu_get_current_aio_context() == qemu_get_aio_context());
+
+    /*
+     * Quick path (e.g. so we do not get an error if pause_count > 0
+     * but the job is not even paused)
+     */
+    if (!job->paused) {
+        return 0;
+    }
+
+    /* If the user has paused the job, waiting will not help */
+    if (job->user_paused) {

+ error_setg(errp, "Job '%s' has been paused by the user",job->id);

+        return -EBUSY;
+    }
+

Or the job has encountered an error if that error policy is set. It ismaybe more accurate to say that the job is currently paused/halted (forsome reason) and is awaiting the explicit unpause instruction.

"Job '%s' has been paused and needs to be explicitly resumed withjob-resume", maybe?


Job '%s' has been paused and needs to be [explicitly] resumed
[by the user] [with job-resume]

Some combo of those runes.

Sounds good. I think I’ll go for “Job '%s' has been paused and needs tobe explicitly resumed”.

+ /* Similarly, if the job is still drained, waiting will not helpeither */
+    if (job->pause_count > 0) {
+ error_setg(errp, "Job '%s' is blocked and cannot beunpaused", job->id);
+        return -EBUSY;
+    }
+
This leaks an internal state detail out to the caller. In whichcircumstances does this happen?


Hm.  Now that you ask it.

The circumstance would be a concurrent drain in some other IO thread.Probably the IO thread the job runs in? I don’t know any other threadthat could concurrently drain, because this function runs in the mainthread, and there shouldn’t be any drain in the background.

If it is another IO thread, waiting would indeed help, so there wouldnot be a need to error out.

Perhaps it’s possible to have a background drain in the main thread? Idon’t think so, though...

Do we expect it to?


I can’t say I do.

As the user: Why is it blocked? Can I unblock it? Do I wait?


Waiting would be the strategy.

Perhaps we should bite the bullet, drop the condition and indeed justwait regardless of pause_count.

+    /*
+     * This function is specifically for waiting for a job to be
+     * resumed after a drained section.  Ending the drained section
+     * includes a job_enter(), which schedules the job loop to be run,
+     * and once it does, job->paused will be cleared.  Therefore, we
+     * do not need to invoke job_enter() here.
+     */
+    AIO_WAIT_WHILE(job->aio_context, job->paused);
+
+    return 0;
+}
Looks about right to me, but you'll want Kevin's look-see for the finerdetails, of course.
My concern is that this adds a wait of an indefinite period to thejob_complete command. We mitigate this by checking for some otherinternal state criteria first, and then by process of elimination deducethat it's safe to wait, as it will (likely) be very quick.
Do we open the door for ourselves to get into trouble here, either by astate we are forgetting to rule out (You'd have added it if you know theanswer to this) or a hypothetical future change where we forget toupdate this function?


Well.  Who knows.

The alternatives I see are:

(A) Let drained_end wait for the block jobs to be resumed. There aresome details to consider there, I had some discussion around this withKevin on Tuesday. For example, should every drained_end wait for alljobs involved to be resumed? (That would mean waiting for concurrentdrained_ends, too.) Would the drained_end_counter be the right tool forthe job? (None of this is unsolvable, I guess, but it would mean havinganother discussion.)It would also mean that you basically just move the wait to wherever thedrained_end occurs, for example to qmp_transaction(). Now, everydrained_end is preceded by a drained_begin that always has to wait, soit probably isn’t bad. OTOH, if qmp_transaction() would be allowed towait for a job to be resumed, I think we can allow the same forqmp_block_job_complete().(And there’s the fact that most of the time not having the block jobrunning after drained_end poses no problem. This is the first time I’maware of a problem, so I think it would be preferable to wait only onthe rare occasion where we have to.)

(B) Have block-job-complete be kind of asynchronous. We talked aboutthat on IRC yesterday, and the main problem seems to be that we don’tknow what we’d do with errors. We could only emit them via an event, orlet the whole job fail, both of which seem like bad solutions.

(C) Perhaps mirror’s completion function works just fine when the job ispaused (we just would have to skip the job_enter()). I don’t know.Someone™ would need to find out.

So I can’t see (B) working, (A) is a bit of an ant’s nest that I don’twant to poke too much (and for the problem at hand, I don’t think itwould be a better solution, because it would make us wait on everydrained_end instead of only when there is a real conflict). (C) mightbe nice, but it has the potential of giving headaches.

I can’t see future changes posing a problem, but that’s kind of theproblem with future changes.

Not necessarily a blocker, I think, and this does solve a real problemfairly inexpensively.
On good faith that you understand the synchronicity issues here betterthan I do:


I should let you know that faith is probably misplaced.

Reviewed-by: John Snow <jsnow@redhat.com>


Well, er, thanks?  I don’t know if I can take this now. O:)

Max

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH for-6.0? 0/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz, 2021/04/08
- [PATCH for-6.0? 3/3] iotests/041: block-job-complete on user-paused job, Max Reitz, 2021/04/08
- [PATCH for-6.0? 2/3] test-blockjob: Test job_wait_unpaused(), Max Reitz, 2021/04/08
- [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz, 2021/04/08
  - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, John Snow, 2021/04/08
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz <=
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Kevin Wolf, 2021/04/09
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Kevin Wolf, 2021/04/09
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz, 2021/04/09
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, John Snow, 2021/04/09
  - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Vladimir Sementsov-Ogievskiy, 2021/04/08
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, John Snow, 2021/04/08
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Vladimir Sementsov-Ogievskiy, 2021/04/08
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz, 2021/04/09
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Vladimir Sementsov-Ogievskiy, 2021/04/09
    - Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete, Max Reitz, 2021/04/09

Prev by Date: [PATCH] hw/block/nvme: slba equal to nsze is out of bounds if nlb is 1-based
Next by Date: Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete
Previous by thread: Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete
Next by thread: Re: [PATCH for-6.0? 1/3] job: Add job_wait_unpaused() for block-job-complete
Index(es):
- Date
- Thread