issuing [block-]job-complete to jobs in STANDBY state

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

issuing [block-]job-complete to jobs in STANDBY state

From:	John Snow
Subject:	issuing [block-]job-complete to jobs in STANDBY state
Date:	Thu, 1 Apr 2021 15:02:35 -0400
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1

Hi; downstream we've run into an issue where VMs under heavy load withmany simultaneously concurrent block jobs running might occasionallyflicker into the STANDBY state, during which time they will be unable toreceive JOB COMPLETE commands. I assume this flicker is due tochild_job_drained_begin().


BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1945635

It's safe to just retry this operation again, but it may be difficult tounderstand WHY the job is paused at the application level, since theflush event may be asynchronous and unpredictable.

We could define a transition to allow COMPLETE to be applied to STANDBYjobs, but is there any risk or drawback to doing so? On QMP's side, wedo know the difference between a temporary pause and a user pause/errorpause (Both use the user_pause flag.)

I imagine it's safe to continue rejecting COMPLETE commands ifuser_paused is set ("No, go fix this first!") and we could define apathway for implicitly STANDBY jobs only. However, in this case, wedon't really know how long STANDBY will last. Do we have the ability toeasily accept an async "intent" to complete a job without tying up themonitor?

ATM I think only mirror uses .complete, but it looks like it tries toactually set up the pivot a good deal before delegating to the bottomhalf, so I worry it's not safe to try to run this when we are in themiddle of a drain.


Any thoughts?

--js

[Prev in Thread]

Current Thread

[Next in Thread]

issuing [block-]job-complete to jobs in STANDBY state, John Snow <=
- Re: issuing [block-]job-complete to jobs in STANDBY state, Vladimir Sementsov-Ogievskiy, 2021/04/03
- Re: issuing [block-]job-complete to jobs in STANDBY state, Peter Krempa, 2021/04/06

Prev by Date: Re: [PATCH 2/2] block/rbd: Don't unescape in qemu_rbd_next_tok()
Next by Date: [PATCH v2 1/2] iotests/231: Update expected deprecation message
Previous by thread: [PULL 1/9] vhost-user-blk: use different event handlers on initialization
Next by thread: Re: issuing [block-]job-complete to jobs in STANDBY state
Index(es):
- Date
- Thread