Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory

From:	Ankit Agrawal
Subject:	Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory
Date:	Tue, 26 Sep 2023 14:52:25 +0000

>>>> Good idea.  Fundamentally the device should not be creating NUMA
>>>> nodes, the VM should be configured with NUMA nodes and the device
>>>> memory associated with those nodes.
>>>
>>> +1. That would also make it fly with DIMMs and virtio-mem, where you
>>> would want NUMA-less nodes ass well (imagine passing CXL memory to a VM
>>> using virtio-mem).
>>>
>>
>> We actually do not add the device memory on the host, instead
>> map it into the Qemu VMA using remap_pfn_range(). Please checkout the
>> mmap function in vfio-pci variant driver code managing the device.
>> https://lore.kernel.org/all/20230915025415.6762-1-ankita@nvidia.com/
>> And I think host memory backend would need memory that is added on the
>> host.
>>
>> Moreover since we want to passthrough the entire device memory, the
>> -object memory-backend-ram would have to be passed a size that is equal
>> to the device memory. I wonder if that would be too much of a trouble
>> for an admin (or libvirt) triggering the Qemu process.
>>
>> Both these items are avoided by exposing the device memory as BAR as in the
>> current  implementation (referenced above) since it lets Qemu to naturally
>> discover the device memory region and do mmap.
>>
>
> Just to clarify: nNUMA nodes for DIMMs/NVDIMMs/virtio-mem are configured
> on the device, not on the memory backend.
> 
> e.g., -device pc-dimm,node=3,memdev=mem1,...

Agreed, but still we will have the aforementioned issues viz.
1. The backing memory for the memory device would need to be allocated
on the host. However, we do not add the device memory on the host in this
case. Instead the Qemu VMA is mapped to the device memory physical
address using remap_pfn_range().
2. The memory device need to be passed an allocation size such that all of
the device memory is mapped into the Qemu VMA. This may not be readily
available to the admin/libvirt.

Based on the suggestions here, can we consider something like the 
following?
1. Introduce a new -numa subparam 'devnode', which tells Qemu to mark
the node with MEM_AFFINITY_HOTPLUGGABLE in the SRAT's memory affinity
structure to make it hotpluggable.
2. Create several NUMA nodes with 'devnode' which are supposed to be
associated with the vfio-pci device.
3. Pass the numa node start and count to associate the nodes created.

So, the command would look something like the following.
...
        -numa node,nodeid=2,devnode=on \
        -numa node,nodeid=3,devnode=on \
        -numa node,nodeid=4,devnode=on \
        -numa node,nodeid=5,devnode=on \
        -numa node,nodeid=6,devnode=on \
        -numa node,nodeid=7,devnode=on \
        -numa node,nodeid=8,devnode=on \
        -numa node,nodeid=9,devnode=on \
        -device 
vfio-pci-nohotplug,host=0009:01:00.0,bus=pcie.0,addr=04.0,rombar=0,numa-node-start=2,numa-node-count=8
 \

Thoughts?

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v1 1/4] vfio: new command line params for device memory NUMA nodes, (continued)
- [PATCH v1 2/4] vfio: assign default values to node params, ankita, 2023/09/14
- [PATCH v1 4/4] acpi/gpex: patch guest DSDT for dev mem information, ankita, 2023/09/14
  - Re: [PATCH v1 4/4] acpi/gpex: patch guest DSDT for dev mem information, Igor Mammedov, 2023/09/15
  - Re: [PATCH v1 4/4] acpi/gpex: patch guest DSDT for dev mem information, Jonathan Cameron, 2023/09/27
- Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Cédric Le Goater, 2023/09/15
  - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Alex Williamson, 2023/09/15
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, David Hildenbrand, 2023/09/15
    - RE: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Ankit Agrawal, 2023/09/22
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, David Hildenbrand, 2023/09/22
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Ankit Agrawal <=
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, David Hildenbrand, 2023/09/26
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Alex Williamson, 2023/09/26
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Ankit Agrawal, 2023/09/27
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Jonathan Cameron, 2023/09/27
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Jason Gunthorpe, 2023/09/27
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Alex Williamson, 2023/09/27
    - RE: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Vikram Sethi, 2023/09/27
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Jason Gunthorpe, 2023/09/27
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Jonathan Cameron, 2023/09/28
    - Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory, Alex Williamson, 2023/09/27

Prev by Date: Re: [PATCH 09/12] kvm-all: introduce limits for name_size and num_desc
Next by Date: Re: [PATCH] target/i386/seg_helper: remove shadowed variable
Previous by thread: Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory
Next by thread: Re: [PATCH v1 0/4] vfio: report NUMA nodes for device memory
Index(es):
- Date
- Thread