|
|
7711c0 |
From e145e366df21d291ba3cbcf2b4982598637bcc01 Mon Sep 17 00:00:00 2001
|
|
|
7711c0 |
From: Laurent Vivier <lvivier@redhat.com>
|
|
|
7711c0 |
Date: Wed, 2 Jan 2019 11:29:47 +0100
|
|
|
7711c0 |
Subject: [PATCH 1/8] spapr: Fix ibm, max-associativity-domains property number
|
|
|
7711c0 |
of nodes
|
|
|
7711c0 |
|
|
|
7711c0 |
RH-Author: Laurent Vivier <lvivier@redhat.com>
|
|
|
7711c0 |
Message-id: <20190102112948.18536-2-lvivier@redhat.com>
|
|
|
7711c0 |
Patchwork-id: 83820
|
|
|
7711c0 |
O-Subject: [RHEL-7.6 qemu-kvm-rhev PATCH 1/2] spapr: Fix ibm, max-associativity-domains property number of nodes
|
|
|
7711c0 |
Bugzilla: 1626347
|
|
|
7711c0 |
RH-Acked-by: Thomas Huth <thuth@redhat.com>
|
|
|
7711c0 |
RH-Acked-by: Serhii Popovych <spopovyc@redhat.com>
|
|
|
7711c0 |
RH-Acked-by: David Gibson <dgibson@redhat.com>
|
|
|
7711c0 |
|
|
|
7711c0 |
From: Serhii Popovych <spopovyc@redhat.com>
|
|
|
7711c0 |
|
|
|
7711c0 |
BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1626347
|
|
|
7711c0 |
|
|
|
7711c0 |
Laurent Vivier reported off by one with maximum number of NUMA nodes
|
|
|
7711c0 |
provided by qemu-kvm being less by one than required according to
|
|
|
7711c0 |
description of "ibm,max-associativity-domains" property in LoPAPR.
|
|
|
7711c0 |
|
|
|
7711c0 |
It appears that I incorrectly treated LoPAPR description of this
|
|
|
7711c0 |
property assuming it provides last valid domain (NUMA node here)
|
|
|
7711c0 |
instead of maximum number of domains.
|
|
|
7711c0 |
|
|
|
7711c0 |
### Before hot-add
|
|
|
7711c0 |
|
|
|
7711c0 |
(qemu) info numa
|
|
|
7711c0 |
3 nodes
|
|
|
7711c0 |
node 0 cpus: 0
|
|
|
7711c0 |
node 0 size: 0 MB
|
|
|
7711c0 |
node 0 plugged: 0 MB
|
|
|
7711c0 |
node 1 cpus:
|
|
|
7711c0 |
node 1 size: 1024 MB
|
|
|
7711c0 |
node 1 plugged: 0 MB
|
|
|
7711c0 |
node 2 cpus:
|
|
|
7711c0 |
node 2 size: 0 MB
|
|
|
7711c0 |
node 2 plugged: 0 MB
|
|
|
7711c0 |
|
|
|
7711c0 |
$ numactl -H
|
|
|
7711c0 |
available: 2 nodes (0-1)
|
|
|
7711c0 |
node 0 cpus: 0
|
|
|
7711c0 |
node 0 size: 0 MB
|
|
|
7711c0 |
node 0 free: 0 MB
|
|
|
7711c0 |
node 1 cpus:
|
|
|
7711c0 |
node 1 size: 999 MB
|
|
|
7711c0 |
node 1 free: 658 MB
|
|
|
7711c0 |
node distances:
|
|
|
7711c0 |
node 0 1
|
|
|
7711c0 |
0: 10 40
|
|
|
7711c0 |
1: 40 10
|
|
|
7711c0 |
|
|
|
7711c0 |
### Hot-add
|
|
|
7711c0 |
|
|
|
7711c0 |
(qemu) object_add memory-backend-ram,id=mem0,size=1G
|
|
|
7711c0 |
(qemu) device_add pc-dimm,id=dimm1,memdev=mem0,node=2
|
|
|
7711c0 |
(qemu) [ 87.704898] pseries-hotplug-mem: Attempting to hot-add 4 ...
|
|
|
7711c0 |
<there is no "Initmem setup node 2 [mem 0xHEX-0xHEX]">
|
|
|
7711c0 |
[ 87.705128] lpar: Attempting to resize HPT to shift 21
|
|
|
7711c0 |
... <HPT resize messages>
|
|
|
7711c0 |
|
|
|
7711c0 |
### After hot-add
|
|
|
7711c0 |
|
|
|
7711c0 |
(qemu) info numa
|
|
|
7711c0 |
3 nodes
|
|
|
7711c0 |
node 0 cpus: 0
|
|
|
7711c0 |
node 0 size: 0 MB
|
|
|
7711c0 |
node 0 plugged: 0 MB
|
|
|
7711c0 |
node 1 cpus:
|
|
|
7711c0 |
node 1 size: 1024 MB
|
|
|
7711c0 |
node 1 plugged: 0 MB
|
|
|
7711c0 |
node 2 cpus:
|
|
|
7711c0 |
node 2 size: 1024 MB
|
|
|
7711c0 |
node 2 plugged: 1024 MB
|
|
|
7711c0 |
|
|
|
7711c0 |
$ numactl -H
|
|
|
7711c0 |
available: 2 nodes (0-1)
|
|
|
7711c0 |
^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
7711c0 |
Still only two nodes (and memory hot-added to node 0 below)
|
|
|
7711c0 |
node 0 cpus: 0
|
|
|
7711c0 |
node 0 size: 1024 MB
|
|
|
7711c0 |
node 0 free: 1021 MB
|
|
|
7711c0 |
node 1 cpus:
|
|
|
7711c0 |
node 1 size: 999 MB
|
|
|
7711c0 |
node 1 free: 658 MB
|
|
|
7711c0 |
node distances:
|
|
|
7711c0 |
node 0 1
|
|
|
7711c0 |
0: 10 40
|
|
|
7711c0 |
1: 40 10
|
|
|
7711c0 |
|
|
|
7711c0 |
After fix applied numactl(8) reports 3 nodes available and memory
|
|
|
7711c0 |
plugged into node 2 as expected.
|
|
|
7711c0 |
|
|
|
7711c0 |
>From David Gibson:
|
|
|
7711c0 |
------------------
|
|
|
7711c0 |
Qemu makes a distinction between "non NUMA" (nb_numa_nodes == 0) and
|
|
|
7711c0 |
"NUMA with one node" (nb_numa_nodes == 1). But from a PAPR guests's
|
|
|
7711c0 |
point of view these are equivalent. I don't want to present two
|
|
|
7711c0 |
different cases to the guest when we don't need to, so even though the
|
|
|
7711c0 |
guest can handle it, I'd prefer we put a '1' here for both the
|
|
|
7711c0 |
nb_numa_nodes == 0 and nb_numa_nodes == 1 case.
|
|
|
7711c0 |
|
|
|
7711c0 |
This consolidates everything discussed previously on mailing list.
|
|
|
7711c0 |
|
|
|
7711c0 |
Fixes: da9f80fbad21 ("spapr: Add ibm,max-associativity-domains property")
|
|
|
7711c0 |
Reported-by: Laurent Vivier <lvivier@redhat.com>
|
|
|
7711c0 |
Signed-off-by: Serhii Popovych <spopovyc@redhat.com>
|
|
|
7711c0 |
|
|
|
7711c0 |
Signed-off-by: David Gibson <david@gibson.dropbear.id.au>
|
|
|
7711c0 |
Reviewed-by: Greg Kurz <groug@kaod.org>
|
|
|
7711c0 |
Reviewed-by: Laurent Vivier <lvivier@redhat.com>
|
|
|
7711c0 |
(cherry picked from commit 3908a24fcb83913079d315de0ca6d598e8616dbb)
|
|
|
7711c0 |
Signed-off-by: Laurent Vivier <lvivier@redhat.com>
|
|
|
7711c0 |
Signed-off-by: Miroslav Rezanina <mrezanin@redhat.com>
|
|
|
7711c0 |
---
|
|
|
7711c0 |
hw/ppc/spapr.c | 2 +-
|
|
|
7711c0 |
1 file changed, 1 insertion(+), 1 deletion(-)
|
|
|
7711c0 |
|
|
|
7711c0 |
diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
|
|
|
7711c0 |
index 5f26aea..b49f377 100644
|
|
|
7711c0 |
--- a/hw/ppc/spapr.c
|
|
|
7711c0 |
+++ b/hw/ppc/spapr.c
|
|
|
7711c0 |
@@ -915,7 +915,7 @@ static void spapr_dt_rtas(sPAPRMachineState *spapr, void *fdt)
|
|
|
7711c0 |
cpu_to_be32(0),
|
|
|
7711c0 |
cpu_to_be32(0),
|
|
|
7711c0 |
cpu_to_be32(0),
|
|
|
7711c0 |
- cpu_to_be32(nb_numa_nodes ? nb_numa_nodes - 1 : 0),
|
|
|
7711c0 |
+ cpu_to_be32(nb_numa_nodes ? nb_numa_nodes : 1),
|
|
|
7711c0 |
};
|
|
|
7711c0 |
|
|
|
7711c0 |
_FDT(rtas = fdt_add_subnode(fdt, 0, "rtas"));
|
|
|
7711c0 |
--
|
|
|
7711c0 |
1.8.3.1
|
|
|
7711c0 |
|