fbe740
From a7ad591f6a6b86b24b1ed030cc9b1ca5b3bf4346 Mon Sep 17 00:00:00 2001
fbe740
Message-Id: <a7ad591f6a6b86b24b1ed030cc9b1ca5b3bf4346@dist-git>
fbe740
From: Laine Stump <laine@redhat.com>
fbe740
Date: Thu, 30 Jan 2020 14:12:44 -0500
fbe740
Subject: [PATCH] docs: document <interface> subelement <teaming>
fbe740
MIME-Version: 1.0
fbe740
Content-Type: text/plain; charset=UTF-8
fbe740
Content-Transfer-Encoding: 8bit
fbe740
fbe740
Signed-off-by: Laine Stump <laine@redhat.com>
fbe740
Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>
fbe740
(cherry picked from commit f0f34056ab26eaa9f903a51cd1fa155088fd640f)
fbe740
fbe740
Conflicts:
fbe740
   docs/news.xml - feature is in release 6.1.0 upstream, but
fbe740
      that release doesn't exist downstream.
fbe740
fbe740
https://bugzilla.redhat.com/1693587
fbe740
Signed-off-by: Laine Stump <laine@redhat.com>
fbe740
Message-Id: <20200130191244.24174-7-laine@redhat.com>
fbe740
Reviewed-by: Jiri Denemark <jdenemar@redhat.com>
fbe740
---
fbe740
 docs/formatdomain.html.in | 101 ++++++++++++++++++++++++++++++++++++++
fbe740
 docs/news.xml             |  28 +++++++++++
fbe740
 2 files changed, 129 insertions(+)
fbe740
fbe740
diff --git a/docs/formatdomain.html.in b/docs/formatdomain.html.in
fbe740
index 4db9c292b7..98a811bd09 100644
fbe740
--- a/docs/formatdomain.html.in
fbe740
+++ b/docs/formatdomain.html.in
fbe740
@@ -5873,6 +5873,107 @@
fbe740
 </devices>
fbe740
 ...
fbe740
 
fbe740
+    
Teaming a virtio/hostdev NIC pair
fbe740
+
fbe740
+    

fbe740
+      Since 6.1.0 (QEMU and KVM only, requires
fbe740
+        QEMU 4.2.0 or newer axnd a guest virtio-net driver supporting
fbe740
+        the "failover" feature, such as the one included in Linux
fbe740
+        kernel 4.18 and newer)
fbe740
+      
fbe740
+      The <teaming> element of two interfaces can
fbe740
+      be used to connect them as a team/bond device in the guest
fbe740
+      (assuming proper support in the hypervisor and the guest
fbe740
+      network driver).
fbe740
+    

fbe740
+
fbe740
+
fbe740
+...
fbe740
+<devices>
fbe740
+  <interface type='network'>
fbe740
+    <source network='mybridge'/>
fbe740
+    <mac address='00:11:22:33:44:55'/>
fbe740
+    <model type='virtio'/>
fbe740
+    <teaming type='persistent'/>
fbe740
+    <alias name='ua-backup0'/>
fbe740
+  </interface>
fbe740
+  <interface type='network'>
fbe740
+    <source network='hostdev-pool'/>
fbe740
+    <mac address='00:11:22:33:44:55'/>
fbe740
+    <model type='virtio'/>
fbe740
+    <teaming type='transient' persistent='ua-backup0'/>
fbe740
+  </interface>
fbe740
+</devices>
fbe740
+...
fbe740
+
fbe740
+    

fbe740
+      The <teaming> element required
fbe740
+      attribute type will be set to
fbe740
+      either "persistent" to indicate a device that
fbe740
+      should always be present in the domain,
fbe740
+      or "transient" to indicate a device that may
fbe740
+      periodically be removed, then later re-added to the domain. When
fbe740
+      type="transient", there should be a second attribute
fbe740
+      to <teaming> called "persistent"
fbe740
+      - this attribute should be set to the alias name of the other
fbe740
+      device in the pair (the one that has <teaming
fbe740
+      type="persistent'/>).
fbe740
+    

fbe740
+    

fbe740
+      In the particular case of QEMU,
fbe740
+      libvirt's <teaming> element is used to setup
fbe740
+      a virtio-net "failover" device pair. For this setup, the
fbe740
+      persistent device must be an interface with <model
fbe740
+      type="virtio"/>, and the transient device must
fbe740
+      be <interface type='hostdev'/>
fbe740
+      (or <interface type='network'/> where the
fbe740
+      referenced network defines a pool of SRIOV VFs). The guest will
fbe740
+      then have a simple network team/bond device made of the virtio
fbe740
+      NIC + hostdev NIC pair. In this configuration, the
fbe740
+      higher-performing hostdev NIC will normally be preferred for all
fbe740
+      network traffic, but when the domain is migrated, QEMU will
fbe740
+      automatically unplug the VF from the guest, and then hotplug a
fbe740
+      similar device once migration is completed; while migration is
fbe740
+      taking place, network traffic will use the virtio NIC. (Of
fbe740
+      course the emulated virtio NIC and the hostdev NIC must be
fbe740
+      connected to the same subnet for bonding to work properly).
fbe740
+    

fbe740
+    

fbe740
+      NB1: Since you must know the alias name of the virtio NIC when
fbe740
+      configuring the hostdev NIC, it will need to be manually set in
fbe740
+      the virtio NIC's configuration (as with all other manually set
fbe740
+      alias names, this means it must start with "ua-").
fbe740
+    

fbe740
+    

fbe740
+      NB2: Currently the only implementation of the guest OS
fbe740
+      virtio-net driver supporting virtio-net failover requires that
fbe740
+      the MAC addresses of the virtio and hostdev NIC must
fbe740
+      match. Since that may not always be a requirement in the future,
fbe740
+      libvirt doesn't enforce this limitation - it is up to the
fbe740
+      person/management application that is creating the configuration
fbe740
+      to assure the MAC addresses of the two devices match.
fbe740
+    

fbe740
+    

fbe740
+      NB3: Since the PCI addresses of the SRIOV VFs on the hosts that
fbe740
+      are the source and destination of the migration will almost
fbe740
+      certainly be different, either higher level management software
fbe740
+      will need to modify the <source> of the
fbe740
+      hostdev NIC (<interface type='hostdev'>) at
fbe740
+      the start of migration, or (a simpler solution) the
fbe740
+      configuration will need to use a libvirt "hostdev" virtual
fbe740
+      network that maintains a pool of such devices, as is implied in
fbe740
+      the example's use of the libvirt network named "hostdev-pool" -
fbe740
+      as long as the hostdev network pools on both hosts have the same
fbe740
+      name, libvirt itself will take care of allocating an appropriate
fbe740
+      device on both ends of the migration. Similarly the XML for the
fbe740
+      virtio interface must also either work correctly unmodified on
fbe740
+      both the source and destination of the migration (e.g. by
fbe740
+      connecting to the same bridge device on both hosts, or by using
fbe740
+      the same virtual network), or the management software must
fbe740
+      properly modify the interface XML during migration so that the
fbe740
+      virtio device remains connected to the same network segment
fbe740
+      before and after migration.
fbe740
+    

fbe740
 
fbe740
     
Multicast tunnel
fbe740
 
fbe740
diff --git a/docs/news.xml b/docs/news.xml
fbe740
index 731f010297..408ffc8518 100644
fbe740
--- a/docs/news.xml
fbe740
+++ b/docs/news.xml
fbe740
@@ -65,6 +65,34 @@
fbe740
       </change>
fbe740
     </section>
fbe740
     <section title="New features">
fbe740
+      <change>
fbe740
+        <summary>
fbe740
+          support for virtio+hostdev NIC <teaming>
fbe740
+        </summary>
fbe740
+        <description>
fbe740
+          QEMU 4.2.0 and later, combined with a sufficiently recent
fbe740
+          guest virtio-net driver (e.g. the driver included in Linux
fbe740
+          kernel 4.18 and later), supports setting up a simple network
fbe740
+          bond device comprised of one virtio emulated NIC and one
fbe740
+          hostdev NIC (which must be an SRIOV VF). (in QEMU, this is
fbe740
+          known as the "virtio failover" feature). The allure of this
fbe740
+          setup is that the bond will always favor the hostdev device,
fbe740
+          providing better performance, until the guest is migrated -
fbe740
+          at that time QEMU will automatically unplug the hostdev NIC
fbe740
+          and the bond will send all traffic via the virtio NIC until
fbe740
+          migration is completed, then QEMU on the destination side
fbe740
+          will hotplug a new hostdev NIC and the bond will switch back
fbe740
+          to using the hostdev for network traffic. The result is that
fbe740
+          guests desiring the extra performance of a hostdev NIC are
fbe740
+          now migratable without network downtime (performance is just
fbe740
+          degraded during migration) and without requiring a
fbe740
+          complicated bonding configuration in the guest OS network
fbe740
+          config and complicated unplug/replug logic in the management
fbe740
+          application on the host - it can instead all be accomplished
fbe740
+          in libvirt with the interface <teaming> subelement
fbe740
+          "type" and "persistent" attributes.
fbe740
+        </description>
fbe740
+      </change>
fbe740
       <change>
fbe740
         <summary>
fbe740
           new PCI hostdev address type: unassigned
fbe740
-- 
fbe740
2.25.0
fbe740