Blame SOURCES/0044-tests-Add-broken-files-for-all-broken-tests.patch

b33395
From daa86d6634761796ada1f535c13e47fdd3cc95eb Mon Sep 17 00:00:00 2001
b33395
From: Logan Gunthorpe <logang@deltatee.com>
b33395
Date: Wed, 22 Jun 2022 14:25:19 -0600
2ad819
Subject: [PATCH 44/83] tests: Add broken files for all broken tests
b33395
b33395
Each broken file contains the rough frequency of brokeness as well
b33395
as a brief explanation of what happens when it breaks. Estimates
b33395
of failure rates are not statistically significant and can vary
b33395
run to run.
b33395
b33395
This is really just a view from my window. Tests were done on a
b33395
small VM with the default loop devices, not real hardware. We've
b33395
seen different kernel configurations can cause bugs to appear as well
b33395
(ie. different block schedulers). It may also be that different race
b33395
conditions will be seen on machines with different performance
b33395
characteristics.
b33395
b33395
These annotations were done with the kernel currently in md/md-next:
b33395
b33395
 facef3b96c5b ("md: Notify sysfs sync_completed in md_reap_sync_thread()")
b33395
b33395
Signed-off-by: Logan Gunthorpe <logang@deltatee.com>
b33395
Signed-off-by: Jes Sorensen <jes@trained-monkey.org>
b33395
---
b33395
 tests/01r5integ.broken                     |  7 ++++
b33395
 tests/01raid6integ.broken                  |  7 ++++
b33395
 tests/04r5swap.broken                      |  7 ++++
b33395
 tests/07autoassemble.broken                |  8 ++++
b33395
 tests/07autodetect.broken                  |  5 +++
b33395
 tests/07changelevelintr.broken             |  9 +++++
b33395
 tests/07changelevels.broken                |  9 +++++
b33395
 tests/07reshape5intr.broken                | 45 ++++++++++++++++++++++
b33395
 tests/07revert-grow.broken                 | 31 +++++++++++++++
b33395
 tests/07revert-shrink.broken               |  9 +++++
b33395
 tests/07testreshape5.broken                | 12 ++++++
b33395
 tests/09imsm-assemble.broken               |  6 +++
b33395
 tests/09imsm-create-fail-rebuild.broken    |  5 +++
b33395
 tests/09imsm-overlap.broken                |  7 ++++
b33395
 tests/10ddf-assemble-missing.broken        |  6 +++
b33395
 tests/10ddf-fail-create-race.broken        |  7 ++++
b33395
 tests/10ddf-fail-two-spares.broken         |  5 +++
b33395
 tests/10ddf-incremental-wrong-order.broken |  9 +++++
b33395
 tests/14imsm-r1_2d-grow-r1_3d.broken       |  5 +++
b33395
 tests/14imsm-r1_2d-takeover-r0_2d.broken   |  6 +++
b33395
 tests/18imsm-r10_4d-takeover-r0_2d.broken  |  5 +++
b33395
 tests/18imsm-r1_2d-takeover-r0_1d.broken   |  6 +++
b33395
 tests/19raid6auto-repair.broken            |  5 +++
b33395
 tests/19raid6repair.broken                 |  5 +++
b33395
 24 files changed, 226 insertions(+)
b33395
 create mode 100644 tests/01r5integ.broken
b33395
 create mode 100644 tests/01raid6integ.broken
b33395
 create mode 100644 tests/04r5swap.broken
b33395
 create mode 100644 tests/07autoassemble.broken
b33395
 create mode 100644 tests/07autodetect.broken
b33395
 create mode 100644 tests/07changelevelintr.broken
b33395
 create mode 100644 tests/07changelevels.broken
b33395
 create mode 100644 tests/07reshape5intr.broken
b33395
 create mode 100644 tests/07revert-grow.broken
b33395
 create mode 100644 tests/07revert-shrink.broken
b33395
 create mode 100644 tests/07testreshape5.broken
b33395
 create mode 100644 tests/09imsm-assemble.broken
b33395
 create mode 100644 tests/09imsm-create-fail-rebuild.broken
b33395
 create mode 100644 tests/09imsm-overlap.broken
b33395
 create mode 100644 tests/10ddf-assemble-missing.broken
b33395
 create mode 100644 tests/10ddf-fail-create-race.broken
b33395
 create mode 100644 tests/10ddf-fail-two-spares.broken
b33395
 create mode 100644 tests/10ddf-incremental-wrong-order.broken
b33395
 create mode 100644 tests/14imsm-r1_2d-grow-r1_3d.broken
b33395
 create mode 100644 tests/14imsm-r1_2d-takeover-r0_2d.broken
b33395
 create mode 100644 tests/18imsm-r10_4d-takeover-r0_2d.broken
b33395
 create mode 100644 tests/18imsm-r1_2d-takeover-r0_1d.broken
b33395
 create mode 100644 tests/19raid6auto-repair.broken
b33395
 create mode 100644 tests/19raid6repair.broken
b33395
b33395
diff --git a/tests/01r5integ.broken b/tests/01r5integ.broken
b33395
new file mode 100644
b33395
index 00000000..20737637
b33395
--- /dev/null
b33395
+++ b/tests/01r5integ.broken
b33395
@@ -0,0 +1,7 @@
b33395
+fails rarely
b33395
+
b33395
+Fails about 1 in every 30 runs with a sha mismatch error:
b33395
+
b33395
+    c49ab26e1b01def7874af9b8a6d6d0c29fdfafe6 /dev/md0 does not match
b33395
+    15dc2f73262f811ada53c65e505ceec9cf025cb9 /dev/md0 with /dev/loop3
b33395
+    missing
b33395
diff --git a/tests/01raid6integ.broken b/tests/01raid6integ.broken
b33395
new file mode 100644
b33395
index 00000000..1df735f0
b33395
--- /dev/null
b33395
+++ b/tests/01raid6integ.broken
b33395
@@ -0,0 +1,7 @@
b33395
+fails infrequently
b33395
+
b33395
+Fails about 1 in 5 with a sha mismatch:
b33395
+
b33395
+    8286c2bc045ae2cfe9f8b7ae3a898fa25db6926f /dev/md0 does not match
b33395
+    a083a0738b58caab37fd568b91b177035ded37df /dev/md0 with /dev/loop2 and
b33395
+    /dev/loop3 missing
b33395
diff --git a/tests/04r5swap.broken b/tests/04r5swap.broken
b33395
new file mode 100644
b33395
index 00000000..e38987db
b33395
--- /dev/null
b33395
+++ b/tests/04r5swap.broken
b33395
@@ -0,0 +1,7 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+  mdadm: /dev/loop0 has no superblock - assembly aborted
b33395
+
b33395
+   ERROR: no recovery happening
b33395
diff --git a/tests/07autoassemble.broken b/tests/07autoassemble.broken
b33395
new file mode 100644
b33395
index 00000000..8be09407
b33395
--- /dev/null
b33395
+++ b/tests/07autoassemble.broken
b33395
@@ -0,0 +1,8 @@
b33395
+always fails
b33395
+
b33395
+Prints lots of messages, but the array doesn't assemble. Error
b33395
+possibly related to:
b33395
+
b33395
+  mdadm: /dev/md/1 is busy - skipping
b33395
+  mdadm: no recogniseable superblock on /dev/md/testing:0
b33395
+  mdadm: /dev/md/2 is busy - skipping
b33395
diff --git a/tests/07autodetect.broken b/tests/07autodetect.broken
b33395
new file mode 100644
b33395
index 00000000..294954a1
b33395
--- /dev/null
b33395
+++ b/tests/07autodetect.broken
b33395
@@ -0,0 +1,5 @@
b33395
+always fails
b33395
+
b33395
+Fails with error:
b33395
+
b33395
+    ERROR: no resync happening
b33395
diff --git a/tests/07changelevelintr.broken b/tests/07changelevelintr.broken
b33395
new file mode 100644
b33395
index 00000000..284b4906
b33395
--- /dev/null
b33395
+++ b/tests/07changelevelintr.broken
b33395
@@ -0,0 +1,9 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+  mdadm: this change will reduce the size of the array.
b33395
+         use --grow --array-size first to truncate array.
b33395
+         e.g. mdadm --grow /dev/md0 --array-size 56832
b33395
+
b33395
+  ERROR: no reshape happening
b33395
diff --git a/tests/07changelevels.broken b/tests/07changelevels.broken
b33395
new file mode 100644
b33395
index 00000000..9b930d93
b33395
--- /dev/null
b33395
+++ b/tests/07changelevels.broken
b33395
@@ -0,0 +1,9 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+    mdadm: /dev/loop0 is smaller than given size. 18976K < 19968K + metadata
b33395
+    mdadm: /dev/loop1 is smaller than given size. 18976K < 19968K + metadata
b33395
+    mdadm: /dev/loop2 is smaller than given size. 18976K < 19968K + metadata
b33395
+
b33395
+    ERROR: /dev/md0 isn't a block device.
b33395
diff --git a/tests/07reshape5intr.broken b/tests/07reshape5intr.broken
b33395
new file mode 100644
b33395
index 00000000..efe52a66
b33395
--- /dev/null
b33395
+++ b/tests/07reshape5intr.broken
b33395
@@ -0,0 +1,45 @@
b33395
+always fails
b33395
+
b33395
+This patch, recently added to md-next causes the test to always fail:
b33395
+
b33395
+7e6ba434cc60 ("md: don't unregister sync_thread with reconfig_mutex
b33395
+held")
b33395
+
b33395
+The new error is simply:
b33395
+
b33395
+   ERROR: no reshape happening
b33395
+
b33395
+Before the patch, the error seen is below.
b33395
+
b33395
+--
b33395
+
b33395
+fails infrequently
b33395
+
b33395
+Fails roughly 1 in 4 runs with errors:
b33395
+
b33395
+    mdadm: Merging with already-assembled /dev/md/0
b33395
+    mdadm: cannot re-read metadata from /dev/loop6 - aborting
b33395
+
b33395
+    ERROR: no reshape happening
b33395
+
b33395
+Also have seen a random deadlock:
b33395
+
b33395
+     INFO: task mdadm:109702 blocked for more than 30 seconds.
b33395
+           Not tainted 5.18.0-rc3-eid-vmlocalyes-dbg-00095-g3c2b5427979d #2040
b33395
+     "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
b33395
+     task:mdadm           state:D stack:    0 pid:109702 ppid:     1 flags:0x00004000
b33395
+     Call Trace:
b33395
+      <TASK>
b33395
+      __schedule+0x67e/0x13b0
b33395
+      schedule+0x82/0x110
b33395
+      mddev_suspend+0x2e1/0x330
b33395
+      suspend_lo_store+0xbd/0x140
b33395
+      md_attr_store+0xcb/0x130
b33395
+      sysfs_kf_write+0x89/0xb0
b33395
+      kernfs_fop_write_iter+0x202/0x2c0
b33395
+      new_sync_write+0x222/0x330
b33395
+      vfs_write+0x3bc/0x4d0
b33395
+      ksys_write+0xd9/0x180
b33395
+      __x64_sys_write+0x43/0x50
b33395
+      do_syscall_64+0x3b/0x90
b33395
+      entry_SYSCALL_64_after_hwframe+0x44/0xae
b33395
diff --git a/tests/07revert-grow.broken b/tests/07revert-grow.broken
b33395
new file mode 100644
b33395
index 00000000..9b6db86f
b33395
--- /dev/null
b33395
+++ b/tests/07revert-grow.broken
b33395
@@ -0,0 +1,31 @@
b33395
+always fails
b33395
+
b33395
+This patch, recently added to md-next causes the test to always fail:
b33395
+
b33395
+7e6ba434cc60 ("md: don't unregister sync_thread with reconfig_mutex held")
b33395
+
b33395
+The errors are:
b33395
+
b33395
+    mdadm: No active reshape to revert on /dev/loop0
b33395
+    ERROR: active raid5 not found
b33395
+
b33395
+Before the patch, the error seen is below.
b33395
+
b33395
+--
b33395
+
b33395
+fails rarely
b33395
+
b33395
+Fails about 1 in every 30 runs with errors:
b33395
+
b33395
+    mdadm: Merging with already-assembled /dev/md/0
b33395
+    mdadm: backup file /tmp/md-backup inaccessible: No such file or directory
b33395
+    mdadm: failed to add /dev/loop1 to /dev/md/0: Invalid argument
b33395
+    mdadm: failed to add /dev/loop2 to /dev/md/0: Invalid argument
b33395
+    mdadm: failed to add /dev/loop3 to /dev/md/0: Invalid argument
b33395
+    mdadm: failed to add /dev/loop0 to /dev/md/0: Invalid argument
b33395
+    mdadm: /dev/md/0 assembled from 1 drive - need all 5 to start it
b33395
+            (use --run to insist).
b33395
+
b33395
+    grep: /sys/block/md*/md/sync_action: No such file or directory
b33395
+
b33395
+    ERROR: active raid5 not found
b33395
diff --git a/tests/07revert-shrink.broken b/tests/07revert-shrink.broken
b33395
new file mode 100644
b33395
index 00000000..c33c39ec
b33395
--- /dev/null
b33395
+++ b/tests/07revert-shrink.broken
b33395
@@ -0,0 +1,9 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+    mdadm: this change will reduce the size of the array.
b33395
+           use --grow --array-size first to truncate array.
b33395
+           e.g. mdadm --grow /dev/md0 --array-size 53760
b33395
+
b33395
+    ERROR: active raid5 not found
b33395
diff --git a/tests/07testreshape5.broken b/tests/07testreshape5.broken
b33395
new file mode 100644
b33395
index 00000000..a8ce03e4
b33395
--- /dev/null
b33395
+++ b/tests/07testreshape5.broken
b33395
@@ -0,0 +1,12 @@
b33395
+always fails
b33395
+
b33395
+Test seems to run 'test_stripe' at $dir directory, but $dir is never
b33395
+set. If $dir is adjusted to $PWD, the test still fails with:
b33395
+
b33395
+    mdadm: /dev/loop2 is not suitable for this array.
b33395
+    mdadm: create aborted
b33395
+    ++ return 1
b33395
+    ++ cmp -s -n 8192 /dev/md0 /tmp/RandFile
b33395
+    ++ echo cmp failed
b33395
+    cmp failed
b33395
+    ++ exit 2
b33395
diff --git a/tests/09imsm-assemble.broken b/tests/09imsm-assemble.broken
b33395
new file mode 100644
b33395
index 00000000..a6d4d5cf
b33395
--- /dev/null
b33395
+++ b/tests/09imsm-assemble.broken
b33395
@@ -0,0 +1,6 @@
b33395
+fails infrequently
b33395
+
b33395
+Fails roughly 1 in 10 runs with errors:
b33395
+
b33395
+    mdadm: /dev/loop2 is still in use, cannot remove.
b33395
+    /dev/loop2 removal from /dev/md/container should have succeeded
b33395
diff --git a/tests/09imsm-create-fail-rebuild.broken b/tests/09imsm-create-fail-rebuild.broken
b33395
new file mode 100644
b33395
index 00000000..40c4b294
b33395
--- /dev/null
b33395
+++ b/tests/09imsm-create-fail-rebuild.broken
b33395
@@ -0,0 +1,5 @@
b33395
+always fails
b33395
+
b33395
+Fails with error:
b33395
+
b33395
+    **Error**: Array size mismatch - expected 3072, actual 16384
b33395
diff --git a/tests/09imsm-overlap.broken b/tests/09imsm-overlap.broken
b33395
new file mode 100644
b33395
index 00000000..e7ccab76
b33395
--- /dev/null
b33395
+++ b/tests/09imsm-overlap.broken
b33395
@@ -0,0 +1,7 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+    **Error**: Offset mismatch - expected 15360, actual 0
b33395
+    **Error**: Offset mismatch - expected 15360, actual 0
b33395
+    /dev/md/vol3 failed check
b33395
diff --git a/tests/10ddf-assemble-missing.broken b/tests/10ddf-assemble-missing.broken
b33395
new file mode 100644
b33395
index 00000000..bfd8d103
b33395
--- /dev/null
b33395
+++ b/tests/10ddf-assemble-missing.broken
b33395
@@ -0,0 +1,6 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+
b33395
+    ERROR: /dev/md/vol0 has unexpected state on /dev/loop10
b33395
+    ERROR: unexpected number of online disks on /dev/loop10
b33395
diff --git a/tests/10ddf-fail-create-race.broken b/tests/10ddf-fail-create-race.broken
b33395
new file mode 100644
b33395
index 00000000..6c0df023
b33395
--- /dev/null
b33395
+++ b/tests/10ddf-fail-create-race.broken
b33395
@@ -0,0 +1,7 @@
b33395
+usually fails
b33395
+
b33395
+Fails about 9 out of 10 times with many errors:
b33395
+
b33395
+    mdadm: cannot open MISSING: No such file or directory
b33395
+    ERROR: non-degraded array found
b33395
+    ERROR: disk 0 not marked as failed in meta data
b33395
diff --git a/tests/10ddf-fail-two-spares.broken b/tests/10ddf-fail-two-spares.broken
b33395
new file mode 100644
b33395
index 00000000..eeea56d9
b33395
--- /dev/null
b33395
+++ b/tests/10ddf-fail-two-spares.broken
b33395
@@ -0,0 +1,5 @@
b33395
+fails infrequently
b33395
+
b33395
+Fails roughly 1 in 3 with error:
b33395
+
b33395
+   ERROR: /dev/md/vol1 should be optimal in meta data
b33395
diff --git a/tests/10ddf-incremental-wrong-order.broken b/tests/10ddf-incremental-wrong-order.broken
b33395
new file mode 100644
b33395
index 00000000..a5af3bab
b33395
--- /dev/null
b33395
+++ b/tests/10ddf-incremental-wrong-order.broken
b33395
@@ -0,0 +1,9 @@
b33395
+always fails
b33395
+
b33395
+Fails with errors:
b33395
+    ERROR: sha1sum of /dev/md/vol0 has changed
b33395
+    ERROR: /dev/md/vol0 has unexpected state on /dev/loop10
b33395
+    ERROR: unexpected number of online disks on /dev/loop10
b33395
+    ERROR: /dev/md/vol0 has unexpected state on /dev/loop8
b33395
+    ERROR: unexpected number of online disks on /dev/loop8
b33395
+    ERROR: sha1sum of /dev/md/vol0 has changed
b33395
diff --git a/tests/14imsm-r1_2d-grow-r1_3d.broken b/tests/14imsm-r1_2d-grow-r1_3d.broken
b33395
new file mode 100644
b33395
index 00000000..4ef1d406
b33395
--- /dev/null
b33395
+++ b/tests/14imsm-r1_2d-grow-r1_3d.broken
b33395
@@ -0,0 +1,5 @@
b33395
+always fails
b33395
+
b33395
+Fails with error:
b33395
+
b33395
+    mdadm/tests/func.sh: line 325: dvsize/chunk: division by 0 (error token is "chunk")
b33395
diff --git a/tests/14imsm-r1_2d-takeover-r0_2d.broken b/tests/14imsm-r1_2d-takeover-r0_2d.broken
b33395
new file mode 100644
b33395
index 00000000..89cd4e57
b33395
--- /dev/null
b33395
+++ b/tests/14imsm-r1_2d-takeover-r0_2d.broken
b33395
@@ -0,0 +1,6 @@
b33395
+always fails
b33395
+
b33395
+Fails with error:
b33395
+
b33395
+    tests/func.sh: line 325: dvsize/chunk: division by 0 (error token
b33395
+		is "chunk")
b33395
diff --git a/tests/18imsm-r10_4d-takeover-r0_2d.broken b/tests/18imsm-r10_4d-takeover-r0_2d.broken
b33395
new file mode 100644
b33395
index 00000000..a27399f5
b33395
--- /dev/null
b33395
+++ b/tests/18imsm-r10_4d-takeover-r0_2d.broken
b33395
@@ -0,0 +1,5 @@
b33395
+fails rarely
b33395
+
b33395
+Fails about 1 run in 100 with message:
b33395
+
b33395
+   ERROR:  size is wrong for /dev/md/vol0: 2 * 5120 (chunk=128) = 20480, not 0
b33395
diff --git a/tests/18imsm-r1_2d-takeover-r0_1d.broken b/tests/18imsm-r1_2d-takeover-r0_1d.broken
b33395
new file mode 100644
b33395
index 00000000..aa1982e6
b33395
--- /dev/null
b33395
+++ b/tests/18imsm-r1_2d-takeover-r0_1d.broken
b33395
@@ -0,0 +1,6 @@
b33395
+always fails
b33395
+
b33395
+Fails with error:
b33395
+
b33395
+    tests/func.sh: line 325: dvsize/chunk: division by 0 (error token
b33395
+			is "chunk")
b33395
diff --git a/tests/19raid6auto-repair.broken b/tests/19raid6auto-repair.broken
b33395
new file mode 100644
b33395
index 00000000..e91a1425
b33395
--- /dev/null
b33395
+++ b/tests/19raid6auto-repair.broken
b33395
@@ -0,0 +1,5 @@
b33395
+always fails
b33395
+
b33395
+Fails with:
b33395
+
b33395
+    "should detect errors"
b33395
diff --git a/tests/19raid6repair.broken b/tests/19raid6repair.broken
b33395
new file mode 100644
b33395
index 00000000..e91a1425
b33395
--- /dev/null
b33395
+++ b/tests/19raid6repair.broken
b33395
@@ -0,0 +1,5 @@
b33395
+always fails
b33395
+
b33395
+Fails with:
b33395
+
b33395
+    "should detect errors"
b33395
-- 
2ad819
2.38.1
b33395