Blame SOURCES/CVE-2021-3347.patch

1bd474
From 517d5c245c9805b56f73c7fa0e23e8853fe22da6 Mon Sep 17 00:00:00 2001
1bd474
From: Artem Savkov <asavkov@redhat.com>
1bd474
Date: Fri, 21 May 2021 14:20:32 +0200
1bd474
Subject: [RHEL7.9 KPATCH] CVE-2021-3347 Use after free via PI futex state
1bd474
1bd474
Kernels:
1bd474
3.10.0-1160.el7
1bd474
3.10.0-1160.2.1.el7
1bd474
3.10.0-1160.2.2.el7
1bd474
3.10.0-1160.6.1.el7
1bd474
3.10.0-1160.11.1.el7
1bd474
3.10.0-1160.15.2.el7
1bd474
3.10.0-1160.21.1.el7
1bd474
3.10.0-1160.24.1.el7
1bd474
3.10.0-1160.25.1.el7
1bd474
1bd474
Changes since last build:
1bd474
[x86_64]:
1bd474
futex.o: changed function: do_futex
1bd474
futex.o: changed function: fixup_owner
1bd474
futex.o: changed function: fixup_pi_state_owner.isra.16
1bd474
futex.o: changed function: free_pi_state
1bd474
futex.o: changed function: futex_lock_pi.isra.20
1bd474
futex.o: changed function: futex_wait_requeue_pi.constprop.22
1bd474
futex.o: new function: pi_state_update_owner
1bd474
1bd474
[ppc64le]:
1bd474
futex.o: changed function: do_futex
1bd474
futex.o: changed function: fixup_owner
1bd474
futex.o: changed function: fixup_pi_state_owner.isra.9
1bd474
futex.o: changed function: free_pi_state
1bd474
futex.o: changed function: futex_lock_pi.isra.16
1bd474
futex.o: changed function: futex_wait_requeue_pi.constprop.17
1bd474
futex.o: changed function: unqueue_me_pi
1bd474
futex.o: new function: pi_state_update_owner
1bd474
1bd474
---------------------------
1bd474
1bd474
Modifications: added -fno-optimize-sibling-calls to fixup_owner()
1bd474
1bd474
commit d2fb2a9cf682bdba4b66103fb079c13a04039430
1bd474
Author: Donghai Qiao <dqiao@redhat.com>
1bd474
Date:   Thu May 20 16:35:49 2021 -0400
1bd474
1bd474
    futex: Handle faults correctly for PI futexes
1bd474
1bd474
    Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1935108
1bd474
    Upstream status: 34b1a1ce1458f50ef27c54e28eb9b1947012907a
1bd474
    CVE: CVE-2021-3347
1bd474
1bd474
    Conflicts:
1bd474
    The original patch is intent to make the state of rtmutex and pi_state consistent
1bd474
    if the kernel is unable to update the user space futex word, rather than unlocking
1bd474
    the rtmutex and leaving pi_state out of synched. As a result, this original fix
1bd474
    removed part of the code which was introduced by 16ffa12d7 ("futex: Pull
1bd474
    rt_mutex_futex_unlock() out from under hb->lock") to the functions futex_lock_pi()
1bd474
    and futex_wait_requeue_pi() to avoid the inconsistency. So the conflicts are related
1bd474
    to the following two commits, though git blame displayed a much longer list which
1bd474
    shows the chain of dependency in the history.
1bd474
1bd474
    16ffa12d7425 ("futex: Pull rt_mutex_futex_unlock() out from under hb->lock")
1bd474
    c236c8e95a3d ("futex: Fix potential use-after-free in FUTEX_REQUEUE_PI")
1bd474
1bd474
    commit 34b1a1ce1458f50ef27c54e28eb9b1947012907a
1bd474
    Author: Thomas Gleixner <tglx@linutronix.de>
1bd474
    Date:   Mon, 18 Jan 2021 19:01:21 +0100
1bd474
1bd474
        futex: Handle faults correctly for PI futexes
1bd474
1bd474
        fixup_pi_state_owner() tries to ensure that the state of the rtmutex,
1bd474
        pi_state and the user space value related to the PI futex are consistent
1bd474
        before returning to user space. In case that the user space value update
1bd474
        faults and the fault cannot be resolved by faulting the page in via
1bd474
        fault_in_user_writeable() the function returns with -EFAULT and leaves
1bd474
        the rtmutex and pi_state owner state inconsistent.
1bd474
1bd474
        A subsequent futex_unlock_pi() operates on the inconsistent pi_state and
1bd474
        releases the rtmutex despite not owning it which can corrupt the RB tree of
1bd474
        the rtmutex and cause a subsequent kernel stack use after free.
1bd474
1bd474
        It was suggested to loop forever in fixup_pi_state_owner() if the fault
1bd474
        cannot be resolved, but that results in runaway tasks which is especially
1bd474
        undesired when the problem happens due to a programming error and not due
1bd474
        to malice.
1bd474
1bd474
        As the user space value cannot be fixed up, the proper solution is to make
1bd474
        the rtmutex and the pi_state consistent so both have the same owner. This
1bd474
        leaves the user space value out of sync. Any subsequent operation on the
1bd474
        futex will fail because the 10th rule of PI futexes (pi_state owner and
1bd474
        user space value are consistent) has been violated.
1bd474
1bd474
        As a consequence this removes the inept attempts of 'fixing' the situation
1bd474
        in case that the current task owns the rtmutex when returning with an
1bd474
        unresolvable fault by unlocking the rtmutex which left pi_state::owner and
1bd474
        rtmutex::owner out of sync in a different and only slightly less dangerous
1bd474
        way.
1bd474
1bd474
        Fixes: 1b7558e457ed ("futexes: fix fault handling in futex_lock_pi")
1bd474
        Reported-by: gzobqq@gmail.com
1bd474
        Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
1bd474
        Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
1bd474
        Cc: stable@vger.kernel.org
1bd474
1bd474
    Signed-off-by: Donghai Qiao <dqiao@redhat.com>
1bd474
1bd474
commit 25077b49b47c1cdf224b54c837172ff820e8be88
1bd474
Author: Donghai Qiao <dqiao@redhat.com>
1bd474
Date:   Thu May 20 16:30:16 2021 -0400
1bd474
1bd474
    futex: Provide and use pi_state_update_owner()
1bd474
1bd474
    Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1935108
1bd474
    Upstream status: c5cade200ab9a2a3be9e7f32a752c8d86b502ec7
1bd474
    CVE: CVE-2021-3347
1bd474
1bd474
    Conflicts:
1bd474
    Updating the owner of pi_state requires that we remove the pi_state structure from
1bd474
    the old owner's pi_state_list then add it to the new owner's pi_state_list. Because
1bd474
    this action takes place in multiple occassions in the current upstream futex.c, so
1bd474
    the similar code is duplicated in all these places. The purpose of this patch is to
1bd474
    eliminate these code duplications with a new routine pi_state_update_owner().
1bd474
1bd474
    The conflicts in 7.9.z are caused by the differences in places where updating owner
1bd474
    takes place. After sorting out the details, the relevant commit IDs as below :
1bd474
1bd474
    734009e96d19 ("futex: Change locking rules")
1bd474
    b4abf91047cf ("rtmutex: Make wait_lock irq safe")
1bd474
1bd474
    commit c5cade200ab9a2a3be9e7f32a752c8d86b502ec7
1bd474
    Author: Thomas Gleixner <tglx@linutronix.de>
1bd474
    Date:   Tue, 19 Jan 2021 15:21:35 +0100
1bd474
1bd474
        futex: Provide and use pi_state_update_owner()
1bd474
1bd474
        Updating pi_state::owner is done at several places with the same
1bd474
        code. Provide a function for it and use that at the obvious places.
1bd474
1bd474
        This is also a preparation for a bug fix to avoid yet another copy of the
1bd474
        same code or alternatively introducing a completely unpenetratable mess of
1bd474
        gotos.
1bd474
1bd474
        Originally-by: Peter Zijlstra <peterz@infradead.org>
1bd474
        Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
1bd474
        Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
1bd474
        Cc: stable@vger.kernel.org
1bd474
1bd474
    Signed-off-by: Donghai Qiao <dqiao@redhat.com>
1bd474
1bd474
commit 69414a50f8bad2063b89981110fb374733209d9d
1bd474
Author: Donghai Qiao <dqiao@redhat.com>
1bd474
Date:   Wed May 19 14:24:04 2021 -0400
1bd474
1bd474
    futex: Replace pointless printk in fixup_owner()
1bd474
1bd474
    Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1935108
1bd474
    Upstream status: 04b79c55201f02ffd675e1231d731365e335c307
1bd474
    CVE: CVE-2021-3347
1bd474
1bd474
    commit 04b79c55201f02ffd675e1231d731365e335c307
1bd474
    Author: Thomas Gleixner <tglx@linutronix.de>
1bd474
    Date:   Tue, 19 Jan 2021 16:06:10 +0100
1bd474
1bd474
        futex: Replace pointless printk in fixup_owner()
1bd474
1bd474
        If that unexpected case of inconsistent arguments ever happens then the
1bd474
        futex state is left completely inconsistent and the printk is not really
1bd474
        helpful. Replace it with a warning and make the state consistent.
1bd474
1bd474
        Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
1bd474
        Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
1bd474
        Cc: stable@vger.kernel.org
1bd474
1bd474
    Signed-off-by: Donghai Qiao <dqiao@redhat.com>
1bd474
1bd474
commit 7e96fb06469c95628039ead2591f82e88af5da10
1bd474
Author: Donghai Qiao <dqiao@redhat.com>
1bd474
Date:   Wed May 19 14:19:05 2021 -0400
1bd474
1bd474
    futex: Ensure the correct return value from futex_lock_pi()
1bd474
1bd474
    Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1935108
1bd474
    Upstream status: 12bb3f7f1b03d5913b3f9d4236a488aa7774dfe9
1bd474
    CVE: CVE-2021-3347
1bd474
1bd474
    Conflicts:
1bd474
    This original upstream patch relies heavily on c1e2f0eaf015 ("futex: Avoid
1bd474
    violating the 10th rule of futex") which is one of the upstream commits listed
1bd474
    below. But the backport for c1e2f0eaf015 requires we resolve very complex chain
1bd474
    of dependencies across multiple critical kernel source files therefore the risk
1bd474
    is considered too high for 7.9.z.
1bd474
1bd474
    Instead of pulling together tons of the relevant commits in to 7.9.z, we just
1bd474
    want to take a light risk approach by digesting the fix 12bb3f7f1b03 ("futex:
1bd474
    Ensure the correct return value from futex_lock_pi()") for 7.9.z. All we need
1bd474
    to do is to make the changed functions fixup_owner() and fixup_pi_state_owner()
1bd474
    of 7.9.z return the required values as this upstream fix suggests in every
1bd474
    circumstance. This way, we can cleanly cut this CVE patch set with merely four
1bd474
    patches, without having to backport tons of patches in the chain of dependency.
1bd474
1bd474
    Besides, an extra change made to fixup_owner() (see HUNK -2063,13 +2062,11 in
1bd474
    this backport patch) is to eliminate a mistake made by upstream, where the
1bd474
    specification of a local variable "ret" was removed from that function, but
1bd474
    there was still a dereference to "ret" as shown by that HUNK.
1bd474
1bd474
    16ffa12d7425 ("futex: Pull rt_mutex_futex_unlock() out from under hb->lock")
1bd474
    c1e2f0eaf015 ("futex: Avoid violating the 10th rule of futex")
1bd474
    734009e96d19 ("futex: Change locking rules")
1bd474
    d7c5ed73b19c ("futex: Remove needless goto's")
1bd474
    6b4f4bc9cb22 ("locking/futex: Allow low-level atomic operations to return -EAGAIN")
1bd474
1bd474
    commit 12bb3f7f1b03d5913b3f9d4236a488aa7774dfe9
1bd474
    Author: Thomas Gleixner <tglx@linutronix.de>
1bd474
    Date:   Wed, 20 Jan 2021 16:00:24 +0100
1bd474
1bd474
        futex: Ensure the correct return value from futex_lock_pi()
1bd474
1bd474
        In case that futex_lock_pi() was aborted by a signal or a timeout and the
1bd474
        task returned without acquiring the rtmutex, but is the designated owner of
1bd474
        the futex due to a concurrent futex_unlock_pi() fixup_owner() is invoked to
1bd474
        establish consistent state. In that case it invokes fixup_pi_state_owner()
1bd474
        which in turn tries to acquire the rtmutex again. If that succeeds then it
1bd474
        does not propagate this success to fixup_owner() and futex_lock_pi()
1bd474
        returns -EINTR or -ETIMEOUT despite having the futex locked.
1bd474
1bd474
        Return success from fixup_pi_state_owner() in all cases where the current
1bd474
        task owns the rtmutex and therefore the futex and propagate it correctly
1bd474
        through fixup_owner(). Fixup the other callsite which does not expect a
1bd474
        positive return value.
1bd474
1bd474
        Fixes: c1e2f0eaf015 ("futex: Avoid violating the 10th rule of futex")
1bd474
        Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
1bd474
        Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org>
1bd474
        Cc: stable@vger.kernel.org
1bd474
1bd474
    Signed-off-by: Donghai Qiao <dqiao@redhat.com>
1bd474
1bd474
Signed-off-by: Artem Savkov <asavkov@redhat.com>
1bd474
Acked-by: Joe Lawrence <joe.lawrence@redhat.com>
1bd474
Acked-by: Yannick Cote <ycote@redhat.com>
1bd474
---
1bd474
 kernel/futex.c | 123 +++++++++++++++++++++++++------------------------
1bd474
 1 file changed, 63 insertions(+), 60 deletions(-)
1bd474
1bd474
diff --git a/kernel/futex.c b/kernel/futex.c
1bd474
index 877831775d7aa..8ec57c357ca58 100644
1bd474
--- a/kernel/futex.c
1bd474
+++ b/kernel/futex.c
1bd474
@@ -640,6 +640,29 @@ static struct futex_pi_state * alloc_pi_state(void)
1bd474
 	return pi_state;
1bd474
 }
1bd474
 
1bd474
+static void pi_state_update_owner(struct futex_pi_state *pi_state,
1bd474
+				  struct task_struct *new_owner)
1bd474
+{
1bd474
+	struct task_struct *old_owner = pi_state->owner;
1bd474
+
1bd474
+	lockdep_assert_held(&pi_state->pi_mutex.wait_lock);
1bd474
+
1bd474
+	if (old_owner) {
1bd474
+		raw_spin_lock_irq(&old_owner->pi_lock);
1bd474
+		WARN_ON(list_empty(&pi_state->list));
1bd474
+		list_del_init(&pi_state->list);
1bd474
+		raw_spin_unlock_irq(&old_owner->pi_lock);
1bd474
+	}
1bd474
+
1bd474
+	if (new_owner) {
1bd474
+		raw_spin_lock_irq(&new_owner->pi_lock);
1bd474
+		WARN_ON(!list_empty(&pi_state->list));
1bd474
+		list_add(&pi_state->list, &new_owner->pi_state_list);
1bd474
+		pi_state->owner = new_owner;
1bd474
+		raw_spin_unlock_irq(&new_owner->pi_lock);
1bd474
+	}
1bd474
+}
1bd474
+
1bd474
 static void free_pi_state(struct futex_pi_state *pi_state)
1bd474
 {
1bd474
 	if (!atomic_dec_and_test(&pi_state->refcount))
1bd474
@@ -650,10 +673,7 @@ static void free_pi_state(struct futex_pi_state *pi_state)
1bd474
 	 * and has cleaned up the pi_state already
1bd474
 	 */
1bd474
 	if (pi_state->owner) {
1bd474
-		raw_spin_lock_irq(&pi_state->owner->pi_lock);
1bd474
-		list_del_init(&pi_state->list);
1bd474
-		raw_spin_unlock_irq(&pi_state->owner->pi_lock);
1bd474
-
1bd474
+		pi_state_update_owner(pi_state, NULL);
1bd474
 		rt_mutex_proxy_unlock(&pi_state->pi_mutex, pi_state->owner);
1bd474
 	}
1bd474
 
1bd474
@@ -791,7 +811,8 @@ void exit_pi_state_list(struct task_struct *curr)
1bd474
  *	FUTEX_OWNER_DIED bit. See [4]
1bd474
  *
1bd474
  * [10] There is no transient state which leaves owner and user space
1bd474
- *	TID out of sync.
1bd474
+ *	TID out of sync. Except one error case where the kernel is denied
1bd474
+ *	write access to the user address, see fixup_pi_state_owner().
1bd474
  */
1bd474
 static int
1bd474
 lookup_pi_state(u32 uval, struct futex_hash_bucket *hb,
1bd474
@@ -1168,16 +1189,7 @@ static int wake_futex_pi(u32 __user *uaddr, u32 uval, struct futex_q *this)
1bd474
 		return ret;
1bd474
 	}
1bd474
 
1bd474
-	raw_spin_lock_irq(&pi_state->owner->pi_lock);
1bd474
-	WARN_ON(list_empty(&pi_state->list));
1bd474
-	list_del_init(&pi_state->list);
1bd474
-	raw_spin_unlock_irq(&pi_state->owner->pi_lock);
1bd474
-
1bd474
-	raw_spin_lock_irq(&new_owner->pi_lock);
1bd474
-	WARN_ON(!list_empty(&pi_state->list));
1bd474
-	list_add(&pi_state->list, &new_owner->pi_state_list);
1bd474
-	pi_state->owner = new_owner;
1bd474
-	raw_spin_unlock_irq(&new_owner->pi_lock);
1bd474
+	pi_state_update_owner(pi_state, new_owner);
1bd474
 
1bd474
 	raw_spin_unlock(&pi_state->pi_mutex.wait_lock);
1bd474
 	rt_mutex_unlock(&pi_state->pi_mutex);
1bd474
@@ -1953,20 +1965,9 @@ retry:
1bd474
 	 * We fixed up user space. Now we need to fix the pi_state
1bd474
 	 * itself.
1bd474
 	 */
1bd474
-	if (pi_state->owner != NULL) {
1bd474
-		raw_spin_lock_irq(&pi_state->owner->pi_lock);
1bd474
-		WARN_ON(list_empty(&pi_state->list));
1bd474
-		list_del_init(&pi_state->list);
1bd474
-		raw_spin_unlock_irq(&pi_state->owner->pi_lock);
1bd474
-	}
1bd474
+	pi_state_update_owner(pi_state, newowner);
1bd474
 
1bd474
-	pi_state->owner = newowner;
1bd474
-
1bd474
-	raw_spin_lock_irq(&newowner->pi_lock);
1bd474
-	WARN_ON(!list_empty(&pi_state->list));
1bd474
-	list_add(&pi_state->list, &newowner->pi_state_list);
1bd474
-	raw_spin_unlock_irq(&newowner->pi_lock);
1bd474
-	return 0;
1bd474
+	return newowner == current;
1bd474
 
1bd474
 	/*
1bd474
 	 * To handle the page fault we need to drop the hash bucket
1bd474
@@ -1989,10 +1990,26 @@ handle_fault:
1bd474
 	 * Check if someone else fixed it for us:
1bd474
 	 */
1bd474
 	if (pi_state->owner != oldowner)
1bd474
-		return 0;
1bd474
+		return newowner == current;
1bd474
+
1bd474
+	if (ret) {
1bd474
+		/*
1bd474
+		 * fault_in_user_writeable() failed so user state is immutable. At
1bd474
+		 * best we can make the kernel state consistent but user state will
1bd474
+		 * be most likely hosed and any subsequent unlock operation will be
1bd474
+		 * rejected due to PI futex rule [10].
1bd474
+		 *
1bd474
+		 * Ensure that the rtmutex owner is also the pi_state owner despite
1bd474
+		 * the user space value claiming something different. There is no
1bd474
+		 * point in unlocking the rtmutex if current is the owner as it
1bd474
+		 * would need to wait until the next waiter has taken the rtmutex
1bd474
+		 * to guarantee consistent state. Keep it simple. Userspace asked
1bd474
+		 * for this wreckaged state.
1bd474
+		 */
1bd474
+		pi_state_update_owner(pi_state, rt_mutex_owner(&pi_state->pi_mutex));
1bd474
 
1bd474
-	if (ret)
1bd474
 		return ret;
1bd474
+	}
1bd474
 
1bd474
 	goto retry;
1bd474
 }
1bd474
@@ -2014,10 +2031,10 @@ static long futex_wait_restart(struct restart_block *restart);
1bd474
  *  0 - success, lock not taken;
1bd474
  * <0 - on error (-EFAULT)
1bd474
  */
1bd474
+__attribute__((optimize("-fno-optimize-sibling-calls")))
1bd474
 static int fixup_owner(u32 __user *uaddr, struct futex_q *q, int locked)
1bd474
 {
1bd474
 	struct task_struct *owner;
1bd474
-	int ret = 0;
1bd474
 
1bd474
 	if (locked) {
1bd474
 		/*
1bd474
@@ -2025,8 +2042,8 @@ static int fixup_owner(u32 __user *uaddr, struct futex_q *q, int locked)
1bd474
 		 * did a lock-steal - fix up the PI-state in that case:
1bd474
 		 */
1bd474
 		if (q->pi_state->owner != current)
1bd474
-			ret = fixup_pi_state_owner(uaddr, q, current);
1bd474
-		goto out;
1bd474
+			return fixup_pi_state_owner(uaddr, q, current);
1bd474
+		return 1;
1bd474
 	}
1bd474
 
1bd474
 	/*
1bd474
@@ -2040,8 +2057,7 @@ static int fixup_owner(u32 __user *uaddr, struct futex_q *q, int locked)
1bd474
 		 * rt_mutex waiters list.
1bd474
 		 */
1bd474
 		if (rt_mutex_trylock(&q->pi_state->pi_mutex)) {
1bd474
-			locked = 1;
1bd474
-			goto out;
1bd474
+			return 1;
1bd474
 		}
1bd474
 
1bd474
 		/*
1bd474
@@ -2054,22 +2070,18 @@ static int fixup_owner(u32 __user *uaddr, struct futex_q *q, int locked)
1bd474
 		if (!owner)
1bd474
 			owner = rt_mutex_next_owner(&q->pi_state->pi_mutex);
1bd474
 		raw_spin_unlock(&q->pi_state->pi_mutex.wait_lock);
1bd474
-		ret = fixup_pi_state_owner(uaddr, q, owner);
1bd474
-		goto out;
1bd474
+
1bd474
+		return fixup_pi_state_owner(uaddr, q, owner);
1bd474
 	}
1bd474
 
1bd474
 	/*
1bd474
 	 * Paranoia check. If we did not take the lock, then we should not be
1bd474
-	 * the owner of the rt_mutex.
1bd474
+	 * the owner of the rt_mutex. Warn and establish consistent state.
1bd474
 	 */
1bd474
-	if (rt_mutex_owner(&q->pi_state->pi_mutex) == current)
1bd474
-		printk(KERN_ERR "fixup_owner: ret = %d pi-mutex: %p "
1bd474
-				"pi-state %p\n", ret,
1bd474
-				q->pi_state->pi_mutex.owner,
1bd474
-				q->pi_state->owner);
1bd474
+	if (WARN_ON_ONCE(rt_mutex_owner(&q->pi_state->pi_mutex) == current))
1bd474
+		return fixup_pi_state_owner(uaddr, q, current);
1bd474
 
1bd474
-out:
1bd474
-	return ret ? ret : locked;
1bd474
+	return 0;
1bd474
 }
1bd474
 
1bd474
 /**
1bd474
@@ -2363,13 +2375,6 @@ retry_private:
1bd474
 	if (res)
1bd474
 		ret = (res < 0) ? res : 0;
1bd474
 
1bd474
-	/*
1bd474
-	 * If fixup_owner() faulted and was unable to handle the fault, unlock
1bd474
-	 * it and return the fault to userspace.
1bd474
-	 */
1bd474
-	if (ret && (rt_mutex_owner(&q.pi_state->pi_mutex) == current))
1bd474
-		rt_mutex_unlock(&q.pi_state->pi_mutex);
1bd474
-
1bd474
 	/* Unqueue and drop the lock */
1bd474
 	unqueue_me_pi(&q);
1bd474
 
1bd474
@@ -2666,6 +2671,11 @@ static int futex_wait_requeue_pi(u32 __user *uaddr, unsigned int flags,
1bd474
 			spin_lock(q.lock_ptr);
1bd474
 			ret = fixup_pi_state_owner(uaddr2, &q, current);
1bd474
 			spin_unlock(q.lock_ptr);
1bd474
+			/*
1bd474
+			 * Adjust the return value. It's either -EFAULT or
1bd474
+			 * success (1) but the caller expects 0 for success.
1bd474
+			 */
1bd474
+			ret = ret < 0 ? ret : 0;
1bd474
 		}
1bd474
 	} else {
1bd474
 		/*
1bd474
@@ -2695,14 +2705,7 @@ static int futex_wait_requeue_pi(u32 __user *uaddr, unsigned int flags,
1bd474
 		unqueue_me_pi(&q);
1bd474
 	}
1bd474
 
1bd474
-	/*
1bd474
-	 * If fixup_pi_state_owner() faulted and was unable to handle the
1bd474
-	 * fault, unlock the rt_mutex and return the fault to userspace.
1bd474
-	 */
1bd474
-	if (ret == -EFAULT) {
1bd474
-		if (pi_mutex && rt_mutex_owner(pi_mutex) == current)
1bd474
-			rt_mutex_unlock(pi_mutex);
1bd474
-	} else if (ret == -EINTR) {
1bd474
+	if (ret == -EINTR) {
1bd474
 		/*
1bd474
 		 * We've already been requeued, but cannot restart by calling
1bd474
 		 * futex_lock_pi() directly. We could restart this syscall, but
1bd474
-- 
1bd474
2.26.3
1bd474