IB/ipath: NMI cpu lockup if local loopback used
authorRalph Campbell <[email protected]>
Thu, 15 Mar 2007 21:44:50 +0000 (14:44 -0700)
committerRoland Dreier <[email protected]>
Thu, 19 Apr 2007 03:20:54 +0000 (20:20 -0700)
If a post send is done in loopback and there is no receive queue
entry, the sending QP is put on a timeout list for a while so the
receiver has a chance to post a receive buffer. If the another post
send is done, the code incorrectly tried to put the QP on the timeout
list again an corrupted the timeout list. This eventually leads to a
spin lock deadlock NMI due to the timer function looping forever with
the lock held.

Signed-off-by: Bryan O'Sullivan <[email protected]>
Signed-off-by: Roland Dreier <[email protected]>
drivers/infiniband/hw/ipath/ipath_ruc.c

index 146db8516997d78d59162cbb0c5e9d352937f536..cda84933bb4395b8c76e9c4c3bdbefcb30b6d80b 100644 (file)
@@ -265,7 +265,8 @@ static void ipath_ruc_loopback(struct ipath_qp *sqp)
 again:
        spin_lock_irqsave(&sqp->s_lock, flags);
 
-       if (!(ib_ipath_state_ops[sqp->state] & IPATH_PROCESS_SEND_OK)) {
+       if (!(ib_ipath_state_ops[sqp->state] & IPATH_PROCESS_SEND_OK) ||
+           qp->s_rnr_timeout) {
                spin_unlock_irqrestore(&sqp->s_lock, flags);
                goto done;
        }