git.openwrt.org Git - openwrt/staging/blogic.git/commit

author	Peter Zijlstra <[email protected]>
	Thu, 14 Jul 2016 18:08:46 +0000 (20:08 +0200)
committer	Ingo Molnar <[email protected]>
	Wed, 10 Aug 2016 12:34:01 +0000 (14:34 +0200)
commit	80127a39681bd68c959f0953f84a830cbd7c3b1c
tree	223bcc2a5cbec5c0873f8fae85a98797f94e6c56	tree \| snapshot
parent	08be8f63c40c030b5cf95b4368e314e563a86301	commit \| diff

locking/percpu-rwsem: Optimize readers and reduce global impact

Currently the percpu-rwsem switches to (global) atomic ops while a
writer is waiting; which could be quite a while and slows down
releasing the readers.

This patch cures this problem by ordering the reader-state vs
reader-count (see the comments in __percpu_down_read() and
percpu_down_write()). This changes a global atomic op into a full
memory barrier, which doesn't have the global cacheline contention.

This also enables using the percpu-rwsem with rcu_sync disabled in order
to bias the implementation differently, reducing the writer latency by
adding some cost to readers.

Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
Reviewed-by: Oleg Nesterov <[email protected]>
Cc: Andrew Morton <[email protected]>
Cc: Linus Torvalds <[email protected]>
Cc: Paul E. McKenney <[email protected]>
Cc: Paul McKenney <[email protected]>
Cc: Peter Zijlstra <[email protected]>
Cc: Thomas Gleixner <[email protected]>
Cc: [email protected]
[ Fixed modular build. ]
Signed-off-by: Ingo Molnar <[email protected]>

include/linux/percpu-rwsem.h		diff \| blob \| history
kernel/locking/percpu-rwsem.c		diff \| blob \| history
kernel/rcu/sync.c		diff \| blob \| history