linux-toradex.git/arch/arm/include/asm/spinlock.h, branch v4.3-rc2

arm/spinlock: Replace ACCESS_ONCE with READ_ONCE

2014-12-18T08:54:40+00:00

ACCESS_ONCE does not work reliably on non-scalar types. For
example gcc 4.6 and 4.7 might remove the volatile tag for such
accesses during the SRA (scalar replacement of aggregates) step
(https://gcc.gnu.org/bugzilla/show_bug.cgi?id=58145)

Change the spinlock code to replace ACCESS_ONCE with READ_ONCE.

Signed-off-by: Christian Borntraeger 
Acked-by: Paul E. McKenney

ARM: 7955/1: spinlock: ensure we have a compiler barrier before sev

2014-02-10T11:44:50+00:00

When unlocking a spinlock, we require the following, strictly ordered
sequence of events:

		/* dmb */
	
		/* dsb */
	

Whilst the code does indeed reflect this in terms of the architecture,
the final  +  have been contracted into a single inline
asm without a "memory" clobber, therefore the compiler is at liberty to
reorder the unlock to the end of the above sequence. In such a case,
a waiting CPU may be woken up before the lock has been unlocked, leading
to extremely poor performance.

This patch reworks the dsb_sev() function to make use of the dsb()
macro and ensure ordering against the unlock.

Cc: 
Reported-by: Mark Rutland 
Signed-off-by: Will Deacon 
Signed-off-by: Russell King

Merge branch 'devel-stable' into for-next

2013-11-12T10:58:59+00:00

Conflicts:
	arch/arm/include/asm/atomic.h
	arch/arm/include/asm/hardirq.h
	arch/arm/kernel/smp.c

ARM: 7854/1: lockref: add support for lockless lockrefs using cmpxchg64

2013-10-29T11:06:11+00:00

Our spinlocks are only 32-bit (2x16-bit tickets) and, on processors
with 64-bit atomic instructions, cmpxchg64 makes use of the double-word
exclusive accessors.

This patch wires up the cmpxchg-based lockless lockref implementation
for ARM.

Signed-off-by: Will Deacon 
Signed-off-by: Russell King

ARM: locks: prefetch the destination word for write prior to strex

2013-09-30T15:42:55+00:00

The cost of changing a cacheline from shared to exclusive state can be
significant, especially when this is triggered by an exclusive store,
since it may result in having to retry the transaction.

This patch prefixes our {spin,read,write}_[try]lock implementations with
pldw instructions (on CPUs which support them) to try and grab the line
in exclusive state from the start. arch_rwlock_t is changed to avoid
using a volatile member, since this generates compiler warnings when
falling back on the __builtin_prefetch intrinsic which expects a const
void * argument.

Acked-by: Nicolas Pitre 
Signed-off-by: Will Deacon

ARM: smp_on_up: move inline asm ALT_SMP patching macro out of spinlock.h

2013-09-30T15:42:55+00:00

Patching UP/SMP alternatives inside inline assembly blocks is useful
outside of the spinlock implementation, where it is used for sev and wfe.

This patch lifts the macro into processor.h and gives it a scarier name
to (a) avoid conflicts in the global namespace and (b) to try and deter
its usage unless you "know what you're doing". The W macro for generating
wide instructions when targetting Thumb-2 is also made available under
the name WASM, to reduce the potential for conflicts with other headers.

Acked-by: Nicolas Pitre 
Signed-off-by: Will Deacon

Merge branches 'debug-choice', 'devel-stable' and 'misc' into for-linus

2013-09-05T09:34:15+00:00

ARM: 7812/1: rwlocks: retry trylock operation if strex fails on free lock

2013-08-13T19:22:44+00:00

Commit 15e7e5c1ebf5 ("ARM: 7749/1: spinlock: retry trylock operation if
strex fails on free lock") modifying our arch_spin_trylock to retry the
acquisition if the lock appeared uncontended, but the strex failed.

This patch does the same for rwlocks, which were missed by the original
patch.

Signed-off-by: Will Deacon 
Signed-off-by: Russell King

ARM: 7811/1: locks: use early clobber in arch_spin_trylock

2013-08-13T19:22:43+00:00

The res variable is written before we've finished with the input
operands (namely the lock address), so ensure that we mark it as `early
clobber' to avoid unintended register sharing.

Signed-off-by: Will Deacon 
Signed-off-by: Russell King

ARM: spinlock: use inner-shareable dsb variant prior to sev instruction

2013-08-12T11:25:45+00:00

When unlocking a spinlock, we use the sev instruction to signal other
CPUs waiting on the lock. Since sev is not a memory access instruction,
we require a dsb in order to ensure that the sev is not issued ahead
of the store placing the lock in an unlocked state.

However, as sev is only concerned with other processors in a
multiprocessor system, we can restrict the scope of the preceding dsb
to the inner-shareable domain. Furthermore, we can restrict the scope to
consider only stores, since there are no independent loads on the unlock
path.

A side-effect of this change is that a spin_unlock operation no longer
forces completion of pending TLB invalidation, something which we rely
on when unlocking runqueues to ensure that CPU migration during TLB
maintenance routines doesn't cause us to continue before the operation
has completed.

This patch adds the -ishst suffix to the ARMv7 definition of dsb_sev()
and adds an inner-shareable dsb to the context-switch path when running
a preemptible, SMP, v7 kernel.

Reviewed-by: Catalin Marinas 
Signed-off-by: Will Deacon