Commit 66ce3a4d authored by Paul E. McKenney's avatar Paul E. McKenney

doc: Update memory-barriers.txt for read-to-write dependencies

The memory-barriers.txt document contains an obsolete passage stating that
smp_read_barrier_depends() is required to force ordering for read-to-write
dependencies.  We now know that this is not required, even for DEC Alpha.
This commit therefore updates this passage to state that read-to-write
dependencies are respected even without smp_read_barrier_depends().
Reported-by: default avatarLance Roy <ldr709@gmail.com>
Signed-off-by: default avatarPaul E. McKenney <paulmck@linux.vnet.ibm.com>
Cc: David Howells <dhowells@redhat.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Jonathan Corbet <corbet@lwn.net>
Cc: Alan Stern <stern@rowland.harvard.edu>
Cc: Andrea Parri <parri.andrea@gmail.com>
Cc: Jade Alglave <j.alglave@ucl.ac.uk>
Cc: Luc Maranget <luc.maranget@inria.fr>
[ paulmck: Reference control-dependencies sections and use WRITE_ONCE()
  per Will Deacon.  Correctly place split-cache paragraph while there. ]
Acked-by: default avatarWill Deacon <will.deacon@arm.com>
parent 4de5f89e
...@@ -594,7 +594,24 @@ between the address load and the data load: ...@@ -594,7 +594,24 @@ between the address load and the data load:
This enforces the occurrence of one of the two implications, and prevents the This enforces the occurrence of one of the two implications, and prevents the
third possibility from arising. third possibility from arising.
A data-dependency barrier must also order against dependent writes:
[!] Note that this extremely counterintuitive situation arises most easily on
machines with split caches, so that, for example, one cache bank processes
even-numbered cache lines and the other bank processes odd-numbered cache
lines. The pointer P might be stored in an odd-numbered cache line, and the
variable B might be stored in an even-numbered cache line. Then, if the
even-numbered bank of the reading CPU's cache is extremely busy while the
odd-numbered bank is idle, one can see the new value of the pointer P (&B),
but the old value of the variable B (2).
A data-dependency barrier is not required to order dependent writes
because the CPUs that the Linux kernel supports don't do writes
until they are certain (1) that the write will actually happen, (2)
of the location of the write, and (3) of the value to be written.
But please carefully read the "CONTROL DEPENDENCIES" section and the
Documentation/RCU/rcu_dereference.txt file: The compiler can and does
break dependencies in a great many highly creative ways.
CPU 1 CPU 2 CPU 1 CPU 2
=============== =============== =============== ===============
...@@ -603,29 +620,19 @@ A data-dependency barrier must also order against dependent writes: ...@@ -603,29 +620,19 @@ A data-dependency barrier must also order against dependent writes:
<write barrier> <write barrier>
WRITE_ONCE(P, &B); WRITE_ONCE(P, &B);
Q = READ_ONCE(P); Q = READ_ONCE(P);
<data dependency barrier> WRITE_ONCE(*Q, 5);
*Q = 5;
The data-dependency barrier must order the read into Q with the store Therefore, no data-dependency barrier is required to order the read into
into *Q. This prohibits this outcome: Q with the store into *Q. In other words, this outcome is prohibited,
even without a data-dependency barrier:
(Q == &B) && (B == 4) (Q == &B) && (B == 4)
Please note that this pattern should be rare. After all, the whole point Please note that this pattern should be rare. After all, the whole point
of dependency ordering is to -prevent- writes to the data structure, along of dependency ordering is to -prevent- writes to the data structure, along
with the expensive cache misses associated with those writes. This pattern with the expensive cache misses associated with those writes. This pattern
can be used to record rare error conditions and the like, and the ordering can be used to record rare error conditions and the like, and the CPUs'
prevents such records from being lost. naturally occurring ordering prevents such records from being lost.
[!] Note that this extremely counterintuitive situation arises most easily on
machines with split caches, so that, for example, one cache bank processes
even-numbered cache lines and the other bank processes odd-numbered cache
lines. The pointer P might be stored in an odd-numbered cache line, and the
variable B might be stored in an even-numbered cache line. Then, if the
even-numbered bank of the reading CPU's cache is extremely busy while the
odd-numbered bank is idle, one can see the new value of the pointer P (&B),
but the old value of the variable B (2).
The data dependency barrier is very important to the RCU system, The data dependency barrier is very important to the RCU system,
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment