Commits · d8cd289dbe69ce9b8115d6f200ceff657e5dafa0 · Kirill Smelkov / linux

08 Nov, 2012 40 commits

drbd: Remove left-over unused define · d8cd289d

Andreas Gruenbacher authored May 03, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

d8cd289d

drbd: fix schedule in atomic · 009ba89d

Lars Ellenberg authored May 02, 2011

An administrative detach used to request a state change directly to D_DISKLESS,
first suspending IO to avoid the last put_ldev() occuring from an endio handler,
potentially in irq context.

This is not enough on the receiving side (typically secondary), we may miss
some peer_req on the way to local disk, which then may do the last put_ldev()
from their drbd_peer_request_endio().

This patch makes the detach always go through the intermediate D_FAILED state.
We may consider to rename it D_DETACHING.

Alternative approach would be to create yet an other work item to be scheduled
on the worker, do the destructor work from there, and get the timing right.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

009ba89d

drbd: fix thread stop deadlock · 992d6e91

Lars Ellenberg authored May 02, 2011

There are races where the receiver may be exiting,
but still need the worker to process some stuff.

Do not wait for the receiver to die from an exiting worker.
The receiver must already be dead in case the worker decides to exit.
If the receiver was still alive, it may still want to queue work, and do
drbd_flush_workqueue() from it's disconnect cleanup code,
which would no longer be processed by an exiting worker.

This also would deadlock,
if the worker was to synchornously wait for the receiver to die.

Do not implicitly stop the worker.
The worker will only be stopped from configuration context, from
conn_reconfig_done(), drbd_adm_down() or drbd_adm_delete_connection(),
after making sure the receiver is already stopped.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

992d6e91

drbd: fix race when forcefully disconnecting · f3dfa40a

Lars Ellenberg authored May 02, 2011

If a forced disconnect hits a restarting receiver right after it passed
its final "if (C_DISCONNECTING)" test in drbdd_init(), but before it was
actually restarted by drbd_thread_setup, we could be left with a
connection stuck in C_DISCONNECTING, never reaching C_STANDALONE,
which would be necessary to take it down or reconfigure it.

Move the last cleanup into w_after_conn_state_ch(), and do an additional
state change request in conn_try_disconnect(), just in case.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

f3dfa40a

drbd: Allow to change data-integrity-alg on the fly · 88104ca4

Andreas Gruenbacher authored Apr 28, 2011

The main purpose of this is to allow to turn data integrity checking on
and off on demand without causing interruptions.

Implemented by allocating tconn->peer_integrity_tfm only when receiving
a P_PROTOCOL message. l accesses to tconn->peer_integrity_tf happen in
worker context, and no further synchronization is necessary.

On the sender side, tconn->integrity_tfm is modified under
tconn->data.mutex, and a P_PROTOCOL message is sent whenever. All
accesses to tconn->integrity_tfm already happen under this mutex.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

88104ca4

drbd: Introduce a "lockless" variant of drbd_send_protocoll() · a7eb7bdf

Andreas Gruenbacher authored Apr 29, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

a7eb7bdf

drbd: Remove obsolete drbd_crypto_is_hash() · 4b6ad6d4

Andreas Gruenbacher authored Apr 29, 2011

We allocate hash transformations with crypto_alloc_hash() which will
only return hash algorithms.  It is not necessary to reconfirm that we
actually got a hash algorithm.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

4b6ad6d4

drbd: Rename integrity_r_tfm -> peer_integrity_tfm · 5b614abe

Andreas Gruenbacher authored Apr 27, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5b614abe

drbd: Rename integrity_w_tfm -> integrity_tfm · 8d412fc6

Andreas Gruenbacher authored Apr 27, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

8d412fc6

drbd: Wrong use of RCU in receive_protocol() · 86db0618

Andreas Gruenbacher authored Apr 28, 2011

It is not enough to grab net_conf->integrity_alg under rcu_read_lock()
and access it outside of it; the entire net_conf object may be gone by
then.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

86db0618

drbd: fix copy/paste error in comment · acb104c3

Lars Ellenberg authored Apr 28, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

acb104c3

drbd: rename variable sc to res_opts · b57a1e27

Lars Ellenberg authored Apr 27, 2011

sc was short for syncer conf, which does not exist anymore anyways.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

b57a1e27

drbd: rename variable ndc to new_disk_conf · 5ecc72c3

Lars Ellenberg authored Apr 27, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5ecc72c3

drbd: on reconfiguration requests, mind the SET_DEFAULTS flag · 5979e361

Lars Ellenberg authored Apr 27, 2011

The DRBD_GENL_F_SET_DEFAULTS flag was ignored
for drbd_adm_disk_opts() and drbd_adm_net_opts().

Factor out drbd_set_*_defaults() helper functions,
and call them appropriately.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5979e361

drbd: Consider all crypto options in connect and in net-options · 0fd0ea06

Philipp Reisner authored Apr 27, 2011

So for this was simply not considered after the options have been
re-arranged.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

0fd0ea06

drbd: fix various disconnecting races · d9cc6e23

Lars Ellenberg authored Apr 27, 2011

If an admin requests disconnect at a time when the state handling
already disconnects/reconnects, there have been some races.

Make sure to always really stop the network threads before
returning success for disconnect. Do not pretend successfull
forced disconnect, if the state handling returned an error.

Return success from drbd_adm_down() only after all threads are finished.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

d9cc6e23

drbd: remove useless kobject_uevent from drbd_adm_connect · 5ee743e9

Lars Ellenberg authored Apr 26, 2011

Calling kobject_uevent, which may sleep, from within rcu_read_lock()
protected regions is not possible.
This particular kobject_uevent also is also wrong. It was supposed to
trigger a udev run, just in case something relevant to udev symlink
magic has changed, when adjusting runtime re-configurable settings while
we still had the "syncer conf". It was improperly placed in connect
when we dropped the "syncer conf". The right thing to do is probably to
call "udevadm trigger" directly in those cases where drbdadm thinks
there was a need to trigger extra udev runs.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5ee743e9

drbd: Removed the OBJECT_DYING and the CONFIG_PENDING bits · a18e9d1e

Philipp Reisner authored Apr 24, 2011

superseded by refcounting
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

a18e9d1e

drbd: Take a reference on tconn when finding a tconn by name · 0ace9dfa

Philipp Reisner authored Apr 24, 2011

Rule #3 of kref.txt
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

0ace9dfa

drbd: Basic refcounting for drbd_tconn · 9dc9fbb3

Philipp Reisner authored Apr 22, 2011

References hold by:
 * Each (running) drbd thread has a reference on tconn
 * Each mdev has a referenc on tconn
 * Beeing in the all_tconn list counts for one reference
 * Each after_conn_state_chg_work has a reference to tconn
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

9dc9fbb3

drbd: Eliminated drbd_free_resoruces() it is superseeded by conn_free_crypto() · 1d041225
Philipp Reisner authored Apr 22, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
1d041225

drbd: move comment about stopping the receiver thread to where it belongs · f5e2b8b3

Lars Ellenberg authored Apr 24, 2011

When the last volume of a replication group is unconfigured,
the worker thread exits. To not interfere with cleanup
of other threads, before the the last cleanups run,
we need to make sure the receiver has already exited.

The commend explaining that clearly belongs above
drbd_thread_stop(&tconn->receiver), not in the cleanup loop below.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

f5e2b8b3

drbd: cmdname() enum to string convertion was missing a few constants · ae25b336

Lars Ellenberg authored Apr 24, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ae25b336

drbd: fix setsockopt for user mode linux · ed439848

Lars Ellenberg authored Apr 23, 2011

We use our own copy of kernel_setsockopt, and did not mess around with
get_fs/set_fs, since we thought we knew we would always be KERNEL_DS
anyways. Apparently not so for at least user mode linux, so put the
set_fs(KERNEL_DS) in there.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ed439848

drbd: allow status dump request all volumes of a specific resource · 71932efc

Lars Ellenberg authored Apr 18, 2011

We had drbd_adm_get_status (one single volume),
and drbd_adm_get_status_all (dump of all volumes of all resources).

This enhances the latter to be able to dump all volumes
of just one specific resource.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

71932efc

drbd: Considering that the two_primaries config flag can change · 302bdeae

Philipp Reisner authored Apr 21, 2011

Now since it is possible to change the two_primaries config
flag while the connection is up, make sure we treat a peer_req
in a consistent way if the config flag changes while the peer_req
is under IO.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

302bdeae

drbd: Proper locking for updates to net_conf under RCU · 91fd4dad

Philipp Reisner authored Apr 20, 2011

Removing the get_net_conf()/put_net_conf() functions
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

91fd4dad

drbd: rcu_read_lock() and rcu_dereference() for tconn->net_conf · 44ed167d

Philipp Reisner authored Apr 19, 2011

Removing the get_net_conf()/put_net_conf() calls
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

44ed167d

drbd: Allow online change of replication protocol only with agreed_pv >= 100 · b032b6fa
Philipp Reisner authored Apr 13, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
b032b6fa

drbd: Check consistency of net options when the get changed online · cd64397c

Philipp Reisner authored Apr 13, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

cd64397c

drbd: Runtime changeable wire protocol · 303d1448

Philipp Reisner authored Apr 13, 2011

The wire protocol is no longer a property that is negotiated
between the two peers. It is now expressed with two bits
(DP_SEND_WRITE_ACK and DP_SEND_RECEIVE_ACK) in each data
packet. Therefore the primary node is free to change the
wire protocol at any time without disconnect/reconnect.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

303d1448

drbd: protect all idr accesses that might sleep with drbd_cfg_rwsem · d3fcb490

Philipp Reisner authored Apr 13, 2011

With this commit the locking for all accesses to IDRs is complete:

 * Non sleeping read accesses are protected by RCU
 * sleeping read accesses are protocted by a read lock on drbd_cfg_rwsem
 * accesses that add anything are protected by a write lock
 * accesses that remove an object are protoected by a write lock
   and a call to synchronize_rcu() after it is removed from the IDR
   and before the object is actually free()ed.
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

d3fcb490

drbd: Converted drbd_cfg_mutex into drbd_cfg_rwsem · ef356262

Philipp Reisner authored Apr 13, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ef356262

drbd: rcu_read_[un]lock() for all idr accesses that do not sleep · 695d08fa

Philipp Reisner authored Apr 11, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

695d08fa

drbd: Inlined drbd_free_mdev(); it got called only from one place · cd1d9950

Philipp Reisner authored Apr 11, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

cd1d9950

drbd: drbd_delete_device() takes a struct drbd_conf * now · ff370e5a

Philipp Reisner authored Apr 11, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

ff370e5a

drbd: Rename drbd_pp_free() to drbd_free_pages() · 5cc287e0

Andreas Gruenbacher authored Apr 07, 2011

Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>

5cc287e0

drbd: Rename drbd_pp_alloc() to drbd_alloc_pages() and make it non-static · c37c8ecf
Andreas Gruenbacher authored Apr 07, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
c37c8ecf
drbd: Rename drbd_pp_first_pages_or_try_alloc() to __drbd_alloc_pages() · 18c2d522
Andreas Gruenbacher authored Apr 07, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
18c2d522
drbd: Make drbd_wait_ee_list_empty() and _drbd_wait_ee_list_empty() static · d4da1537
Andreas Gruenbacher authored Apr 07, 2011
```
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
```
d4da1537