Commits · 0e57eb0504787c3aeb6120aa6b01fdf0cd0af391 · Kirill Smelkov / neo

31 Mar, 2017 11 commits

Bump protocol version · 0e57eb05
Julien Muchembled authored Mar 31, 2017
```
Commit ad43dcd3 should have bumped it as well.
```
0e57eb05
qa: new ConnectionFilter.retry() · aefa65a2
Julien Muchembled authored Mar 29, 2017
```
Unused but it is likely to be useful in the future.
```
aefa65a2

Fix race when tweak touches partitions that are being reported as replicated · 87c5178b

Julien Muchembled authored Mar 15, 2017

The bug could lead to data corruption (if a partition is wrongly marked as
UP_TO_DATE) or crashes (assertion failure on either the storage or the master).

The protocol is extended to handle the following scenario:

    S                                    M
    partition 0 outdated
      <-- UnfinishedTransactions ------>
    replication of partition 0 ...
    partition 1 outdated
      --- UnfinishedTransactions ...
    ... replication finished
      --- ReplicationDone ...
                                         tweak
      <-- partition 1 discarded --------
                                         tweak
      <-- partition 1 outdated ---------
          ... UnfinishedTransactions -->
          ... ReplicationDone --------->

The master can't simply mark all outdated cells as being updatable when it
receives an UnfinishedTransactions packet.

87c5178b

qa: add a basic assertion in Patch to detect when patched code changes · cb78e6b2
Julien Muchembled authored Mar 24, 2017

cb78e6b2

Forbid read-accesses to cells that are actually non-readable · 64afd7d2

Julien Muchembled authored Mar 08, 2017

After an attempt to read from a non-readable, which happens when a client has
a newer or older PT than storage's, the client now retries to read.

This bugfix is for all kinds of read-access except undoLog, which can still
report incomplete results.

64afd7d2

Fix potential EMFILE when retrying to connect indefinitely · 43fdd059
Julien Muchembled authored Mar 17, 2017

43fdd059
The partition table must forget dropped nodes · 6f86c773
Julien Muchembled authored Mar 15, 2017

6f86c773

master: make sure that storage nodes have an up-to-date PT/NM when they're added · 7ffc96fd

Julien Muchembled authored Mar 15, 2017

This revert commit bddc1802,
to fix the following storage crash:

  Traceback (most recent call last):
    ...
    File "neo/lib/handler.py", line 72, in dispatch
      method(conn, *args, **kw)
    File "neo/storage/handlers/master.py", line 44, in notifyPartitionChanges
      app.pt.update(ptid, cell_list, app.nm)
    File "neo/lib/pt.py", line 231, in update
      assert node is not None, 'No node found for uuid ' + uuid_str(uuid)
  AssertionError: No node found for uuid S3

Partitition table updates must also be processed with InitializationHandler
when nodes remain in PENDING state because they're not added to the cluster.

7ffc96fd

In STOPPING cluster state, really wait for all transaction to be finished · 9e433594
Julien Muchembled authored Mar 15, 2017

9e433594
master: fix random crashes on shutdown when using several master nodes · 35737c9b
Julien Muchembled authored Mar 20, 2017

35737c9b
Mention RocksDB as a possible MySQL engine in neo.conf · 35468667
Julien Muchembled authored Mar 29, 2017

35468667

30 Mar, 2017 1 commit
- README: update location of automated test results · ca980d33
  Julien Muchembled authored Mar 30, 2017
  
  ca980d33
23 Mar, 2017 9 commits

storage: in deadlock avoidance, fix performance issue that could freeze the cluster · 1280f73e

Julien Muchembled authored Mar 14, 2017

In the worst case, with many clients trying to lock the same oids,
the cluster could enter in an infinite cascade of deadlocks.

Here is an overview with 3 storage nodes and 3 transactions:

S1 S2 S3 order of locking tids # abbreviations:
l1 l1 l2 123 # l: lock
q23 q23 d1q3 231 # d: deadlock triggered
r1:l3 r1:l2 (r1) # for S3, we still have l2 # q: queued
d2q1 q13 q13 312 # r: rebase

Above, we show what happens when a random transaction gets a lock just after
that another is rebased. Here, the result is that the last 2 lines are a
permutation of the first 2, and this can repeat indefinitely with bad luck.

This commit reduces the probability of deadlock by processing delayed
stores/checks in the order of their locking tid. In the above example,
S1 would give the lock to 2 when 1 is rebased, and 2 would vote successfully.

1280f73e

qa: document and fortify testCascadedDeadlockAvoidanceOnCheckCurrent · 1b9f8f72
Julien Muchembled authored Mar 14, 2017

1b9f8f72
storage: discard answers from aborted replications · ad43dcd3
Julien Muchembled authored Mar 06, 2017
```
This fixes a bug that could to data corruption or crashes.
```
ad43dcd3

Use Connection.send instead of answer when a packet id must be reused · 4222ac8a

Julien Muchembled authored Mar 07, 2017

It becomes possible to answer with several packets:
- the last is the usual associated answer packet
- all other (previously sent) packets are notifications

Connection.send does not return the packet id anymore. This is not useful
enough, and the caller can inspect the sent packet (getId).

4222ac8a

Rename {Node,Connection}.notify to send · ff4242d4
Julien Muchembled authored Mar 06, 2017

ff4242d4
Code clean up: Connection · 4bde7d76
Julien Muchembled authored Mar 07, 2017

4bde7d76
qa: in threaded tests, nodes can now be reset with a different configuration · dfa346a6
Julien Muchembled authored Mar 06, 2017

dfa346a6
mysql: add support for RocksDB · f2f44cd5
Julien Muchembled authored Mar 02, 2017

f2f44cd5
storage: by default, do not retry to connect to MySQL server automatically · 069dd583
Julien Muchembled authored Mar 23, 2017

069dd583

22 Mar, 2017 1 commit
- qa: hack to make threaded tests pass on a single-core CPU · 7e7af30c
  Julien Muchembled authored Mar 21, 2017
```
In reality, this was tested with
  taskset 1 neotestrunner ...
```
  7e7af30c
21 Mar, 2017 1 commit
- qa: skip election in SSLTests.testShutdown due to random failures · 180e8f6a
  Julien Muchembled authored Mar 21, 2017
  
  180e8f6a
20 Mar, 2017 2 commits
- client: fix harmless 'unexpected ... AnswerRequestIdentification' exceptions · 655a4ea9
  Julien Muchembled authored Mar 20, 2017
  
  655a4ea9
- qa: do not always use MySQL backend in testPack (neo.tests.zodb) · 2cb7bf1b
  Julien Muchembled authored Mar 20, 2017
  
  2cb7bf1b
18 Mar, 2017 1 commit

master: fix crash when a transaction begins while a storage node starts operation · 781b4eb5

Julien Muchembled authored Mar 17, 2017

Traceback (most recent call last):
  ...
  File "neo/lib/handler.py", line 72, in dispatch
    method(conn, *args, **kw)
  File "neo/master/handlers/client.py", line 70, in askFinishTransaction
    conn.getPeerId(),
  File "neo/master/transactions.py", line 387, in prepare
    assert node_list, (ready, failed)
AssertionError: (set([]), frozenset([]))

Master log leading to the crash:
  PACKET    #0x0009 StartOperation                 > S1
  PACKET    #0x0004 BeginTransaction               < C1
  DEBUG     Begin <...>
  PACKET    #0x0004 AnswerBeginTransaction         > C1
  PACKET    #0x0001 NotifyReady                    < S1

It was wrong to process BeginTransaction before receiving NotifyReady.

The changes in the storage are cosmetics: the 'ready' attribute has become
redundant with 'operational'.

781b4eb5

17 Mar, 2017 3 commits
- qa: fix ConnectionFilter bug causing packets to be stuck after __exit__/remove · 0fd3b652
  Julien Muchembled authored Mar 17, 2017
  
  0fd3b652
- mysql: do not retry a failing query forever · f0c45ea4
  Julien Muchembled authored Mar 16, 2017
```
Due to a bug in MariaDB Connector/C 2.3.2, some tests like testBasicStore and
test_max_allowed_packet were retrying the same failing query indefinitely.
```
  f0c45ea4
- qa: fix tests to not loop forever when the master dies unexpectedly · d0d0c143
  Julien Muchembled authored Mar 17, 2017
  
  d0d0c143
14 Mar, 2017 4 commits
- storage: avoid repeated 'Lock delayed' logs · e5fd0233
  Julien Muchembled authored Mar 10, 2017
```
On clusters with many deadlock avoidances, this flooded logs.
Hopefully, this commit reduces the size of logs without losing information.
```
  e5fd0233
- Warn when a cell becomes non-readable whereas all cells were readable · 3a39ac9a
  Julien Muchembled authored Mar 09, 2017
```
An issue that happened for the first time on a storage node didn't always cause
other nodes to flush their logs, which made debugging difficult.
```
  3a39ac9a
- Code clean up: PartitionTable · 1eed0239
  Julien Muchembled authored Mar 09, 2017
  
  1eed0239
- mysql: do not flood logs when retrying to connect non-stop · b61ee7f1
  Julien Muchembled authored Mar 13, 2017
  
  b61ee7f1
07 Mar, 2017 1 commit
- storage: fix possible KeyError when notifying about replicated partitions · ed966e80
  Julien Muchembled authored Mar 07, 2017
  
  ed966e80
03 Mar, 2017 1 commit

qa: fix random failure of check_checkCurrentSerialInTransaction · fec9a3a5

Julien Muchembled authored Mar 03, 2017

Generators are not thread-safe:

Exception in thread T2:
Traceback (most recent call last):
  ...
  File "ZODB/tests/StorageTestBase.py", line 157, in _dostore
    r2 = self._storage.tpc_vote(t)
  File "neo/client/Storage.py", line 95, in tpc_vote
    return self.app.tpc_vote(transaction)
  File "neo/client/app.py", line 507, in tpc_vote
    self.waitStoreResponses(txn_context)
  File "neo/client/app.py", line 500, in waitStoreResponses
    _waitAnyTransactionMessage(txn_context)
  File "neo/client/app.py", line 145, in _waitAnyTransactionMessage
    self._waitAnyMessage(queue, block=block)
  File "neo/client/app.py", line 128, in _waitAnyMessage
    conn, packet, kw = get(block)
  File "neo/lib/locking.py", line 203, in get
    self._lock()
  File "neo/tests/threaded/__init__.py", line 590, in _lock
    for i in TIC_LOOP:
ValueError: generator already executing

======================================================================
FAIL: check_checkCurrentSerialInTransaction (neo.tests.zodb.testBasic.BasicTests)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "neo/tests/zodb/testBasic.py", line 33, in check_checkCurrentSerialInTransaction
    super(BasicTests, self).check_checkCurrentSerialInTransaction()
  File "ZODB/tests/BasicStorage.py", line 294, in check_checkCurrentSerialInTransaction
    utils.load_current(self._storage, b'\0\0\0\0\0\0\0\xf4')[1])
failureException: False is not true

fec9a3a5

02 Mar, 2017 2 commits

storage: fix PT updates in case of late AnswerUnfinishedTransactions · a74937c8

Julien Muchembled authored Feb 28, 2017

This is done by moving
        self.replicator.populate()
after the switch to MasterOperationHandler, so that the latter is not delayed.

This change comes with some refactoring of the main loop,
to clean up app.checker and app.replicator properly (like app.tm).

Another option could have been to process notifications with the last handler,
instead of the first one. But if possible, cleaning up the whole code to not
delay handlers anymore looks the best option.

a74937c8

mysql: code clean up · 041a3eda
Julien Muchembled authored Feb 24, 2017

041a3eda

27 Feb, 2017 3 commits

Fix oids remaining write-locked forever · 9b33b1db

Julien Muchembled authored Feb 24, 2017

This happened in 2 cases:
- Commit a4c06242 ("Review aborting of
  transactions") introduced a race condition causing oids to remain
  write-locked forever after that the transaction modifying them is aborted.
- An unfinished transaction is not locked/unlocked during tpc_finish: oids
  must be unlocked when being notified that the transaction is finished.

9b33b1db

storage: fix bug not replicating unfinished transactions when the last ones are aborted · 7f754b5e

Julien Muchembled authored Feb 24, 2017

This was found by the first assertion of answerRebaseObject (client) because
a storage node missed a few transactions and reported a conflict with an older
serial than the one being stored: this must never happen and this commit adds a
more generic assertion on the storage side.

The above case is when the "first phase" of replication of a partition
(all history up to the tid before unfinished transactions) ended after
that the unfinished transactions are finished: this was a corruption bug,
where UP_TO_DATE cells could miss data.

Otherwise, if the "first phase" ended before, then the partition remained stuck
in OUT_OF_DATE state. Restarting the storage node was enough to recover.

7f754b5e

client: fix an AssertionError while processing late AnswerRebaseObject · 44452395

Julien Muchembled authored Feb 24, 2017

Traceback (most recent call last):
  ...
  File "neo/client/app.py", line 507, in tpc_vote
    self.waitStoreResponses(txn_context)
  File "neo/client/app.py", line 500, in waitStoreResponses
    _waitAnyTransactionMessage(txn_context)
  File "neo/client/app.py", line 150, in _waitAnyTransactionMessage
    self._handleConflicts(txn_context)
  File "neo/client/app.py", line 474, in _handleConflicts
    self._store(txn_context, oid, conflict_serial, data)
  File "neo/client/app.py", line 410, in _store
    self._waitAnyTransactionMessage(txn_context, False)
  File "neo/client/app.py", line 145, in _waitAnyTransactionMessage
    self._waitAnyMessage(queue, block=block)
  File "neo/client/app.py", line 133, in _waitAnyMessage
    _handlePacket(conn, packet, kw)
  File "neo/lib/threaded_app.py", line 133, in _handlePacket
    handler.dispatch(conn, packet, kw)
  File "neo/lib/handler.py", line 72, in dispatch
    method(conn, *args, **kw)
  File "neo/client/handlers/storage.py", line 122, in answerRebaseObject
    assert txn_context.conflict_dict[oid] == (serial, conflict)
AssertionError

Scenario:
0. unanswered rebase from S2
1. conflict resolved between t1 and t2 -> S1 & S2
2. S1 reports a new conflict
3. S2 answers to the rebase:
   returned serial (t1) is smaller than in conflict_dict (t2)
4. S2 reports the same conflict as in 2

44452395