Commits · a19ab67318760f8f155ef7f4f821dfc738542c67 · nexedi / MariaDB

05 Nov, 2021 3 commits

Merge branch '10.3' into 10.4 · a19ab673
Oleksandr Byelkin authored Nov 05, 2021

a19ab673
Merge branch '10.2' into 10.3 · a2f147af
Oleksandr Byelkin authored Nov 05, 2021

a2f147af

MDEV-26833 Missed statement rollback in case transaction drops or create temporary table · 561b6c7e

Andrei Elkin authored Oct 25, 2021

When transaction creates or drops temporary tables and afterward its statement
faces an error even the transactional table statement's cached ROW
format events get involved into binlog and are visible after the transaction's commit.

Fixed with proper analysis of whether the errored-out statement needs
to be rolled back in binlog.
For instance a fact of already cached CREATE or DROP for temporary
tables by previous statements alone
does not cause to retain the being errored-out statement events in the
cache.
Conversely, if the statement creates or drops a temporary table
itself it can't be rolled back - this rule remains.

561b6c7e

03 Nov, 2021 2 commits
- Merge branch '10.3' into 10.4 · 3021b929
  Oleksandr Byelkin authored Nov 03, 2021
  
  3021b929
- Merge branch '10.2' into 10.3 · 69c70c18
  Oleksandr Byelkin authored Nov 03, 2021
  
  69c70c18
02 Nov, 2021 8 commits
- post merge result fix · eb2c3d38
  Oleksandr Byelkin authored Nov 02, 2021
  
  eb2c3d38
- Merge branch '10.3' into 10.4 · ef968c9e
  Oleksandr Byelkin authored Nov 02, 2021
  
  ef968c9e
- Fix mutex order according to a new sequence. · bb46b79c
  Oleksandr Byelkin authored Nov 02, 2021
  
  bb46b79c
- Merge branch '10.2' into 10.3 · f0b9194d
  Oleksandr Byelkin authored Nov 02, 2021
  
  f0b9194d
- move "bad" test in seperate file with valgrind prohibited (different size of allocated memory) · d7c179e6
  Oleksandr Byelkin authored Nov 02, 2021
  
  d7c179e6
- MDEV-23328 Server hang due to Galera lock conflict resolution · eab7f5d8
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  eab7f5d8
- MDEV-23328 Server hang due to Galera lock conflict resolution · db649244
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  db649244
- MDEV-23328 Server hang due to Galera lock conflict resolution · e571eaae
  Jan Lindström authored Nov 02, 2021
```
Use better error message when KILL fails even in case TOI
fails.
```
  e571eaae
01 Nov, 2021 1 commit
- MDEV-23328 Server hang due to Galera lock conflict resolution · ea239034
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  ea239034
29 Oct, 2021 8 commits

Merge branch '10.3' into 10.4 · 5900f3a7
Oleksandr Byelkin authored Oct 29, 2021

5900f3a7
Merge branch '10.2' into 10.3 · 6953af36
Oleksandr Byelkin authored Oct 29, 2021

6953af36

MDEV-23328 Server hang due to Galera lock conflict resolution · 157b3a63

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

157b3a63

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · 30337add
Jan Lindström authored Oct 21, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit 29bbcac0.
```
30337add

MDEV-23328 Server hang due to Galera lock conflict resolution · 5c230b21

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

5c230b21

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · aa7ca987
Jan Lindström authored Oct 22, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit eac8341d.
```
aa7ca987

MDEV-23328 Server hang due to Galera lock conflict resolution · db50ea3a

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

db50ea3a

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · c8b39f7e
Jan Lindström authored Oct 21, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit 29bbcac0.
```
c8b39f7e

28 Oct, 2021 12 commits

wolfssl v4.8.1-stable · e1083826
Oleksandr Byelkin authored Oct 28, 2021

e1083826
Merge branch '10.3' into 10.4 · 89f69c62
Oleksandr Byelkin authored Oct 28, 2021

89f69c62
Merge branch '10.2' into 10.3 · 2ddea602
Oleksandr Byelkin authored Oct 28, 2021

2ddea602
fix depricated pthread_yield() for tokudb · b3cdf416
Oleksandr Byelkin authored Oct 28, 2021

b3cdf416

compilation fixes for sys-devel/gcc-11.2.0:11 · 1203b658

Sergei Golubchik authored Oct 28, 2021

for example:

sql/sql_prepare.cc:5714:63: error: 'static void Ed_result_set::operator delete(void*, MEM_ROOT*)' called on pointer returned from a mismatched allocation function [-Werror=mismatched-new-delete]

1203b658

Merge remote-tracking branch 'connect/10.2' into 10.2 · 99c89358
Oleksandr Byelkin authored Oct 28, 2021

99c89358
Fix message severity for "thread pool blocked" messages. · ff3274dd
Vladislav Vaintroub authored Oct 27, 2021
```
Those messages don't indicate errors, they should be normal warnings.
```
ff3274dd
Merge 10.3 into 10.4 · 3a79e5fd
Marko Mäkelä authored Oct 28, 2021

3a79e5fd
Merge 10.2 into 10.3 · 657bcf92
Marko Mäkelä authored Oct 28, 2021

657bcf92

MDEV-26867: Update the InnoDB version number to 5.7.36 · 563daec1

Marko Mäkelä authored Oct 28, 2021

The InnoDB changes in MySQL 5.7.36 that were applicable to MariaDB
were covered by MDEV-26864, MDEV-26865, MDEV-26866.

563daec1

MDEV-26866 FOREIGN KEY…SET NULL corrupts an index on a virtual column · 1f5ca66e

Nikita Malyavin authored Oct 27, 2021

The initial test case for MySQL Bug #33053297 is based on
mysql/mysql-server@27130e25078864b010d81266f9613d389d4a229b.

innobase_get_field_from_update_vector is not a suitable function to fetch
updated row info, as well as parent table's update vector is not always
suitable. For instance, in case of DELETE it contains undefined data.

castade->update vector seems to be good enough to fetch all base columns
update data, and besides faster, and less error-prone.

1f5ca66e

MDEV-26914: Unreleased mutex in the exec_relay_log_event() function · 7948a1dc

Julius Goryavsky authored Oct 27, 2021

In the replication-related code, in the exec_relay_log_event() (slave.cc)
function, where the "data_lock" mutex is captured, this mutex is then not
released on one of the early return branches within a specific insert for
WSREP, namely under the branch: "if (wsrep_before_statement(thd))". As a
result, the mutex remains captured, resulting in errors or hangs.

This commit fixes this issue, which is now showing up as intermittent
failures in mtr tests for galera and galera_sr suites.

7948a1dc

27 Oct, 2021 6 commits

MDEV-18543 fixup: Fix 32-bit builds · 772d6d34
Marko Mäkelä authored Oct 27, 2021

772d6d34

Fix compile warning: · 3a9967d7

Sergei Petrunia authored Oct 27, 2021

ha_rocksdb.h:459:15: warning: 'table_type' overrides a member
function but is not marked 'override' [-Winconsistent-missing-override]

3a9967d7

MDEV-25402 Assertion `!str || str != Ptr' failed in String::copy · 2ed148c8

Alexander Barkov authored Oct 27, 2021

The assert inside String::copy() prevents copying from from "str"
if its own String::Ptr also points to the same memory.

The idea of the assert is that copy() performs memory reallocation,
and this reallocation can free (and thus invalidate) the memory pointed by Ptr,
which can lead to further copying from a freed memory.

The assert was incomplete: copy() can free the memory pointed by its Ptr
only if String::alloced is true!

If the String is not alloced, it is still safe to copy even from
the location pointed by Ptr.

This scenario demonstrates a safe copy():
  const char *tmp= "123";
  String str1(tmp, 3);
  String str2(tmp, 3);
  // This statement is safe:
  str2.copy(str1->ptr(), str1->length(), str1->charset(), cs_to, &errors);

Inside the copy() the parameter "str" is equal to String::Ptr in this example.
But it's still ok to reallocate the memory for str2, because str2
was a constant before the copy() call. Thus reallocation does not
make the memory pointed by str1->ptr() invalid.

Adjusting the assert condition to allow copying for constant strings.

2ed148c8

Fix tests for PLUGIN_PARTITION=NO · 4b8340d8
Marko Mäkelä authored Oct 27, 2021

4b8340d8
MDEV-22380 Assertion `name.length == strlen(name.str)' failed .. w/optimizer_trace enabled · 05a0eae3
Alexander Barkov authored Oct 27, 2021
```
Adding 10.4 specific tests.
```
05a0eae3
Merge remote-tracking branch 'origin/10.3' into 10.4 · 7b752429
Alexander Barkov authored Oct 27, 2021

7b752429