Commits · 8d7196cdf165843b7b1271e2616bd57be359e1da · nexedi / MariaDB

04 Nov, 2021 1 commit

Sergei Golubchik authored Nov 05, 2021

old ftp.pcre.org is apparently down,
www.pcre.org says to use github as the primary download location

8d7196cd

03 Nov, 2021 3 commits
- Merge branch '10.4' into 10.5 · bc82c622
  Oleksandr Byelkin authored Nov 03, 2021
  
  bc82c622
- Merge branch '10.3' into 10.4 · 3021b929
  Oleksandr Byelkin authored Nov 03, 2021
  
  3021b929
- Merge branch '10.2' into 10.3 · 69c70c18
  Oleksandr Byelkin authored Nov 03, 2021
  
  69c70c18
02 Nov, 2021 10 commits
- Merge branch '10.4' into 10.5 · e26b30d1
  Oleksandr Byelkin authored Nov 02, 2021
  
  e26b30d1
- post merge result fix · eb2c3d38
  Oleksandr Byelkin authored Nov 02, 2021
  
  eb2c3d38
- Merge branch '10.3' into 10.4 · ef968c9e
  Oleksandr Byelkin authored Nov 02, 2021
  
  ef968c9e
- Fix mutex order according to a new sequence. · bb46b79c
  Oleksandr Byelkin authored Nov 02, 2021
  
  bb46b79c
- Merge branch '10.2' into 10.3 · f0b9194d
  Oleksandr Byelkin authored Nov 02, 2021
  
  f0b9194d
- move "bad" test in seperate file with valgrind prohibited (different size of allocated memory) · d7c179e6
  Oleksandr Byelkin authored Nov 02, 2021
  
  d7c179e6
- MDEV-23328 Server hang due to Galera lock conflict resolution · 7846c56f
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  7846c56f
- MDEV-23328 Server hang due to Galera lock conflict resolution · eab7f5d8
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  eab7f5d8
- MDEV-23328 Server hang due to Galera lock conflict resolution · db649244
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  db649244
- MDEV-23328 Server hang due to Galera lock conflict resolution · e571eaae
  Jan Lindström authored Nov 02, 2021
```
Use better error message when KILL fails even in case TOI
fails.
```
  e571eaae
01 Nov, 2021 1 commit
- MDEV-23328 Server hang due to Galera lock conflict resolution · ea239034
  Jan Lindström authored Nov 01, 2021
```
* Fix error handling NULL-pointer reference
* Add mtr-suppression on galera_ssl_upgrade
```
  ea239034
30 Oct, 2021 1 commit
- Merge branch '10.4' into 10.5 · cb5b3230
  Oleksandr Byelkin authored Oct 30, 2021
  
  cb5b3230
29 Oct, 2021 12 commits

MDEV-23328 Server hang due to Galera lock conflict resolution · ef2dbb8d

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

ef2dbb8d

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · d5bc0579
Jan Lindström authored Oct 28, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit eac8341d.
```
d5bc0579
Merge branch '10.3' into 10.4 · 5900f3a7
Oleksandr Byelkin authored Oct 29, 2021

5900f3a7
Merge branch '10.2' into 10.3 · 6953af36
Oleksandr Byelkin authored Oct 29, 2021

6953af36
columnstore-5.6.3-2 · 1974df01
Oleksandr Byelkin authored Oct 29, 2021

1974df01
Merge branch '10.4' into 10.5 · 1c1396f0
Oleksandr Byelkin authored Oct 29, 2021

1c1396f0

MDEV-23328 Server hang due to Galera lock conflict resolution · 157b3a63

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

157b3a63

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · 30337add
Jan Lindström authored Oct 21, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit 29bbcac0.
```
30337add

MDEV-23328 Server hang due to Galera lock conflict resolution · 5c230b21

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

5c230b21

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · aa7ca987
Jan Lindström authored Oct 22, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit eac8341d.
```
aa7ca987

MDEV-23328 Server hang due to Galera lock conflict resolution · db50ea3a

sjaakola authored Oct 21, 2021

Mutex order violation when wsrep bf thread kills a conflicting trx,
the stack is

          wsrep_thd_LOCK()
          wsrep_kill_victim()
          lock_rec_other_has_conflicting()
          lock_clust_rec_read_check_and_lock()
          row_search_mvcc()
          ha_innobase::index_read()
          ha_innobase::rnd_pos()
          handler::ha_rnd_pos()
          handler::rnd_pos_by_record()
          handler::ha_rnd_pos_by_record()
          Rows_log_event::find_row()
          Update_rows_log_event::do_exec_row()
          Rows_log_event::do_apply_event()
          Log_event::apply_event()
          wsrep_apply_events()

and mutexes are taken in the order

          lock_sys->mutex -> victim_trx->mutex -> victim_thread->LOCK_thd_data

When a normal KILL statement is executed, the stack is

          innobase_kill_query()
          kill_handlerton()
          plugin_foreach_with_mask()
          ha_kill_query()
          THD::awake()
          kill_one_thread()

        and mutexes are

          victim_thread->LOCK_thd_data -> lock_sys->mutex -> victim_trx->mutex

This patch is the plan D variant for fixing potetial mutex locking
order exercised by BF aborting and KILL command execution.

In this approach, KILL command is replicated as TOI operation.
This guarantees total isolation for the KILL command execution
in the first node: there is no concurrent replication applying
and no concurrent DDL executing. Therefore there is no risk of
BF aborting to happen in parallel with KILL command execution
either. Potential mutex deadlocks between the different mutex
access paths with KILL command execution and BF aborting cannot
therefore happen.

TOI replication is used, in this approach,  purely as means
to provide isolated KILL command execution in the first node.
KILL command should not (and must not) be applied in secondary
nodes. In this patch, we make this sure by skipping KILL
execution in secondary nodes, in applying phase, where we
bail out if applier thread is trying to execute KILL command.
This is effective, but skipping the applying of KILL command
could happen much earlier as well.

This also fixed unprotected calls to wsrep_thd_abort
that will use wsrep_abort_transaction. This is fixed
by holding THD::LOCK_thd_data while we abort transaction.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

db50ea3a

MDEV-25114: Crash: WSREP: invalid state ROLLED_BACK (FATAL) · c8b39f7e
Jan Lindström authored Oct 21, 2021
```
Revert "MDEV-23328 Server hang due to Galera lock conflict resolution"

This reverts commit 29bbcac0.
```
c8b39f7e

28 Oct, 2021 12 commits
- wolfssl v4.8.1-stable · e1083826
  Oleksandr Byelkin authored Oct 28, 2021
  
  e1083826
- Merge branch '10.3' into 10.4 · 89f69c62
  Oleksandr Byelkin authored Oct 28, 2021
  
  89f69c62
- Merge branch '10.2' into 10.3 · 2ddea602
  Oleksandr Byelkin authored Oct 28, 2021
  
  2ddea602
- fix depricated pthread_yield() for tokudb · b3cdf416
  Oleksandr Byelkin authored Oct 28, 2021
  
  b3cdf416
- compilation fixes for sys-devel/gcc-11.2.0:11 · 1203b658
  Sergei Golubchik authored Oct 28, 2021
```
for example:

sql/sql_prepare.cc:5714:63: error: 'static void Ed_result_set::operator delete(void*, MEM_ROOT*)' called on pointer returned from a mismatched allocation function [-Werror=mismatched-new-delete]
```
  1203b658
- Merge remote-tracking branch 'connect/10.2' into 10.2 · 99c89358
  Oleksandr Byelkin authored Oct 28, 2021
  
  99c89358
- Fix message severity for "thread pool blocked" messages. · ff3274dd
  Vladislav Vaintroub authored Oct 27, 2021
```
Those messages don't indicate errors, they should be normal warnings.
```
  ff3274dd
- Merge 10.4 into 10.5 · a8ded395
  Marko Mäkelä authored Oct 28, 2021
  
  a8ded395
- Merge 10.3 into 10.4 · 3a79e5fd
  Marko Mäkelä authored Oct 28, 2021
  
  3a79e5fd
- Merge 10.2 into 10.3 · 657bcf92
  Marko Mäkelä authored Oct 28, 2021
  
  657bcf92
- MDEV-26867: Update the InnoDB version number to 5.7.36 · 563daec1
  Marko Mäkelä authored Oct 28, 2021
```
The InnoDB changes in MySQL 5.7.36 that were applicable to MariaDB
were covered by MDEV-26864, MDEV-26865, MDEV-26866.
```
  563daec1
- MDEV-26866 FOREIGN KEY…SET NULL corrupts an index on a virtual column · 1f5ca66e
  Nikita Malyavin authored Oct 27, 2021
```
The initial test case for MySQL Bug #33053297 is based on
mysql/mysql-server@27130e25078864b010d81266f9613d389d4a229b.

innobase_get_field_from_update_vector is not a suitable function to fetch
updated row info, as well as parent table's update vector is not always
suitable. For instance, in case of DELETE it contains undefined data.

castade->update vector seems to be good enough to fetch all base columns
update data, and besides faster, and less error-prone.
```
  1f5ca66e