Commits · ffc384e044d8e312cad13b67b9ae27878804336c · nexedi / MariaDB

12 Jan, 2021 1 commit

MDEV-23804: Server crashes in st_select_lex::collect_grouping_fields_for_derived · ffc384e0

Varun Gupta authored Jan 12, 2021

The issue here was we were trying to push an extracted condition for a view into the
underlying table value constructor inside the view.
The fix would be to not push conditions into table value constructors.

ffc384e0

11 Jan, 2021 2 commits
- Merge 10.3 into 10.4 · fd5e103a
  Marko Mäkelä authored Jan 11, 2021
  
  fd5e103a
- Merge 10.2 into 10.3 (except MDEV-17556) · 5a1a7141
  Marko Mäkelä authored Jan 11, 2021
```
The fix of MDEV-17556 (commit e25623e7
and commit 61a362c9) has been
omitted due to conflicts and will have to be applied separately later.
```
  5a1a7141
09 Jan, 2021 2 commits

MDEV-24554 Do not use verisign server for authenticode timestamping · 3b548d3b
Vladislav Vaintroub authored Jan 09, 2021

3b548d3b

Fix MTR test galera_as_slave_replay · a131b976

Daniele Sciascia authored Jan 08, 2021

- Galera cluster must be in node 1 and 2, and acts as slave for node 3
- Sync point commit_monitor_enter_sync was renamed to
  commit_monitor_master_enter_sync in 4.x
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

a131b976

08 Jan, 2021 6 commits

MDEV-23536 : Race condition between KILL and transaction commit · 775fccea

Jan Lindström authored Dec 17, 2020

A race condition may occur between the execution of transaction commit,
and an execution of a KILL statement that would attempt to abort that
transaction.

MDEV-17092 worked around this race condition by modifying InnoDB code.
After that issue was closed, Sergey Vojtovich pointed out that this
race condition would better be fixed above the storage engine layer:

If you look carefully into the above, you can conclude that
thd->free_connection() can be called concurrently with
KILL/thd->awake(). Which is the bug. And it is partially fixed in
THD::~THD(), that is destructor waits for KILL completion:

Fix: Add necessary mutex operations to THD::free_connection()
and move WSREP specific code also there. This ensures that no
one is using THD while we do free_connection(). These mutexes
will also ensures that there can't be concurrent KILL/THD::awake().

innobase_kill_query
  We can now remove usage of trx_sys_mutex introduced on MDEV-17092.

trx_t::free()
  Poison trx->state and trx->mysql_thd

This patch is validated with an RQG run similar to the one that
reproduced MDEV-17092.

775fccea

Cleanup: Remove unused symbol QUE_THR_PROCEDURE_WAIT · 18254c18
Marko Mäkelä authored Jan 08, 2021

18254c18
fixup MDEV-17556: fix mroonga · 61a362c9
Nikita Malyavin authored Jan 08, 2021

61a362c9
Update wsrep-lib · 26d913a7
Jan Lindström authored Jan 08, 2021

26d913a7
MDEV-19838 fixup: clang -Wunused-const-variable · cd1e5d65
Marko Mäkelä authored Jan 08, 2021

cd1e5d65

MDEV-17556 Assertion `bitmap_is_set_all(&table->s->all_set)' failed · e25623e7

Nikita Malyavin authored Dec 29, 2020

The assertion failed in handler::ha_reset upon SELECT under
READ UNCOMMITTED from table with index on virtual column.

This was the debug-only failure, though the problem is mush wider:
* MY_BITMAP is a structure containing my_bitmap_map, the latter is a raw
 bitmap.
* read_set, write_set and vcol_set of TABLE are the pointers to MY_BITMAP
* The rest of MY_BITMAPs are stored in TABLE and TABLE_SHARE
* The pointers to the stored MY_BITMAPs, like orig_read_set etc, and
 sometimes all_set and tmp_set, are assigned to the pointers.
* Sometimes tmp_use_all_columns is used to substitute the raw bitmap
 directly with all_set.bitmap
* Sometimes even bitmaps are directly modified, like in
TABLE::update_virtual_field(): bitmap_clear_all(&tmp_set) is called.

The last three bullets in the list, when used together (which is mostly
always) make the program flow cumbersome and impossible to follow,
notwithstanding the errors they cause, like this MDEV-17556, where tmp_set
pointer was assigned to read_set, write_set and vcol_set, then its bitmap
was substituted with all_set.bitmap by dbug_tmp_use_all_columns() call,
and then bitmap_clear_all(&tmp_set) was applied to all this.

To untangle this knot, the rule should be applied:
* Never substitute bitmaps! This patch is about this.
 orig_*, all_set bitmaps are never substituted already.

This patch changes the following function prototypes:
* tmp_use_all_columns, dbug_tmp_use_all_columns
 to accept MY_BITMAP** and to return MY_BITMAP * instead of my_bitmap_map*
* tmp_restore_column_map, dbug_tmp_restore_column_maps to accept
 MY_BITMAP* instead of my_bitmap_map*

These functions now will substitute read_set/write_set/vcol_set directly,
and won't touch underlying bitmaps.

e25623e7

07 Jan, 2021 4 commits

MDEV-16272 rpl.rpl_semisync_ali_issues failed in buildbot, SHOW variable was... · df1eefb2

Alice Sherepa authored Jan 07, 2021

MDEV-16272 rpl.rpl_semisync_ali_issues failed in buildbot, SHOW variable was done instead of waiting for the value of that variable

df1eefb2

Update wsrep-lib (new logger interface) · 033f8d13

Alexey Yurchenko authored Dec 30, 2020

Ensure consistent use of logging macros in wsrep-related code
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

033f8d13

Urgent fix of MDEV-23446 fix: · 188b3283

Oleksandr Byelkin authored Jan 06, 2021

Use the same variable in both scopes (from where we have "goto error" and target of the goto)

188b3283

MDEV-17891 Assertion failure upon attempt to replace into a full table · d846b55d

Nikita Malyavin authored Mar 05, 2019

Problem: Assertion `transactional_table || !changed ||
thd->transaction.stmt.modified_non_trans_table' failed due REPLACE into a
versioned table.

It is not specific to system versioning/pertitioning/heap, but this
combination makes it much easier to reproduce.

The thing is to make first ha_update_row call succeed to make
info->deleted != 0. And then make REPLACE fail by any reason.

In this scenario we overflow versioned partition, so next ha_update_row
succeeds, but corresponding ha_write_row fails to insert history record.

Fix: modified_non_trans_table is set in one missed place

d846b55d

06 Jan, 2021 1 commit
- MDEV-19442 add-on · f319c426
  Andrei Elkin authored Jan 06, 2021
```
fixing windows build.
```
  f319c426
05 Jan, 2021 3 commits

MDEV-23632 ALTER TABLE...ADD KEY creates corrupted index on virtual column · 9a645dae

Nikita Malyavin authored Dec 21, 2020

mysql_col_offset was not updated after the new column has been added by an
INSTANT ALTER TABLE -- table data dictionary had been remaining the same.

When the virtual column is added or removed, table was usually evicted and
then reopened, which triggered vcol info rebuild on the next open.

However this also should be done when the usual column is added or removed:
mariadb always stores virtual field at the end of maria record,
so the shift should always happen.

Fix:
expand the eviction condition to the case when usual fields are
added/removed

Note:
this should happen only in the case of !new_clustered:
* When new_clustered is true, a new data dictionary is created, and vcol
metadata is rebuilt in `alter_rebuild_apply_log()`
* We can't do it in `new_clustered` case, because the old table is not yet
subctituted correctly

9a645dae

ut_ad(err != DB_DUPLICATE_KEY) in row_rename_table_for_mysql · f0baa864
Nikita Malyavin authored Dec 23, 2020

f0baa864
handler0alter.cc: extract cache eviction and stats drop to functions · a81fbbc6
Nikita Malyavin authored Dec 23, 2020

a81fbbc6

04 Jan, 2021 8 commits

MDEV-24500: Added wait condition to make sure table t1 is replicated to node_2. · 7edbd272
Stepan Patryshev authored Jan 04, 2021

7edbd272
MDEV-24482: Added wait condition to make sure table t1 is replicated to node_2. · 51b7438d
Stepan Patryshev authored Dec 30, 2020

51b7438d
MDEV-24465: Added wait condition to make sure table t1 is replicated to node_2. · 06644f70
Stepan Patryshev authored Dec 30, 2020

06644f70
MDEV-24464: Added wait condition to make sure table t1 is replicated to node_2. · 1284e6c3
Stepan Patryshev authored Dec 30, 2020

1284e6c3
MDEV-24447: Added wait condition to make sure table t1 is replicated to node_2. · 9de9e0c7
Stepan Patryshev authored Dec 30, 2020

9de9e0c7
MDEV-24462: Added wait condition to make sure table t1 is replicated to node_2. · cd529ae8
Stepan Patryshev authored Dec 30, 2020

cd529ae8

MDEV-23033: All slaves crash once in ~24 hours and loop restart with signal 11 · 608b0ee5

Sujatha authored Dec 31, 2020

Problem:
=======
Upon deleting or updating a row in a parent table (with primary key), if
the child table has virtual column and an associated key with ON UPDATE
CASCADE/ON DELETE CASCADE, it will result in slave crash.

Analysis:
========
Tables which are related through foreign key require prelocking similar to
triggers. i.e If a table has triggers/foreign keys we should add all tables
and routines used by them to the prelocking set.  This prelocking happens
during 'open_and_lock_tables' call.  Each table being opened is checked for
foreign key references. If foreign key reference exists then the child
table is opened and it is linked to the table_list. Upon any modification
to  parent table its corresponding child tables are retried from table_list
and they are updated accordingly. This prelocking work fine on master.

On slave  prelocking works for following cases.
 - Statement/mixed based replication
 - In row based replication when trigger execution is enabled through
   'slave_run_triggers_for_rbr=YES/LOGGING/ENFORCE'

Otherwise it results in an assert/crash, as the parent table will not find
the corresponding child table and it will be NULL. Dereferencing NULL
pointer leads to slave server exit.

Fix:
===
Introduce a new 'slave_fk_event_map' flag similar to 'trg_event_map'. This
flag will ensure that when foreign key is enabled in row based replication
all the parent and child tables are prelocked, so that parent is able to
locate the child table.

Note: This issue is specific to slave, hence only slave needs to be
      upgraded.

608b0ee5

MDEV-23875 is failing to build on windows. · 25db9ffa
Rucha Deodhar authored Jan 04, 2021

25db9ffa

02 Jan, 2021 3 commits

MDEV-24512 Assertion failed in rec_is_metadata() in btr_discard_only_page_on_level() · d67e17bb

Marko Mäkelä authored Jan 02, 2021

btr_discard_only_page_on_level(): Attempt to read the MDEV-15562 metadata
record from the leaf page, not the root page. In the root, the leftmost
(in this case, the only) node pointer would look like a metadata record.

This corruption bug was introduced in
commit 0e5a4ac2 (MDEV-15562).
The scenario is rare: a column was dropped instantly or the order of
columns was changed instantly, and then the table became empty in such
a way that in the last step, the root page had one child page.
Normally, a non-leaf B-tree page would always contain at least 2 children.

d67e17bb

MDEV-20386: Allow RDRAND, RDSEED WITH_MSAN · 734c587f

Marko Mäkelä authored Jan 01, 2021

Let us use Intel intrinsic functions in WolfSSL whenever possible.
This allows such code to be compiled WITH_MSAN.

734c587f

WolfSSL v4.6.0-stable · c1a7a82b
Marko Mäkelä authored Jan 01, 2021

c1a7a82b

31 Dec, 2020 1 commit

MDEV-23875: select into outfile not respect UMASK and UMASK_DIR · 4f5d5a78

Rucha Deodhar authored Dec 28, 2020

Analysis: select into outfile creates files everytime with 666 permission,
regardsless if umask environment variables and umask settings on OS level.
It seems hardcoded.
Fix: change 0666 to 0644 which will let anybody consume the file but not
change it.

4f5d5a78

29 Dec, 2020 2 commits
- Merge branch '10.3' into 10.4 · 3454b5cf
  Oleksandr Byelkin authored Dec 29, 2020
  
  3454b5cf
- Correction of the merge 10.2 into 10.3 for MDEV-23619 · e59c1cef
  Igor Babaev authored Dec 28, 2020
```
(correction for commit 6fed6de9)
```
  e59c1cef
28 Dec, 2020 6 commits

MDEV-24255 MTR test galera_bf_abort fails with --ps-protocol · 4601e6e5

Teemu Ollakka authored Oct 29, 2020

Under ps-protocol, commandsl like COM_STMT_FETCH, COM_STMT_CLOSE and
COM_STMT_SEND_LONG_DATA are not supposed to return errors. Therefore,
if a transaction is BF aborted and the client is processing one of
those commands, then we should not return a deadlock error
immediately. Instead wait for the a subsequent client interaction
which permits errors to be returned. To handle this,
wsrep_before_command() now accepts parameter keep_command_error. If
set true, keep_command_error will cause wsrep-lib side to skip result
handling, and to keep the current error for the next interaction with
the client.
Reviewed-by: Jan Lindström <jan.lindstrom@mariadb.com>

4601e6e5

Merge 10.3 into 10.4 · a64cb6d2
Marko Mäkelä authored Dec 28, 2020

a64cb6d2
Merge 10.2 into 10.3 · 7f037b8c
Marko Mäkelä authored Dec 28, 2020

7f037b8c

MDEV-19442 server_audit plugin doesn't consider proxy users in... · 78292047

Alexey Botchkov authored Dec 28, 2020

MDEV-19442 server_audit plugin doesn't consider proxy users in server_audit_excl_users/server_audit_incl_users.

Check the proxy user just as the connection user against the
incl_users_list and excl_users_list.

78292047

MDEV-24449 Corruption of system tablespace or last recovered page · 5b9ee8d8

Marko Mäkelä authored Dec 28, 2020

This corresponds to 10.5 commit 39378e13.

With a patched version of the test innodb.ibuf_not_empty (so that
it would trigger crash recovery after using the change buffer),
and patched code that would modify the os_thread_sleep() in
recv_apply_hashed_log_recs() to be 1ms as well as add a sleep of
the same duration to the end of recv_recover_page() when
recv_sys->n_addrs=0, we can demonstrate a race condition.

After disabling some debug checks in buf_all_freed_instance(),
buf_pool_invalidate_instance() and buf_validate(), we managed to
trigger an assertion failure in fseg_free_step(), on the XDES_FREE_BIT.
In other words, an trx_undo_seg_free() call during
trx_rollback_resurrected() was attempting a double-free of a page.
This was repeated about once in 400 to 500 test runs. With the fix
applied, the test passed 2,000 runs.

recv_apply_hashed_log_recs(): Do not only wait for recv_sys->n_addrs
to reach 0, but also wait for buf_get_n_pending_read_ios() to reach 0,
to guarantee that buf_page_io_complete() will not be executing
ibuf_merge_or_delete_for_page().

5b9ee8d8

MDEV-23851 MDEV-24229 BF-BF conflict issues · 8e3e87d2

sjaakola authored Dec 08, 2020

Issues MDEV-23851 and MDEV-24229 are probably duplicates and are caused by the new self-asserting function lock0lock.cc:wsrep_assert_no_bf_bf_wait().
The criteria for asserting is too strict and does not take in consideration scenarios of "false positive" lock conflicts, which are resolved by replaying the local transaction.
As a fix, this PR is relaxing the assert criteria by two conditions, which skip assert if high priority transactions are locking in correct order or if conflicting high priority lock holder is aborting and has just not yet released the lock.

Alternative fix would be to remove wsrep_assert_no_bf_bf_wait() altogether, or remove the assert in this function and let it only print warnings in error log.
But in my high conflict rate multi-master test scenario, this relaxed asserting appears to be safe.

This PR also removes two wsrep_report_bf_lock_wait() calls in innodb lock manager, which cause mutex access assert in debug builds.

Foreign key appending missed handling of data types of float and double in INSERT execution. This is not directly related to the actual issue here but is fixed in this PR nevertheless. Missing these foreign keys values in certification could cause problems in some multi-master load scenarios.

Finally, some problem reports suggest that some of the issues reported in MDEV-23851 might relate to false positive lock conflicts over unique secondary index gaps. There is separate work for relaxing UK index gap locking of replication appliers, and separate PR will be submitted for it, with a related mtr test as well.

8e3e87d2

25 Dec, 2020 1 commit
- Merge branch '10.3' into 10.4 · 478b8303
  Oleksandr Byelkin authored Dec 25, 2020
  
  478b8303