Commits · 20e9e804c131c6522bc7c469e4863e8d1eaa3ee0 · nexedi / MariaDB

14 Feb, 2022 7 commits

MDEV-20605 Awaken transaction can miss inserted by other transaction records... · 20e9e804

Vlad Lesin authored Nov 30, 2021

MDEV-20605 Awaken transaction can miss inserted by other transaction records due to wrong persistent cursor restoration

sel_restore_position_for_mysql() moves forward persistent cursor
position after btr_pcur_restore_position() call if cursor relative position
is BTR_PCUR_ON and the cursor points to the record with NOT the same field
values as in a stored record(and some other not important for this case
conditions).

It was done because btr_pcur_restore_position() sets
page_cur_mode_t mode to PAGE_CUR_LE for cursor->rel_pos == BTR_PCUR_ON
before opening cursor. So we are searching for the record less or equal
to stored one. And if the found record is not equal to stored one, then
it is less and we need to move cursor forward.

But there can be a situation when the stored record was purged, but the
new one with the same key but different value was inserted while
row_search_mvcc() was suspended. In this case, when the thread is
awaken, it will invoke sel_restore_position_for_mysql(), which, in turns,
invoke btr_pcur_restore_position(), which will return false because found
record don't match stored record, and
sel_restore_position_for_mysql() will move forward cursor position.

The above can lead to the case when awaken row_search_mvcc() do not see
records inserted by other transactions while it slept. The mtr test case
shows the example how it can be.

The fix is to return special value from persistent cursor restoring
function which would notify its caller that uniq fields of restored
record and stored record are the same, and in this case
sel_restore_position_for_mysql() don't move cursor forward.

Delete-marked records are correctly processed in row_search_mvcc().
Non-unique secondary indexes are "uniquified" by adding the PK, the
index->n_uniq should then be index->n_fields. So there is no need in
additional checks in the fix.

If transaction's readview can't see the changes made in secondary index
record, it requests clustered index record in row_search_mvcc() to check
its transaction id and get the correspondent record version. After this
row_search_mvcc() commits mtr to preserve clustered index latching
order, and starts mtr. Between those mtr commit and start secondary
index pages are unlatched, and purge has the ability to remove stored in
the cursor record, what causes rows duplication in result set for
non-locking reads, as cursor position is restored to the previously
visited record.

To solve this the changes are just switched off for non-locking reads,
it's quite simple solution, besides the changes don't make sense for
non-locking reads.

The more complex and effective from performance perspective solution is
to create mtr savepoint before clustered record requesting and rolling
back to that savepoint after that. See MDEV-27557.

One more solution is to have per-record transaction id for secondary
indexes. See MDEV-17598.

If any of those is implemented, just remove select_lock_type argument in
sel_restore_position_for_mysql().

20e9e804

Merge 10.4 into 10.5 · 52b32c60
Marko Mäkelä authored Feb 14, 2022

52b32c60
Merge mariadb-10.5.15 into 10.5 · 6405ed63
Marko Mäkelä authored Feb 14, 2022

6405ed63
Merge 10.3 into 10.4 · c9bc10e6
Marko Mäkelä authored Feb 14, 2022

c9bc10e6
Merge mariadb-10.4.24 into 10.4 · 4964f181
Marko Mäkelä authored Feb 14, 2022

4964f181
Merge 10.2 into 10.3 · e928fdbf
Marko Mäkelä authored Feb 14, 2022

e928fdbf
Merge mariadb-10.3.34 into 10.3 · a6ef239b
Marko Mäkelä authored Feb 14, 2022

a6ef239b

12 Feb, 2022 4 commits
- bump the VERSION · e777645d
  Daniel Bartholomew authored Feb 12, 2022
  
  e777645d
- bump the VERSION · e50421be
  Daniel Bartholomew authored Feb 12, 2022
  
  e50421be
- bump the VERSION · b55b808b
  Daniel Bartholomew authored Feb 12, 2022
  
  b55b808b
- bump the VERSION · 1557204b
  Daniel Bartholomew authored Feb 12, 2022
  
  1557204b
11 Feb, 2022 4 commits

MDEV-27813 Windows, compiling : RelWithDebInfo should use /Ob2 · 91d9e9bd

Vladislav Vaintroub authored Feb 11, 2022

Fixed inlining flags. Remove /Ob1 added by CMake for RelWithDebInfo.
(the actual compiler default is /Ob2 if optimizations are enabled)

Allow to define custom /Ob flag with new variable MSVC_INLINE, if desired

91d9e9bd

Disable innodb_gis.rtree_compress2 · 1a7573d5
Marko Mäkelä authored Feb 11, 2022

1a7573d5

MDEV-27746 Wrong comparision of BLOB's empty preffix with non-preffixed BLOB... · 3b10e8f8

Vlad Lesin authored Feb 09, 2022

MDEV-27746 Wrong comparision of BLOB's empty preffix with non-preffixed BLOB causes rows count mismatch for clustered and secondary indexes during non-locking read

row_sel_sec_rec_is_for_clust_rec() treats empty BLOB prefix field in
secondary index as a field equal to any external BLOB field in clustered
index. Row_sel_get_clust_rec_for_mysql::operator() doesn't zerro out
clustered record pointer in row_search_mvcc(), and row_search_mvcc()
thinks that delete-marked secondary index record has visible for
"CHECK TABLE"'s read view old-versioned clustered index record, and
row_scan_index_for_mysql() counts it as a row.

The fix is to execute row_sel_sec_rec_is_for_blob() in
row_sel_sec_rec_is_for_clust_rec() if clustered field contains BLOB's
reference.

3b10e8f8

MDEV-27804 Fails to build - perf schema - thread id of type uintptr_t requires header · 7c6ec0a5

Samuel Thibault authored Feb 08, 2022

While building on GNU/Hurd and kfreebsd.

On the C++ standard uintptr_t can be defined in <cstdint>
ref: https://www.cplusplus.com/reference/cstdint/

Fixes: 0d44792a

7c6ec0a5

10 Feb, 2022 10 commits

Merge branch '10.4' into 10.5 · 9aa3564e
Sergei Golubchik authored Feb 10, 2022

9aa3564e
Merge branch '10.3' into 10.4 · b4477ae7
Sergei Golubchik authored Feb 10, 2022

b4477ae7
Merge branch '10.2' into 10.3 · a36fc80a
Sergei Golubchik authored Feb 10, 2022

a36fc80a

MDEV-25636: Bug report: abortion in sql/sql_parse.cc:6294 · 3a525694

Sergei Petrunia authored Feb 10, 2022

The asserion failure was caused by this query

  select /*id=1*/ from t1
  where
   col= ( select /*id=2*/ from ... where corr_cond1
          union
          select /*id=4*/ from ... where corr_cond2)

Here,
- select with id=2 was correlated due to corr_cond1.
- select with id=4 was initially correlated due to corr_cond2, but then
  the optimizer optimized away the correlation, making the select with id=4
  uncorrelated.

However, since select with id=2 remained correlated, the execution had to
re-compute the whole UNION. When it tried to execute select with id=4, it
hit an assertion  (join buffer already free'd).

This is because select with id=4 has freed its execution structures after
it has been executed once. The select is uncorrelated, so it did not expect
it would need to be executed for the second time.

Fixed this by adding this logic in
st_select_lex::optimize_unflattened_subqueries():

  If a member of a UNION is correlated, mark all its members as
  correlated, so that they are prepared to be executed multiple times.

3a525694

MDEV-27796 Windows - starting server with huge innodb-log-buffer-size may fail · 012e724d

Vladislav Vaintroub authored Feb 10, 2022

Fixed tpool::pread() and tpool::pwrite() to return SSIZE_T on Windows,
so that huge numbers are not converted to negatives.

Also, make sure to never attempt reading/writing more bytes than
DWORD can accomodate (4G)

012e724d

MDEV-26351 segfault - (MARIA_HA *) 0x0 in ha_maria::extra · 9e2c26b0
Sergei Golubchik authored Feb 10, 2022
```
don't let Aria create a table that it cannot open
```
9e2c26b0

MDEV-26351 segfault - (MARIA_HA *) 0x0 in ha_maria::extra · 1b8bb441

Sergei Golubchik authored Feb 10, 2022

use the correct check. before invoking handler methods we
need to know that the table was opened, not only created.

1b8bb441

MDEV-25766 Unused CTE lead to a crash in find_field_in_tables/find_order_in_list · 0168d1ed
Oleksandr Byelkin authored Sep 15, 2021
```
Do not assume that subquery Item always present.
```
0168d1ed
MDEV-25787 Bug report: crash on SELECT DISTINCT thousands_blob_fields · 9e39d0ae
Sergei Golubchik authored Feb 10, 2022
```
fix a debug assert to account for not opened temp tables
```
9e39d0ae

MDEV-27789 mysql_upgrade / mariadb-upgrade in 10.6.6 is putting password in host argument · ad1fb069

Monty authored Feb 10, 2022

Removed all dependencies of command line arguments based on positions in
an array (this kind of code should never have been written).
Instead use option names, which are stable.

Reviewer: Sergei Golubchik

ad1fb069

09 Feb, 2022 6 commits

MDEV-27716 mtr_t::commit() acquires log_sys.mutex when writing no log · fd101daa

Marko Mäkelä authored Feb 09, 2022

mtr_t::is_block_dirtied(), mtr_t::memo_push(): Never set m_made_dirty
for pages of the temporary tablespace. Ever since
commit 5eb53955
we never add those pages to buf_pool.flush_list.

mtr_t::commit(): Implement part of mtr_t::prepare_write() here,
and avoid acquiring log_sys.mutex if no log is written.
During IMPORT TABLESPACE fixup, we do not write log, but we must
add pages to buf_pool.flush_list and for that, be prepared
to acquire log_sys.flush_order_mutex.

mtr_t::do_write(): Replaces mtr_t::prepare_write().

fd101daa

Merge branch '10.5' into bb-10.5-release · 34c50196
Oleksandr Byelkin authored Feb 09, 2022

34c50196
Merge branch '10.4' into bb-10.4-release · 8a7776a8
Oleksandr Byelkin authored Feb 09, 2022

8a7776a8
Merge branch '10.3' into bb-10.3-release · e3524445
Oleksandr Byelkin authored Feb 09, 2022

e3524445
Merge branch '10.2' into bb-10.2-release · 941bc705
Oleksandr Byelkin authored Feb 09, 2022

941bc705

MDEV-27734 Set innodb_change_buffering=none by default · 5c46751f

Marko Mäkelä authored Feb 09, 2022

The aim of the InnoDB change buffer is to avoid delays when a leaf page
of a secondary index is not present in the buffer pool, and a record needs
to be inserted, delete-marked, or purged. Instead of reading the page into
the buffer pool for making such a modification, we may insert a record to
the change buffer (a special index tree in the InnoDB system tablespace).
The buffered changes are guaranteed to be merged if the index page
actually needs to be read later.

The change buffer could be useful when the database is stored on a
rotational medium (hard disk) where random seeks are slower than
sequential reads or writes.

Obviously, the change buffer will cause write amplification, due to
potentially large amount of metadata that is being written to the
change buffer. We will have to write redo log records for modifying
the change buffer tree as well as the user tablespace. Furthermore,
in the user tablespace, we must maintain a change buffer bitmap page
that uses 2 bits for estimating the amount of free space in pages,
and 1 bit to specify whether buffered changes exist. This bitmap needs
to be updated on every operation, which could reduce performance.

Even if the change buffer were free of bugs such as MDEV-24449
(potentially causing the corruption of any page in the system tablespace)
or MDEV-26977 (corruption of secondary indexes due to a currently
unknown reason), it will make diagnosis of other data corruption harder.

Because of all this, it is best to disable the change buffer by default.

5c46751f

08 Feb, 2022 9 commits

bump the VERSION · f7704d74
Daniel Bartholomew authored Feb 08, 2022

f7704d74
bump the VERSION · 2f07b21c
Daniel Bartholomew authored Feb 08, 2022

2f07b21c
bump the VERSION · 30cc63fa
Daniel Bartholomew authored Feb 08, 2022

30cc63fa
bump the VERSION · c0a44ff7
Daniel Bartholomew authored Feb 08, 2022

c0a44ff7

MDEV-26585 Wrong query results when `using index for group-by` · 38058c04

Monty authored Feb 02, 2022

The problem was that "group_min_max optimization" does not work if
some aggregate functions, like COUNT(*), is used.
The function get_best_group_min_max() is using the join->sum_funcs
array to check which aggregate functions are used.
The bug was that aggregates in HAVING where not yet added to
join->sum_funcs at the time get_best_group_min_max() was called.

Fixed by populate join->sum_funcs already in prepare, which means that
all sum functions will be in join->sum_funcs in get_best_group_min_max().
A benefit of this approach is that we can remove several calls to
make_sum_func_list() from the code and simplify the function.

I removed some wrong setting of 'sort_and_group'.
This variable is set when alloc_group_fields() is called, as part
of allocating the cache needed by end_send_group() and does not need
to be set by other functions.

One problematic thing was that Spider is using *join->sum_funcs to detect
at which stage the optimizer is and do internal calculations of aggregate
functions. Updating join->sum_funcs early caused Spider to fail when trying
to find min/max values in opt_sum_query().
Fixed by temporarily resetting sum_funcs during opt_sum_query().

Reviewer: Sergei Petrunia

38058c04

MDEV-27442 Wrong result upon query with DISTINCT and EXISTS subquery · d314bd26

Monty authored Feb 02, 2022

The problem was that get_best_group_min_max() did not check if fields used
by the "group_min_max optimization" where used in sub queries.
Because of this, it did not detect that a key (b,a) was used in the WHERE
clause for the statement:
SELECT DISTINCT b FROM t1 WHERE EXISTS ( SELECT 1 FROM DUAL WHERE a > 1 ).

Fixed by also traversing the sub queries when checking if a field is used.
This disables group_min_max_optimization for the above query.

Reviewer: Sergei Petrunia

d314bd26

MENT-328 Retry BACKUP STAGE BLOCK DDL in case of deadlocks · a1c23807

Monty authored Feb 06, 2022

MENT-328 wrongly assumed that the backup failed because of warnings from
mariabackup about not found files. This is normal (and the error message
should be deleted).

randgen failed because mariabackup didn't retry BACKUP STAGE BLOCK DDL
if it failed with a deadlock.

To simplify things, I implemented the retry loop in the server as
this particular deadlock should be quickly resolved.

a1c23807

Don't run innodb_defgragment under valgrind (too slow) · 0ec27d7b
Monty authored Feb 02, 2022

0ec27d7b
Fixes some compiler issues on AIX ( · 88fb89ac
Monty authored Feb 02, 2022

88fb89ac