Commits · 1fde785315ec6d575d0cd5c3e33d53a5d83e3e00 · nexedi / MariaDB

28 Aug, 2023 1 commit

MDEV-31890: Compilation failing on MacOS (unknown warning option -Wno-unused-but-set-variable) · 1fde7853

Dmitry Shulga authored Aug 11, 2023

For clang compiler the compiler's flag -Wno-unused-but-set-variable
was set based on compiler version. This approach could result in
false positive detection for presence of compiler option since
only first three groups of digits in compiler version taken into account
and it could lead to inaccuracy in determining of supported compiler's
features.

Correct way to detect options supported by a compiler is to use
the macros  MY_CHECK_CXX_COMPILER_FLAG and to check the result of
variable with prefix have_CXX__
So, to check whether compiler does support the option
 -Wno-unused-but-set-variable
the macros
 MY_CHECK_CXX_COMPILER_FLAG(-Wno-unused-but-set-variable)
should be called and the result variable
 have_CXX__Wno_unused_but_set_variable
be tested for assigned value.

1fde7853

24 Aug, 2023 1 commit

MDEV-31813 SET GLOBAL innodb_max_purge_lag_wait hangs if innodb_read_only · 02878f12

Marko Mäkelä authored Aug 24, 2023

innodb_max_purge_lag_wait_update(): Return immediately if we are
in high_level_read_only mode.

srv_wake_purge_thread_if_not_active(): Relax a debug assertion.
If srv_read_only_mode holds, purge_sys.enabled() will not hold
and this function will do nothing.

trx_t::commit_in_memory(): Remove a redundant condition before
invoking srv_wake_purge_thread_if_not_active().

02878f12

23 Aug, 2023 1 commit

MDEV-31117 Fix spider connection info parsing · e9f3ca61

Yuchen Pei authored Jul 05, 2023

Spider connection string is a comma-separated parameter definitions,
where each definition is of the form "<param_title> <param_value>",
where <param_value> is quote delimited on both ends, with backslashes
acting as an escaping prefix.

Despite the simple syntax, the existing spider connection string
parser was poorly-written, complex, hard to reason and error-prone,
causing issues like the one described in MDEV-31117. For example it
treated param title the same way as param value when assigning, and
have nonsensical fields like delim_title_len and delim_title.

Thus as part of the bugfix, we clean up the spider comment connection
string parsing, including:

- Factoring out some code from the parsing function
- Simplify the struct `st_spider_param_string_parse`
- And any necessary changes caused by the above changes

e9f3ca61

22 Aug, 2023 1 commit

MDEV-20194 test adjustment for s390x · ff682ead

Marko Mäkelä authored Aug 22, 2023

The test innodb.row_size_error_log_warnings_3 that was added in
commit 372b0e63 (MDEV-20194)
failed to take into account the earlier adjustment in
commit cf574cf5 (MDEV-27634)
that is specific to many GNU/Linux distributions for the s390x.

ff682ead

21 Aug, 2023 1 commit
- Make vgdb call more universal. · c062b351
  Oleksandr Byelkin authored Aug 21, 2023
  
  c062b351
17 Aug, 2023 3 commits

MDEV-31928 Assertion xid ... < 128 failed in trx_undo_write_xid() · 5a8a8fc9
Marko Mäkelä authored Aug 17, 2023
```
trx_undo_write_xid(): Correct an off-by-one error in a debug assertion.
```
5a8a8fc9

MDEV-31254 InnoDB: Trying to read doublewrite buffer page · 518fe519

Marko Mäkelä authored Aug 17, 2023

buf_read_page_low(): Remove an error message that could be triggered
by buf_read_ahead_linear() or buf_read_ahead_random().

This is a backport of commit c9eff1a1
from MariaDB Server 10.5.

518fe519

MDEV-31875 ROW_FORMAT=COMPRESSED table: InnoDB: ... Only 0 bytes read · 44df6f35

Marko Mäkelä authored Aug 17, 2023

buf_read_ahead_random(), buf_read_ahead_linear(): Avoid read-ahead
of the last page(s) of ROW_FORMAT=COMPRESSED tablespaces that use
a page size of 1024 or 2048 bytes. We invoke os_file_set_size() on
integer multiples of 4096 bytes in order to be compatible with
the requirements of innodb_flush_method=O_DIRECT regardless of the
physical block size of the underlying storage.

This change must be null-merged to MariaDB Server 10.5 and later.
There, out-of-bounds read-ahead should be handled gracefully
by simply discarding the buffer page that had been allocated.

Tested by: Matthias Leich

44df6f35

16 Aug, 2023 1 commit

MDEV-29974: Missed kill waiting for worker queues to drain · 34e85854

Kristian Nielsen authored Aug 16, 2023

When the SQL driver thread goes to wait for room in the parallel slave
worker queue, there was a race where a kill at the right moment could
be ignored and the wait proceed uninterrupted by the kill.

Fix by moving the THD::check_killed() to occur _after_ doing ENTER_COND().

This bug was seen as sporadic failure of the testcase rpl.rpl_parallel
(rpl.rpl_parallel_gco_wait_kill since 10.5), with "Slave stopped with
wrong error code".
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

34e85854

15 Aug, 2023 6 commits

MDEV-31655: Parallel replication deadlock victim preference code errorneously removed · 900c4d69

Kristian Nielsen authored Jul 11, 2023

Restore code to make InnoDB choose the second transaction as a deadlock
victim if two transactions deadlock that need to commit in-order for
parallel replication. This code was erroneously removed when VATS was
implemented in InnoDB.

Also add a test case for InnoDB choosing the right deadlock victim.
Also fixes this bug, with testcase that reliably reproduces:

MDEV-28776: rpl.rpl_mark_optimize_tbl_ddl fails with timeout on sync_with_master

Note: This should be null-merged to 10.6, as a different fix is needed
there due to InnoDB locking code changes.
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

900c4d69

MDEV-31482: Lock wait timeout with INSERT-SELECT, autoinc, and statement-based replication · 920789e9

Kristian Nielsen authored Jul 09, 2023

Remove the exception that InnoDB does not report auto-increment locks waits
to the parallel replication.

There was an assumption that these waits could not cause conflicts with
in-order parallel replication and thus need not be reported. However, this
assumption is wrong and it is possible to get conflicts that lead to hangs
for the duration of --innodb-lock-wait-timeout. This can be seen with three
transactions:

1. T1 is waiting for T3 on an autoinc lock
2. T2 is waiting for T1 to commit
3. T3 is waiting on a normal row lock held by T2

Here, T3 needs to be deadlock killed on the wait by T1.

Note: This should be null-merged to 10.6, as a different fix is needed
there due to InnoDB lock code changes.
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

920789e9

Remove the often-hanging test innodb.alter_rename_files · b4ace139

Marko Mäkelä authored Aug 15, 2023

The test innodb.alter_rename_files rather frequently hangs in
checkpoint_set_now. The test was removed in MariaDB Server 10.5
commit 37e7bde1 when the code that
it aimed to cover was simplified. Starting with MariaDB Server 10.5
the page flushing and log checkpointing is much simpler, handled
by the single buf_flush_page_cleaner() thread.

Let us remove the test to avoid occasional failures. We are not going
to fix the cause of the failure in MariaDB Server 10.4.

b4ace139

Merge mariadb-10.4.31 into 10.4 · 6fdc6846
Marko Mäkelä authored Aug 15, 2023

6fdc6846
MDEV-24797 Column Compression - ERROR 1265 (01000): Data truncated for column · 9c8ae6dc
Alexander Barkov authored Aug 15, 2023
```
Fix issue was earlier fixed by MDEV-31724. Only adding MTR tests.
```
9c8ae6dc

MDEV-31724 Compressed varchar values lost on joins when sorting on columns from joined table(s) · 1fa7c9a3

Alexander Barkov authored Jul 21, 2023

Field_varstring::get_copy_func() did not take into account
that functions do_varstring1[_mb], do_varstring2[_mb] do not support
compressed data.

Changing the return value of Field_varstring::get_copy_func()
to `do_field_string` if there is a compresion and truncation
at the same time. This fixes the problem, so now it works as follows:
- val_str() uncompresses the data
- The prefix is then calculated on the uncompressed data

Additionally, introducing two new copying functions
- do_varstring1_no_truncation()
- do_varstring2_no_truncation()

Using new copying functions in cases when:
- a Field_varstring with length_bytes==1 is changing to a longer
    Field_varstring with length_bytes==1
- a Field_varstring with length_bytes==2 is changing to a longer
    Field_varstring with length_bytes==2

In these cases we don't care neither of compression nor
of multi-byte prefixes: the entire data gets fully copied
from the source column to the target column as is.

This is a kind of new optimization, but this also was needed
to preserve existing MTR test results.

1fa7c9a3

14 Aug, 2023 1 commit
- bump the VERSION · dd19ba18
  Daniel Bartholomew authored Aug 14, 2023
  
  dd19ba18
11 Aug, 2023 1 commit
- galera: wsrep-lib submodule update · 646eb7be
  Julius Goryavsky authored Aug 11, 2023
  
  646eb7be
10 Aug, 2023 4 commits

MDEV-31893 Valgrind reports issues in main.join_cache_notasan · 2aea9387

Monty authored Aug 10, 2023

This is also related to
MDEV-31348 Assertion `last_key_entry >= end_pos' failed in virtual bool
           JOIN_CACHE_HASHED::put_record()

Valgrind exposed a problem with the join_cache for hash joins:
=25636== Conditional jump or move depends on uninitialised value(s)
==25636== at 0xA8FF4E: JOIN_CACHE_HASHED::init_hash_table()
          (sql_join_cache.cc:2901)

The reason for this was that avg_record_length contained a random value
if one had used SET optimizer_switch='optimize_join_buffer_size=off'.

This causes either 'random size' memory to be allocated (up to
join_buffer_size) which can increase memory usage or, if avg_record_length
is less than the row size, memory overwrites in thd->mem_root, which is
bad.

Fixed by setting avg_record_length in JOIN_CACHE_HASHED::init()
before it's used.

There is no test case for MDEV-31893 as valgrind of join_cache_notasan
checks that.
I added a test case for MDEV-31348.

2aea9387

MDEV-23021: rpl.rpl_parallel_optimistic_until fails in Buildbot · b2e312b0

Kristian Nielsen authored Aug 08, 2023

The test case accessed slave-relay-bin.000003 without waiting for the IO
thread to write it first. If the IO thread was slow, this could fail.
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

b2e312b0

MDEV-381: fdatasync() does not correctly flush growing binlog file · 5055490c

Kristian Nielsen authored Aug 03, 2023

Revert the old work-around for buggy fdatasync() on Linux ext3. This bug was
fixed in Linux > 10 years ago back to kernel version at least 3.0.
Reviewed-by: Marko Mäkelä <marko.makela@mariadb.com>
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

5055490c

MDEV-31893 Valgrind reports issues in main.join_cache_notasan · e9333ff0

Monty authored Aug 10, 2023

This is also related to
MDEV-31348 Assertion `last_key_entry >= end_pos' failed in virtual bool
           JOIN_CACHE_HASHED::put_record()

Valgrind exposed a problem with the join_cache for hash joins:
=25636== Conditional jump or move depends on uninitialised value(s)
==25636== at 0xA8FF4E: JOIN_CACHE_HASHED::init_hash_table()
          (sql_join_cache.cc:2901)

The reason for this was that avg_record_length contained a random value
if one had used SET optimizer_switch='optimize_join_buffer_size=off'.

This causes either 'random size' memory to be allocated (up to
join_buffer_size) which can increase memory usage or, if avg_record_length
is less than the row size, memory overwrites in thd->mem_root, which is
bad.

Fixed by setting avg_record_length in JOIN_CACHE_HASHED::init()
before it's used.

There is no test case for MDEV-31893 as valgrind of join_cache_notasan
checks that.
I added a test case for MDEV-31348.

e9333ff0

08 Aug, 2023 4 commits

Revert "use environment file in systemd units for _WSREP_START_POSITION" · 161ce045
Andrew Hutchings authored Aug 08, 2023
```
This reverts commit 6c405904.
```
161ce045
Revert "update galera_new_cluster to use environment file" · 48e6918c
Andrew Hutchings authored Aug 08, 2023
```
This reverts commit b54e4bf0.
```
48e6918c

MDEV-31869 Server aborts when table does drop column · 0ede90dd

Thirunarayanan Balathandayuthapani authored Aug 08, 2023

- InnoDB aborts when table is dropping the column. This is
caused by 5f09b53b (MDEV-31086).
While iterating the altered table fields, we fail to consider
the dropped columns.

0ede90dd

MDEV-31413 : Node has been dropped from the cluster on Startup / Shutdown with async replica · 277968aa

Jan Lindström authored Jun 20, 2023

There was two related problems:

(1) Galera node that is defined as a slave to async MariaDB
master at restart might do SST (state stransfer) and
part of that it will copy mysql.gtid_slave_pos table.
Problem is that updates on that table are not replicated
on a cluster. Therefore, table from donor that is not
slave is copied and joiner looses gtid position it was
and start executing events from wrong position of the binlog.
This incorrect position could break replication and
causes node to be dropped and requiring user action.

(2) Slave sql thread might start executing events before
galera is ready (wsrep_ready=ON) and that could also
cause node to be dropped from the cluster.

In this fix we enable replication of mysql.gtid_slave_pos
table on a cluster. In this way all nodes in a cluster
will know gtid slave position and even after SST joiner
knows correct gtid position to start.

Furthermore, we wait galera to be ready before slave
sql thread executes any events to prevent too early
execution.
Signed-off-by: Julius Goryavsky <julius.goryavsky@mariadb.com>

277968aa

02 Aug, 2023 2 commits

update galera_new_cluster to use environment file · b54e4bf0

Christian Hesse authored Jan 29, 2019

Now that the systemd unit files use an environment file to pass
_WSREP_START_POSITION we have to update galera_new_cluster as well.

b54e4bf0

use environment file in systemd units for _WSREP_START_POSITION · 6c405904

Christian Hesse authored Jan 28, 2019

We used to run `systemctl set-environment` to pass
_WSREP_START_POSITION. This is bad because:

* it clutter systemd's environment (yes, pid 1)
* it requires root privileges
* options (like LimitNOFILE=) are not applied

Let's just create an environment file in ExecStartPre=, that is read
before ExecStart= kicks in. We have _WSREP_START_POSITION around for the
main process without any downsides.

6c405904

31 Jul, 2023 5 commits

MDEV-31092 mysqldump --force doesn't ignore error as it should · ab10a675
Sergei Golubchik authored Jul 31, 2023
```
failed SHOW CREATE FUNCTION means we don't dump this function,
but should still try to dump all other functions
```
ab10a675

MDEV-31800 Problem with open ranges on prefix blobs keys · 4dd38f9f

Sergei Golubchik authored Jul 31, 2023

don't construct open ranges from prefix blob keys for < (less than)
just as it's already done for > (greater than)

because prefix KEY_PART doesn't create prefix Field for blobs
(see open_table_from_share() near "Create a new field for the key part"),
so stored_field_cmp_to_item() will compare the original field to the
value not taking the prefix length into account.

4dd38f9f

Fix double definition of CRYPTO_cleanup_all_ex_data · 4da80a41
Sergei Golubchik authored Jul 31, 2023

4da80a41
Revert "MDEV-30528 Assertion in dtype_get_at_most_n_mbchars" · 69b118a3
Aleksey Midenkov authored Jul 31, 2023
```
This reverts commit add0c01b

Duplicates must be avoided in FTS_DOC_ID_INDEX
```
69b118a3
MDEV-30159 fixup: Plug a memory leak in the test · f182de2e
Marko Mäkelä authored Jul 31, 2023

f182de2e

30 Jul, 2023 2 commits

Fix rpl.rpl_rotate_logs to work with --repeat · a4b9e9b9

Kristian Nielsen authored Jul 24, 2023

(It's not using include/rpl_init.inc, so it needs to reset the GTID position
explicitly).
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

a4b9e9b9

MDEV-31723: Crash on SET SESSION gtid_seq_no= DEFAULT · d632c85b

Kristian Nielsen authored Jul 17, 2023

A simple "SET SESSION gtid_seq_no= DEFAULT" did not work, it would straight
up crash the server! Also, explicitly setting gtid_seq_no to 0 gave an error
in --gtid-strict-mode=1.

Setting to DEFAULT or 0 should disable any prior setting of
gtid_seq_no, so that the next transaction is allocated the next GTID
in sequence, as normal.
Reviewed-by: Monty <monty@mariadb.org>
Signed-off-by: Kristian Nielsen <knielsen@knielsen-hq.org>

d632c85b

26 Jul, 2023 3 commits

MDEV-31003: Second execution for ps-protocol · 9854fb6f

Lena Startseva authored May 31, 2023

This patch adds for "--ps-protocol" second execution
of queries "SELECT".
Also in this patch it is added ability to disable/enable
(--disable_ps2_protocol/--enable_ps2_protocol) second
execution for "--ps-prototocol" in testcases.

9854fb6f

MDEV-18374: Add SELinux policy to cracklib_password_check packages · 23dae617
Geoff Montee authored Jan 24, 2019

23dae617

MDEV-31407: Add aliases in opt_trace.test for long column name for removing... · 515ba857

Lena Startseva authored Jun 06, 2023

MDEV-31407: Add aliases in opt_trace.test for long column name for removing "--disable-view-protocol"

Change tests:
	opt_trace.test
	opt_trace_index_merge.test
	opt_trace_ucs2.test

515ba857

25 Jul, 2023 3 commits

new WolfSSL v5.6.3-stable · 2a46b358
Oleksandr Byelkin authored Jul 25, 2023

2a46b358

MDEV-30619: Parallel Slave SQL Thread Can Update Seconds_Behind_Master with Active Workers · 063f4ac2

Brandon Nesterenko authored Jun 28, 2023

MDEV-31749 sporadic assert in MDEV-30619 new test

If the workers of a parallel replica are busy (potentially with long
queues), but the SQL thread has no events left to distribute (so it
goes idle), then the next event that comes from the primary will
update mi->last_master_timestamp with its timestamp, even if the
workers have not yet finished.

This patch changes the parallel replica logic which updates
last_master_timestamp after idling from using solely sql_thread_caught_up
(added in MDEV-29639) to using the latter with rli queued/dequeued
event counters.
That is, if  the queued count is equal to the dequeued count, it
means all events have been processed and the replica is considered
idle when the driver thread has also distributed all events.

Low level details of the commit include
- to make a more generalized test for Seconds_Behind_Master on
  the parallel replica, rpl_delayed_parallel_slave_sbm.test
  is renamed to rpl_parallel_sbm.test for this purpose.
- pause_sql_thread_on_next_event usage was removed
  with the MDEV-30619 fixes. Rather than remove it, we adapt it
  to the needs of this test case
- added test case to cover SBM spike of relay log read and LMT
  update that was fixed by MDEV-29639
- rpl_seconds_behind_master_spike.test is made to use
  the negate_clock_diff_with_master debug eval.

Reviewed By:
============
Andrei Elkin <andrei.elkin@mariadb.com>

063f4ac2

MDEV-31400 Simple plugin dependency resolution · 734583b0

Yuchen Pei authored Jul 25, 2023

We introduce simple plugin dependency. A plugin init function may
return HA_ERR_RETRY_INIT. If this happens during server startup when
the server is trying to initialise all plugins, the failed plugins
will be retried, until no more plugins succeed in initialisation or
want to be retried.

This will fix spider init bugs which is caused in part by its
dependency on Aria for initialisation.

The reason we need a new return code, instead of treating every
failure as a request for retry, is that it may be impossible to clean
up after a failed plugin initialisation. Take InnoDB for example, it
has a global variable `buf_page_cleaner_is_active`, which may not
satisfy an assertion during a second initialisation try, probably
because InnoDB does not expect the initialisation to be called
twice.

734583b0