Commits · c1e3fc0e0dcbc8275b46916fb5247e9e7635d072 · nexedi / MariaDB

29 Jun, 2022 1 commit

MDEV-28977: mariabackup.huge_lsn,strict_full_crc32 fails in 10.8 · c1e3fc0e

Marko Mäkelä authored Jun 29, 2022

recv_sys_t::recover_deferred(): Hold the exclusive page latch until
the tablespace has been set up. Otherwise, the write of the page
may be lost due to non-existent tablespace. This race only affects
the recovery of the first page in a newly created tablespace.

This race condition was introduced in MDEV-24626.

c1e3fc0e

28 Jun, 2022 3 commits

Fix a sporadic failure of main.backup_locks · 2fa3ada0

Marko Mäkelä authored Jun 28, 2022

Ever since commit 9608773f
the InnoDB persistent statistics are enabled on all InnoDB tables
by default. We must filter out any output that indicates that the
statistics tables are being internally accessed by InnoDB.

2fa3ada0

MDEV-28897 Wrong table.get_ref_count() upon concurrent truncate and backup stage operation · 5e40934d

Monty authored Jun 28, 2022

The issue was that flush_tables() didn't take a MDL lock on cached
TABLE_SHARE before calling open_table() to do a HA_EXTRA_FLUSH call.
Most engines seams to have no issue with it, but apparantly this conflicts
with InnoDB in 10.6 when using TRUNCATE

Fixed by taking a MDL lock before trying to open the table in
flush_tables().

There is no test case as it hard to repeat the scheduling that causes
the error. I did run the test case in MDEV-28897 to verify
that the bug is fixed.

5e40934d

MDEV-18976 fixup: encryption.innodb-redo-nokeys · 02a313dc
Marko Mäkelä authored Jun 28, 2022
```
This test failure is similar to encryption.innodb-redo-badkey,
which was fixed in commit 0f0a45b2.
```
02a313dc

27 Jun, 2022 15 commits

MDEV-26979 heap-use-after-free or SIGSEGV when accessing INNODB_SYS_TABLESTATS during DDL · 1ae81607
Marko Mäkelä authored Jun 27, 2022
```
i_s_dict_fill_sys_tablestats(): Read all fields of dict_table_t
while holding dict_sys.latch.

dict_sys_t::allow_eviction(): Remove.
```
1ae81607
Merge 10.5 into 10.6 · 20cf63fe
Marko Mäkelä authored Jun 27, 2022

20cf63fe
Merge 10.4 into 10.5 · 773f1dad
Marko Mäkelä authored Jun 27, 2022

773f1dad
Merge 10.3 into 10.4 · b922ae5f
Marko Mäkelä authored Jun 27, 2022

b922ae5f

MDEV-26577 InnoDB: Failing assertion: dict_tf2_is_valid(flags, flags2) during ADD COLUMN · f339ef3f

Marko Mäkelä authored Jun 27, 2022

prepare_inplace_alter_table_dict(): If the table will not be rebuilt,
preserve all of the original ROW_FORMAT, including the compressed
page size flags related to ROW_FORMAT=COMPRESSED.

f339ef3f

MDEV-28389 fixup: Fix compiler warnings · a75ad735

Marko Mäkelä authored Jun 27, 2022

hex_to_ascii(): Add #if around the definition to avoid
clang -Wunused-function. Avoid GCC 5 -Wconversion with a cast.

a75ad735

MDEV-28950 Assertion `*err == DB_SUCCESS' failed in btr_page_split_and_insert · 39f45f6f

Marko Mäkelä authored Jun 27, 2022

btr_root_raise_and_insert(), btr_lift_page_up(),
rtr_page_split_and_insert(): Reset DB_FAIL from a failure to
copy records on a ROW_FORMAT=COMPRESSED page to DB_SUCCESS
before retrying.

This fixes a regression that was introduced by
commit 0b47c126 (MDEV-13542).

btr_root_raise_and_insert(): Remove a redundant condition.
btr_page_split_and_insert() will invoke btr_page_split_and_insert()
if needed.

39f45f6f

Suppress a message that may be emitted on slow systems · 7d92c9d2

Marko Mäkelä authored Jun 27, 2022

On FreeBSD, tests run on persistent storage, and no asynchronous I/O
has been implemented. Warnings about 205-second waits on dict_sys.latch
may occur.

7d92c9d2

Merge 10.5 into 10.6 · 87bd79b1
Marko Mäkelä authored Jun 27, 2022

87bd79b1
Merge 10.4 into 10.5 · ea847cbe
Marko Mäkelä authored Jun 27, 2022

ea847cbe
Fix GCC -Og -Wmaybe-uninitialized · 03174cab
Marko Mäkelä authored Jun 27, 2022

03174cab
MDEV-28854 after-merge fix: Remove a test for MDEV-26583 · dd7e9fb3
Marko Mäkelä authored Jun 27, 2022

dd7e9fb3
Merge 10.3 into 10.4 · 01d75703
Marko Mäkelä authored Jun 27, 2022

01d75703

MDEV-28389: Simplify the InnoDB corrupted page output · c86b1389

Marko Mäkelä authored Jun 27, 2022

buf_page_print(): Dump the buffer page 32 bytes (64 hexadecimal digits)
per line. In this way, the limitation in mtr
("Data too long for column 'line'") will not be triggered.

Also, do not bother decoding the page contents, because everything
is present in the hexadecimal output.

dict_index_find_on_id_low(): Merge to dict_index_get_if_in_cache_low().
The direct call in buf_page_print() was prone to crashing, in case the
table definition was concurrently evicted or dropped from the
data dictionary cache.

c86b1389

MDEV-28854 Disallow INSERT DELAYED on Spider table · 2c1aaa66

Hirokazu Hata authored Jun 27, 2022

Spider supports (or at least allows) INSERT DELAYED but the
documentation does not specify spider as a storage engine that supports
"INSERT DELAYED".
Also, although not mentioned in the documentation, "INSERT DELAYED" is
not intended to be executed inside a transaction, as can be seen from
the list of supported storage engines.
The current implementation allows executing a delayed insert on a
remote transactional table and this breaks the consistency ensured by
the transaction.

We too remove "internal_delayed", one of the Spider table parameters.
Documentation says,

> Whether to transmit existence of delay to remote servers when
> executing an INSERT DELAYED statement on local server.

This table parameter is only used for "INSERT DELAYED".

Reviewed by: Nayuta Yanagisawa

2c1aaa66

24 Jun, 2022 1 commit

MDEV-22590 SIGSEGV in flush_all_key_blocks when changing key_buffer_size /... · 5feb60ce

Oleksandr Byelkin authored Jun 23, 2022

MDEV-22590 SIGSEGV in flush_all_key_blocks when changing key_buffer_size / ASAN: heap-use-after-free in flush_all_key_blocks

Take into account that in preparation of a simple key cache for resizing no disk blocks might be assigned to it.

Reviewer: IgorBabaev <igor@mariadb.com>

5feb60ce

23 Jun, 2022 4 commits

MDEV-28935 crash in io_slots::release · d96436c9

Vladislav Vaintroub authored Jun 23, 2022

Revert "TSAN: data race on vptr (ctor/dtor vs virtual call)"

This reverts commit 78084fa7.

This commit was done to please TSAN, which falsely reported an error
where there was none.
Yet as consequence, it could cause a real error, a crash in os_aio_free on
shutdown

d96436c9

MDEV-28923 atomic.rename_table occasionally fails · f2f18e20

Marko Mäkelä authored Jun 23, 2022

fil_name_process(): If the recovery of a tablespace was deferred,
do invoke fil_ibd_load() even though the name in recv_spaces is
not changing. This allows us to recover from a situation where
there are many FILE_RENAME records, renaming a tablespace back
and forth, and a FILE_MODIFY record that had been written by
fil_names_clear().

Co-developed with: Thirunarayanan Balathandayuthapani

f2f18e20

Merge remote-tracking branch 'origin/10.5' into 10.6 · eb7f46ca
Vladislav Vaintroub authored Jun 23, 2022

eb7f46ca

MDEV-28920 Rescheduling of innodb_stats_func() missing · 35f2cdcb

Vladislav Vaintroub authored Jun 23, 2022


Fixed tpool timer implementation on POSIX.
Prior to this patch, under some specific rare circumstances (concurrency
related), timer callback execution might be skipped.

35f2cdcb

22 Jun, 2022 3 commits

MDEV-18976 fixup: encryption.innodb-redo-badkey · 0f0a45b2

Marko Mäkelä authored Jun 22, 2022

When attempting to recover a database with an incorrect encryption key,
the unencrypted page contents should be expected to differ from what
was written before recovery. Let us suppress some more messages.
This caused intermittent failures, depending on when the latest
log checkpoint was triggered.

0f0a45b2

MDEV-22388 Corrupted undo log record leads to server crash · 6f4d0659

Marko Mäkelä authored Jun 22, 2022

trx_undo_rec_copy(): Return nullptr if the undo record is corrupted.

trx_undo_rec_get_undo_no(): Define inline with the declaration.

trx_purge_dummy_rec: Replaced with a -1 pointer.

row_undo_rec_get(), UndorecApplier::apply_undo_rec(): Check
if trx_undo_rec_copy() returned nullptr.

trx_purge_get_next_rec(): Return nullptr upon encountering any
corruption, to signal the end of purge.

6f4d0659

MDEV-28836 fixup · 0fa19fde

Marko Mäkelä authored Jun 22, 2022

On GNU/Linux, even though the C11 aligned_alloc() appeared in
GNU libc early on, some custom memory allocators did not
implement it until recently. For example, before
gperftools/gperftools@d406f2285390c402e824dd28e6992f7f890dcdf9
the free() in tcmalloc would fail to free memory that was
returned by aligned_alloc(), because the latter would map to the
built-in allocator of libc. The Linux specific memalign() has a
similar interface and is safer to use, because it has been
available for a longer time. For AddressSanitizer, we will use
aligned_alloc() so that the constraint on size can be enforced.

buf_tmp_reserve_compression_buf(): When HAVE_ALIGNED_ALLOC holds,
round up the size to be an integer multiple of the alignment.

pfs_malloc(): In the unit test stub, round up the size to be an
integer multiple of the alignment.

0fa19fde

21 Jun, 2022 6 commits

MDEV-28836: Memory alignment cleanup · 37946731

Marko Mäkelä authored Jun 21, 2022

Table_cache_instance: Define the structure aligned at
the CPU cache line, and remove a pad[] data member.
Krunal Bauskar reported this to improve performance on ARMv8.

aligned_malloc(): Wrapper for the Microsoft _aligned_malloc()
and the ISO/IEC 9899:2011 <stdlib.h> aligned_alloc().
Note: The parameters are in the Microsoft order (size, alignment),
opposite of aligned_alloc(alignment, size).
Note: The standard defines that size must be an integer multiple
of alignment. It is enforced by AddressSanitizer but not by GNU libc
on Linux.

aligned_free(): Wrapper for the Microsoft _aligned_free() and
the standard free().

HAVE_ALIGNED_ALLOC: A new test. Unfortunately, support for
aligned_alloc() may still be missing on some platforms.
We will fall back to posix_memalign() for those cases.

HAVE_MEMALIGN: Remove, along with any use of the nonstandard memalign().

PFS_ALIGNEMENT (sic): Removed; we will use CPU_LEVEL1_DCACHE_LINESIZE.

PFS_ALIGNED: Defined using the C++11 keyword alignas.

buf_pool_t::page_hash_table::create(),
lock_sys_t::hash_table::create():
lock_sys_t::hash_table::resize(): Pad the allocation size to an
integer multiple of the alignment.

Reviewed by: Vladislav Vaintroub

37946731

MDEV-28870 InnoDB: Missing FILE_CREATE, FILE_DELETE or FILE_MODIFY before FILE_CHECKPOINT · 2e43af69

Marko Mäkelä authored Jun 21, 2022

There was a race condition between log_checkpoint_low() and
deleting or renaming data files. The scenario is as follows:

1. The buffer pool does not contain dirty pages.
2. A FILE_DELETE or FILE_RENAME record is written.
3. The checkpoint LSN will be moved ahead of the write of the record.
4. The server is killed before the file is actually renamed or deleted.

We will prevent this race condition by ensuring that a log checkpoint
cannot occur between the durable write and the file system operation:

1. Durably write the FILE_DELETE or FILE_RENAME record.
2. Perform the file system operation.
3. Allow any log checkpoint to proceed.

mtr_t::commit_file(): Implement the DELETE or RENAME logic.

fil_delete_tablespace(): Delegate some of the logic to
mtr_t::commit_file().

fil_space_t::rename(): Delegate some logic to mtr_t::commit_file().
Remove the debug injection point fil_rename_tablespace_failure_2
because we do test RENAME failures without any debug injection.

fil_name_write_rename_low(), fil_name_write_rename(): Remove.

Tested by Matthias Leich

2e43af69

MDEV-26562: galera-sst-mariabackup is failing due to missing xtrabackup_checkpoints · 3e09c619

Julius Goryavsky authored Jun 21, 2022

This commit contains workaround for a bug known as 'Red Hat issue 1870279'
(connection reset by peer issue in socat versions 1.7.3.3 to 1.7.4.0) which
further causes crashes during SST using mariabackup (when openssl is used).

Also fixed broken logic of automatic generation of the Diffie-Hellman parameters
for socat version less than 1.7.3 (which defaults to 512-bit values instead of
2048-bit ones).

3e09c619

MDEV-28845 fixup: Prevent an infinite loop · 55f02c24

Marko Mäkelä authored Jun 21, 2022

buf_page_create_low(): Before retrying, release the exclusive page latch
in order to prevent an infinite loop in buf_pool_t::corrupted_evict().

55f02c24

MDEV-28782 fixup: ./mtr --embedded · 3b662c6e
Marko Mäkelä authored Jun 21, 2022

3b662c6e
MDEV-28583 postfix: fixing .result files after automatic merge · af929146
Julius Goryavsky authored Jun 17, 2022

af929146

20 Jun, 2022 1 commit
- MDEV-28819 Statically compiled encryption plugins do not work in mariadb-backup · 01c0345d
  Vladislav Vaintroub authored Jun 20, 2022
```
Disable static build for encryption plugin file_key_management
```
  01c0345d
18 Jun, 2022 2 commits

MDEV-28884: include kernel information in crashing signal handler · d4539426

Daniel Black authored Jun 18, 2022

Recent adventures in liburing and btrfs have shown up some kernel
version dependent bugs. Having a bug report of accurace kernel version
can start to correlate these errors sooner.

On Linux, /proc/version contains the kernel version.

FreeBSD has kern.version (per man 8 sysctl), so include that too.

Example output:

Max nice priority 0 0
Max realtime priority 0 0
Max realtime timeout unlimited unlimited us
Core pattern: |/usr/lib/systemd/systemd-coredump %P %u %g %s %t %c %h

Kernel version: Linux version 5.19.0-0.rc2.21.fc37.x86_64 (mockbuild@bkernel01.iad2.fedoraproject.org) (gcc (GCC) 12.1.1 20220507 (Red Hat 12.1.1-1), GNU ld version 2.38-14.fc37) #1 SMP PREEMPT_DYNAMIC Mon Jun 13 15:27:24 UTC 2022

Segmentation fault (core dumped)

d4539426

remove invalid test · f299351e

Sergei Golubchik authored Jun 18, 2022

it starts an EXPLAIN of a multi-table join and tries to KILL it.
no sync points.
depending on how fast the hareware is and optimizer development
it might kill EXPLAIN at some random point in time (generally unrelated
to the Bug#28598 it was supposed to test) or EXPLAIN might finish
before the KILL and the test will fail.

f299351e

17 Jun, 2022 3 commits

MDEV-28782 mariadb-tzinfo-to-sql to work in bootstrap mode · 0e4cf497

Daniel Black authored Jun 09, 2022

Work around MDEV-28718 for now, but also optimize the interation
of information_schema.SYSTEM_VARIABLES.

Add test case to show that tzinfo data into bootstrap is
desired functionality.

Bug report thanks to Dan Lenski of AWS.

0e4cf497

MDEV-17390: re-neable rpl_semi_sync_after_sync test · 0565dfe4
Daniel Black authored Jun 17, 2022
```
The reasons sited for disabling this test in MDEV-16172 where
disputed.
```
0565dfe4

Fix intermittent failures of innodb.stats_persistent · be99d0dd

Marko Mäkelä authored Jun 17, 2022

We do not really care about the exact result; we only care that the
statistics will be accessed. The result could change depending on
when some statistics were updated in the background or when some
committed delete-marked rows were purged from other tables on
which persistent statistics are enabled.

be99d0dd

16 Jun, 2022 1 commit
- Merge 10.5 into 10.6 · 5bb90cb2
  Marko Mäkelä authored Jun 16, 2022
  
  5bb90cb2