Commits · 764ca7e6e75885de3276743c07ab15430d82ec5c · nexedi / MariaDB

19 Jan, 2022 2 commits

MDEV-27499 fixup: Add a wait to buf_flush_sync() · 764ca7e6

Marko Mäkelä authored Jan 19, 2022

The test innodb.log_file_size would occasionally fail with
an assertion failure !buf_pool.any_io_pending(). Let us wait
for the page cleaner thread to become idle already in
srv_prepare_to_delete_redo_log_file(), like we used to.

764ca7e6

MDEV-27025: Null merge 10.5 into 10.6 · 965c0d22
Marko Mäkelä authored Jan 19, 2022

965c0d22

18 Jan, 2022 4 commits

MDEV-27025 insert-intention lock conflicts with waiting ORDINARY lock · be811386

Vlad Lesin authored Jan 11, 2022

The code was backported from 10.6 bd03c0e5
commit. See that commit message for details.

Apart from the above commit trx_lock_t::wait_trx was also backported from
MDEV-24738. trx_lock_t::wait_trx is protected with lock_sys.wait_mutex
in 10.6, but that mutex was implemented only in MDEV-24789. As there is no
need to backport MDEV-24789 for MDEV-27025,
trx_lock_t::wait_trx is protected with the same mutexes as
trx_lock_t::wait_lock.

This fix should not break innodb-lock-schedule-algorithm=VATS. This
algorithm uses an Eldest-Transaction-First (ETF) heuristic, which prefers
older transactions over new ones. In this fix we just insert granted lock
just before the last granted lock of the same transaction, what does not
change transactions execution order.

The changes in lock_rec_create_low() should not break Galera Cluster,
there is a big "if" branch for WSREP. This branch is necessary to provide
the correct transactions execution order, and should not be changed for
the current bug fix.

be811386

MDEV-27025 insert-intention lock conflicts with waiting ORDINARY lock · bd03c0e5

Vlad Lesin authored Dec 02, 2021

When lock is checked for conflict, ignore other locks on the record if
they wait for the requesting transaction.

lock_rec_has_to_wait_in_queue() iterates not all locks for
the page, but only the locks located before the waiting lock in the
queue. So there is some invariant - any lock in the queue can wait only
lock which is located before the waiting lock in the queue.

In the case when conflicting lock waits for the transaction of
requesting lock, we need to place the requesting lock before the waiting
lock in the queue to preserve the invariant. That is why we are looking
for the first waiting for requesting transation lock and place the new
lock just after the last granted requesting transaction lock before the
first waiting for requesting transaction lock.

Example:

trx1 waiting lock, trx1 granted lock, ..., trx2 lock - waiting for trx1
place new lock here -----------------^

There are also implicit locks which are lazily converted to explicit
ones, and we need to place the newly created explicit lock to the correct
place in a queue. All explicit locks converted from implicit ones are
placed just after the last non-waiting lock of the same transaction before
the first waiting for the transaction lock.

Code review and cleanup was made by Marko Mäkelä.

bd03c0e5

Merge 10.5 into 10.6 · 1abc476f
Marko Mäkelä authored Jan 18, 2022

1abc476f

MDEV-27499 Performance regression in log_checkpoint_margin() · e44439ab

Marko Mäkelä authored Jan 17, 2022

In commit 4c3ad244 (MDEV-27416)
an unnecessarily strict wait condition was introduced in the
function buf_flush_wait(). Most callers actually only care that
the pages have been flushed, not that a checkpoint has completed.

Only in the buf_flush_sync() call for log resizing, we might care
about the log checkpoint. But, in fact,
srv_prepare_to_delete_redo_log_file() is explicitly disabling
checkpoints. So, we can simply remove the unnecessary wait loop.

Thanks to Krunal Bauskar for reporting this performance regression
that we failed to repeat in our testing.

e44439ab

17 Jan, 2022 4 commits

MDEV-26230 mysql_upgrade fails to load type_mysql_json due to insufficient maturity level · 745aa8be
Sergei Golubchik authored Dec 29, 2021
```
bump maturity to beta
```
745aa8be

MDEV-25373 DROP TABLE doesn't raise error while dropping non-existing table in... · 5af6a137

Sergei Golubchik authored Dec 29, 2021

MDEV-25373 DROP TABLE doesn't raise error while dropping non-existing table in MariaDB 10.5.9 when OQGraph SE is loaded to the server

don't auto-succeed every DROP TABLE

5af6a137

MDEV-27461: Buffer pool resize fails to wake up the page cleaner · f18e2564

Marko Mäkelä authored Jan 17, 2022

buf_pool_t::realloc(): Invoke page_cleaner_wakeup()
if buf_LRU_get_free_only() returns a null pointer.

Ever since commit 7b1252c0 (MDEV-24278)
the page cleaner would remain in untimed sleep, expecting explicit
calls to buf_pool_t::page_cleaner_wakeup() when the ratio of dirty pages
could change.

Failure to wake up the page cleaner will cause all page writes to be
initiated by buf_flush_LRU_list_batch(). That might work too,
provided that the buffer pool size is at least BUF_LRU_MIN_LEN (256)
pages, but it would not advance the log checkpoint.

f18e2564

MDEV-27469: Assertion failure in defragment due to tx_read_only · 343f695c

Marko Mäkelä authored Jan 17, 2022

In commit c5fd9aa5 (MDEV-25919)
we prevented the function dict_stats_save_index_stat()
from being called in read-only mode in dict_stats_save(),
but not elsewhere.

dict_stats_save_defrag_summary(), dict_stats_save_defrag_stats():
If the transaction is in read-only mode, return DB_READ_ONLY
and do not attempt to lock or modify anything.

343f695c

15 Jan, 2022 3 commits

MDEV-27240 fixup: remove dead code · b7e4dc12
Nayuta Yanagisawa authored Jan 15, 2022

b7e4dc12

MDEV-27240 fixup: remove #ifdef in macro call · 64f844b6

Nayuta Yanagisawa authored Jan 15, 2022

Windows builds failed due to the following error:
'#': invalid character: possibly the result of a macro expansion

64f844b6

MDEV-27240 SIGSEGV in ha_spider::store_lock on LOCK TABLE · 2ecd39c9

Nayuta Yanagisawa authored Jan 11, 2022

The commit e954d9de gave different lifetime to wide_share and
partition_handler_share. This introduced the possibility that
partition_handler_share could be accessed even after it was freed.

We stop sharing partitoiin_handler_share and make it belong to
a single wide_handler to fix the problem.

2ecd39c9

14 Jan, 2022 3 commits

Remove FIXME comments that refer to an early MDEV-14425 plan · 8535c260

Marko Mäkelä authored Jan 14, 2022

In MDEV-14425, an early plan was to introduce a separate log file
for file-level records and checkpoint information. The reasoning was
that fil_system.mutex contention would be reduced by not having to
maintain fil_system.named_spaces. The mutex contention was actually
fixed in MDEV-23855 by making some data fields in fil_space_t and
fil_node_t use std::atomic.

Using a single circular log file simplifies recovery and backup.

8535c260

Merge 10.5 into 10.6 · 16b87f98
Marko Mäkelä authored Jan 14, 2022

16b87f98

MDEV-27500 buf_page_free() fails to drop the adaptive hash index · c104a01b

Marko Mäkelä authored Jan 14, 2022

The function buf_page_free() that was introduced
in commit a35b4ae8 (MDEV-15528)
failed to remove any adaptive hash index entries for the page
before freeing the page.

This caused an assertion failure on shutdown of 10.6 server of
in the function buf_pool_t::clear_hash_index() with the expression:
(s >= buf_page_t::UNFIXED || s == buf_page_t::REMOVE_HASH).
The assertion would fail for a block that is in the freed state.

The failing assertion was added in
commit aaef2e1d
in the 10.6 branch.

Thanks to Matthias Leich for finding the bug and testing the fix.

c104a01b

13 Jan, 2022 1 commit

MDEV-27058 fixup: Bogus assertion !block->page.is_io_fixed() · e6a06113

Marko Mäkelä authored Jan 13, 2022

buf_page_get_gen(): After recv_sys_t::recover_low() returned,
the page must not be read-fixed, but it may be write-fixed,
because the io-fix state is protected by block->page.lock,
which we are not holding yet.

Also, let us copy the block descriptor state to a local variable
for examination, so that in case an assertion would fail again,
we will have the sampled state in the core dump. In a core dump of
the assertion failure, we had block->page.fix() == buf_page_t::UNFIXED,
that is, the assertion expression was holding again.

e6a06113

12 Jan, 2022 4 commits

MDEV-26824 Can't add foreign key with empty referenced columns list · 6831b3f2

Aleksey Midenkov authored Jan 12, 2022

create_table_info_t::create_foreign_keys() expects equal number of
iterations through fk->columns and fk->ref_columns. If fk->ref_columns
is empty copy it from fk->columns.

6831b3f2

MDEV-27476 heap-use-after-free in buf_pool_t::is_block_field() · ba5ef63a

Marko Mäkelä authored Jan 12, 2022

This follows up commit 017d1b86.
In commit aaef2e1d (MDEV-27058)
some more problematic debug assertions were added.

btr_search_update_block_hash_info(), trx_purge_truncate_history():
Use simpler assertions to check that an uncompressed page is present.

ba5ef63a

Merge 10.5 into 10.6 · 0261eac5
Marko Mäkelä authored Jan 12, 2022

0261eac5

MDEV-27476 heap-use-after-free in buf_pool_t::is_block_field() · 017d1b86

Marko Mäkelä authored Jan 12, 2022

mtr_t::modify(): Remove a debug assertion that had been added
in commit 05fa4558 (MDEV-22110).
The function buf_pool_t::is_uncompressed() is only safe to invoke
while holding a buf_pool.page_hash latch so that buf_pool_t::resize()
cannot concurrently invoke free() on any chunks.

017d1b86

11 Jan, 2022 1 commit

MDEV-27022 Buffer pool is being flushed during recovery · f443cd11

Eugene Kosov authored Nov 11, 2021

The problem was introduced by the removal of buf_pool.flush_rbt
in commit 46b1f500 (MDEV-23399)

recv_sys_t::apply(): don't write to disc and fsync() the last batch.
Insead, sort it by oldest_modification for MariaDB server and some
mariabackup operations.

log_sort_flush_list(): a thread-safe function which sorts buf_pool::flush_list

f443cd11

10 Jan, 2022 3 commits

MDEV-27640 trx_has_lock_x() gives wrong result if the table has pending table lock · 428b057e
Thirunarayanan Balathandayuthapani authored Jan 10, 2022
```
trx_has_lock_x() fails to find whether the trx has X-lock on the table
when other transactions are waiting for an X or S lock on the table.
```
428b057e

Cleanup: Remove unused log_cmdq_key · fcbd3989

Marko Mäkelä authored Jan 10, 2022

There was an intention to add a CommandQueue in
mysql/mysql-server@eca5b0fc17a5bd6d4833d35a0d08c8549dd3b5ec
but it never appeared in any release (not even MySQL 5.7.3
where that commit appeared).

fcbd3989

MDEV-23836: Assertion `! is_set() || m_can_overwrite_status' in · 81e00485

Rucha Deodhar authored Oct 16, 2020

Diagnostics_area::set_error_status (interrupted ALTER TABLE under LOCK)

Analysis: KILL_QUERY is not ignored when local memory used exceeds maximum
session memory. Hence the query proceeds, OK is sent and we end up
reopening tables that are marked for reopen. During this, kill status is
eventually checked and assertion failure happens during trying to send error
message because OK has already been sent.
Fix: Ok is already sent so statement has already executed. It is too
late to give error. So ignore kill.

81e00485

09 Jan, 2022 2 commits

Silence CMake warning from exteral cmake project (pcre2) · c62bb9c3

Vladislav Vaintroub authored Jan 09, 2022


The warning reads:

CMake Deprecation Warning at CMakeLists.txt:101 (CMAKE_MINIMUM_REQUIRED):
Compatibility with CMake < 2.8.12 will be removed from a future version of
CMake.

c62bb9c3

MDEV-26879: Detach innodb_evict_tables_on_commit_debug from SAFE_MUTEX · 75d4c530

Marko Mäkelä authored Jan 09, 2022

In commit 18535a40 (MDEV-24811)
the implementation of innodb_evict_tables_on_commit_debug
depended on dict_sys.mutex and SAFE_MUTEX.
That is no longer the case.

SAFE_MUTEX is not available on Microsoft Windows.

75d4c530

05 Jan, 2022 5 commits

MDEV-27017 Assertion failure 'table->get_ref_count() == 0' on DDL that involves FULLTEXT INDEX · e8d1bb04

Thirunarayanan Balathandayuthapani authored Dec 30, 2021

 purge_sys.stop_FTS() does not wait for purge operation
on FTS tables to finish. InnoDB DDL does purge_sys.stop_FTS()
and lock all fts tables. It eventually fails due to
n_ref_count value.

fts_stop_purge(): Stops the purge thread to process new FTS tables,
check n_ref_count of all fts index auxiliary, common tables.
This should make sure that consecutive fts_lock_tables()
is always successful.

e8d1bb04

MDEV-26879 innodb_evict_tables_on_commit_debug=on makes table creation hang · 59e8a126

Marko Mäkelä authored Jan 05, 2022

In commit c5fd9aa5 (MDEV-25919)
an incorrect change to lock_release() was applied.

The setting innodb_evict_tables_on_commit_debug=on should only be
applied to normal transactions, not DDL transactions in the likes of
CREATE TABLE, nor transactions that are holding dict_sys.latch,
such as dict_stats_save().

59e8a126

MDEV-27335 Windows, MSI - Bring the datadir location into the instance config UI · 2a6e0869
Vladislav Vaintroub authored Dec 21, 2021

2a6e0869
Windows installer - fix UI of the "Uninstall" Dialog. · 3712808a
Vladislav Vaintroub authored Dec 18, 2021
```
Give RemoveDatadirText field extra 10 pixels in height, to avoid
truncated display of directory path
```
3712808a

Update errmsg-utf8.txt (spa) part 2 · b7734168

Jesús Marín authored Jan 05, 2022

Further changes to 3e030488

New changes in translation:

* Converted to LATAM countries treatment: tú for vd. This way it serves good for Spain and all LATAM countries.
* Minor changes

b7734168

04 Jan, 2022 3 commits

Work around MDEV-27421 ./mtr --ps-protocol main.opt_trace · cd751f02
Marko Mäkelä authored Jan 04, 2022

cd751f02
Merge 10.5 into 10.6 · 3f572676
Marko Mäkelä authored Jan 04, 2022

3f572676

MDEV-27416 InnoDB hang in buf_flush_wait_flushed(), on log checkpoint · 4c3ad244

Marko Mäkelä authored Jan 04, 2022

InnoDB could sometimes hang when triggering a log checkpoint. This is
due to commit 7b1252c0 (MDEV-24278),
which introduced an untimed wait to buf_flush_page_cleaner().

The hang was noticed by occasional failures of IMPORT TABLESPACE tests,
such as innodb.innodb-wl5522, which would (unnecessarily) invoke
log_make_checkpoint() from row_import_cleanup().

The reason of the hang was that buf_flush_page_cleaner() would enter
untimed sleep despite buf_flush_sync_lsn being set. The exact failure
scenario is unclear, because buf_flush_sync_lsn should actually be
protected by buf_pool.flush_list_mutex. We prevent the hang by
invoking buf_pool.page_cleaner_set_idle(false) whenever we are
setting buf_flush_sync_lsn and signaling buf_pool.do_flush_list.

The bulk of these changes was originally developed as a preparation
for MDEV-26827, to invoke buf_flush_list() from fewer threads,
and tested on 10.6 by Matthias Leich.

This fix was tested by running 100 repetitions of 100 concurrent instances
of the test innodb.innodb-wl5522 on a RelWithDebInfo build, using ext4fs
and innodb_flush_method=O_DIRECT on a SATA SSD with 4096-byte block size.
During the test, the call to log_make_checkpoint() in row_import_cleanup()
was present.

buf_flush_list(): Make static.

buf_flush_wait(): Wait for buf_pool.get_oldest_modification()
to reach a target, by work done in the buf_flush_page_cleaner.
If buf_flush_sync_lsn is going to be set, we will invoke
buf_pool.page_cleaner_set_idle(false).

buf_flush_ahead(): If buf_flush_sync_lsn or buf_flush_async_lsn
is going to be set and the page cleaner woken up, we will invoke
buf_pool.page_cleaner_set_idle(false).

buf_flush_wait_flushed(): Invoke buf_flush_wait().

buf_flush_sync(): Invoke recv_sys.apply() at the start in case
crash recovery is active. Invoke buf_flush_wait().

buf_flush_sync_batch(): A lower-level variant of buf_flush_sync()
that is only called by recv_sys_t::apply().

buf_flush_sync_for_checkpoint(): Do not trigger log apply
or checkpoint during recovery.

buf_dblwr_t::create(): Only initiate a buffer pool flush, not
a checkpoint.

row_import_cleanup(): Do not unnecessarily invoke log_make_checkpoint().
Invoking buf_flush_list_space() before starting to generate redo log
for the imported tablespace should suffice.

srv_prepare_to_delete_redo_log_file():
Set recv_sys.recovery_on in order to prevent
buf_flush_sync_for_checkpoint() from initiating a checkpoint
while the log is inaccessible. Remove a wait loop that is already
part of buf_flush_sync().
Do not invoke fil_names_clear() if the log is being upgraded,
because the FILE_MODIFY record is specific to the latest format.

create_log_file(): Clear recv_sys.recovery_on only after calling
log_make_checkpoint(), to prevent buf_flush_page_cleaner from
invoking a checkpoint.

innodb_shutdown(): Simplify the logic in mariadb-backup --prepare.

os_aio_wait_until_no_pending_writes(): Update the function comment.
Apart from row_quiesce_table_start() during FLUSH TABLES...FOR EXPORT,
this is being called by buf_flush_list_space(), which is invoked
by ALTER TABLE...IMPORT TABLESPACE as well as some encryption operations.

4c3ad244

03 Jan, 2022 5 commits

Deb: Adapt custom build steps to be compatible with latest Salsa-CI · eab89f14

Otto Kekäläinen authored Dec 31, 2021

Upstream Salsa-CI refactored the build process in
https://salsa.debian.org/salsa-ci-team/pipeline/-/commit/58880fcef5b742cb9c661121a8c8707bf392b3b5

This broke our custom direct invocation of install-build-deps.sh as the
Salsa-CI images no longer contain them. Adapt the .build-script
equivalent to follow new Salsa-CI method so builds work again.

eab89f14

MDEV-27414 Server may hang when innodb_undo_log_truncate=ON · c410f7aa

Marko Mäkelä authored Jan 03, 2022

trx_purge_truncate_history(): Avoid a deadlock with
buf_pool_t::release_freed_page(). Page latches are not supposed
to be waited for while holding a mutex like buf_pool.mutex or
buf_pool.flush_list_mutex.

This regression was caused by
commit aaef2e1d (MDEV-27058).
Before that, trx_purge_truncate_history() would buffer-fix the block,
release buf_pool.flush_list_mutex, and then wait for the
exclusive page latch.

This bug led to occasional failures of the test
innodb.undo_truncate_recover.

c410f7aa

MDEV-27039 Trying to lock mutex ... when the mutex was already locked · 30b917d3

Andrei authored Dec 27, 2021

The reason of the double lock was an extraneous ha_flush_logs().
Unlike the upstream it is unnecessary in Mariadb that exploits a binlog
checkpoint mechanism for not letting PURGE or RESET-MASTER to trouble
transaction recovery. That is in case should a trx
be prepared but its binlog file gone, the trx then is committed on disk too.
Those facts have been always verified by existing tests of

  binlog.binlog_{checkpoint,xa_recover}.test.

A regression test for the bug is included though.

30b917d3

Merge 10.4 into 10.5 · c9db50b5
Marko Mäkelä authored Jan 03, 2022

c9db50b5

Correct some copyright messages · 1df05a08

Marko Mäkelä authored Jan 03, 2022

Most of the Facebook contribution
mysql/mysql-server@72d656acdf082d5ead1cc1be84f2fd68ab6a65a9
was removed in
commit 5bea43f5 (MDEV-12353).
Mainly the configuration parameter innodb_compression_level remains.
It had been renamed to page_zip_level in
mysql/mysql-server@5b38f2a712a7077c994c00787b891a7d4ee328df.

1df05a08