Commits · 5caff20216f47fc10540f7de14548cc80cd1c369 · nexedi / MariaDB

22 Oct, 2021 1 commit

MDEV-26883 InnoDB hang due to table lock conflict · 5caff202

Marko Mäkelä authored Oct 22, 2021

In a stress test campaign of a 10.6-based branch by Matthias Leich,
a deadlock between two InnoDB threads occurred, involving
lock_sys.wait_mutex and a dict_table_t::lock_mutex.

The cause of the hang is a latching order violation in
lock_sys_t::cancel(). That function and the latching order
violation were originally introduced in
commit 8d16da14 (MDEV-24789).

lock_sys_t::cancel(): Invoke table->lock_mutex_trylock() in order
to avoid a deadlock. If that fails, release lock_sys.wait_mutex,
and acquire both latches. In that way, we will be obeying the
latching order and no hangs will occur.

This hang should mostly affect DDL operations. DML operations will
acquire only IX or IS table locks, which are compatible with each other.

5caff202

21 Oct, 2021 15 commits

Remove trailing space · 059a5f11
Vladislav Vaintroub authored Oct 21, 2021

059a5f11
Merge 10.5 into 10.6 · 73f5cbd0
Marko Mäkelä authored Oct 21, 2021

73f5cbd0
Fix GCC 11.2.0 -m32 (IA-32) warnings · a0fda162
Marko Mäkelä authored Oct 21, 2021
```
page_create_low(): Fix -Warray-bounds

log_buffer_extend(): Fix -Wstringop-overflow
```
a0fda162
Merge 10.4 into 10.5 · 5f8561a6
Marko Mäkelä authored Oct 21, 2021

5f8561a6
Merge 10.3 into 10.4 · 489ef007
Marko Mäkelä authored Oct 21, 2021

489ef007
Merge 10.2 into 10.3 · d5bcccda
Marko Mäkelä authored Oct 21, 2021

d5bcccda
MDEV-19522 fixup: Integer type mismatch in unit test · fbb1e92e
Marko Mäkelä authored Oct 21, 2021

fbb1e92e
Merge 10.2 into 10.3 · e4a7c15d
Marko Mäkelä authored Oct 21, 2021

e4a7c15d

MDEV-26865: Add test case and instrumentation · 1a2308d3

Marko Mäkelä authored Oct 21, 2021

Based on mysql/mysql-server@bc9c46bf2894673d0df17cd0ee872d0d99663121
but without sleeps.

The test was verified to hit the debug assertion if the change to
fts_add_doc_by_id() in commit 2d98b967
was reverted.

1a2308d3

MDEV-26865 fts_optimize_thread cannot keep up with workload · 2d98b967

Marko Mäkelä authored Oct 21, 2021

fts_cache_t::total_size_at_sync: New field, to sample total_size.

fts_add_doc_by_id(): Invoke sync if total_size has grown too much
since the previous sync request. (Maintain cache->total_size_at_sync.)

ib_wqueue_t::length: Caches ib_list_len(*items).

ib_wqueue_len(): Removed. We will refer to fts_optimize_wq->length
directly.

Based on mysql/mysql-server@bc9c46bf2894673d0df17cd0ee872d0d99663121

2d98b967

MDEV-26864 Race condition between transaction commit and undo log truncation · c484a358

Marko Mäkelä authored Oct 21, 2021

trx_commit_in_memory(): Do not release the rseg reference before
trx_undo_commit_cleanup() has been invoked and the current transaction
is truly done with the rollback segment. The purpose of the reference
count is to prevent data races with trx_purge_truncate_history().

This is based on
mysql/mysql-server@ac79aa1522f33e6eb912133a81fa2614db764c9c.

c484a358

MDEV-19522 InnoDB commit fails when FTS_DOC_ID value is greater than 4294967295 · 8ce8c269

Thirunarayanan Balathandayuthapani authored Oct 06, 2021

InnoDB commit fails when consecutive FTS_DOC_ID value
is greater than 4294967295.
Fix is that InnoDB should remove the delta FTS_DOC_ID
value limitations and fts should encode 8 byte value,
remove FTS_DOC_ID_MAX_STEP variable. Replaced the
fts0vlc.ic file with fts0vlc.h

fts_encode_int(): Should be able to encode 10 bytes value

fts_get_encoded_len(): Should get the length of the value
which has 10 bytes

fts_decode_vlc(): Add debug assertion to verify the maximum
length allowed is 10.

mach_read_uint64_little_endian(): Reads 64 bit stored in
little endian format

Added a unit test case which check for minimum and maximum
value to do the fts encoding

8ce8c269

MDEV-22627 fixup: Add a type cast for 32-bit platforms · 6b4fad94
Marko Mäkelä authored Oct 21, 2021

6b4fad94

MDEV-26262 fixup: Remove a bogus assertion · d3426c4c

Marko Mäkelä authored Oct 21, 2021

In commit 1811fd51 the assertion
should have said error_reported instead of !error_reported.
But, that revised assertion would still fail in main.defaults
where ER_BAD_DATA is reported during CREATE TABLE.

d3426c4c

MDEV-19129: Xcode compatibility update: mysql-test-run.pl · 2e844a08
Sergei Krivonos authored Oct 21, 2021

2e844a08

20 Oct, 2021 9 commits

MDEV-22627 fixup: Cover also ALTER TABLE...ALGORITHM=INPLACE · 05c3dced
Marko Mäkelä authored Oct 20, 2021

05c3dced

MDEV-20131 Assertion `!pk->has_virtual()' failed · d10c42b4

Nikita Malyavin authored Oct 07, 2021

Assertion `!pk->has_virtual()' failed in dict_index_build_internal_clust
while creating PRIMARY key longer than possible to store in the page.

This happened because the key was wrongly deduced as Long UNIQUE supported,
however PRIMARY KEY cannot be of that type. The main reason is that
only 8 bytes are used to store the hash, see HA_HASH_FIELD_LENGTH.

This is also why HA_NOSAME flag is removed (and caused the assertion in
turn) in open_table_from_share:
      if (key_info->algorithm == HA_KEY_ALG_LONG_HASH)
      {
        key_part_end++;
        key_info->flags&= ~HA_NOSAME;
      }

To make it unique, the additional check is done by
check_duplicate_long_entries call from ha_write_row, and similar one from
ha_update_row.

PRIMARY key is already forbidden, which is checked by the first test in
main.long_unique, however is_hash_field_needed was wrongly deduced to true
in mysql_prepare_create_table in this particular case.

FIX:

* Improve the check for Key::PRIMARY type
* Simplify is_hash_field_needed deduction for a more neat reading

d10c42b4

Update libmariadb · 69b3de83
Marko Mäkelä authored Oct 20, 2021

69b3de83

MDEV-22627 Failing assertion: dict_tf2_is_valid(flags, flags2) · b06e8167

Marko Mäkelä authored Oct 20, 2021

create_table_info_t::innobase_table_flags(): Refuse to create
a PAGE_COMPRESSED table with PAGE_COMPRESSION_LEVEL=0 if also
innodb_compression_level=0.

The parameter value innodb_compression_level=0 was only somewhat
meaningful for testing or debugging ROW_FORMAT=COMPRESSED tables.
For the page_compressed format, it never made any sense, and the
check in dict_tf_is_valid_not_redundant() that was added in
72378a25 (MDEV-12873) would cause
the server to crash.

b06e8167

MDEV-22445 Crash on HANDLER READ NEXT after XA PREPARE · caebe151

Nikita Malyavin authored Oct 07, 2021

The assertion is absolutely correct since no data access is possible after
XA PREPARE.

The check is added in mysql_ha_read.

caebe151

MDEV-26262 frm is corrupted after ER_EXPRESSION_REFERS_TO_UNINIT_FIELD · 1811fd51

Nikita Malyavin authored Aug 02, 2021

This is a duplicate of MDEV-18278 89936f11, but I will add an
additional assertion

Description:

The frm corruption should not be reported during CREATE TABLE. Normally
it doesn't, and the data to fill TABLE is taken by open_table_from_share
call. However, the vcol data is stored as SQL string in
table->s->vcol_defs.str and is anyway parsed on each table open.
It is impossible [or hard] to avoid, because it's hard to clone the
expression tree in general (it's easier to parse).

Normally parse_vcol_defs should only fail on semantic errors. If so,
error_reported is set to true. Any other failure is not expected during
table creation. There is either unhandled/unacknowledged error, or
something went really wrong, like memory reject. This all should be
asserted anyway.

Solution:
* Set *error_reported=true for the forward references check;
* Assert for every unacknowledged error during table creation.

1811fd51

restore default.test, default.result after MDEV-23597 c47e4aab commit · a8401ad5
Nikita Malyavin authored Jul 21, 2021

a8401ad5
MDEV-26554: Stabilize the test · 78dec1f1
Marko Mäkelä authored Oct 20, 2021

78dec1f1

MDEV-26363 Passwords incorrectly expiring after MySQL5.7 -> MariaDB10.3 -> 10.4+ upgrades · 4590f8b4

Daniel Black authored Aug 16, 2021

MySQL-5.7 mysql.user tables have a last_password_changed field.

Because before MariaDB-10.4 remained oblivious to this, the act of creating
users or otherwise changing a users row left the last_password_field with 0.

Running a MariaDB-10.4 instance on this would work correctly, until mysql_upgrade
is run, when this 0 value immediately translates to password expired
state.

MySQL-5.7 relied on the password_expired enum to indicate password
expiry so we aren't going to activate password that were expired in
MySQL-5.7.

Thanks Hans Borresen for the bug report and review of the fix.

4590f8b4

19 Oct, 2021 10 commits

After-merge fix: Remove unused variable · d6a3f425
Marko Mäkelä authored Oct 19, 2021

d6a3f425

MDEV-26772 InnoDB DDL fails with DUPLICATE KEY error · 6e390a62

Marko Mäkelä authored Oct 19, 2021

ha_innobase::delete_table(): When the table that is being dropped
has a name starting with #sql, temporarily set
innodb_lock_wait_timeout=0 while attempting to lock the
persistent statistics tables. If the statistics tables cannot be locked,
pretend that statistics did not exist and carry on with dropping
the table. The SQL layer is not really prepared for failures of
this operation. This is what fixes the test case.

ha_innobase::rename_table(): When renaming a table from a name
that starts with #sql, try to lock the statistics tables with an
immediate timeout, and ignore the statistics if the locks were
not available. In fact, during any rename from a #sql name,
dict_stats_rename_table() should have no effect, because already
when an earlier rename to a #sql name took place we should have
deleted the statistics for the table using the non-#sql name.
This change is just analogous to the ha_innobase::delete_table().

6e390a62

Fix Groonga crash on MIPS: Correctly link to libatomic · 1388845e

Vicențiu Ciorbaru authored Oct 19, 2021

MIPS (and possibly other) platforms require linking against libatomic to
support 64-bit atomic integers. Groonga was failing to do so and all related
tests were failing with an atomics relocation error on MIPS.

Contributors:
James Cowgill <jcowgill@debian.org>

1388845e

MDEV-19129: Xcode compatibility update: update libmariadb submodule · 3c2ab896
Sergei Krivonos authored Oct 18, 2021

3c2ab896

Fix MIPS build failure: Handle unaligned buffers in connect's TYPBLK class · a33c1082

Vicențiu Ciorbaru authored Oct 15, 2021

On MIPS platforms (and probably others) unaligned memory access results in a
bus error. In the connect storage engine, block data for some data formats is
stored packed in memory and the TYPBLK class is used to read values from it.
Since TYPBLK does not have special handling for this packed memory, it can
quite easily result in unaligned memory accesses.

The simple way to fix this is to perform all accesses to the main buffer
through memcpy. With GCC and optimizations turned on, this call to memcpy is
completely optimized away on architectures where unaligned accesses are ok
(like x86).

Contributors:
James Cowgill <jcowgill@debian.org>

a33c1082

Link with libatomic to enable C11 atomics support · f502ccbc

Vicențiu Ciorbaru authored Oct 15, 2021

Some architectures (mips) require libatomic to support proper
atomic operations. Check first if support is available without
linking, otherwise use the library.

Contributors:
James Cowgill <jcowgill@debian.org>
Jessica Clarke <jrtc27@debian.org>
Vicențiu Ciorbaru <vicentiu@mariadb.org>

f502ccbc

MDEV-26158 SIGSEGV in spider_free_mem from ha_spider::open on INSERT · e7208bd9

Nayuta Yanagisawa authored Sep 21, 2021

The server crashes due to passing NULL to spider_free().

In some cases, this == pt_handler_share_handlers[0] at the label
error_get_share in ha_spider::open().

In such cases, to nullify pt_handler_share_handlers[0]->wide_handler
is nothing but to nullify this->wide_handler. We should not do this
before freeing this->wide_handler.

e7208bd9

MDEV-24585 Assertion `je->s.cs == nice_js->charset()' failed in json_nice. · 1a54cf62
Alexey Botchkov authored Oct 11, 2021
```
We should set the charset in
Item_func_json_format::fix_length_and_dec().
```
1a54cf62

MDEV-26855: Enable spinning for log_sys_mutex and log_flush_order_mutex · f7684f0c

Krunal Bauskar authored Oct 19, 2021

As part of MDEV-26779 we first discovered the effect of enabling spinning for
some critical mutex. MDEV-26779 tried enabling it for lock_sys.wait_mutex and
observed a good gain in performance.

In yet another discussion, Mark Callaghan pointed a reference to pthread based
mutex spin using PTHREAD_MUTEX_ADAPTIVE_NP (MDEV-26769 Intel RTM).

Given the strong references, Marko Makela as part of his comment in #1923
pointed an idea to enable spinning for other mutexes. Based on perf profiling
we decided to explore spinning for log_sys_mutex and log_flush_order_mutex as
they are occupying the top slots in the contented mutex list.

The evaluation showed promising results for ARM64 but not for x86.
So a patch is here-by proposed to enable the spinning of the mutex for
ARM64-based platform.

f7684f0c

MDEV-14804 innodb.update_time failed in buildbot with wrong result · 53167031

Marko Mäkelä authored Oct 19, 2021

Let us use a minimal-size buffer pool to ensure that page flushing
will be slow enough so that LRU eviction cannot be avoided.

53167031

18 Oct, 2021 5 commits

MDEV-26299: Some views force server (and mysqldump) to generate invalid SQL for their definitions · 27bf57fd

Oleksandr Byelkin authored Oct 01, 2021

Do not print illegal table field names for non-top-level SELECT list,
they will not be refered in any case but create problem for parsing
of printed result.

27bf57fd

MDEV-25284: Assertion `info->type == READ_CACHE || info->type == WRITE_CACHE' failed · 2291f8ef

Brandon Nesterenko authored Oct 13, 2021

Problem:
========
This patch addresses two issues.

First, if a CHANGE MASTER command is issued and an error happens
while locating the replica’s relay logs, the logs can be put into an
invalid state where future updates fail and future CHANGE MASTER
calls crash the server. More specifically, right before a replica
purges the relay logs (part of the `CHANGE MASTER TO` logic), the
relay log is temporarily closed with state LOG_TO_BE_OPENED. If the
server errors in-between the temporary log closure and purge, i.e.
during the function find_log_pos, the log should be closed.
MDEV-25284 reveals the log is not properly closed.

Second, upon issuing a RESET SLAVE ALL command, a slave’s GTID
filters are not cleared (DO_DOMAIN_IDS, IGNORE_DOMIAN_IDS,
IGNORE_SERVER_IDS). MySQL had a similar bug report, Bug #18816897,
which fixed this issue to clear IGNORE_SERVER_IDS after issuing
RESET SLAVE ALL in version 5.7.

Solution:
=========

To fix the first problem, the CHANGE MASTER error handling logic was
extended to transition the relay log state to LOG_CLOSED from
LOG_TO_BE_OPENED.

To fix the second problem, the RESET SLAVE ALL logic is extended to
clear the domain_id filter and ignore_server_ids.

Reviewed By:
============
Andrei Elkin <andrei.elkin@mariadb.com>

2291f8ef

MDEV-26554: Races between INSERT on child and DDL on parent table · c3c53926

Marko Mäkelä authored Oct 18, 2021

The SQL layer never acquires metadata locks (MDL) on the tables
that the tables that DML statement accesses is modifying.

However, the storage engine must access the parent table in order to
ensure that the child table will not refer to a non-existing record
in the parent table.

During certain DDL operations, the InnoDB table metadata (dict_table_t)
may be be freed and rebuilt. This would cause a race condition with
a concurrent INSERT that is attempting to report a FOREIGN KEY violation.

We work around the insufficient MDL during DML by acquiring exclusive
InnoDB table locks on all child tables during DDL. To avoid deadlocks,
we will follow the following order of acquisition:

1. tables whose REFERENCES clauses point to the current table
2. the current table that is being subjected to DDL
3. mysql.innodb_table_stats
4. mysql.innodb_index_stats
5. the InnoDB dictionary tables (SYS_TABLES and so on)
6. exclusive dict_sys.latch

c3c53926

Merge 10.5 into 10.6 · 59fe6a8a
Marko Mäkelä authored Oct 18, 2021

59fe6a8a

MDEV-26582 SIGSEGV in spider_db_bulk_insert and spider_db_connect and... · edde9084

Nayuta Yanagisawa authored Sep 24, 2021

MDEV-26582 SIGSEGV in spider_db_bulk_insert and spider_db_connect and spider_db_before_query, and hang in "End of update loop" / "Reset for next command" query states

Spider accesses a freed connection in ha_spider::end_bulk_insert()
and results in SIGSEGV.

The cause of the bug is that ha_spider::is_bulk_insert_exec_period()
wrongly returns TRUE when the bulk insertion has not yet started.

Spider decides whether it is during the bulk insertion or not by
the value of insert_pos, but the variable is not reset in a case,
and this result in the bug.

edde9084