Commits · 47036387756d465c68ac4360eb031e78f48d9e58 · nexedi / MariaDB

14 Mar, 2023 1 commit

MDEV-30805 SIGSEGV in my_convert and UBSAN: member access within null pointer... · 47036387

Alexander Barkov authored Mar 14, 2023

MDEV-30805 SIGSEGV in my_convert and UBSAN: member access within null pointer of type 'const struct MY_CHARSET_HANDLER' in my_convert

Type_handler::partition_field_append_value() erroneously
passed the address of my_collation_contextually_typed_binary
to conversion functions copy_and_convert() and my_convert().

This happened because generate_partition_syntax_for_frm()
was called from mysql_create_frm_image() in the stage when
the fields in List<Create_field> can still contain unresolved
contextual collations, like "binary" in the reported crash scenario:

  ALTER TABLE t CHANGE COLUMN a a CHAR BINARY;

Fix:

1. Splitting mysql_prepare_create_table() into two parts:
   - mysql_prepare_create_table_stage1() interates through
     List<Create_field> and calls Create_field::prepare_stage1(),
     which performs basic attribute initialization, including
     context collation resolution.
   - mysql_prepare_create_table_finalize() - the rest of the
     old mysql_prepare_create_table() code.

2. Changing mysql_create_frm_image():
   It now calls:
   - mysql_prepare_create_table_stage1() in the very
     beginning, before the partition related code.
   - mysql_prepare_create_table_finalize() in the end,
    instead of the old mysql_prepare_create_table() call

3. Adding mysql_prepare_create_table() as a wrapper
   for two calls:
     mysql_prepare_create_table_stage1() ||
     mysql_prepare_create_table_finalize()
   so the code stays unchanged in the other places
   where mysql_prepare_create_table() was used.

4. Changing prototype for Type_handler::Column_definition_prepare_stage1()
   Removing arguments:
   - handler *file
   - ulonglong table_flags
   Adding a new argument instead:
   - column_definition_type_t type
   This allows to call Column_definition_prepare_stage1() and
   therefore to call mysql_prepare_create_table_stage1()
   before instantiation of a handler.
   This simplifies the code, because in case of a partitioned table,
   mysql_create_frm_image() creates a handler of the underlying
   partition first, the frees it and created a ha_partition
   instance instead.
   mysql_prepare_create_table() before the fix was called with the final
   (ha_partition) handler.

5. Moving parts of Column_definition_prepare_stage1() which
   need a pointer to handler and table_flags to
   Column_definition_prepare_stage2().

47036387

06 Mar, 2023 4 commits
- Merge 10.8 into 10.9 · 46a76038
  Marko Mäkelä authored Mar 06, 2023
  
  46a76038
- Merge 10.6 into 10.8 · 669a0c6e
  Marko Mäkelä authored Mar 06, 2023
  
  669a0c6e
- Merge 10.5 into 10.6 · 25c04806
  Marko Mäkelä authored Mar 06, 2023
  
  25c04806
- Fix GCC 5.3.1 -Wsign-compare · 948fb3c2
  Marko Mäkelä authored Mar 06, 2023
```
This fixes up commit 57c526ff
```
  948fb3c2
02 Mar, 2023 2 commits

MDEV-30341 Reset check_foreigns, check_unique_secondary variables · 49e2b50d

Thirunarayanan Balathandayuthapani authored Feb 21, 2023

- InnoDB fails to reset the check_foreigns and check_unique_secondary
in trx_t::free(), trx_t::commit_cleanup(). This lead to bulk insert
in internal innodb fts table operation.

49e2b50d

MDEV-30752 Assertion `!index->is_ibuf()' failed around · 550b8d76
Thirunarayanan Balathandayuthapani authored Feb 28, 2023
```
			cmp_dtuple_rec_with_match_bytes

- InnoDB shouldn't use the adaptive hash index for change buffer indexes.
```
550b8d76

28 Feb, 2023 5 commits

CONC-637 Build fails when specifying -DPLUGIN_AUTH_GSSAPI_CLIENT=OFF · 6d923362
Sergei Golubchik authored Feb 28, 2023

6d923362
Merge 10.5 into 10.6 · 085d0ac2
Marko Mäkelä authored Feb 28, 2023

085d0ac2

MDEV-30753 Possible corruption due to trx_purge_free_segment() · c14a3943

Marko Mäkelä authored Feb 28, 2023

Starting with commit 0de3be8c (MDEV-30671),
the field TRX_UNDO_NEEDS_PURGE lost its previous meaning.
The following scenario is possible:

(1) InnoDB is killed at a point of time corresponding to the durable
execution of some fseg_free_step_not_header() but not
trx_purge_remove_log_hdr().
(2) After restart, the affected pages are allocated for something else.
(3) Purge will attempt to access the newly reallocated pages when looking
for some old undo log records.

trx_purge_free_segment(): Invoke trx_purge_remove_log_hdr() as the first
thing, to be safe. If the server is killed, some pages will never be
freed. That is the lesser evil. Also, before each mtr.start(), invoke
log_free_check() to prevent ib_logfile0 overrun.

c14a3943

Merge 10.8 into 10.9 · c3246e4b
Marko Mäkelä authored Feb 28, 2023

c3246e4b
Merge 10.6 into 10.8 · 6ac44ac3
Marko Mäkelä authored Feb 28, 2023

6ac44ac3

27 Feb, 2023 2 commits

Added detection of memory overwrite with multi_malloc · 57c526ff

Monty authored Feb 26, 2023

This patch also fixes some bugs detected by valgrind after this
patch:

- Not enough copy_func elements was allocated by Create_tmp_table() which
  causes an memory overwrite in Create_tmp_table::add_fields()
  I added an ASSERT() to be able to detect this also without valgrind.
  The bug was that TMP_TABLE_PARAM::copy_fields was not correctly set
  when calling create_tmp_table().
- Aria::empty_bits is not allocated if there is no varchar/char/blob
  fields in the table.  Fixed code to take this into account.
  This cannot cause any issues as this is just a memory access
  into other Aria memory and the content of the memory would not be used.
- Aria::last_key_buff was not allocated big enough. This may have caused
  issues with rtrees and ma_extra(HA_EXTRA_REMEMBER_POS) as they
  would use the same memory area.
- Aria and MyISAM didn't take extended key parts into account, which
  caused problems when copying rec_per_key from engine to sql level.
- Mark asan builds with 'asan' in version strihng to detect these in
  not_valgrind_build.inc.
  This is needed to not have main.sp-no-valgrind fail with asan.

57c526ff

Merge 10.5 into 10.6 · 3e2ad0e9
Marko Mäkelä authored Feb 27, 2023

3e2ad0e9

24 Feb, 2023 1 commit

MDEV-30671 InnoDB undo log truncation fails to wait for purge of history · 0de3be8c

Marko Mäkelä authored Feb 24, 2023

It is not safe to invoke trx_purge_free_segment() or execute
innodb_undo_log_truncate=ON before all undo log records in
the rollback segment has been processed.

A prominent failure that would occur due to premature freeing of
undo log pages is that trx_undo_get_undo_rec() would crash when
trying to copy an undo log record to fetch the previous version
of a record.

If trx_undo_get_undo_rec() was not invoked in the unlucky time frame,
then the symptom would be that some committed transaction history is
never removed. This would be detected by CHECK TABLE...EXTENDED that
was impleented in commit ab019010.
Such a garbage collection leak should be possible even when using
innodb_undo_log_truncate=OFF, just involving trx_purge_free_segment().

trx_rseg_t::needs_purge: Change the type from Boolean to a transaction
identifier, noting the most recent non-purged transaction, or 0 if
everything has been purged. On transaction start, we initialize this
to 1 more than the transaction start ID. On recovery, the field may be
adjusted to the transaction end ID (TRX_UNDO_TRX_NO) if it is larger.

The field TRX_UNDO_NEEDS_PURGE becomes write-only; only some debug
assertions that would validate the value. The field reflects the old
inaccurate Boolean field trx_rseg_t::needs_purge.

trx_undo_mem_create_at_db_start(), trx_undo_lists_init(),
trx_rseg_mem_restore(): Remove the parameter max_trx_id.
Instead, store the maximum in trx_rseg_t::needs_purge,
where trx_rseg_array_init() will find it.

trx_purge_free_segment(): Contiguously hold a lock on
trx_rseg_t to prevent any concurrent allocation of undo log.

trx_purge_truncate_rseg_history(): Only invoke trx_purge_free_segment()
if the rollback segment is empty and there are no pending transactions
associated with it.

trx_purge_truncate_history(): Only proceed with innodb_undo_log_truncate=ON
if trx_rseg_t::needs_purge indicates that all history has been purged.

Tested by: Matthias Leich

0de3be8c

23 Feb, 2023 1 commit
- MDEV-30716 Wrong casefolding in xxx_unicode_520_ci for U+0700..U+07FF · b62123e0
  Alexander Barkov authored Feb 23, 2023
```
The array my_unicase_pages_unicode520[7] erroneously mapped to plane06
instead of plane07.
```
  b62123e0
22 Feb, 2023 1 commit

MDEV-25984 Assertion `max_doc_id > 0' failed in fts_init_doc_id() · db245e11

Thirunarayanan Balathandayuthapani authored Feb 22, 2023

- rollback_inplace_alter_table() locks the fts internal tables.
At the time, insert tries to fetch the doc id from config table,
fails to lock the config table and returns doc id as 0.

fts_cmp_set_sync_doc_id(): Retry to fetch the doc id again if
it encounter DB_LOCK_WAIT_TIMEOUT error

db245e11

21 Feb, 2023 1 commit
- MDEV-29871 innodb_fts.fulltext_misc unexpectedly reports a result · df9f9ba1
  Thirunarayanan Balathandayuthapani authored Feb 21, 2023
```
- match()+0 returns the floating result and converts into integer value
and it leads to sporadic failure.
```
  df9f9ba1
20 Feb, 2023 1 commit

MDEV-27701 Race on trx->lock.wait_lock between lock_rec_move() and lock_sys_t::cancel() · a474e327

Vlad Lesin authored Feb 09, 2023

The initial issue was in assertion failure, which checked the equality
of lock to cancel with trx->lock.wait_lock in lock_sys_t::cancel().

If we analyze lock_sys_t::cancel() code from the perspective of
trx->lock.wait_lock racing, we won't find the error there, except the
cases when we need to reload it after the corresponding latches
acquiring.

So the fix is just to remove the assertion and reload
trx->lock.wait_lock after acquiring necessary latches.

Reviewed by: Marko Mäkelä <marko.makela@mariadb.com>

a474e327

16 Feb, 2023 11 commits

Merge 10.8 into 10.9 · 0d55914d
Marko Mäkelä authored Feb 16, 2023

0d55914d
Merge 10.6 into 10.8 · b12cd88c
Marko Mäkelä authored Feb 16, 2023

b12cd88c
Merge 10.5 into 10.6 · 67a6ad0a
Marko Mäkelä authored Feb 16, 2023

67a6ad0a
MDEV-30552 fixup: Fix the test for non-debug · d3f35aa4
Marko Mäkelä authored Feb 16, 2023

d3f35aa4
Fix clang -Winconsistent-missing-override · 0c79ae94
Marko Mäkelä authored Feb 16, 2023

0c79ae94
MDEV-27774 fixup: Correct a comment · 34f0433c
Marko Mäkelä authored Feb 16, 2023

34f0433c
Merge 10.6 into 10.8 · 5abbe092
Marko Mäkelä authored Feb 16, 2023

5abbe092

MDEV-30638 Deadlock between INSERT and InnoDB non-persistent statistics update · 201cfc33

Marko Mäkelä authored Feb 16, 2023

This is a partial revert of
commit 8b6a308e (MDEV-29883)
and a follow-up to the
merge commit 394fc71f (MDEV-24569).

The latching order related to any operation that accesses the allocation
metadata of an InnoDB index tree is as follows:

1. Acquire dict_index_t::lock in non-shared mode.
2. Acquire the index root page latch in non-shared mode.
3. Possibly acquire further index page latches. Unless an exclusive
dict_index_t::lock is held, this must follow the root-to-leaf,
left-to-right order.
4. Acquire a *non-shared* fil_space_t::latch.
5. Acquire latches on the allocation metadata pages.
6. Possibly allocate and write some pages, or free some pages.

btr_get_size_and_reserved(), dict_stats_update_transient_for_index(),
dict_stats_analyze_index(): Acquire an exclusive fil_space_t::latch
in order to avoid a deadlock in fseg_n_reserved_pages() in case of
concurrent access to multiple indexes sharing the same "inode page".

fseg_page_is_allocated(): Acquire an exclusive fil_space_t::latch
in order to avoid deadlocks. All callers are holding latches
on a buffer pool page, or an index, or both.
Before commit edbde4a1 (MDEV-24167)
a third mode was available that would not conflict with the shared
fil_space_t::latch acquired by ha_innobase::info_low(),
i_s_sys_tablespaces_fill_table(),
or i_s_tablespaces_encryption_fill_table().
Because those calls should be rather rare, it makes sense to use
the simple rw_lock with only shared and exclusive modes.

fil_crypt_get_page_throttle(): Avoid invoking fseg_page_is_allocated()
on an allocation bitmap page (which can never be freed), to avoid
acquiring a shared latch on top of an exclusive one.

mtr_t::s_lock_space(), MTR_MEMO_SPACE_S_LOCK: Remove.

201cfc33

MDEV-30134 Assertion failed in buf_page_t::unfix() in buf_pool_t::watch_unset() · 54c0ac72

Marko Mäkelä authored Feb 16, 2023

buf_pool_t::watch_set(): Always buffer-fix a block if one was found,
no matter if it is a watch sentinel or a buffer page. The type of
the block descriptor will be rechecked in buf_page_t::watch_unset().
Do not expect the caller to acquire the page hash latch. Starting with
commit bd5a6403 it is safe to release
buf_pool.mutex before acquiring a buf_pool.page_hash latch.

buf_page_get_low(): Adjust to the changed buf_pool_t::watch_set().

This simplifies the logic and fixes a bug that was reproduced when
using debug builds and the setting innodb_change_buffering_debug=1.

54c0ac72

MDEV-30397: MariaDB crash due to DB_FAIL reported for a corrupted page · 9c157994

Marko Mäkelä authored Feb 16, 2023

buf_read_page_low(): Map the buf_page_t::read_complete() return
value DB_FAIL to DB_PAGE_CORRUPTED. The purpose of the DB_FAIL
return value is to avoid error log noise when read-ahead brings
in an unused page that is typically filled with NUL bytes.

If a synchronous read is bringing in a corrupted page where the
page frame does not contain the expected tablespace identifier and
page number, that must be treated as an attempt to read a corrupted
page. The correct error code for this is DB_PAGE_CORRUPTED.
The error code DB_FAIL is not handled by row_mysql_handle_errors().

This was missed in commit 0b47c126
(MDEV-13542).

9c157994

Merge 10.5 into 10.6 · cc27e5fd
Marko Mäkelä authored Feb 16, 2023

cc27e5fd

15 Feb, 2023 5 commits

MDEV-30657 InnoDB: Not applying UNDO_APPEND due to corruption · 5300c0fb

Marko Mäkelä authored Feb 15, 2023

This almost completely reverts
commit acd23da4 and
retains a safe optimization:

recv_sys_t::parse(): Remove any old redo log records for the
truncated tablespace, to free up memory earlier.
If recovery consists of multiple batches, then recv_sys_t::apply()
will must invoke recv_sys_t::trim() again to avoid wrongly
applying old log records to an already truncated undo tablespace.

5300c0fb

MDEV-30324: Wrong result upon SELECT DISTINCT ... WITH TIES · 4afa3b64

Vicențiu Ciorbaru authored Feb 15, 2023

WITH TIES would not take effect if SELECT DISTINCT was used in a
context where an INDEX is used to resolve the ORDER BY clause.

WITH TIES relies on the `JOIN::order` to contain the non-constant
fields to test the equality of ORDER BY fiels required for WITH TIES.

The cause of the problem was a premature removal of the `JOIN::order`
member during a DISTINCT optimization. This lead to WITH TIES code assuming
ORDER BY only contained "constant" elements.

Disable this optimization when WITH TIES is in effect.

(side-note: the order by removal does not impact any current tests, thus
it will be removed in a future version)

Reviewed by: monty@mariadb.org

4afa3b64

Whitespace fix · d2b773d9
Vicențiu Ciorbaru authored Feb 04, 2023

d2b773d9
Update comments to match new debug_sync implementation · e83ae66e
Vicențiu Ciorbaru authored Feb 04, 2023

e83ae66e

MDEV-30333 Wrong result with not_null_range_scan and LEFT JOIN with empty table · 192427e3

Monty authored Feb 15, 2023

There was a bug in JOIN::make_notnull_conds_for_range_scans() when
clearing TABLE->tmp_set, which was used to mark fields that could not be
null.

This function was only used if 'not_null_range_scan=on' is set.

The effect was that tmp_set contained a 'random value' and this caused
the optimizer to think that some fields could not be null.
FLUSH TABLES clears tmp_set and because of this things worked temporarily.

Fixed by clearing tmp_set properly.

192427e3

14 Feb, 2023 5 commits
- Fix S3 engine Coverity hits · 690fcfbd
  Andrew Hutchings authored Jan 18, 2023
```
Very minor hits found by Coverity for the S3 engine.
```
  690fcfbd
- Merge 10.5 into 10.6 · 96a3b11d
  Marko Mäkelä authored Feb 14, 2023
  
  96a3b11d
- MDEV-30426 Assertion !rec_offs_nth_extern(offsets2, n) during bulk insert · 951d81d9
  Thirunarayanan Balathandayuthapani authored Feb 14, 2023
```
- cmp_rec_rec_simple() fails to detect duplicate key error for
bulk insert operation
```
  951d81d9
- MDEV-30552 InnoDB recovery crashes when error handling scenario · 1a5c7552
  Thirunarayanan Balathandayuthapani authored Feb 14, 2023
```
- InnoDB fails to reset the after_apply variable before applying
the redo log in last batch during multi-batch recovery.
```
  1a5c7552
- MDEV-30551 InnoDB recovery hangs when buffer pool ran out of memory · 3eea2e8e
  Thirunarayanan Balathandayuthapani authored Feb 14, 2023
```
- During non-last batch of multi-batch recovery, InnoDB holds
log_sys.mutex and preallocates the block which may intiate
page flush, which may initiate log flush, which requires
log_sys.mutex to acquire again. This leads to assert failure.
So InnoDB recovery should release log_sys.mutex before
preallocating the block.
```
  3eea2e8e