1. 26 Aug, 2022 1 commit
    • Marko Mäkelä's avatar
      MDEV-29383 Assertion mysql_mutex_assert_owner(&log_sys.flush_order_mutex) failed in mtr_t::commit() · 0fbcb0a2
      Marko Mäkelä authored
      In commit 0b47c126 (MDEV-13542)
      a few calls to mtr_t::memo_push() were moved before a write latch
      on the page was acquired. This introduced a race condition:
      
      1. is_block_dirtied() returned false to mtr_t::memo_push()
      2. buf_page_t::write_complete() was executed, the block marked clean,
      and a page latch released
      3. The page latch was acquired by the caller of mtr_t::memo_push(),
      and mtr_t::m_made_dirty was not set even though the block is in
      a clean state.
      
      The impact of this race condition is that crash recovery and backups
      may fail.
      
      btr_cur_latch_leaves(), btr_store_big_rec_extern_fields(),
      btr_free_externally_stored_field(), trx_purge_free_segment():
      Acquire the page latch before invoking mtr_t::memo_push().
      This fixes the regression caused by MDEV-13542.
      
      Side note: It would suffice to set mtr_t::m_made_dirty at the time
      we set the MTR_MEMO_MODIFY flag for a block. Currently that flag is
      unnecessarily set if a mini-transaction acquires a page latch on
      a page that is in a clean state, and will not actually modify the block.
      This may cause unnecessary acquisitions of log_sys.flush_order_mutex
      on mtr_t::commit().
      
      mtr_t::free(): If the block had been exclusively latched in this
      mini-transaction, set the m_made_dirty flag so that the flush order mutex
      will be acquired during mtr_t::commit(). This should have been part of
      commit 4179f93d (MDEV-18976).
      It was necessary to change mtr_t::free() so that
      WriteOPT_PAGE_CHECKSUM::operator() would be able to avoid writing
      checksums for freed pages.
      0fbcb0a2
  2. 25 Aug, 2022 5 commits
  3. 24 Aug, 2022 6 commits
    • Vlad Lesin's avatar
      MDEV-29081 trx_t::lock.was_chosen_as_deadlock_victim race in lock_wait_end() · 8ff10969
      Vlad Lesin authored
      The issue is that trx_t::lock.was_chosen_as_deadlock_victim can be reset
      before the transaction check it and set trx_t::error_state.
      
      The fix is to reset trx_t::lock.was_chosen_as_deadlock_victim only in
      trx_t::commit_in_memory(), which is invoked on full rollback. There is
      also no need to have separate bit in
      trx_t::lock.was_chosen_as_deadlock_victim to flag transaction it was
      chosen as a victim of Galera conflict resolution, the same variable can be
      used for both cases except debug build. For debug build we need to
      distinguish deadlock and Galera's abort victims for debug checks. Also
      there is no need to check for deadlock in lock_table_enqueue_waiting() for
      Galera as the coresponding check presents in lock_wait().
      
      Local variable "error_state" in lock_wait() was replaced with
      trx->error_state, because before the replace
      lock_sys_t::cancel<false>(trx, lock) and lock_sys.deadlock_check() could
      change trx->error_state, which then could be overwritten with the local
      "error_state" variable value.
      
      The lock_wait_suspend_thread_enter DEBUG_SYNC point name is misleading,
      because lock_wait_suspend_thread was eliminated in e71e6133. It was renamed
      to lock_wait_start.
      
      Reviewed by: Marko Mäkelä, Jan Lindström.
      8ff10969
    • Marko Mäkelä's avatar
    • Vladislav Vaintroub's avatar
      MDEV-29367 Refactor tpool::cache · a3fd9e6b
      Vladislav Vaintroub authored
      Removed use std::vector's ba push_back(), pop_back() to  make it more
      obvious that memory in the vectors won't be reallocated.
      
      Also, "borrowed" elements can be debugged a little better now,
      they are put into the start of the m_cache vector.
      a3fd9e6b
    • Thirunarayanan Balathandayuthapani's avatar
      MDEV-29319 Assertion failure size_in_header >= space.free_limit in... · 61f456e7
      Thirunarayanan Balathandayuthapani authored
      MDEV-29319 Assertion failure size_in_header >= space.free_limit in fsp_get_available_space_in_free_extents()
      
      - Don't remove the constant parameter in fsp_get_available_space_in_free_extents()
      61f456e7
    • Thirunarayanan Balathandayuthapani's avatar
      MDEV-29291 Assertion `!table->fts' failed in dict_table_can_be_evicted on SHUTDOWN · dd737d07
      Thirunarayanan Balathandayuthapani authored
      - InnoDB fts table initially added to LRU table cache
      while creating the table. Later, table was marked
      as non-evicted when we add the table to fts optimizer
      list. Before marking the table as non-evicted, master
      thread can try to evict the fts table.
      dd737d07
    • Thirunarayanan Balathandayuthapani's avatar
      MDEV-29319 Assertion failure size_in_header >= space.free_limit in... · 0b805733
      Thirunarayanan Balathandayuthapani authored
      MDEV-29319 Assertion failure size_in_header >= space.free_limit in fsp_get_available_space_in_free_extents()
      
      - Race condition between fsp_get_available_space_in_free_extents()
      and fsp_try_extend_data_file() while accessing space.free_limit.
      Before calling fsp_get_available_space_in_free_extents(), take
      shared lock on space->latch.
      0b805733
  4. 23 Aug, 2022 9 commits
  5. 22 Aug, 2022 11 commits
    • Marko Mäkelä's avatar
      Merge 10.4 into 10.5 · 3b656ac8
      Marko Mäkelä authored
      3b656ac8
    • Brandon Nesterenko's avatar
      MDEV-28294: set default role bypasses Replicate_Wild_Ignore_Table: mysql.% · 8963d64e
      Brandon Nesterenko authored
      Problem:
      ========
      When replicating SET DEFAULT ROLE, the pre-update check (i.e. that
      in set_var_default_role::check()) tries to validate the existence of
      the given rules/user even when the targeted tables are ignored. When
      previously issued CREATE USER/ROLE commands are ignored by the
      replica because of the replication filtering rules, this results in
      an error because the targeted data does not exist.
      
      Solution:
      ========
      Before checking that the given roles/user exist of a SET DEFAULT
      ROLE command, first ensure that the mysql.user and
      mysql.roles_mapping tables are not excluded by replication filters.
      
      Reviewed By:
      ============
      Andrei Elkin <andrei.elkin@mariadb.com>
      Sergei Golubchik <serg@mariadb.com>
      8963d64e
    • Marko Mäkelä's avatar
      Merge 10.3 into 10.4 · b68ae6dc
      Marko Mäkelä authored
      b68ae6dc
    • Marko Mäkelä's avatar
      Merge 10.5 into 10.6 · d65a2b7b
      Marko Mäkelä authored
      d65a2b7b
    • Thirunarayanan Balathandayuthapani's avatar
      MDEV-27700 ASAN: Heap_use_after_free in btr_search_drop_page_hash_index() · c7f8cfc9
      Thirunarayanan Balathandayuthapani authored
      Reason:
      =======
      Race condition between btr_search_drop_hash_index() and
      btr_search_lazy_free(). One thread does resizing of buffer pool
      and clears the ahi on all pages in the buffer pool, frees the
      index and table while removing the last reference. At the same time,
      other thread access index->heap in btr_search_drop_hash_index().
      
      Solution:
      =========
      Acquire the respective ahi latch before checking index->freed()
      
      btr_search_drop_page_hash_index(): Added new parameter to indicate
      that drop ahi entries only if the index is marked as freed
      
      btr_search_check_marked_free_index(): Acquire all ahi latches and
      return true if the index was freed
      c7f8cfc9
    • Marko Mäkelä's avatar
      Merge 10.4 into 10.5 · 1d90d687
      Marko Mäkelä authored
      1d90d687
    • Alexander Barkov's avatar
      MDEV-27101 Subquery using the ALL keyword on TIMESTAMP columns produces a wrong result · 316847ea
      Alexander Barkov authored
      TIMESTAMP columns were compared as strings in ALL/ANY comparison,
      which did not work well near DST time change.
      
      Changing ALL/ANY comparison to use "Native" representation to compare
      TIMESTAMP columns, like simple comparison does.
      316847ea
    • Marko Mäkelä's avatar
      Merge 10.3 into 10.4 · 36d173e5
      Marko Mäkelä authored
      36d173e5
    • Marko Mäkelä's avatar
      MDEV-25257 follow-up: Fix a test · 6005f3c5
      Marko Mäkelä authored
      Ever since commit 09177ead
      the test innodb.row_format_redundant cannot work when the
      data file was created with innodb_checksum_algorithm=full_crc32.
      
      Backport of 10.5 commit a9d0bb12
      6005f3c5
    • Marko Mäkelä's avatar
      MDEV-13013 fixup: Adjust a test · fd0cd480
      Marko Mäkelä authored
      fd0cd480
    • Brad Smith's avatar
      Revert aligned_alloc() addition from MDEV-28836 · f02ca429
      Brad Smith authored
      As pointed out with MDEV-29308 there are issues with the code as is.
      MariaDB is built as C++11 / C99. aligned_alloc() is not guarenteed
      to be exposed when building with any mode other than C++17 / C11.
      The other *BSD's have their stdlib.h header to expose the function
      with C+11 anyway, but the issue exists in the C99 code too, the
      build just does not use -Werror. Linux globally defines _GNU_SOURCE
      hiding the issue as well.
      f02ca429
  6. 19 Aug, 2022 4 commits
    • Thirunarayanan Balathandayuthapani's avatar
      MDEV-29250 InnoDB: Failing assertion: table->get_ref_count() == 0 · 7624bf86
      Thirunarayanan Balathandayuthapani authored
      Reason:
      ======
       This issue is caused by race condition between fulltext DDL and
      purge thread. DDL sets the signal to stop the purge thread to
      process the new undo log records and wait for the ongoing
      processed FTS table undo log records to finish.
      
      But in dict_acquire_mdl_shared(),InnoDB release all innodb
      table related locks before acquiring the mdl. At the same time,
      DDL assumes that there are no purge threads working on fts table.
      
      There is a possiblity that purge thread can skip processing the
      valid undo log records if it checks purge_sys.must_wait_FTS() twice
      in different places.
      
      Solution:
      ==========
      
      Add the purge_sys.must_wait_FTS() check in dict_acquire_mdl_shared()
      to avoid the purge thread processing undo log records.
      
      dict_open_table_on_id(): return -1 if the purge thread has to
      wait
      
      dict_acquire_mdl_shared(): Added 1 new parameters to indicate that
      purge thread invoking the function, return -1  if the purge
      thread has to wait.
      7624bf86
    • Marko Mäkelä's avatar
      75c416d3
    • Marko Mäkelä's avatar
      c2df3d30
    • Marko Mäkelä's avatar
      MDEV-29043 mariabackup --compress hangs · a1055ab3
      Marko Mäkelä authored
      Even though commit b817afaa passed
      the test mariabackup.compress_qpress, that test turned out to be
      too small to reveal one more problem that had previously been prevented
      by the existence of ctrl_mutex. I did not realize that there can be
      multiple concurrent callers to compress_write(). One of them is the
      log copying thread; further callers are data file copying threads
      (default: --parallel=1).
      
      By default, there is only one compression worker thread
      (--compress-threads=1).
      
      compress_write(): Fix a race condition between threads that would
      use the same worker thread object. Make thd->data_avail contain the
      thread identifier of the submitter, and add thd->avail_cond to
      notify other compress_write() threads that are waiting for a slot.
      a1055ab3
  7. 18 Aug, 2022 1 commit
    • Marko Mäkelä's avatar
      MDEV-29321 Percona XtraDB 5.7 can't be upgrade to MariaDB 10.6 or above · ec379066
      Marko Mäkelä authored
      In MySQL 5.7, rollback segments 1 to 32 are used for temporary tables,
      which is an unnecessary file format change from MySQL 5.6.
      This format change was avoided in MariaDB Server by
      commit 124bae08 (MDEV-12289).
      
      An upgrade from MySQL 5.7 would crash due to dereferencing a null pointer,
      which is a regression due to
      commit 0b47c126 (MDEV-13542).
      
      trx_rseg_t::get(): Return nullptr if no tablespace exists. This is where
      the upgrade would crash.
      
      trx_rseg_mem_restore(): Return DB_TABLESPACE_NOT_FOUND if the
      undo tablespace does not exist. This is likely dead code.
      ec379066
  8. 17 Aug, 2022 1 commit
  9. 16 Aug, 2022 2 commits