1. 22 Oct, 2021 1 commit
    • Marko Mäkelä's avatar
      MDEV-26883 InnoDB hang due to table lock conflict · 5caff202
      Marko Mäkelä authored
      In a stress test campaign of a 10.6-based branch by Matthias Leich,
      a deadlock between two InnoDB threads occurred, involving
      lock_sys.wait_mutex and a dict_table_t::lock_mutex.
      
      The cause of the hang is a latching order violation in
      lock_sys_t::cancel(). That function and the latching order
      violation were originally introduced in
      commit 8d16da14 (MDEV-24789).
      
      lock_sys_t::cancel(): Invoke table->lock_mutex_trylock() in order
      to avoid a deadlock. If that fails, release lock_sys.wait_mutex,
      and acquire both latches. In that way, we will be obeying the
      latching order and no hangs will occur.
      
      This hang should mostly affect DDL operations. DML operations will
      acquire only IX or IS table locks, which are compatible with each other.
      5caff202
  2. 21 Oct, 2021 15 commits
  3. 20 Oct, 2021 9 commits
    • Marko Mäkelä's avatar
    • Nikita Malyavin's avatar
      MDEV-20131 Assertion `!pk->has_virtual()' failed · d10c42b4
      Nikita Malyavin authored
      Assertion `!pk->has_virtual()' failed in dict_index_build_internal_clust
      while creating PRIMARY key longer than possible to store in the page.
      
      This happened because the key was wrongly deduced as Long UNIQUE supported,
      however PRIMARY KEY cannot be of that type. The main reason is that
      only 8 bytes are used to store the hash, see HA_HASH_FIELD_LENGTH.
      
      This is also why HA_NOSAME flag is removed (and caused the assertion in
      turn) in open_table_from_share:
            if (key_info->algorithm == HA_KEY_ALG_LONG_HASH)
            {
              key_part_end++;
              key_info->flags&= ~HA_NOSAME;
            }
      
      To make it unique, the additional check is done by
      check_duplicate_long_entries call from ha_write_row, and similar one from
      ha_update_row.
      
      PRIMARY key is already forbidden, which is checked by the first test in
      main.long_unique, however is_hash_field_needed was wrongly deduced to true
      in mysql_prepare_create_table in this particular case.
      
      FIX:
      
      * Improve the check for Key::PRIMARY type
      * Simplify is_hash_field_needed deduction for a more neat reading
      d10c42b4
    • Marko Mäkelä's avatar
      Update libmariadb · 69b3de83
      Marko Mäkelä authored
      69b3de83
    • Marko Mäkelä's avatar
      MDEV-22627 Failing assertion: dict_tf2_is_valid(flags, flags2) · b06e8167
      Marko Mäkelä authored
      create_table_info_t::innobase_table_flags(): Refuse to create
      a PAGE_COMPRESSED table with PAGE_COMPRESSION_LEVEL=0 if also
      innodb_compression_level=0.
      
      The parameter value innodb_compression_level=0 was only somewhat
      meaningful for testing or debugging ROW_FORMAT=COMPRESSED tables.
      For the page_compressed format, it never made any sense, and the
      check in dict_tf_is_valid_not_redundant() that was added in
      72378a25 (MDEV-12873) would cause
      the server to crash.
      b06e8167
    • Nikita Malyavin's avatar
      MDEV-22445 Crash on HANDLER READ NEXT after XA PREPARE · caebe151
      Nikita Malyavin authored
      The assertion is absolutely correct since no data access is possible after
      XA PREPARE.
      
      The check is added in mysql_ha_read.
      caebe151
    • Nikita Malyavin's avatar
      MDEV-26262 frm is corrupted after ER_EXPRESSION_REFERS_TO_UNINIT_FIELD · 1811fd51
      Nikita Malyavin authored
      This is a duplicate of MDEV-18278 89936f11, but I will add an
      additional assertion
      
      Description:
      
      The frm corruption should not be reported during CREATE TABLE. Normally
      it doesn't, and the data to fill TABLE is taken by open_table_from_share
      call. However, the vcol data is stored as SQL string in
      table->s->vcol_defs.str and is anyway parsed on each table open.
      It is impossible [or hard] to avoid, because it's hard to clone the
      expression tree in general (it's easier to parse).
      
      Normally parse_vcol_defs should only fail on semantic errors. If so,
      error_reported is set to true. Any other failure is not expected during
      table creation. There is either unhandled/unacknowledged error, or
      something went really wrong, like memory reject. This all should be
      asserted anyway.
      
      Solution:
      * Set *error_reported=true for the forward references check;
      * Assert for every unacknowledged error during table creation.
      1811fd51
    • Nikita Malyavin's avatar
    • Marko Mäkelä's avatar
      MDEV-26554: Stabilize the test · 78dec1f1
      Marko Mäkelä authored
      78dec1f1
    • Daniel Black's avatar
      MDEV-26363 Passwords incorrectly expiring after MySQL5.7 -> MariaDB10.3 -> 10.4+ upgrades · 4590f8b4
      Daniel Black authored
      MySQL-5.7 mysql.user tables have a last_password_changed field.
      
      Because before MariaDB-10.4 remained oblivious to this, the act of creating
      users or otherwise changing a users row left the last_password_field with 0.
      
      Running a MariaDB-10.4 instance on this would work correctly, until mysql_upgrade
      is run, when this 0 value immediately translates to password expired
      state.
      
      MySQL-5.7 relied on the password_expired enum to indicate password
      expiry so we aren't going to activate password that were expired in
      MySQL-5.7.
      
      Thanks Hans Borresen for the bug report and review of the fix.
      4590f8b4
  4. 19 Oct, 2021 10 commits
    • Marko Mäkelä's avatar
      After-merge fix: Remove unused variable · d6a3f425
      Marko Mäkelä authored
      d6a3f425
    • Marko Mäkelä's avatar
      MDEV-26772 InnoDB DDL fails with DUPLICATE KEY error · 6e390a62
      Marko Mäkelä authored
      ha_innobase::delete_table(): When the table that is being dropped
      has a name starting with #sql, temporarily set
      innodb_lock_wait_timeout=0 while attempting to lock the
      persistent statistics tables. If the statistics tables cannot be locked,
      pretend that statistics did not exist and carry on with dropping
      the table. The SQL layer is not really prepared for failures of
      this operation. This is what fixes the test case.
      
      ha_innobase::rename_table(): When renaming a table from a name
      that starts with #sql, try to lock the statistics tables with an
      immediate timeout, and ignore the statistics if the locks were
      not available. In fact, during any rename from a #sql name,
      dict_stats_rename_table() should have no effect, because already
      when an earlier rename to a #sql name took place we should have
      deleted the statistics for the table using the non-#sql name.
      This change is just analogous to the ha_innobase::delete_table().
      6e390a62
    • Vicențiu Ciorbaru's avatar
      Fix Groonga crash on MIPS: Correctly link to libatomic · 1388845e
      Vicențiu Ciorbaru authored
      MIPS (and possibly other) platforms require linking against libatomic to
      support 64-bit atomic integers. Groonga was failing to do so and all related
      tests were failing with an atomics relocation error on MIPS.
      
      Contributors:
      James Cowgill <jcowgill@debian.org>
      1388845e
    • Sergei Krivonos's avatar
    • Vicențiu Ciorbaru's avatar
      Fix MIPS build failure: Handle unaligned buffers in connect's TYPBLK class · a33c1082
      Vicențiu Ciorbaru authored
      On MIPS platforms (and probably others) unaligned memory access results in a
      bus error. In the connect storage engine, block data for some data formats is
      stored packed in memory and the TYPBLK class is used to read values from it.
      Since TYPBLK does not have special handling for this packed memory, it can
      quite easily result in unaligned memory accesses.
      
      The simple way to fix this is to perform all accesses to the main buffer
      through memcpy. With GCC and optimizations turned on, this call to memcpy is
      completely optimized away on architectures where unaligned accesses are ok
      (like x86).
      
      Contributors:
      James Cowgill <jcowgill@debian.org>
      a33c1082
    • Vicențiu Ciorbaru's avatar
      Link with libatomic to enable C11 atomics support · f502ccbc
      Vicențiu Ciorbaru authored
      Some architectures (mips) require libatomic to support proper
      atomic operations. Check first if support is available without
      linking, otherwise use the library.
      
      Contributors:
      James Cowgill <jcowgill@debian.org>
      Jessica Clarke <jrtc27@debian.org>
      Vicențiu Ciorbaru <vicentiu@mariadb.org>
      f502ccbc
    • Nayuta Yanagisawa's avatar
      MDEV-26158 SIGSEGV in spider_free_mem from ha_spider::open on INSERT · e7208bd9
      Nayuta Yanagisawa authored
      The server crashes due to passing NULL to spider_free().
      
      In some cases, this == pt_handler_share_handlers[0] at the label
      error_get_share in ha_spider::open().
      
      In such cases, to nullify pt_handler_share_handlers[0]->wide_handler
      is nothing but to nullify this->wide_handler. We should not do this
      before freeing this->wide_handler.
      e7208bd9
    • Alexey Botchkov's avatar
      MDEV-24585 Assertion `je->s.cs == nice_js->charset()' failed in json_nice. · 1a54cf62
      Alexey Botchkov authored
      We should set the charset in
      Item_func_json_format::fix_length_and_dec().
      1a54cf62
    • Krunal Bauskar's avatar
      MDEV-26855: Enable spinning for log_sys_mutex and log_flush_order_mutex · f7684f0c
      Krunal Bauskar authored
      As part of MDEV-26779 we first discovered the effect of enabling spinning for
      some critical mutex. MDEV-26779 tried enabling it for lock_sys.wait_mutex and
      observed a good gain in performance.
      
      In yet another discussion, Mark Callaghan pointed a reference to pthread based
      mutex spin using PTHREAD_MUTEX_ADAPTIVE_NP (MDEV-26769 Intel RTM).
      
      Given the strong references, Marko Makela as part of his comment in #1923
      pointed an idea to enable spinning for other mutexes. Based on perf profiling
      we decided to explore spinning for log_sys_mutex and log_flush_order_mutex as
      they are occupying the top slots in the contented mutex list.
      
      The evaluation showed promising results for ARM64 but not for x86.
      So a patch is here-by proposed to enable the spinning of the mutex for
      ARM64-based platform.
      f7684f0c
    • Marko Mäkelä's avatar
      MDEV-14804 innodb.update_time failed in buildbot with wrong result · 53167031
      Marko Mäkelä authored
      Let us use a minimal-size buffer pool to ensure that page flushing
      will be slow enough so that LRU eviction cannot be avoided.
      53167031
  5. 18 Oct, 2021 5 commits
    • Oleksandr Byelkin's avatar
      MDEV-26299: Some views force server (and mysqldump) to generate invalid SQL for their definitions · 27bf57fd
      Oleksandr Byelkin authored
      Do not print illegal table field names for non-top-level SELECT list,
      they will not be refered in any case but create problem for parsing
      of printed result.
      27bf57fd
    • Brandon Nesterenko's avatar
      MDEV-25284: Assertion `info->type == READ_CACHE || info->type == WRITE_CACHE' failed · 2291f8ef
      Brandon Nesterenko authored
      Problem:
      ========
      This patch addresses two issues.
      
      First, if a CHANGE MASTER command is issued and an error happens
      while locating the replica’s relay logs, the logs can be put into an
      invalid state where future updates fail and future CHANGE MASTER
      calls crash the server. More specifically, right before a replica
      purges the relay logs (part of the `CHANGE MASTER TO` logic), the
      relay log is temporarily closed with state LOG_TO_BE_OPENED. If the
      server errors in-between the temporary log closure and purge, i.e.
      during the function find_log_pos, the log should be closed.
      MDEV-25284 reveals the log is not properly closed.
      
      Second, upon issuing a RESET SLAVE ALL command, a slave’s GTID
      filters are not cleared (DO_DOMAIN_IDS, IGNORE_DOMIAN_IDS,
      IGNORE_SERVER_IDS). MySQL had a similar bug report, Bug #18816897,
      which fixed this issue to clear IGNORE_SERVER_IDS after issuing
      RESET SLAVE ALL in version 5.7.
      
      Solution:
      =========
      
      To fix the first problem, the CHANGE MASTER error handling logic was
      extended to transition the relay log state to LOG_CLOSED from
      LOG_TO_BE_OPENED.
      
      To fix the second problem, the RESET SLAVE ALL logic is extended to
      clear the domain_id filter and ignore_server_ids.
      
      Reviewed By:
      ============
      Andrei Elkin <andrei.elkin@mariadb.com>
      2291f8ef
    • Marko Mäkelä's avatar
      MDEV-26554: Races between INSERT on child and DDL on parent table · c3c53926
      Marko Mäkelä authored
      The SQL layer never acquires metadata locks (MDL) on the tables
      that the tables that DML statement accesses is modifying.
      
      However, the storage engine must access the parent table in order to
      ensure that the child table will not refer to a non-existing record
      in the parent table.
      
      During certain DDL operations, the InnoDB table metadata (dict_table_t)
      may be be freed and rebuilt. This would cause a race condition with
      a concurrent INSERT that is attempting to report a FOREIGN KEY violation.
      
      We work around the insufficient MDL during DML by acquiring exclusive
      InnoDB table locks on all child tables during DDL. To avoid deadlocks,
      we will follow the following order of acquisition:
      
      1. tables whose REFERENCES clauses point to the current table
      2. the current table that is being subjected to DDL
      3. mysql.innodb_table_stats
      4. mysql.innodb_index_stats
      5. the InnoDB dictionary tables (SYS_TABLES and so on)
      6. exclusive dict_sys.latch
      c3c53926
    • Marko Mäkelä's avatar
      Merge 10.5 into 10.6 · 59fe6a8a
      Marko Mäkelä authored
      59fe6a8a
    • Nayuta Yanagisawa's avatar
      MDEV-26582 SIGSEGV in spider_db_bulk_insert and spider_db_connect and... · edde9084
      Nayuta Yanagisawa authored
      MDEV-26582 SIGSEGV in spider_db_bulk_insert and spider_db_connect and spider_db_before_query, and hang in "End of update loop" / "Reset for next command" query states
      
      Spider accesses a freed connection in ha_spider::end_bulk_insert()
      and results in SIGSEGV.
      
      The cause of the bug is that ha_spider::is_bulk_insert_exec_period()
      wrongly returns TRUE when the bulk insertion has not yet started.
      
      Spider decides whether it is during the bulk insertion or not by
      the value of insert_pos, but the variable is not reset in a case,
      and this result in the bug.
      edde9084