1. 29 Apr, 2024 6 commits
    • Rucha Deodhar's avatar
      MDEV-22141: JSON_REMOVE returns NULL on valid arguments · 9e6858a4
      Rucha Deodhar authored
      Analysis:
      When we scan json to get to a beginning according to the path, we end up
      scanning json even if we have exhausted it. When eventually returns error.
      
      Fix:
      Continue scanning json only if we have not exhausted it and return result
      accordingly.
      9e6858a4
    • Rucha Deodhar's avatar
      MDEV-32287: JSON_EXTRACT not returning multiple values for same path · 5ca64e65
      Rucha Deodhar authored
      Analysis:
      When scanning json and getting the exact path at each step, if a path
      is reached, we end up adding the item in the result and immediately get the
      next item which results in current path changing.
      Fix:
      Instead of immediately returning the item, count the occurences of the path
      in argument and append in the result as needed.
      5ca64e65
    • Rucha Deodhar's avatar
      MDEV-19487: JSON_TYPE doesnt detect the type of String Values · d7df63e1
      Rucha Deodhar authored
      (returns NULL) and for Date/DateTime returns "INTEGER"
      
      Analysis:
      When the first character of json is scanned it is number. Based on that
      integer is returned.
      Fix:
      Scan rest of the json before returning the final result to ensure json is
      valid in the first place in order to have a valid type.
      d7df63e1
    • Alexander Barkov's avatar
      MDEV-30646 View created via JSON_ARRAYAGG returns incorrect json object · c6e3fe29
      Alexander Barkov authored
      Backporting add782a1 from 10.6, this fixes the problem.
      c6e3fe29
    • Alexander Barkov's avatar
      MDEV-21058 CREATE TABLE with generated column and RLIKE results in sigabrt · dc25d600
      Alexander Barkov authored
      Regexp_processor_pcre::fix_owner() called Regexp_processor_pcre::compile(),
      which could fail on the regex syntax error in the pattern and put
      an error into the diagnostics area. However, the callers:
        - Item_func_regex::fix_length_and_dec()
        - Item_func_regexp_instr::fix_length_and_dec()
      still returned "false" in such cases, which made the code
      crash later inside Diagnostics_area::set_ok_status().
      
      Fix:
      
      - Change the return type of fix_onwer() from "void" to "bool"
        and return "true" whenever an error is put to the DA
        (e.g. on the syntax error in the pattern).
      - Fixing fix_length_and_dec() of the mentioned Item_func_xxx
        classes to return "true" if fix_onwer() returned "true".
      dc25d600
    • Yuchen Pei's avatar
      MDEV-30727 Check spider_hton_ptr in spider udfs · 267dd5a9
      Yuchen Pei authored
      We have to #undef my_error and find it from udfs when spider is not
      installed.
      267dd5a9
  2. 28 Apr, 2024 2 commits
  3. 27 Apr, 2024 1 commit
    • Alexander Barkov's avatar
      MDEV-33534 UBSAN: Negation of -X cannot be represented in type 'long long... · 3141a68b
      Alexander Barkov authored
      MDEV-33534 UBSAN: Negation of -X cannot be represented in type 'long long int'; cast to an unsigned type to negate this value to itself in my_double_round from sql/item_func.cc|
      
      The negation in this line:
        ulonglong abs_dec= dec_negative ? -dec : dec;
      did not take into account that 'dec' can be the smallest possible
      signed negative value -9223372036854775808. Its negation is
      an operation with an undefined behavior.
      
      Fixing the code to use Longlong_hybrid, which implements a safe
      method to get an absolute value.
      3141a68b
  4. 26 Apr, 2024 1 commit
    • Hugo Wen's avatar
      MDEV-33574 Improve mysqlbinlog error message · 3d417476
      Hugo Wen authored
      Previously, when running mysqlbinlog without providing a binlog file, it
      would print the entire help text, which was very verbose and made it
      difficult to identify the actual issue.
      
      Now change the behavior to print a more concise error message instead:
      
          "ERROR: Please provide the log file(s). Run with '--help' for usage instructions."
      
      This makes the error output more user-friendly and easier to understand,
      especially when running the tool in scripts or automated processes.
      
      All new code of the whole pull request, including one or several files
      that are either new files or modified ones, are contributed under the
      BSD-new license. I am contributing on behalf of my employer
      Amazon Web Services, Inc.
      3d417476
  5. 25 Apr, 2024 5 commits
    • Jan Lindström's avatar
      MDEV-33896 : Galera test failure on galera_3nodes.MDEV-29171 · b3e531a3
      Jan Lindström authored
      Based on logs we might start SST before donor has reached
      Primary state. Because this test shutdowns all nodes we
      need to make sure when we start nodes that previous nodes
      have reached Primary state and joined the cluster.
      Signed-off-by: default avatarJulius Goryavsky <julius.goryavsky@mariadb.com>
      b3e531a3
    • Marko Mäkelä's avatar
      MDEV-26450 fixup: Remove a bogus assertion · 10d251e0
      Marko Mäkelä authored
      mtr_t::commit_shrink(): Do not assert that some previously clean pages
      will be flagged as modified by this mini-transaction. It could be the
      case that there had been no recent write-back of any of the undo
      tablespace pages that we are modifying when truncating the tablespace.
      It suffices to assert that some pages were modified again:
      ut_ad(m_modifications).
      
      This fixes up commit f5fddae3
      10d251e0
    • Kristian Nielsen's avatar
      MDEV-33602: Sporadic test failure in rpl.rpl_gtid_stop_start · 553a4d62
      Kristian Nielsen authored
      The test could fail with a duplicate key error because switching to non-GTID
      mode could start at the wrong old-style position. The position could be
      wrong when the previous GTID connect was stopped before receiving the fake
      GTID list event which gives the old-style position corresponding to the GTID
      connected position.
      
      Work-around by injecting an extra event and syncing the slave before
      switching to non-GTID mode.
      Signed-off-by: default avatarKristian Nielsen <knielsen@knielsen-hq.org>
      553a4d62
    • Marko Mäkelä's avatar
      MDEV-33974 Enable GNU libstdc++ debugging · a1c1f502
      Marko Mäkelä authored
      Starting with GCC 10, let us enable _GLIBCXX_DEBUG as well as
      _GLIBCXX_ASSERTIONS which have an impact on the GNU libstdc++.
      On GCC 8, we observed a compilation failure related to some
      missing type conversion.
      
      Even though clang on GNU/Linux would default to using libstdc++
      and enabling the debugging seems to work with clang-18, we will
      not enable this on clang, in case it would lead to compilation
      errors.
      
      For the clang libc++ before clang-15 there was _LIBCPP_DEBUG,
      but according to
      llvm/llvm-project@f3966eaf869b7bdd9113ab9d5b78469eb0f5f028 and
      llvm/llvm-project@13ea1343231fa4ae12fe9fba4c789728465783d7 and
      llvm/llvm-project@ff573a42cd1f1d05508f165dc3e645a0ec17edb5 it
      looks like that for proper results, a specially built debug version
      of libc++ would have to be used in order to enable equivalent checks.
      
      This should help catch bugs like the one that
      commit 455a15fd fixed.
      
      Reviewed by: Sergei Golubchik
      a1c1f502
    • Marko Mäkelä's avatar
      MDEV-23974 fixup: Cover all debug builds · 72293842
      Marko Mäkelä authored
      While commit 75b7cd68 was a significant
      improvement, we occasionally got test failures of debug builds. One of
      the affected tests is innodb.innodb-64k-crash.
      72293842
  6. 24 Apr, 2024 3 commits
  7. 23 Apr, 2024 3 commits
  8. 22 Apr, 2024 2 commits
    • Alexander Barkov's avatar
      MDEV-21034 GREATEST() and LEAST() malfunction for NULL · 24abbb9b
      Alexander Barkov authored
      There is a convention that Item::val_int() and Item::val_real() return
      SQL NULL doing effectively what this code does:
        null_value= true;
        return 0; // Always return 0 for SQL NULL
      
      This is done to optimize boolean value evaluation:
      if Item::val_int() or Item::val_real() returned 1 -
      that always means TRUE and never can means SQL NULL.
      This convention helps to avoid unnecessary testing
      Item::null_value after getting a non-zero return value.
      
      Item_func_min_max did not follow this convention.
      It could return a non-zero value together with null_value==true.
      This made evaluate_join_record() erroneously misinterpret
      SQL NULL as TRUE in this call:
      
        select_cond_result= MY_TEST(select_cond->val_int());
      
      Fixing Item_func_min_max to follow the convention.
      24abbb9b
    • Markus Staab's avatar
      Remove unnecessary whitespace in mysqldump · 361b7903
      Markus Staab authored
      361b7903
  9. 20 Apr, 2024 2 commits
    • Kristian Nielsen's avatar
      MDEV-19415: use-after-free on charsets_dir from slave connect · 57f6a1ca
      Kristian Nielsen authored
      The slave IO thread sets MYSQL_SET_CHARSET_DIR. The code for this option
      however is not thread-safe in sql-common/client.c. The value set is
      temporarily written to mysys global variable `charsets-dir` and can be seen
      by other threads running in parallel, which can result in use-after-free
      error.
      
      Problem was visible as random failures of test cases in suite multi_source
      with Valgrind or MSAN.
      
      Work-around by not setting this option for slave connect, it is redundant
      anyway as it is just setting the default value.
      Signed-off-by: default avatarKristian Nielsen <knielsen@knielsen-hq.org>
      57f6a1ca
    • Kristian Nielsen's avatar
      MDEV-30232: rpl.rpl_gtid_crash fails sporadically in BB · 0c249ad7
      Kristian Nielsen authored
      The root cause of the failure is a bug in the Linux network stack:
      
        https://lore.kernel.org/netdev/87sf0ldk41.fsf@urd.knielsen-hq.org/T/#u
      
      If the slave does a connect(2) at the exact same time that kill -9 of the
      master process closes the listening socket, the FIN or RST packet is lost in
      the kernel, and the slave ends up timing out waiting for the initial
      communication from the server. This timeout defaults to
      --slave-net-timeout=120, which causes include/master_gtid_wait.inc to time
      out first and fail the test.
      
      Work-around this problem by reducing the --slave-net-timeout for this test
      case. If this problem turns up in other tests, we can consider reducing the
      default value for all tests.
      Signed-off-by: default avatarKristian Nielsen <knielsen@knielsen-hq.org>
      0c249ad7
  10. 19 Apr, 2024 3 commits
  11. 18 Apr, 2024 2 commits
    • Vladislav Vaintroub's avatar
      MDEV-16944 postfix. Fix a typo · 2e84560d
      Vladislav Vaintroub authored
      2e84560d
    • mariadb-DebarunBanerjee's avatar
      MDEV-32489 Change buffer index fails to delete the records · 5928e04d
      mariadb-DebarunBanerjee authored
      When the change buffer records for a page span across multiple change
      buffer leaf pages or the starting record is at the beginning of a page
      with a left sibling, ibuf_delete_recs deletes only the records in first
      page and fails to move to subsequent pages.
      
      Subsequently a slow shutdown hangs trying to delete those left over
      records.
      
      Fix-A: Position the cursor to an user record in B-tree and exit only
      when all records are exhausted.
      
      Fix-B: Make sure we call ibuf_delete_recs during slow shutdown for
      pages with IBUF entries to cleanup any previously left over records.
      5928e04d
  12. 17 Apr, 2024 10 commits
    • Brandon Nesterenko's avatar
      MDEV-27512: Assertion !thd->transaction_rollback_request failed in rows_event_stmt_cleanup · 0ad52e4d
      Brandon Nesterenko authored
      If replicating an event in ROW format, and InnoDB detects a deadlock
      while searching for a row, the row event will error and rollback in
      InnoDB and indicate that the binlog cache also needs to be cleared,
      i.e. by marking thd->transaction_rollback_request. In the normal
      case, this will trigger an error in Rows_log_event::do_apply_event()
      and cause a rollback. During the Rows_log_event::do_apply_event()
      cleanup of a successful event application, there is a DBUG_ASSERT in
      log_event_server.cc::rows_event_stmt_cleanup(), which sets the
      expectation that thd->transaction_rollback_request cannot be set
      because the general rollback (i.e. not the InnoDB rollback) should
      have happened already. However, if the replica is configured to skip
      deadlock errors, the rows event logic will clear the error and
      continue on, as if no error happened. This results in
      thd->transaction_rollback_request being set while in
      rows_event_stmt_cleanup(), thereby triggering the assertion.
      
      This patch fixes this in the following ways:
       1) The assertion is invalid, and thereby removed.
       2) The rollback case is forced in rows_event_stmt_cleanup() if
      transaction_rollback_request is set.
      
      Note the differing behavior between transactions which are skipped
      due to deadlock errors and other errors. When a transaction is
      skipped due to an ignored deadlock error, the entire transaction is
      rolled back and skipped (though note MDEV-33930 which allows
      statements in the same transaction after the deadlock-inducing one
      to commit). When a transaction is skipped due to ignoring a
      different error, only the erroring statements are rolled-back and
      skipped - the rest of the transaction will execute as normal. The
      effect of this can be seen in the test results. The added test case
      to rpl_skip_error.test shows that only statements which are ignored
      due to non-deadlock errors are ignored in larger transactions. A
      diff between rpl_temporary_error2_skip_all.result and
      rpl_temporary_error2.result shows that all statements in the errored
      transaction are rolled back (diff pasted below):
      
      : diff rpl_temporary_error2.result rpl_temporary_error2_skip_all.result
      49c49
      < 2	1
      ---
      > 2	NULL
      51c51
      < 4	1
      ---
      > 4	NULL
      53c53
      < * There will be two rows in t2 due to the retry.
      ---
      > * There will be one row in t2 because the ignored deadlock does not retry.
      57d56
      < 1
      59c58
      < 1
      ---
      > 0
      
      Reviewed By:
      ============
      Andrei Elkin <andrei.elkin@mariadb.com>
      0ad52e4d
    • Vladislav Vaintroub's avatar
      MDEV-16944 Fix file sharing issues on Windows in mysqltest · 061adae9
      Vladislav Vaintroub authored
      On Windows systems, occurrences of ERROR_SHARING_VIOLATION due to
      conflicting share modes between processes accessing the same file can
      result in CreateFile failures.
      
      mysys' my_open() already incorporates a workaround by implementing
      wait/retry logic on Windows.
      
      But this does not help if files are opened using shell redirection like
      mysqltest traditionally did it, i.e via
      
      --echo exec "some text" > output_file
      
      In such cases, it is cmd.exe, that opens the output_file, and it
      won't do any sharing-violation retries.
      
      This commit addresses the issue by introducing a new built-in command,
      'write_line', in mysqltest. This new command serves as a brief alternative
      to 'write_file', with a single line output, that also resolves variables
      like "exec" would.
      
      Internally, this command will use my_open(), and therefore retry-on-error
      logic.
      
      Hopefully this will eliminate the very sporadic "can't open file because
      it is used by another process" error on CI.
      061adae9
    • Vladislav Vaintroub's avatar
      Remove duplicate key "Language" from .clang-format · b48de973
      Vladislav Vaintroub authored
      Latest Visual Studio complains about invalid format, it breaks formatting
      in the IDE
      b48de973
    • Vladislav Vaintroub's avatar
      Do not run maria_recover_encrypted with embedded. · 173847b7
      Vladislav Vaintroub authored
      It uses shutdown/restart etc, features not compatible the embedded.
      
      also add have_debug.inc , since it uses debug_dbug variable
      173847b7
    • Vladislav Vaintroub's avatar
      Fix LTO (aka interprocedural optimization) build with MSVC · e87a175b
      Vladislav Vaintroub authored
      Also, disable MSVC LTO for static client libraries - they won't be usable
      for end-users.
      e87a175b
    • mariadb-DebarunBanerjee's avatar
      MDEV-33431 Latching order violation reported fil_system.sys_space.latch and... · 040069f4
      mariadb-DebarunBanerjee authored
      MDEV-33431 Latching order violation reported fil_system.sys_space.latch and ibuf_pessimistic_insert_mutex
      
      Issue:
      ------
      The actual order of acquisition of the IBUF pessimistic insert mutex
      (SYNC_IBUF_PESS_INSERT_MUTEX) and IBUF header page latch
      (SYNC_IBUF_HEADER) w.r.t space latch (SYNC_FSP) differs from the order
      defined in sync0types.h. It was not discovered earlier as the path to
      ibuf_remove_free_page was not covered by the mtr test. Ideal order and
      one defined in sync0types.h is as follows.
      SYNC_IBUF_HEADER -> SYNC_IBUF_PESS_INSERT_MUTEX -> SYNC_FSP
      
      In ibuf_remove_free_page, we acquire space latch earlier and we have
      the order as follows resulting in the assert with innodb_sync_debug=on.
      SYNC_FSP -> SYNC_IBUF_HEADER -> SYNC_IBUF_PESS_INSERT_MUTEX
      
      Fix:
      ---
      We do maintain this order in other places and there doesn't seem to be
      any real issue here. To reduce impact in GA versions, we avoid doing
      extensive changes in mutex ordering to match the current
      SYNC_IBUF_PESS_INSERT_MUTEX order. Instead we relax the ordering check
      for IBUF pessimistic insert mutex using SYNC_NO_ORDER_CHECK.
      040069f4
    • Vladislav Vaintroub's avatar
      MDEV-33840 tpool- switch to longer maintainence timer interval, if pool is idle · f6e9600f
      Vladislav Vaintroub authored
      Previous solution, that would entirely switch timer off, turned out
      to be deadlock prone.
      
      This patch fixed previous attempt to switch between long/short interval
      periods in MDEV-24295. Now, initial state of the timer is fixed (it is ON).
      Also, avoid switching timer to longer periods if there is any activity in
      the pool.
      f6e9600f
    • Vladislav Vaintroub's avatar
      2ba79aba
    • Marko Mäkelä's avatar
      Merge 10.4 into 10.5 · 3a3fe300
      Marko Mäkelä authored
      3a3fe300
    • Marko Mäkelä's avatar
      Tests: remove a duplicated check · 9164c2b8
      Marko Mäkelä authored
      This fixes up the merge commit 9b182756
      9164c2b8