- 06 Mar, 2020 1 commit
-
-
Marko Mäkelä authored
recv_log_recover_pre_10_2(): Merged from recv_find_max_checkpoint_0(), recv_log_format_0_recover().
-
- 05 Mar, 2020 7 commits
-
-
Marko Mäkelä authored
Some fields were protected by log_sys.mutex, which adds quite some overhead for readers. Some readers were submitting dirty reads. log_t::lsn: Declare private and atomic. Add wrappers get_lsn() and set_lsn() that will use relaxed memory access. Many accesses to log_sys.lsn are still protected by log_sys.mutex; we avoid the mutex for some readers. log_t::flushed_to_disk_lsn: Declare private and atomic, and move to the same cache line with log_t::lsn. log_t::buf_free: Declare as size_t, and move to the same cache line with log_t::lsn. log_t::check_flush_or_checkpoint_: Declare private and atomic, and move to the same cache line with log_t::lsn. log_get_lsn(): Define as an alias of log_sys.get_lsn(). log_get_lsn_nowait(), log_peek_lsn(): Remove. log_get_flush_lsn(): Define as an alias of log_sys.get_flush_lsn(). log_t::initiate_write(): Replaces log_buffer_sync_in_background().
-
Eugene Kosov authored
O_DSYNC is faster than O_SYNC because it syncs as little as needed (e.g. no timestamp changes) This change is similar to change fsync() -> fdatasync() in MDEV-21382
-
Otto Kekäläinen authored
Since this tool is already shipped in the system, don't provide it from MariaDB from 10.5 series onwards. It will potentially just create an installation failure due to conflict if users install rocksdb-tools. Package rocksdb-tools in Debian since Buster (10): https://packages.debian.org/search?suite=buster&arch=any&mode=path&searchon=contents&keywords=sst_dump Package rocksdb-tools in Ubuntu since Disco (19.04): https://packages.ubuntu.com/search?suite=disco&arch=any&mode=exactfilename&searchon=contents&keywords=sst_dump This change is adopted from Debian the change has been since Dec 18, 2018. https://salsa.debian.org/mariadb-team/mariadb-10.3/-/commit/d50e2eaf1fad46c494bd0f090b0e75c19ac3f41a
-
Otto Kekäläinen authored
The command is needed to install/upgrade databases. It should therefore reside in the server-core package, as that package is supposed to contain all necessary tools to start and run the database (although manually, there are no automation or service scripts included in server-core by design). Also add Breaks+Replaces in the control so upgrades will not complain about overwriting files in previously installed packages of different name. This was adopted from Debian where this change has been since Nov 4, 2018. https://salsa.debian.org/mariadb-team/mariadb-10.1/-/commit/3689350ad30916aa68fe2c1b35c290fb5095bc9b In addition, also move all *.sql files referenced mysql_install_db for the same reasons.
-
Marko Mäkelä authored
This amends commit 37e7bde1
-
Marko Mäkelä authored
The ut_crc32() function uses a hard-coded initial CRC-32C value of 0. Replace it with ut_crc32_low(), which allows to specify the initial checksum value, and provide an inlined compatibility wrapper ut_crc32(). Also, remove non-inlined wrapper functions on ARMv8 and POWER8, and remove dead code (the generic implementation) on POWER8. Note: The original AMD64 instruction set architecture in 2003 only included SSE2. The CRC-32C instructions are part of the SSE4.2 instruction set extension for IA-32 and AMD64, with first processors released in November 2007 (using the AMD Barcelona microarchitecture) and November 2008 (Intel Nehalem microarchiteture). It might be safe to assume that SSE4.2 is available on all currently used AMD64 based systems, but we are not taking that step yet.
-
Marko Mäkelä authored
-
- 04 Mar, 2020 5 commits
-
-
Marko Mäkelä authored
The configuration parameter innodb_scrub_log never really worked, as reported in MDEV-13019 and MDEV-18370. Because MDEV-14425 is changing the redo log format, the innodb_scrub_log feature would have to be adjusted for it. Due to the known problems, it is easier to remove the feature for now, and to ignore and deprecate the parameters. If old log contents should be kept secret, then enabling innodb_encrypt_log or setting a smaller innodb_log_file_size could help.
-
Ian Gilfillan authored
-
Marko Mäkelä authored
MONITOR_PENDING_CHECKPOINT_WRITE and MONITOR_LOG_IO track log_sys.n_pending_checkpoint_writes and log_sys.n_log_ios, respectively. The MONITOR_INC calls are redundant, because the values will be overwritten in srv_mon_process_existing_counter().
-
Marko Mäkelä authored
Compute MONITOR_LSN_CHECKPOINT_AGE on demand in srv_mon_process_existing_counter(). This allows us to remove the overhead of MONITOR_SET calls for the counter.
-
Marko Mäkelä authored
The function log_header_read() was only used during server startup, and it will mostly be used only for reading checkpoint information from pre-MDEV-14425 format redo log files. Let us replace the function with more direct calls, so that it is clearer what is going on. It is not strictly necessary to hold any mutex during this operation, and because there will be only a limited number of operations during early server startup, it is not necessary to increment any I/O counters.
-
- 03 Mar, 2020 9 commits
-
-
Marko Mäkelä authored
Simplify the logging of ALTER TABLE operations, by making use of the TRX_UNDO_RENAME_TABLE undo log record that was introduced in commit 0bc36758. commit_try_rebuild(): Invoke row_rename_table_for_mysql() and actually rename the files before committing the transaction. fil_mtr_rename_log(), commit_cache_rebuild(), log_append_on_checkpoint(), row_merge_rename_tables_dict(): Remove. mtr_buf_copy_t, log_t::append_on_checkpoint: Remove. row_rename_table_for_mysql(): If !use_fk, ignore missing foreign keys. Remove a call to dict_table_rename_in_cache(), because trx_rollback_to_savepoint() should invoke the function if needed.
-
Marko Mäkelä authored
group_commit_lock::release(): Ensure that prev will be initialized, simplify a comparison, and fix some white space.
-
Marko Mäkelä authored
In commit 138cbec5, we computed an incorrect parameter to page_dir_split_slot(), leading us to splitting the wrong directory slot, or an out-of-bounds access when splitting the supremum slot. This was once caught in the test innodb_gis.kill_server for inserting records to a clustered index root page. page_dir_split_slot(): Take the slot as a pointer, instead of a numeric index. page_apply_insert_redundant(), page_apply_insert_dynamic(): Rename slot to last_slot, and make owner_slot a pointer.
-
Marko Mäkelä authored
For undo log truncation, commit 055a3334 repurposed the MLOG_FILE_CREATE2 record with a nonzero page size to indicate that an undo tablespace will be shrunk in size. In commit 7ae21b18 the MLOG_FILE_CREATE2 record was replaced by a FILE_CREATE record. Now that the redo log encoding was changed, there is no actual need to write a file name in the log record; it suffices to write the page identifier of the first page that is not part of the file. This TRIM_PAGES record could allow us to shrink any data files in the future. For now, it will be limited to undo tablespaces. mtr_t::log_file_op(): Remove the parameter first_page_no, because it would always be 0 for file operations. mtr_t::trim_pages(): Replaces fil_truncate_log(). mtr_t::log_write(): Avoid same_page encoding if !bpage&&!m_last. fil_op_replay_rename(): Remove the constant parameter first_page_no=0.
-
Rahul Anand authored
-
Andy Fiddaman authored
MDEV-21476: auth_socket: add support for illumos with getpeerucred()
-
Aleksey Midenkov authored
-
Aleksey Midenkov authored
This patch adds support of RENAME INDEX operation to the ALTER TABLE statement. Code which determines if ALTER TABLE can be done in-place for "simple" storage engines like MyISAM, Heap and etc. was updated to handle ALTER TABLE ... RENAME INDEX as an in-place operation. Support for in-place ALTER TABLE ... RENAME INDEX for InnoDB was covered by MDEV-13301. Syntax changes ============== A new type of <alter_specification> is added: <rename index clause> ::= RENAME ( INDEX | KEY ) <oldname> TO <newname> Where <oldname> and <newname> are identifiers for old name and new name of the index. Semantic changes ================ The result of "ALTER TABLE t1 RENAME INDEX a TO b" is a table which contents and structure are identical to the old version of 't1' with the only exception index 'a' being called 'b'. Neither <oldname> nor <newname> can be "primary". The index being renamed should exist and its new name should not be occupied by another index on the same table. Related to: WL#6555, MDEV-13301
-
Aleksey Midenkov authored
The existing syntax for renaming a column uses "ALTER TABLE ... CHANGE" command. This requires full column specification to rename the column. This patch adds new syntax "ALTER TABLE ... RENAME COLUMN", which do not expect users to provide full column specification. It means that the new syntax would pick in-place or copy algorithm in the same way as that of existing "ALTER TABLE ... CHANGE" command. The existing syntax "ALTER TABLE ... CHANGE" will continue to work. Syntax changes ============== ALTER TABLE tbl_name [alter_specification [, alter_specification] ...] [partition_options] Following is a new <alter_specification> added: | RENAME COLUMN <oldname> TO <newname> Where <oldname> and <newname> are identifiers for old name and new name of the column. Related to: WL#10761
-
- 02 Mar, 2020 4 commits
-
-
Vladislav Vaintroub authored
MY_RELAX_CPU on this arch needs int32, defined in my_global.h
-
Marko Mäkelä authored
Most of the time, we can refer to recv_sys.recovered_lsn.
-
Marko Mäkelä authored
-
Vlad Lesin authored
exist lock_rec_lock() does not set record lock if table lock is stronger or equal to the acquired record lock.
-
- 01 Mar, 2020 2 commits
-
-
Vladislav Vaintroub authored
-
Vladislav Vaintroub authored
Introduce special synchronization primitive group_commit_lock for more efficient synchronization of redo log writing and flushing. The goal is to reduce CPU consumption on log_write_up_to, to reduce the spurious wakeups, and improve the throughput in write-intensive benchmarks.
-
- 28 Feb, 2020 3 commits
-
-
Sergey Vojtovich authored
XA specification doesn't permit empty gtrid. It is now enforced by this patch. This solution was agreed in favour of fixing InnoDB, which doesn't expect empty XID since early 10.5. Also fixed wrong assertion (and added a test cases) that didn't permit 64 bytes gtrid + 64 bytes bqual.
-
Alexander Barkov authored
-
Varun Gupta authored
It is useful to know whether sorting uses addon fields[packed|unpacked] or ROWID. Provide this information in ANALYZE FORMAT=JSON output.
-
- 27 Feb, 2020 9 commits
-
-
Marko Mäkelä authored
-
Marko Mäkelä authored
-
Marko Mäkelä authored
Inserting a record into an index page involves updating multiple fields in the page header as well as updating the next-record links and potentially updating fields related to the sparse page directory. Let us cover the insert operations by higher-level log records, to avoid 'redundant' logging about the writes. The code for applying the high-level log records will check the consistency of the page thoroughly, to avoid crashes during recovery. We will refuse to replay the inserts if any inconsistency is detected. With innodb_force_recovery=1, recovery will continue, but the affected pages may be more inconsistent if some changes were omitted. mrec_ext_t: Introduce the EXTENDED record subtypes INSERT_HEAP_REDUNDANT, INSERT_REUSE_REDUNDANT, INSERT_HEAP_DYNAMIC, INSERT_REUSE_DYNAMIC. The record will explicitly identify the page type and whether the space will be allocated from PAGE_HEAP_TOP or reused from the PAGE_FREE list. It will also tell how many bytes to copy from the preceding record header and payload, and how to initialize the rest of the record header and payload. mtr_t::page_insert(): Write the high-level log records. log_phys_t::apply(): Parse the high-level log records. page_apply_insert_redundant(), page_apply_insert_dynamic(): Apply the high-level log records. page_dir_split_slot(): Introduce a variant that does not write log nor deal with ROW_FORMAT=COMPRESSED pages. page_mem_alloc_heap(): Remove the mtr_t parameter page_cur_insert_rec_low(): Write log only via mtr_t::page_insert().
-
Marko Mäkelä authored
The function page_rec_get_base_extra_size() became dead code in commit 08ba3887.
-
Marko Mäkelä authored
This is a follow-up to commit 572d2075 where we introduced the EXTENDED log record subtypes DELETE_ROW_FORMAT_REDUNDANT and DELETE_ROW_FORMAT_DYNAMIC. log_phys_t::apply(): If corruption was noticed, stop applying the log unless innodb_force_recovery is set.
-
Marko Mäkelä authored
This is a follow-up to commit 84e3f9ce that introduced the EXTENDED log record of UNDO_APPEND subtype. mtr_t::undo_append(): Accurately enforce the mtr_buf_t::MAX_DATA_SIZE limit. Also, replace mtr_buf_t::push() with simpler code, to append 1 byte to the log. log_phys_t::undo_append(): Return whether the page was found to be in an inconsistent state. log_phys_t::apply(): If corruption was noticed, stop applying log unless innodb_force_recovery is set.
-
Sergey Vojtovich authored
-
Daniel-Solo authored
function to release all named locks
-
Sergei Golubchik authored
Revert "MDEV-17554 Auto-create new partition for system versioned tables with history partitioned by INTERVAL/LIMIT" This reverts commit 9894751a. This reverts commit f707c83f.
-