Commits · 8e797ae26753b9bd3f8fd107175d4751953d875b · nexedi / MariaDB

03 May, 2015 16 commits

MDEV-8014 MariaDB client can hang in an infinite loop · 8e797ae2

Sergei Golubchik authored May 03, 2015

On EOF vio_read returns 0, it's not an error so the errno
is not reset. If the previous error was EINTR the client
will loop forever. See also man recv.

8e797ae2

MDEV-7781 cannot install/uninstall plugins during bootstrap · aa509562

Sergei Golubchik authored May 03, 2015

Merge branch 'openquery:mdev-7781-allow-install-uninstall-plugins-during-bootstrap' into 10.0
Undo MySQL fix for bug#46261

aa509562

clarify the test case · dbe97bcc
Sergei Golubchik authored May 03, 2015

dbe97bcc
MDEV-7390 alter online table xxxx (no options) should be possible · c8c51cee
Sergei Golubchik authored May 03, 2015
```
Merge branch 'openquery:mdev-7390-alter-online-table-xx-possible-10.0' into 10.0
```
c8c51cee
more tests, moving code around · 532de702
Sergei Golubchik authored May 03, 2015

532de702
Fix connection thread handling to address MDEV-6282 MDEV-6345 and MDEV-6784 · a2297506
Sergei Golubchik authored May 03, 2015
```
Merge branch 'pastcomputer:10.0-oqgraph-6282-6345-6784-test' into 10.0
```
a2297506
SSL: Verbosely report SSL initialization errors · ef1eb9c6
Sergei Golubchik authored May 02, 2015
```
And don't ignore SSL_CTX_set_tmp_dh() failures
```
ef1eb9c6
MDEV-7794 MariaDB - mysql-test - fips: some ssl tests with cipher are failing · 601dcd49
Sergei Golubchik authored May 02, 2015
```
change openssl_1 test not to use non-FIPS ciphers
```
601dcd49

MDEV-7695 MariaDB - ssl - fips: can not connect with... · 7e7dd8e8

Sergei Golubchik authored May 02, 2015

MDEV-7695 MariaDB - ssl - fips: can not connect with --ssl-cipher=DHE-RSA-AES256-SHA - handshake failure

Change 512bit DH key to 1024bit to meet FIPS requirements

7e7dd8e8

remove unused file and unnecessary #include · e1e1f94f
Sergei Golubchik authored May 01, 2015

e1e1f94f

MDEV-7788 my_md5 crashes with openssl in fips mode · 93c563d3

Sergei Golubchik authored May 01, 2015

Tell OpenSSL to use MD5 even if FIPS prohibits it.
This is fine as long as we do not use MD5 for cryptographical
purposes (md5 is used internally for P_S message digests and for view
checksums)

93c563d3

MDEV-7697 Client reports ERROR 2006 (MySQL server has gone away) or ERROR 2013... · cc12a35c

Sergei Golubchik authored May 01, 2015

MDEV-7697 Client reports ERROR 2006 (MySQL server has gone away) or ERROR 2013 (Lost connection to MySQL server during query) while executing AES* functions under SSL

Clear OpenSSL error queue after an error in AES_ENCRYPT/AES_DECRYPT.
Otherwise it might affect current ssl-encrypted connection.

cc12a35c

MDEV-5114 seconds_behind_master flips to 0 & spikes back, when running show slaves status · f875c9f2

Sergei Golubchik authored Apr 30, 2015

1. After a period of wait (where last_master_timestamp=0)
   do NOT restore the last_master_timestamp to the timestamp
   of the last executed event (which would mean we've just
   executed it, and we're that much behind the master).

2. Update last_master_timestamp before executing the event,
   not after.

Take the approach from the this commit (but with a different test
case that actually makes sense):

commit 0c75ab453fb8c5439576af8fe5add7a1b89f1569
Author: Luis Soares <luis.soares@sun.com>
Date:   Thu Apr 15 17:39:31 2010 +0100

    BUG#52166: Seconds_Behind_Master spikes after long idle period

f875c9f2

init_status_vars() was not invoked for embedded · e6d918ca
Sergei Golubchik authored May 03, 2015
```
which failed main.features test in embedded, because
status variables were not sorted
```
e6d918ca
reformat long strings · 91f8931e
Sergei Golubchik authored May 03, 2015
```
(to help 'git diff' show the correct function for hunks)
```
91f8931e

MDEV-7774: Crash when dropping user within rebuild_role_grants · 6c55e52b

Vicențiu Ciorbaru authored Mar 13, 2015

The issue comes from not taking all possibilities to match an entry
within the roles_mapping HASH, when updating the data structure.

6c55e52b

02 May, 2015 1 commit

MDEV-7038 Assertion `status_var.memory_used == 0' failed in THD::~THD() on... · acab0faa

Vicențiu Ciorbaru authored May 02, 2015

MDEV-7038 Assertion `status_var.memory_used == 0' failed in THD::~THD() on disconnect after executing EXPLAIN for multi-table UPDATE

Added test case that caught this bug. It is no longer reproducible in
the current tree.

acab0faa

01 May, 2015 1 commit

MDEV-8079: Crash when running MariaDB Debug with InnoDB on Windows · 37093eb5

Jan Lindström authored May 01, 2015

Problem was that std::vector was allocated using calloc instead of
new, this caused vector constructor not being called and vector
metadata not initialized.

37093eb5

30 Apr, 2015 2 commits
- Alter online table x (no options) possible · 2bb0e713
  Daniel Black authored Mar 12, 2015
```
This no-op of an operations should be able to occur without locks and
occur online.
```
  2bb0e713
- Merge test for bug#72594 from upstream · 320240be
  Nirbhay Choubey authored Apr 30, 2015
  
  320240be
29 Apr, 2015 1 commit
- MDEV-7802: group commit status variable addition · 9088f26f
  Kristian Nielsen authored Apr 29, 2015
```
Backport into 10.0
```
  9088f26f
28 Apr, 2015 1 commit

MDEV-7864: Slave SQL: stopping on non-last RBR event with annotations results in SEGV (signal 11) · ed701c6a

Kristian Nielsen authored Apr 28, 2015

The slave SQL thread was clearing serial_rgi->thd before deleting
serial_rgi, which could cause access to NULL THD.

The clearing was introduced in commit
2e100cc5 and is just plain wrong. So revert
that part (single line) of that commit.

Thanks to Daniel Black for bug analysis and test case.

ed701c6a

24 Apr, 2015 1 commit

MDEV-7130: MASTER_POS_WAIT(log_name,log_pos,timeout,"connection_name") hangs,... · 060ec5b6

f4rnham authored Apr 24, 2015

MDEV-7130: MASTER_POS_WAIT(log_name,log_pos,timeout,"connection_name") hangs, does not respect the timeout

Changed also arg_count check for connection_name to prevent same bug
if fifth argument is introduced in future

060ec5b6

23 Apr, 2015 1 commit

MDEV-8031: Parallel replication stops on "connection killed" error (probably... · b616991a

Kristian Nielsen authored Apr 23, 2015

MDEV-8031: Parallel replication stops on "connection killed" error (probably incorrectly handled deadlock kill)

There was a rare race, where a deadlock error might not be correctly
handled, causing the slave to stop with something like this in the error
log:

150423 14:04:10 [ERROR] Slave SQL: Connection was killed, Gtid 0-1-2, Internal MariaDB error code: 1927
150423 14:04:10 [Warning] Slave: Connection was killed Error_code: 1927
150423 14:04:10 [Warning] Slave: Deadlock found when trying to get lock; try restarting transaction Error_code: 1213
150423 14:04:10 [Warning] Slave: Connection was killed Error_code: 1927
150423 14:04:10 [Warning] Slave: Connection was killed Error_code: 1927
150423 14:04:10 [ERROR] Error running query, slave SQL thread aborted. Fix the problem, and restart the slave SQL thread with "SLAVE START". We stopped at log 'master-bin.000001 position 1234

The problem was incorrect error handling. When a deadlock is detected, it
causes a KILL CONNECTION on the offending thread. This error is then later
converted to a deadlock error, and the transaction is retried.

However, the deadlock error was not cleared at the start of the retry, nor
was the lingering kill signal. So it was possible to get another deadlock
kill early during retry. If this happened with particular thread
scheduling/timing, it was possible that the new KILL CONNECTION error was
masked by the earlier deadlock error, so that the second kill was not
properly converted into a deadlock error and retry.

This patch adds code that clears the old error and killed flag before
starting the retry. It also adds code to handle a deadlock kill caught in a
couple of places where it was not handled before.

b616991a

21 Apr, 2015 1 commit
- MDEV-8029: test failure in rpl.rpl_parallel_temptable · 47605287
  Kristian Nielsen authored Apr 21, 2015
```
Fix a silly typo that caused the test to occasionally fail.
```
  47605287
20 Apr, 2015 2 commits
- MDEV-8016: Replication aborts on DROP /*!40005 TEMPORARY */ TABLE IF EXISTS · 519ad0f7
  Kristian Nielsen authored Apr 20, 2015
```
This was a regression from the patch for MDEV-7668.

A test was incorrect, so the slave would not properly handle re-using
temporary tables, which lead to replication failure in this case.
```
  519ad0f7
- test case for install plugin on boostrap · 0759568b
  Daniel Black authored Apr 20, 2015
  
  0759568b
19 Apr, 2015 1 commit
- Increase the version number · 87d54383
  Elena Stepanova authored Apr 20, 2015
  
  87d54383
14 Apr, 2015 2 commits

Merge MDEV-7975 into 10.0 · a8523559
Kristian Nielsen authored Apr 14, 2015

a8523559

MDEV-7975: sporadic failure in test case rpl.rpl_gtid_startpos · 5d2b85a2

Kristian Nielsen authored Apr 14, 2015

Add some suppressions that were missing. They are for if a STOP SLAVE is
executed early during IO thread startup, when it is negotiating with the
master. The master connection may be killed in the middle of a
mysql_real_query(), which is not a test failure if it is a network error.

This also caught one real code error, fixed with this commit: The I/O thread
would fail to automatically reconnect if a network error happened while
fetching the value of @@GLOBAL.gtid_domain_id.

5d2b85a2

13 Apr, 2015 3 commits

Merge MDEV-7936 into 10.0. · 17aff4b1
Kristian Nielsen authored Apr 13, 2015
```
Conflicts:
	sql/sql_base.cc
```
17aff4b1

MDEV-7936: Assertion `!table || table->in_use == _current_thd()' failed on... · 60d094ae

Kristian Nielsen authored Apr 13, 2015

MDEV-7936: Assertion `!table || table->in_use == _current_thd()' failed on parallel replication in optimistic mode

Make sure that in parallel replication, we execute wait_for_prior_commit()
before setting table->in_use for a temporary table. Otherwise we can end up
with two parallel replication worker threads competing with each other for
use of a temporary table.

Re-factor the use of find_temporary_table() to be able to handle errors
in the caller (as wait_for_prior_commit() can return error in case of
deadlock kill).

60d094ae

MDEV-7668: Intermediate master groups CREATE TEMPORARY with INSERT, causing... · c47fe0e9

Kristian Nielsen authored Mar 09, 2015

MDEV-7668: Intermediate master groups CREATE TEMPORARY with INSERT, causing parallel replication failure

[This commit cherry-picked to be able to merge MDEV-7936, of which it
is a pre-requisite, into both 10.0 and 10.1.]

Parallel replication depends on locking (table locks, row locks, etc.) to
prevent two conflicting transactions from running and committing in parallel.
But temporary tables are designed to be visible only to one thread, and have
no such locking.

In the concrete issue, an intermediate master could commit a CREATE TEMPORARY
TABLE in the same group commit as in INSERT into that table. Thus, a
lower-level master could attempt to run them in parallel and get an error.

More generally, we need protection from parallel replication trying to run
transactions in parallel that access a common temporary table.

This patch simply causes use of a temporary table from parallel replication
to wait for all previous transactions to commit, serialising the replication
at that point.

(A more fine-grained locking could be added later, possibly. However,
using temporary tables in statement-based replication is in any case
normally undesirable; for example a restart of the server will lose
temporary tables and can break replication).

Note that row-based replication is not affected, as it does not do any
temporary tables on the slave-side.

This patch also cleans up the locking around protecting the list of
temporary tables in Relay_log_info. This used to take the
rli->data_lock at the end of every statement, which is very bad for
concurrency. With this patch, the lock is not taken unless temporary
tables (with statement-based binlogging) are in use on the slave.

c47fe0e9

09 Apr, 2015 2 commits

Merge MDEV-7940 into 10.0 · 50d98e9c
Kristian Nielsen authored Apr 09, 2015

50d98e9c

MDEV-7940: Sporadic failure in rpl.rpl_gtid_until · 15a2b5aa

Kristian Nielsen authored Apr 09, 2015

Fix a race in the test case. When we do start_slave.inc immediately
followed by stop_slave.inc, it is possible to kill the IO thread while
it is still running inside get_master_version_and_clock(), and this
gives warnings in the error log that cause the test to fail.

15a2b5aa

08 Apr, 2015 4 commits

Merge MDEV-7910' into 10.0 · 670d4dd8
Kristian Nielsen authored Apr 08, 2015

670d4dd8

MDEV-7910: innodb.binlog_consistent fails sporadically in buildbot · b3c7c8cd

Kristian Nielsen authored Apr 08, 2015

The test case was missing --source include/wait_for_binlog_checkpoint.inc.
So it could occasionally fail if the checkpoint managed to occur just at the
right point in time between fetching the two binlog positions to compare.

b3c7c8cd

Merge MDEV-7888 and MDEV-7929 into 10.0. · accdabd6
Kristian Nielsen authored Apr 08, 2015

accdabd6

MDEV-7888, MDEV-7929: Parallel replication hangs sometimes on ANALYZE TABLE or DDL · 3b961347

Kristian Nielsen authored Apr 08, 2015

The hangs occur when the group_commit_orderer object is freed before the last
mark_start_commit() call on it - this loses the wakeup to other waiting worker
threads, causing them to hang until killed manually.

The object was freed because wakeup_subsequent_commits() was called two early
in two places. For MDEV-7888, during ANALYZE TABLE, and for MDEV-7929 during
record_gtid() after processing a DDL event. The group_commit_orderer object
can be freed when its last transaction has called wait_for_prior_commit().

Fix by implementing a suspend/resume mechanism for wakeup_subsequent_commits()
that can be used in places where a transaction is committed without this being
the commit of the actual replication event group.

Also add a protection mechanism (that asserts in debug builds) which can
prevent the too-early free and hang if other similar bugs should remain in
other parts of the code.

3b961347

06 Apr, 2015 1 commit

MDEV-7908: assertion in innobase_release_savepoint · e9c10f99

Jan Lindström authored Apr 06, 2015

Problem was that in XA prepared state we should still be able to
release a savepoint, but assertions were too strict.

e9c10f99