Binlog Servers for Simplifying Point in Time Recovery

来源:互联网 时间:1970-01-01

A common way to implement point in time recovery capability is:

to regularly do a full backup of a database, and to save the binary logs of that database (or from its master if doing backups on a slave).

When point in time recovery is required you need to:

restore a backup, and apply the binary logs up to the point of recovery.

(Step # 2 and # b above are the ones that will be simplified by using Binlog Servers.)

There are many ways of doing a backup of a database:

mysqldump (or mysqlpump) on the master: no need for slaves but taking a dump of a large database without blocking writes and with a consistent binlog position is not trivial, XtraBackupon the master: no need for slaves, but we only have a physical backup, mysqldump (or mysqlpump) on a slave: needs extra hardware but very simple and we now have a logical backup: stop slave sql_thread; do a backup save show slave status/G output save show master status/G output start slave sql_thread; XtraBackup on the slave: this could be useful but I do not have an example in mind.

but saving the binary logs is tricky :

you can use a script to copy the binary logs (many corner cases to take into account), or you can use mysqlbinlog to stream the binary logs directly from a MySQL master (you will still need a wrapper script to restart mysqlbinlog in the case of disconnection from the master).

and applying the binary logs is complicated :

you need to pipe the output of a correctly executed mysqlbinlog to a mysql client (good luck in managing all error cases), or you need to copy the binary logs to your restored database and use CHANGE MASTER TO with RELAY_LOG_FILE and RELAY_LOG_POS (I never tried this myself, I am sure there are some pitfalls).

or you can use a Binlog Server

Binlog Servers download and store an exact copy of the binary logs of a master and serve them to slaves:

from the point of view of the master, a Binlog Server is a standard slave, from the point of view of the slave, the Binlog Server is an exact copy of the master.

You can find more information about Binlog Servers and their use-cases in the following posts from the Booking.com dev blog:

MySQL Slave Scaling (and more), Better Crash-safe replication for MySQL, Better Parallel Replication for MySQL, Abstracting Binlog Servers and MySQL Master Promotion without Reconfiguring all Slaves.

So, Binlog Servers allow to keep an exact copy of the binary logs of a database in a location external from the master: this is exactly what is needed for achieving point in time recovery .

When point in time recovery is needed, after restoring a backup, the restored MySQL is configured as a slave of a Binlog Server . That will bring the database to the required state (point in time) without using file copy or needing running commands on an external server. Moreover, all the error conditions and the corresponding retry logic is already implemented in replication, so you do not need to care about those.

A deployment with no slave (XtraBackup, mysqldump or mysqlpump from the master)would be the following (M is the master and X is a Binlog Server):

If backups are taken on a slave (S), the deployment would be the following:

After restoring a backup (on R below), the database would be brought to the required state by slaving R to X:

If more than one slave is needed and/or more than one copy of the binary logs needs to be kept, one can deploy more than one Binlog Servers (the Binlog Server Layer below). This also allows to take advantage of easy master promotion as described in the following post on the Booking.com dev blog: Abstracting Binlog Servers and MySQL Master Promotion without Reconfiguring all Slaves. The deployment would be the following:

This use-case of the Binlog Servers has been suggested to me by Alessandro Fustini. Alessandro attended my Binlog Server sessionat Percona Live Amsterdamand came up with this idea: thanks Alessandro for this new and original Binlog Server use-case.

If you are in the San Francisco Bay Area at the end of October and want to know more, I will be speaking about Binlog Serves and Replication at Oracle Open World(from October 25 to 29). My two sessions are on October 27:

Binlog Servers at Booking.com(talk) Riding the Binary Logs: Forthcoming Evolution in the Replication Stream(BoF)


相关阅读:
Top