Comments on: Thoughts on MaxScale automated failover (and Orchestrator) https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator Blog by Shlomi Noach Wed, 08 Mar 2017 10:21:09 +0000 hourly 1 https://wordpress.org/?v=5.3.3 By: Replication Manager is Ready for Flashback and Much More! – Cloud Data Architect https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-385629 Wed, 08 Mar 2017 10:21:09 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-385629 […] in the roadmap. I guess this task is addressing some of our fellow ace director musings found here.  Also, Etcd is already in the roadmap and will be worked on in the future and receive […]

]]>
By: Prakash https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-355888 Tue, 19 Apr 2016 03:41:39 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-355888 @Dipti / @Shlomi,

Very interesting and nice presentation. Thank you for the detailed explanation.

Would like to know, when can we expect Multiple-Maxscale servers for deployment? Having single MaxScale is a single point of failure for critical applications. Similar to master-slave for MariaDB database, it’s better to have the same concept for MaxScale as well.

If MaxScale server is down, then there is no point of having master with many slaves. It will defeat the purpose.

If you recommend any work-around for this? Your suggestion is greatly appreciated. Thank you very much in advance.

]]>
By: shlomi https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-334919 Wed, 18 Nov 2015 20:11:42 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-334919 @Dipti,
Thank you for further clarifying the internals of MaxScale

]]>
By: Dipti Joshi https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-334911 Wed, 18 Nov 2015 18:57:53 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-334911 Shlomi:

Thanks for the write up and suggestions.

Few clarification and few comments on failover
(1) MaxScale does not trigger failover it self , but has mechanism to invoke the failover script based on configured events
(2) MaxScale differentiates between having lost communication with master (lost_master event) and having communication with master but the database being down on master (master_down event).
(3) MaxScale can be configured to trigger the external script only on master_down event and not lost_master event.
(4) MaxScale does not automatically invoke MariaDB Replication Manager – MaxScale can be configured to invoke any other external script – could be MHA or could be custom developed by any other user. Some of our users do not like automatic failover, but just notification of master failure – for such users the script could be simply a curl script to send email notification of failure.
(5) Yes, MariaDB Replication Manager script is for MariaDB based database server using MariaDB GTID.
(6) Since MaxScale has knowledge of before failure topology – I agree improvements can be made to MariaDB Replication Manager to use this information rather then rediscovering and not having the pre-failure topology information
(7) While MaxScale and scripts are decoupled, MaxScale does not automatically assume that the failover has been resolved upon triggering of script – Maxscale still continues to monitor the topology and detects when a new master has been promoted through its monitoring.
(8) MaxScale has configurable monitor_interval as well as backend_connect_timeout that can be used to control flapping – though another period can be added specifically for triggering failover script to minimize flapping further
(9) Single MaxScale can handle multiple topologies.
(10) We will take your input on shell script and what you have done in orchestrator, as well as tighter integration between MaxScale and the failover script/tool

On High Availability of MaxScale itself:
We are exploring, Multiple-MaxScale servers to load-balance on our roadmap – and will consider all your points on leader election, group communication, taking over from failed MaxScale

Thanks again for detailed blog about your experience and observation on what can be done with MaxScale to address operational scenarios.

]]>
By: shlomi https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-334839 Wed, 18 Nov 2015 11:11:14 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-334839 @Guillaume,

Thank you. Indeed you are not the first to raise this point, and I understand it. At this time I don’t just rely on MySQL at the backend, I rely on the relational model (so sometimes complex GROUP BY queries etc.). So migration to etcd will be non-trivial.

But there are thoughts in that direction; I don’t expect this to come up in the near future unless someone else steps up to do it.

]]>
By: Guillaume Lefranc https://shlomi-noach.github.io/blog/mysql/thoughts-on-maxscale-automated-failover-and-orchestrator/comment-page-1#comment-334836 Wed, 18 Nov 2015 10:56:20 +0000 https://shlomi-noach.github.io/blog/?p=7439#comment-334836 Hi Shlomi,

very interesting presentation, thank you. I’ll make sure to try Orchestrator in the near future!

However, something that puts me off is the reliance on a mysql datastore. I understand the temptation to use mysql for everything, but wouldn’t a distributed datastore such as consul or etcd be more simple and efficient? I just don’t want to setup a galera cluster for that. Plus consul/etcd also have leader election, so you might find that interesting.

]]>