Comments on: Percona Live 2013 keynotes: followup questions and discussion https://shlomi-noach.github.io/blog/mysql/percona-live-2013-keynotes-followup-questions-and-discussion Blog by Shlomi Noach Mon, 29 Apr 2013 17:50:54 +0000 hourly 1 https://wordpress.org/?v=5.3.3 By: Justin Swanhart https://shlomi-noach.github.io/blog/mysql/percona-live-2013-keynotes-followup-questions-and-discussion/comment-page-1#comment-203300 Mon, 29 Apr 2013 17:50:54 +0000 https://shlomi-noach.github.io/blog/?p=6309#comment-203300 There were a number of posts on Planet inviting people to take the survey. It wasn’t a random sampling, it was an opt-in survey which means it has no scientific value. Aslett can correct me if I’m wrong, but the advertised survey said it was going to be used in a Percona Live keynote, so I avoided taking it, as I’m a Percona Employee and that would create bias.

]]>
By: Marco Tusa https://shlomi-noach.github.io/blog/mysql/percona-live-2013-keynotes-followup-questions-and-discussion/comment-page-1#comment-203076 Sun, 28 Apr 2013 23:17:30 +0000 https://shlomi-noach.github.io/blog/?p=6309#comment-203076 Shlomi, I have start my career looong ago, writing procedures (on mainframe) for a company doing statistical analysis. I have being working in that field for years, writing models to read and interpret the data.

As a matter of fact statistics needs to be taken on a well selected set of representative identities.

When you consider a scenario like MySQL, so wildly use but not universal use (ie water consumption), covering several segments (like enterprise to single developer).
The set cannot be less then the 10 – 15% of the total number per segment.

Questionnaire and reported statistics, must be also differentiate by segment, given each one has a different trend, also if correlated.
Some questions can be generic, but then you need to also have question per segment to contextualize the analysis.

Finally the used set must be describe at the beginning (not at the end), to provide the correct information in order to have the correct in data interpretation.

In short what was present was… nothing.

No meaning, numbers have no sense in this way and taking that conclusions base on those number is simply nonsense.

What you report about you provide your feedback you will be include, shows that he has no idea of what he is talking about, given another characteristic of an study is TIME. You need to collect the information in a specific time frame, and then close.
You cannot add later… is (again) nonsense.

I am not use to be so drastic, but this is the kind of talk that should be review before, to avoid embarrassing situation like this where numbers coming from nowhere and with no meaning are reported as real.

all the best.
Marco

]]>
By: Colin Charles https://shlomi-noach.github.io/blog/mysql/percona-live-2013-keynotes-followup-questions-and-discussion/comment-page-1#comment-202977 Sun, 28 Apr 2013 14:29:19 +0000 https://shlomi-noach.github.io/blog/?p=6309#comment-202977 Hi Shlomi,

You have great questions.

I also wonder who the questionnaire answerers are/who was polled. I know I wasn’t. I’m guessing they are research clients (i.e people whom can afford to pay thousands for a subscription to the reports!).

But I also noted regular mention of the word “Drizzle”. Which implies that the research suggests that Drizzle is still something people consider deploying.

I did fill up the surveymonkey link at the end though. I do hope to participate in future surveys.

]]>
By: Mark Callaghan https://shlomi-noach.github.io/blog/mysql/percona-live-2013-keynotes-followup-questions-and-discussion/comment-page-1#comment-202967 Sun, 28 Apr 2013 13:34:38 +0000 https://shlomi-noach.github.io/blog/?p=6309#comment-202967 Great questions.

A few prominent NoSQL systems (Voldemort, Sherpa/PNUTS) use MySQL for single-node storage. I think the big reason to do that is to use InnoDB. So I wouldn’t be surprised if DynamoDB were to do the same. But this doesn’t mean much to me. As soon as something better comes along MySQL/InnoDB is likely to get swapped out.

Are the big deployments at FB, Google, Amazon 1 user or many users in such a survey?

]]>