What is your favorite "data analysis" cartoon?
This is one of my favorites:
One entry per answer. (This is in the vein of the Stack Overflow question What’s your favorite “programmer” cartoon?.)
P.S. Do not hotlink the cartoon without the site's permission please.
@sharpie: are jokes out? We obviously don't want the entire site to be humor, but everyone benefits from a little educational humor in small doses.
@Sharpie, feel free to close or reopen according to your feelings! I agree with Shane, a bit is ok, but not too much. For example, this question already included a funny cartoon. The jokes question not really a funny joke....
These cartoons are useful too; they can be included in a lecture on a particular topic where you are trying to explain a concept (e.g. correlation/causation above). A little humor can help to keep an audience engaged.
Also my question on the source of a statistical quote was closed too. See here.http://stats.stackexchange.com/questions/15739/who-said-tell-a-story-with-the-data
This question is awesome! it's basically a best of list of xckd and dilbert
Could we clarify the problem with hotlinking referenced in the P.S.? Might need to fix some of these answers, if it's worth worrying about.
According to the tour, this question should be closed, since it is a question that has "too many possible answers" and since it is "primarily opinion-based". I'm not complaining, just surprised it has stayed open for this long.
I decided to recreate this on serverfault. As it seems like a great idea for some levity in an otherwise serious environment. Here's hoping it survives!
Data Science analogy to cartoon in OP. Data Scientist: I went to data science bootcamp and learned how to find correlations in big data. Those insights can be converted into big money. Statistician: But many of those correlations are spurious. Correlation does not imply causation. Data Scientist: Don't give me none of that century old statistics mumbo-jumbo. This is big data. That means the data has everything. So by definition, all relationships in the data are correct. I ring the cash register while you snooze and lose, grandpa.
Was XKCD, so time for Dilbert:
Did anyone else notice that the tour guide changes colors between the second and third frames?
On RANDU: "We guarantee that each number is random individually, but we don't guarantee that more than one of them is random."
One more Dilbert cartoon:
This one reminds me of the recent bailout in the States, where they just made up 700 billion number - they said they just wanted a really large number. :)
From: A visual comparison of normal and paranormal distributions Matthew Freeman J Epidemiol Community Health 2006;60:6. Lower caption says 'Paranormal Distribution' - no idea why the graphical artifact is occuring.
this isn't really funny. it's more of a twist on english terms
Yeah - I think all those jokes suck. There is no underlying statistical humour. This joke should be put on the English stackexchange instead.
'So, uh, we did the green study again and got no link. It was probably a--' 'RESEARCH CONFLICTED ON GREEN JELLY BEAN/ACNE LINK; MORE STUDY RECOMMENDED!'
This is by far my favorite cartoon of all time. It's super educational. It really gets to the heart of the definition of a p-value. In fact, I bet that less than 10% the students who pass a college freshman "intro to stats" class get this joke, and this makes me sad.
Maybe so! Fortunately for freshmen, @Glen_b has offered an excellent breakdown here.
this is a pretty good joke as it clearly demonstrates why repeated multiple testing is dangerous. For anyone interested check out Bonferi correction to deal with this.