Twenty-nine research teams analyzed the same data, and they all reached different results.

Ragdoll X@lemmy.world · edit-2 5 months ago

Twenty-nine research teams analyzed the same data, and they all reached different results.

eestileib@sh.itjust.works · 5 months ago

The chart sure makes it look like there was an overall consensus that refs are about 1.5x as likely, though.

Eheran@lemmy.world · 5 months ago

What do you mean, all different? Most are exactly the same. The first 4 are a bit low and the last 3 a bit high, but last 2 and first also extremely wide, so irrelevant anyway. Everything else agrees, most within >99 % confidence with only slight differences on the absolute values.

bhmnscmm@lemmy.world · 5 months ago

9 of the teams reaching a different conclusion is a pretty large group. Nearly a third of the teams, using what I assume are legitimate methods, disagree with the findings of the other 20 teams.

Sure, not all teams disagree, but a lot do. So the issue is whether or not the current research paradigm correctly answers “subjective” questions such as these.

Eheran@lemmy.world · 5 months ago

If we only look that those with p <0.05 (green) and with 95 % confidence interval, then there are 17 teams left. And they all(!) agree with more than 95% conference.

litchralee@sh.itjust.works · 5 months ago

Obligatory link to Statistics Done Wrong: The Woefully Complete Guide, a book on how statistics can and has been abused in subtle and insidious ways, sometimes recklessly. Specifically, the chapters on the consequences of underpowered statistics and comparing statistical significance between studies.

I’m no expert on statistics, but I know enough that repeated experiments should not yield wildly different results unless: 1) the phenomenon under observation is extremely subtle so results are getting lost in noise, 2) the experiments were performed incorrectly, or 3) the results aren’t wildly divergent after all.