> it was one of the strongest causal signals in the team's GWAS analysis (2,801 ...

i_love_limes · 2025-01-29T11:29:59 1738150199

So, it is like that, but it's also not. A couple decades ago, with the advent of GWASes, because of all the multiple testing that was going on, there was an agreed new p-value threshold p < 5e-8. This was to account for multiple testing going on (how that number came to be requires more explanation of LD + other things).

That is the minimum threshold. This study found that peak was at p < 1e-37 or so. But that is where the biological analysis begins. Unlike social scientists, we don't stop with the statistical correlation, we then go on to look at what we know about that gene, the type of mutation, if it's a loss or gain of function, what role that gene has in various tissues, etc. And mendelian randomization is another way to unpick the causal direction of effect.

Not to say this is the truth or causal, but it's a lot closer to causal than what you are implying.

cluckindan · 2025-01-29T11:56:59 1738151819

In other words, there is a likelihood of 1/10,000,000,000,000,000,000,000,000,000,000,000,000 that the statistical result was due to random chance.

In more casual sciences p < 0.05 is considered the limit of significance, i.e. less than 1/20 likelihood of statistical testing favoring the tested hypothesis over the null hypothesis due to random chance

drawkward · 2025-01-29T18:13:11 1738174391

If you are testing a single hypothesis, sure. But nowadays, statistics training really weighs on bonferroni correction, or other methods to deal with the issue raised by the above-referenced XKCD, whentesting multiple hypotheses.

londons_explore · 2025-01-29T12:38:43 1738154323

> This study found that peak was at p < 1e-37 or so.

If true, this would be cause for someone to read through the study to check there are no maths errors, and if it holds up then to take action immediately.

This isn't a "wait for more science to come in and confirm" type thing.

formerly_proven · 2025-01-29T14:45:56 1738161956

> This study found that peak was at p < 1e-37

This is many orders of magnitudes better confidence than any physics experiment, it feels unlikely a biological result can even be this strong, so it makes it sound like a statistical error.

jakobnissen · 2025-01-29T16:25:59 1738167959

Bioinformatician here. These kinds of p-values are common in these kinds of experiments (GWAS, or association studies), and happens almost automatically once you get enough statistical power. The big problem is that once you have so much statistical power, you get very small p-values from small effects, and then the often-overlooked assumptions behind frequentist statistics begins to matter. Are your samples _really_ idenpendent and identically distributed values? No. Are they really normal distributed? No.

Also, things like gut microbiome and depression are linked through what some people call the _crud factor_, which are weak correlations between nearly all social aspects. For example, probably depressed people eat differently from non-depressed people, causing changes in their gut microbiome. Probably, there are variations in human population's depression rates and obesity rates (correlated with gut microbiome) that somewhat correlate. When you have enough statistical power, you see the crud factor everywhere.

gibusen · 2025-01-30T18:27:49 1738261669

I really appreciate your explanation regarding the crud factor. It adds a lot of intuition.

Out of curiosity, what is your perspective on gnotobiotic systems? I distinctly recall an example of a gnotobiotic mouse that, upon being provided a microbial sample from an obese mouse, started to have substantial increases in body fat despite a simultaneous reduction in feed. Would that type of experimental approach still run into the statistical difficulties you mentioned?

i_love_limes · 2025-01-29T17:13:18 1738170798

Well no. It's not uncommon for p-values to be even lower than that. We are talking about a specific SNP (allele) having a specific mutation being predictive of a phenotype / outcome.

So, a specific SNP mutation being predictive of a gene expression / protein is basically a p-value of 0.

Can't speak for physics experiments, but this is almost certainly not a statistical error

adrianN · 2025-01-29T11:00:11 1738148411

They are aware of this which is why they add that those strange lipids have independently been associated with depression.

genewitch · 2025-01-29T11:43:07 1738150987

I put a tally mark under "more bacteria than human"

It used to be called the gut feeling column, but once the Alzheimer's links were confirmed more strongly i felt it needed to be addressed with more gravitas.

I will read the papers tomorrow but this should be easy to test, I think? Immediately cease use of any product that contains dea or analogs.

And since it's lipidinous, maintain weight? Ia this heretical?