The Infinitesimal

Oct 14

I think there are two forces at play: as we add more genetic variation into GWAS analyses (specifically, rare variants which are the major class of variation not currently captured by GWAS) we should see the heritability estimate go up; as we understand the biases more and account for them, we should see the heritability estimate go down. In the case of height, as GWAS platforms have gotten better, the heritability estimate went from ~40% to ~45% with rare variants adding another 10-20%; with the Tan et al. paper showing that confounding on height is relatively minor. For IQ, the opposite has been true, as we've gotten better at isolating biases the estimates went from 50% (in one of the earliest studies, which had major stratification issues) to ~20% to ~12% direct with my analysis of the Tan et al. data; with the Tan et al. paper showing that confounding is a major issue. In contrast, the influence of rare variants on IQ (in the healthy population) has been negligible from the few studies that have analyzed it. So for IQ I think we are still on the "figuring out biases that inflate the estimate" end rather than the "finding new variants that deflated the estimate" end. As to the total heritability, I think the best estimate we have now is the ~12% estimate from the Tan et al. data with an additional ~1.5x boost from rare variants which gets you to (very crudely) ~18% or so.

Expand full comment

Nov 2

Great article!

Expand full comment

comment 78

Oct 14Edited

If two subjects of unknown relatedness have very different GWAS scores for a very complex trait involving thousands of SNPs, that would correlate with their being only distantly related. But with siblings the degree of relatedness is fixed, so for the same degree of difference in the GWAS score, their would be less chance that other genetic factors (SNPs not considered in the GWAs, SNPS effectiveness amplified by other SNPS) would also be different. So, these results do not mean that complex traits are not mostly heritable. It only means that we have not figured it all out .... yet.

Here is a post on my own substack about this: https://comment78.substack.com/p/bound-to-fail?r=3c6ol1

Expand full comment

Federico Soto del Alba

Jun 3

I am going to write something, respectfully, which is probably catastrophically wrong on something way outside my areas of expertise:

It seems to me some or a lot of this type of research trying to correlate Genetics with traits considered distinctly human might be suffering from:

-- Chimpanzee Confounding Biases.

I am inventing, I hope, a term to try to fight Epistemic Injustice in Epistemic issues.

Such research seems a specific way to try to explain, at some point way latter in the Future, what makes us Uniquely Human by looking at Genetic Differences between members of the Same Species.

I am para-viewing and paraphrasing, I hope correctly, the unstated, my guess, goals...

However, it looks to me, naïvely enough, it would have been better at some point in the Past to look for Genetic Differences making us uniquely human by comparing our Genomes with our closest species siblings: Bonobos and Chimpanzees. Even with Gorillas´and Orangutans´.

My guess, I am lazy and I haven´t searched for it, is probably such research failed to pinpoint exactly which Genes or their Variations explain causally things we considered as distinctively Human like Obesity, Smartness called Intelligence for "research" purposes, Mental Disorders, Accomplishment, etc. Since those other still with us Hominids probably experimentally represent a floor in the measurements of such things.

A lower bound to those Empirical Metrics, where I would have assumed the noise will be smaller and the signal stronger, but here I am way past my incompetence.

Such being said, if Coco the Gorilla learned sign language and expressed herself so eloquently even when explaining having dreams of flying cars, my guess, then probably Genetical differences among members of the same species won´t be explained easily enough. They are going to be harder to be found and their explanatory and predictable power will come as disappointing.

Which I think summarizes what actually happened!, so far...

IQ, Socioeconomic Class belonging, Academic Achievement and Mental Disorders being the most prominent ones So far...

So, if such were the case: No correlations, or lack thereof, between other Hominids Genomes and ours explain those things considered as making us distinctively human, a big IF, I am inclined, maybe forced to think such research projects must have some deep fundamental Epistemic flaw to begin with.

And to try to identify it I would have to dig deep into the methodological assumptions, the Epistemics used a basis for such research projects.

Which I probably won´t do, hence I am just putting it "out there" with my layperson´s comment to this Post, because I think it is a point of view which to my enough ignorance seems lacking and is relevant.

Expand full comment

Fergal McDonald

Jan 3

Have any of the popular hereditarians commented on or responded to these studies?

Expand full comment

Jan 3

The only comments I've seen are pointing to some of the technical issues this study raises about previous sibling GWAS (Howe et al) while completely ignoring the key findings about confounding on IQ and education. This is, in my opinion, par for the course for the modern hereditarian movement, which primarily seeks to shore up (or invent) evidence in support of a pre-existing hypothesis and disregard any evidence to the contrary.

Expand full comment

It annoys how often in scientific discussion, the issues of problems with measurement are avoided. The focus is all around, but almost never about the problems with measurement techniques and the sometimes serious limitations they embody. Heritability by way of SNP-GWAS is one of those. The geneticists at work tend to laugh at all those thousands of dollars/euros that are wasted on such extremely large samplesizes, but in the end, when you make the sample larger but don't do anything about the sensitivity of false positives in the methods, you only enlarge potential biases. Power of statistics is not everything that should be looked at.

Expand full comment

What are your thoughts on the SNP chips often used for these GWAS. From my understanding of genetics, SNP tend to be the most common mutations but also the least impactfull. And also when using SNP's as proxies for genes you walk into a whole other ballpark of potential biases, which include known vs unknown genetics in the whole genome. SNP are by their definition in those chips only known SNP's which even bias the estimates even further. I generally do not trust SNP based GWAS, just because they are cheap to use does not mean they are good to use.

Expand full comment

I'll need to write up a longer post on it but contemporary SNP array data followed by imputation tends to pick up effectively all common variants and is very difficult to outperform. Common SNP variants also tag common structural variants very well, so it is unlikely that there is untapped common SV signal out there (note: some of these associations could still be *driven* by SVs we haven't typed, but tagged very well by SNPs we did type). In terms of variation that is not captured by GWAS, I think it's either rare variants or interactions.

Expand full comment

Oké, that is good news. But is a SNP array not a biased and incomplete perspective of the exome? I. E. It only contains SNP's that have garnered interest in the past? Of course when you gather enough of them the bias would eventually decrease. But any type of copy number variation will not reliably be measured, and neither does the the rest of the genome that is not exome. Which from what I remember contains a lot of structural information which indirectly influences gene expressions as well.

But like I said, I am not all that up to date when it comes to genome measuring tech. I just hope that GWAS will move on from SNP chips to whole genome. Where larger variations will also be captured. The ones that have been done are all of smaller samples which in itself is of course a problem. I'd like you perspective. And if you ever get to writing an article about it I would love to read it.

Expand full comment

I don't think so. Modern SNP arrays are designed to be a comprehensive sampling of common variation *genome-wide*, often based on whole-genome data from many representative populations. Sometimes special exonic variants (or other ROIs) are added, but not at the cost of broad sampling. I don't know where the idea that they only/largely capture the exome comes from, it's just not the case and most GWAS heritability is non-coding.

Expand full comment