Hovatta et al.The Salk Institute for Biological Studies, La Jolla, CA
Genome Biology
February 26, 2007 [PubMed]
This paper was published in an open access journal and can be read in full without a subscription.
The Hovatta el al. paper appears to be another bioinformatic exercise investigating expression quantitative trait loci mapping (eQTL). It utilizes data that they have previously published and no new biological data was added in support of this work. The two main conclusions of the paper, that strain-specific expression differences are enriched for cis eQTLs and that the different regions of the brain show distinct clustering of expression patterns, are, respectively, unsupported and unoriginal. The main strength of this paper is that it is one of the first to show the viability of applying eQTL analyses to data collected from inbred strains of mice although it has been done more thoroughly before.
Using data from 5 regions of the brain, bed nucleus of the stria terminalis, hippocampus, hypothalamus, periaqueductal gray, and pituitary gland, from 6 inbred strains, 129S6/SvEvTac, A/J, C3H/HeJ, C57BL/6J, DBA/2J, and FVB/NJ, they used a simple ANOVA analysis to identify genes whose expression significantly varied by either strain or region of the brain. 5% of the genes showed strain-specific expression variation and 44% showed brain region-specific expression. Dendrograms showed correlation between genetic history of the strains and the averaged expression patterns of the strains.
The worst part of the paper was the contention that cis eQTLs are enriched for in inbred strains. This statement may be true but they provide no justification for it here. The origin of this hypothesis is unclear. It seems that is based on the idea that only a small fraction of genes were found to be strain-specific and cis eQTLs only effect single genes. This last notion presupposes that the cis eQTL gene does not interact with any other transcription in the genome which is wrong. In fact, many of the most prominent trans eQTLs that have been found to also contain a cis eQTL that likely initiated the cascade of transcriptional disregulation that resulted in the trans band. To support this claim, they compared the percentage of cis eQTLs found in the genes identified as having significant strain-specific differences with those that had significant brain region differences. This is an apples to oranges comparison without any real meaning. First, to generate eQTLs, you are comparing strain-specific differences in expression to strain-specific sequence differences. The strain-specific gene collection is primed with all of the transcripts that could potentially produce eQTLs. The region-specific gene collection is just a random collection as far as eQTL analysis is concerned, most of which have no expression differences across the strains and therefore have no ability to produce an eQTL. So, when they express cis eQTL values for each of the two collections as percentages of the number of genes within that collection, it is misleading. Holding up the fact that 48% of the strain-specific genes produce cis eQTL associations vs. 10% of the region-specific set as proof for their hypothesis is disingenuous. Especially if only 20% or less of the region-specific set had the capacity to produce an eQTL.
In conclusion, the only value that this paper provides is proof the eQTL analysis can be done with data from inbred strains. It would have been great if they shared the cis and trans eQTLs that they mapped but they failed to do so.
Other notes from the paper
They define a cis eQTL as an association between data from an Affymetrix probeset and a SNP that mapped within 4 MB of each other.
SNPs located within probesets can create data that looks like a cis eQTL but in reality is differential hybridization with the probe. Instead of doing a sequence search to identify problem probesets, the authors used a custom built algorithm to guess at the presence of probeset SNPs.
Correctly note that cis eQTLs are most trust worthy than trans eQTLs because the likelihood that both the associated SNP and the probeset could both fall into the same limited space of the genome is small.
They site a paper that places the inheritance of expression variation at a low 0.34.
Most of the data analysis was done using software from Teragenomics.
