GPC Members Login
If you have any problems or have forgotten your login please contact [email protected]

When one reference genome is not enough

Much of the research in the field of plant functional genomics to date has relied on approaches based on single reference genomes. But by itself, a single reference genome does not capture the full genetic variability of a species. A pan-genome, the non-redundant union of all the sets of genes found in individuals of a species, is a valuable resource for unlocking natural diversity. However, the computational resources required to produce a large number of high quality genome assemblies has been a limiting factor in creating plant pan-genomes.

Having plant pan-genomes for crops that are important for fuel and food applications would enable breeders to harness natural diversity to improve traits such as yield, disease resistance, and tolerance of marginal growing conditions. In a paper published in Nature Communications, an international team led by researchers at the U.S. Department of Energy (DOE) Joint Genome Institute (JGI), a DOE Office of Science User Facility at Lawrence Berkeley National Laboratory (Berkeley Lab), gauged the size of a plant pan-genome using Brachypodium distachyon, a wild grass widely used as a model for grain and biomass crops. As one of the JGI's Plant Flagship Genomes, B. distachyon ranks among the most complete plant reference genomes.

"There are a vast number of genes that are not captured in a single reference genome," added study senior author John Vogel, head of the JGI's Plant Functional Genomics group. "Indeed, about half of the genes in the pan-genome are found in a variable number of lines." Working toward the primary goal of accurately estimating the size of a plant pan-genome, Vogel and his colleagues performed whole-genome de novo assembly and annotation of 54 geographically diverse lines of B. distachyon, yielding a pan-genome containing nearly twice the number of genes found in any individual line.

"The genome of a species is a collection of genomes, each with their own unique twist," added JGI bioinformaticist and study first author Sean Gordon. "Now knowing that focusing on a single reference genome leads to incomplete and biased estimates of genetic diversity and ignores genes potentially important for breeding applications, we should better incorporate multiple references in future studies of natural diversity."

Moreover, genes found in only some lines tend to contribute to biological processes (e.g., disease resistance, development) that may be beneficial under some environmental conditions, whereas genes found in every line usually underpin essential cellular processes (e.g., glycolysis, iron transport).

"This means that the variable genes are being preferentially retained if they are beneficial under some conditions. These are exactly the types of genes that breeders need to improve crops." Vogel said.

In addition, genes found in only a subset of lines displayed faster rates of evolution, lay closer to transposable elements (thought to play a key role in pan-genome evolution), and were less likely to be found in the same chromosomal location as functionally equivalent genes in other grasses.

The sequence assemblies, gene annotations and related information can be downloaded from the project website BrachyPan: brachypan.jgi.doe.gov. The Brachypodium distachyon genome is available on the JGI Plant Portal Phytozome: phytozome.jgi.doe.gov.

Read the paper: Extensive gene content variation in the Brachypodium distachyon pan-genome correlates with population structure.

Article source: DOE JGI.

Image credit: John Vogel


Wetlands are key for accurate greenhouse gas measurements in the Arctic

The Arctic is rapidly warming, with stronger effects than observed elsewhere in the world. The Arctic regions are particularly important with respect to climate change, as permafrost soils store huge amounts of the Earth's soil carbon (C). Warming of Arctic soils and thawing of permafrost can have substantial consequences for the global climate, as the large C stored in soils could be released to the atmosphere as the greenhouse gases carbon dioxide (CO2) and methane (CH4). The release of these heat-trapping gases, in turn, has the potential to further enhance climate warming.

New approach to conserving tree species

Globally, forest trees are increasingly at risk from habitat destruction, pests and disease, and a changing climate. But the guidelines for effective preservation of a tree species' genetic diversity and adaptive potential have been limited to simple mathematical equations for crop collections from the 1970s, or best guesses based on intuitions.

Multidisciplinary team tackles agricultural threat to global food security

CLEMSON, South Carolina – Weak corn and sorghum stalks cause the loss of about 20 percent of the crops in the U.S. annually, and Rajan Sekhon and Christopher McMahan of Clemson University's College of Science are part of a multi-university consortium trying to find out why.