ICRISAT DG on the importance of genebanks

pre-breeding-graphThere’s a great blog post up on the ICRISAT website from its new Director General, Dr David Bergvinson. It basically says, though not in so many words, that the centre’s germplasm collections are the foundation of all its crop improvement work. Which is nicely illustrated by this diagram (click to embiggen), from Dr Shivali Sharma, who’s a senior scientist in the genebank. You can see more photos of her (and others’) wide crossing and pre-breeding work at ICRISAT on the Flickr album I put together after my visit there a couple of years back.

ICRISAT Visit

Dr Bergvinson closes his post by pointing to ICRISAT’s 100 Voices video series, the first instalment of which is on genomics as a tool to make even better use of genebank collections.

Getting into the weeds on wheat genotyping

ResearchBlogging.orgI don’t know about you, but I usually skip the methods section of genotyping studies. I know I shouldn’t, but life is way too short. Mostly, I just need the answer. However, two papers came across my desk last week which enticed me to bite that silver bullet. One study genotyped 460 bread wheat accessions from various genebanks in Europe and Australia, manly elite lines from Europe, North America and Australia, but also some Chinese landraces; the other, 1,423 bread wheat landraces from West Asia and synthetics (artificial crosses between the putative original parents of bread wheat), from the CIMMYT and ICARDA genebanks. Quite distinct material from different genebanks, 1 you’ll notice, so naturally I wondered to what extent the results would be comparable.

Well, this is the relevant bit from the materials and methods of the first article, by German researchers, which is catchily entitled Subgenomic Diversity Patterns Caused by Directional Selection in Bread Wheat Gene Pools.

For genome-wide marker analysis, DNA samples of all lines were genotyped using the 90,000-SNP wheat genotyping array (Illumina Inc.) described by Wang et al. (2014), which carries 81,587 functional and valid SNPs. Genotyping was outsourced to TraitGenetics GmbH (Gatersleben, Germany) and automated SNP scoring used a cluster file based on worldwide material described by Wang et al. (2014). Raw marker data was processed by first excluding all markers with more than two called alleles, more than 10% missing data, or minor allele frequency (MAF) less than 10%. This resulted in a total of 22,377 high-quality, polymorphic SNPs in the 450 genotypes that were used for population-structure analyses. For all analyses requiring positional information, we used a set of 18,681 SNPs with MAF ≥5% and known map positions on the consensus map described by Wang et al. (2014).

Phew. And this, for your sins, is the corresponding section from the thankfully more racy Exploring and Mobilizing the Gene Bank Biodiversity for Wheat Improvement, courtesy of CIMMYT and ICARDA scientists, mainly connected with the Seeds of Discovery (SeeD) project. 2

For genotypic characterization, a next-generation sequencing technique called DArTseq was employed. A complexity reduction method including two enzymes was used to generate a genome representation of the set of samples. PstI-RE site specific adapter was tagged with 96 different barcodes enabling multiplexing a plate of DNA samples to run within a single lane on Illumina HiSeq2500 instrument (Illumina Inc., San Diego, CA). The successful amplified fragments were sequenced up to 77 bases, generating approximately 500,000 unique reads per sample. Thereafter the FASTQ files (full reads of 77bp) were quality filtered using a Phred quality score of 30, which represent a 90% of base call accuracy for at least 50% of the bases. More stringent filtering was also performed on barcode sequences using a Phred quality score of 10, which represent 99.9% of base call accuracy for at least 75% of the bases. A proprietary analytical pipeline developed by DArT P/L was used to generate allele calls for SNP and presence/absence variation (PAV) markers. Then, a set of filtering parameter was applied to select high quality markers for this specific study. One of the most important parameters is the average reproducibility of markers in technical replicates for a subset of samples, which in this specific study was set at 99.5%. Another critical quality parameter is call rate. This is the percentage of targets that could be scored as ‘0’ or ‘1’, the threshold was set at 50%. PAV’s markers were not used in this study.

Double phew. But, cutting to the chase: they don’t sound that comparable, do they? I confess I needed help with this, but here’s the bottom line: quite different polymorphisms are being picked up by the two studies. The German work (call it method A) used a genotyping approach that is more expensive, but yields more complete data on a well-defined set of polymorphisms. The SeeD paper’s way (method B) is cheaper, much cheaper, and is better at finding new polymorphisms, but does result in more missing data. And that’s fine. Different research groups will always want to do things their own way, for a variety of both good and bad reasons.

But look at it from the point of view of the wheat community as a whole. One of the things other people who are interested in wheat — genebanks, breeders — will want to be able to do is to see how their material relates to other people’s material: whether it is more or less diverse, to what extent it overlaps in diversity, that kind of thing. So what is team C to do? Follow method A, or method B? Maybe method A and method B, just to be on the safe side? Or maybe it could use its own favourite method C, as long as at least a subset of the polymorphisms picked up by all the three methods was something that everyone agreed was an adequate common denominator.

Well, that’s just the kind of decisions that DivSeek is there to help team C (and D, and E…) make. The DivSeek steering committee met last month and a short report from Susan McCouch, the chair, is now available. She sees the committee’s main job in the next few months as coming up with specific ideas on how “many independent, stand-alone efforts … [can] work together under a common umbrella to apply state-of-the-art genomic, phenomic, molecular and bioinformatics tools and strategies to characterize crop diversity and to integrate and share data and information.” If that means I can skip methods sections with a clear conscience, it will be worth it.

Featured: More melons

My mate Danny knows a think or two about Maldivian melons:

Maldives is an interesting place for watermelons. I worked on the island of Thoddoo (in Alif Alif Atoll) back in the nineties on an IPM programme for watermelons. Thoddoo is the largest producer of watermelons (or was?) in the Maldives, strategically targeting the demand during the holy month of Ramadan in nearby Male.

But are they like Remaissance melons? Not that kind of melon, I guess.

Nutritional yield in the spotlight

Dr Jess Fanzo had a paper in the works on the topic when he asked a few days ago “How would you measure agricultural production?” But his pleas for measuring nutrition per hectare, rather than just calories or yield, certainly gets a boost from the article, and in Science, no less.

Here is what Prof. Ruth DeFries of Columbia University, who is the lead author, and others 3 think:

We propose a metric of “nutritional yield,” the number of adults who would be able to obtain 100% of their recommended DRI [daily dietary reference intake] of different nutrients for 1 year from a food item produced annually on one hectare.

Why? Because…

…nutritional needs for a wide range of essential nutrients in the human diet have generally not been included in considerations of sustainable intensification. Access to food with high nutritional quality is a primary concern, particularly for 2 to 3 billion people who are undernourished, overweight, or obese or deficient in micronutrients.

They even provide a worked example, highlighting the fact that many neglected staples are more nutrient-dense than boring old rice, wheat and maize.

In 2013, for example, on average one hectare of rice produced 4.5 metric tons/year, which is the equivalent of providing the annual energy requirement for 19.9 adults. Millet produced only 0.9 metric tons/ha per year, the annual energy requirement for 4.0 adults. However, a hectare of rice fulfills the annual iron requirement for only 7.6 adults, compared with 15.3 for millet.

Leave aside for a minute that, depending on what particular millet is meant, rice vs millet is an unusual comparison to be making. This does sound like a promising idea; but here’s the problem I see. You have to do the calculation for each damn nutritional factor: protein, iron, zinc, whatever. How do you know which to pick for any given comparison you want to make? Is there no way to come up with a more synoptic nutritional yield score? One that takes into account multiple nutrients at once, rather than one at the time. How about this, for example:

the number of adults who would be able to obtain at least 50% of their recommended DRI of all of X nutrients for 1 year from a food item produced annually on one hectare

Where X is whatever nutritionists think is a sensible basket of nutrients. After all, people rarely need just iron.

Tweeting up a storm in Minneapolis and Pullman

What did we do before Twitter? Had a life, probably. But also, it was undeniably more difficult to keep up to date with stuff. Absent heroic tweeps in Minneapolis, I would not have been able to follow Plant Biology 2015 quite so assiduously, and thus find out about, among many other things, the Legume Federation. Or indeed see their poster.

legumefederation

Likewise the annual meeting of the National Association of Plant Breeders would have slipped me by. And I would have missed this photo of two plant breeding legends.