Nibbles: Amazon, Aquaculture, Bees, ICTs, Food prices, Dates, Cats, Taro

Lost in genebank database hell

Navigating around germplasm databases can be a frustrating experience. A posting on the CropWildRelativesGroup alerted me to a Science Daily piece on tomato genomics which mentioned the wild relative Lycopersicon pennellii (or Solanum pennellii, but I’m not going there, at least not today). But how many accessions of this species are conserved ex situ? And where is it found in the wild?

Ok, so SINGER first, as that’s been much on my mind — and on this blog — of late. SINGER shows 61 accessions of L. pennellii, all from the AVRDC collection. Most of them are from Peru, although 7 accessions have USA, Mexico, Poland (?) or “unknown” as source country. None of these accessions seem to have geo-references, so no nice map from SINGER this time. Pity. But SINGER does give very neat summaries for your query results. ((Incidentally, AVRDC has its own Vegetable Genetic Resources Information System online, which has 65 records for L. pennellii.))

GRIN returns 51 accessions. I can’t find any easy way of working out the duplication between these and the AVRDC material, but I imagine it is significant. Again, most of the accessions are from Peru, but it’s kind of difficult to get summary information across all accessions in GRIN at the moment, though I know they are working on this. Now, tomato germplasm is conserved at the C.M. Rick Tomato Genetic Resources Center (GRIN tells you so), and they have a database of their own. Querying it results in 45 hits, but again there’s no easy way I can see of looking at summary information across all these. You have to look at each individual accession in turn to find out where they’re from, and if you do you get a little map too. The thing I don’t quite understand is why the accessions are geo-referenced in the Tomato Genetic Resources Center database, but not in GRIN. Maybe they’re upgrading the data gradually at the Centre and haven’t passed the latest version on to GRIN? That may also explain the discrepancy in accession numbers. It looks like they’re working on the geo-spatial part of the database, and it may well be possible to get a map of all the accessions of a particular species eventually.

You can of course do that in GBIF right now, but GBIF only has 8 geo-referenced L. pennellii records: from the Missouri Botanical Garden, the Dutch genebank and the European germplasm database, EURISCO. Too bad the Tomato Genetic Resources Center is not a GBIF data provider. And, indeed, that its geo-reference data is not included in GRIN, which is a GBIF provider.

So the answers to the questions I started with are: at least, and probably not much more than, 112, but that probably includes duplicates; and Peru. But I cannot produce a decent map of the distribution of L. pannellii online. I would have to mess around and download the data from the Tomato Genetic Resources Centre database, and then map it myself. Which I may well do, just to show it can be done. But this little exercise does show that there’s a lot of work to be done to improve the data in — and fully integrate — existing agrobiodiversity databases.

Nibbles: Poppies, Gardening, Milk, Grapes, Genebanks, Meat, Biotech, IK, Plant health

SINGER maps crop wild relatives

Putting the new SINGER interface through its paces, I find that it can do something interesting that GRIN cannot. Or at least I can’t see a way of doing it, let me know if you can. Below is a screenshot from SINGER showing a Google Map of the distribution of all wild Arachis accessions that the database knows about which have geographic coordinates. Very useful, I think. GRIN does map localities, but I could not manage to get it to do so for multiple species like this.

SINGER’s new tune

After my intemperate comment about SINGER the other day, I’m very happy to pass on the news that the new SINGER website is now online at http://singer.cgiar.org. According to the announcement made by Bioversity International, the new SINGER has the following features:

  • GIS maps using Google technology.
  • Users are able to search accessions using Google Maps.
  • Presentation of data in a cumulative format to help users do statistical analysis.
  • Improved presentation of distribution data including across genus and species.
  • Improved navigation and searching capabilities.
  • Free text search.
  • Links to the external databases hosted by partners to provide additional information about accessions. Example: IRRI. Click on the “IRRI Link” under links on this page to view the information.
  • Users are able to view the available pedigree information on the site. Example: IRRI, WARDA, CIP. Please look for the “Pedigree” field under passport information.
  • Users are able to view the availability of an accession before requesting germplasm. The “Availability” field has been added under passport information.
  • Users are able to view if the accession has been placed under long term storage in Svalbard. The “Svalbard” field has been added. Example: WARDA. Please look for the “Safety-duplicate in Svalbard” field under passport information.
  • Users are able to save the search history while they are navigating the site.
  • Users are able to download data in the “xml” and “csv” formats.
  • The shopping cart system has been incorporated – work is still in progress to incorporate the complete shopping cart for an ordering system.