How would you PageRank genebank accessions?

Various friends have sent me, over the past few days, different takes on a recent paper which used the Google PageRank algorithm to identify the most “important” species in food webs, perhaps because they know I’m a sucker for examples of cross-pollination between disciplines. The BBC had its say, and also ScienceDaily, among others. I posted the ScienceDaily article on Facebook, as I am wont to do when I think something is interesting — maybe even have a gut feeling it might be relevant to agrobiodiversity conservation — but don’t know quite what to make of it. Sure enough, someone left a comment that he thought the algorithm was a secret, which was also my understanding: Google don’t want people to manipulate the rank of their web pages. But then someone else came in and said that the basics of how the thing works are in the public domain.

To prove it, he provided a link to an American Mathematical Society article entitled How Google Finds Your Needle in the Web’s Haystack. Which is why I love social networking, but that’s another story. Now, that article is definitely NSFW, unless you work at the American Mathematical Society, so think twice before clicking, but here’s the lede:

Imagine a library containing 25 billion documents but with no centralized organization and no librarians. In addition, anyone may add a document at any time without telling anyone. You may feel sure that one of the documents contained in the collection has a piece of information that is vitally important to you, and, being impatient like most of us, you’d like to find it in a matter of seconds. How would you go about doing it?

And I thought to myself: just change that 25 billion, which of course refers to the number of pages on the internet, to 6.5 million or 7.2 million or whatever, and the guy could just as easily be talking about accessions in the world’s genebanks.

Now, basically we search for the germplasm we need by starting with a big dataset and applying filters: wheat, awnless wheat, awnless wheat with such and such resistance, awnless wheat with such and such resistance from areas with less than x mm of rainfall per annum, and so on. Would it make any sense to rank the accessions in that initial big dataset? On what basis would one do that anyway? That is, what is the equivalent of hyperlinks for accessions? Because the essence of PageRank is that important pages receive lots of hyperlinks from important pages. So, numbers of requests? Amount of data available on the accession? But wouldn’t that just mean that only the usual suspects would get picked all the time? Genetic uniqueness, perhaps, then? That would be turning the algorithm on its head. Looking for lack of connections rather than connections to other accessions. You could in fact have different ranking criteria for different purposes, I suppose.

Ok, now my brain hurts. This cross-pollination stuff can be fun, but it is hard work.

Featured: Tipping point

Jacob tackles tipping points:

I think that an explosion of local varieties, for instance, could count as variance amplification. An increase in variety names could be due to a fragmented knowledge system with lots of redundancies (the same variety having different names in different places) as farmers fail to trace each variety. At a given moment, the names become meaningless. This can then lead to genetic erosion, as farmers fail to find certain varieties due to the name confusion. Less common varieties that survive by going from hand to hand will be the first to go extinct. Has anyone observed this?

Well, has anyone observed this? Read the full comment, there’s also stuff in there about why a locally-based “genetic erosion monitoring portal” wouldn’t be much use.

Ghanaian buffet

Ghana has forty-seven different kinds of edible green leaves, each with a distinctive flavor.

I bet. And the diversity doesn’t stop there.

I think of Ghanaian cuisine as a kind of culinary jazz. The pepper, tomatoes, and onions, and possibly the oil, form the rhythm section. The stew is one musical form, like blues, the soup and one-pot dishes are others. Like a successful improvisation, the additional ingredients—vegetables, seeds and nuts, meat and fish—harmonize and combine into vibrant, mellow creations.

Dip into the sampler CD at Global Voices Online.

(New) things to do with pears

A little something silly for the weekend.

peras_budas.jpg

This bizarre story made it to Boing Boing and the Daily Mail, but the earliest I can find for it, and indeed where I first saw it (Thanks, Matt), was a website in Portuguese. Now, through the miracle of Google Translate, we bring you, How to make pear shaped Buddha!

I think this is the “How to” Strange as I posted here, after all, who is going to try this at home? Well, then, for purposes of curiosity, if you ever heard of pear-shaped Buddha or even the famous square watermelons in Japan knows that the process is simple, just put a cast on fruit when they are starting to rise because the fruit tends to take the shape of the mold as it unfolds.

I’m afraid of what marketers can do with it …

I’m more afraid of what the Mail did to it, but no matter. More to the point, even the square watermelon is not as new as all that. From Popular Science, January 1938:

lrg_pumpkin_face.jpg

Farmer Grows Pumpkins with Human Faces

Pumpkins with human faces have been produced by John M. Czeski, Ohio farmer, after four years of experimenting. To grow the novel fruit, Czeski fashions an aluminum mold of the head he wants to reproduce, and places it around a growing pumpkin approximately the size of a small grapefruit. After the pumpkin has expanded enough to fill the inside contours, the mold is removed. The print of the features remains as the pumpkin continues to grow, and the final result is a lifelike full-size image in the ripened fruit.

Nothing new etc.