Jack Grieve is a computational linguist at Aston University in Birmingham, England. I came across him on Twitter, where he occasionally posts fun maps showing the geographic distributions (usually within the USA) of different words, usually dialectical variants, based on their appearance in geocoded tweets. He very kindly ran a couple of crop names through his magic box for us, and this is what he got. I wanted to know if the distribution of crops could be inferred from where people tweet about it more than the average. I’ve placed his map for each crop side by side with the relevant distribution map from USDA.
Not a perfect match by any means, but not too bad. Except for cotton, that is. Any ideas why people should be tweeting so much about cotton in the northern Great Plains? They’re certainly not growing it.
Jack’s dataset apparently only covers the US and the UK at the moment, which means I can’t check whether Kenyans, say, are tweeting about maize particularly assiduously where they’re growing it, or indeed about maize lethal necrosis where they’re worried about it. Google famously tried to predict flu outbreaks from search patterns, but that seems to have fizzled out. Could tweeting trends help pinpoint crops (or livestock?) and their pests and diseases in space and time? I don’t see much of that kind of thing in the discussion of ICTs in agriculture.