摘要
’Long tail’data is the difficult-to-get-at data that sits in libraries,institutes and on the computers of individual scientists.Informatics specialists like to contrast it with the smaller number of large,more accessible data sets(e.g.Sinha et al.,2013).The name’long tail’derives from graphs drawn of the size of data sets against their number:there are relatively few large datasets and a lot of smaller ones.