Talk:DataONE:Notebook/Reuse of repository data

From OpenWetWare
Revision as of 16:58, 14 June 2010 by Sarah Judson (talk | contribs) (New page: *'''~~~~''': Valerie, I don't know if you've already ruled this out as a search option, but when I got started on the project and was selecting journals, I thought of a technique for the d...)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search
  • Sarah Judson 19:58, 14 June 2010 (EDT): Valerie, I don't know if you've already ruled this out as a search option, but when I got started on the project and was selecting journals, I thought of a technique for the depository approach. It's pretty simple I think and maybe too time consuming, but I'll outline it for you:
  1. On dryad (which is probably different for treebase), I clicked on "authors" and "Journal Title" to see which journals/authors had the most citations (i.e. most likely for data reuse)
  2. I then searched a few articles in ISI to see how many times they are cited and picked one that was cited many (10-20+ times)
  3. Then I quickly searched that paper for the last name of the original author (i.e. the paper picked in step 1) and made note of if the paper from ISI was citing the paper for conceptual purposes, or hopefully and more exciting, the dataset. I alternatively search "hdl" in the full text of the article (dryad's accession number).
  • This process didn't take too long and helped focus on most likely journals/authors that practice data reuse. However, it's probably biased towards groups I thought were more likely to have data reuse. That is probably ok for a preliminary study like this, but not as standardized as it possibly could be. Anyways, just an idea.
  • Sarah Judson 19:58, 14 June 2010 (EDT):Let me know when you're ready for my zotero full text database for test searching. I'm making note of common search terms as I go through articles and annotating/tagging articles with "Yes_DataCitation", "Yes_Treebase", "Yes_DataSharing" (i.e. not reuse) or "No_DataCitation" which can help validate if your test search catches articles that have been manually noted to have a data set citation of some sort.