DataONE:ArrayExpress metadata study

From OpenWetWare

Revision as of 16:12, 20 July 2010 by Heather A Piwowar (Talk | contribs)
Jump to: navigation, search

This DataONE OpenWetWare site contains informal notes for several research projects funded through DataONE. DataONE is a collaboration among many partner organizations, and is funded by the US National Science Foundation (NSF) under a Cooperative Agreement.

Home        People        Research        Summer 2010        Resources       


THIS PROJECT IS MID-DEVELOPMENT. RESULTS HERE ARE UNSTABLE, INCOMPLETE, AND PERHAPS WILDLY WRONG. That said, please enjoy your reading in the spirit of Open Notebook Science and I'd love to hear your thoughts and suggestions :)

Contents

Aim

Does metadata quantity correlate with increased reuse of datasets?  Is more compete metadata associated with increased benefit for investigators in the form of more citations?

Background

Methods

Downloaded ArrayExpress metadata using custom Python code on July 22, 2009. Open Source: <<link to git>>. (Note the one year gap. This was due to an intervening thesis. Also, updated metadata capture is not necessary because we would be ideally be capturing the metadata that existed at the time reusers would have been searching it... )

Identified ArrayExpress reuse in PubMed Central using the ArrayExpress variant of the DataONE:Protocols/Find_GEO_reuses protocol. Reuses captured on July 19, 2010

Downloaded Scopus citation counts for the PMIDs listed in the ArrayExpress metadata. Collected on July 19, 2010 using the DataONE:Protocols/Scopus_citation_counts_from_PMIDs protocol.

Results

  • <<link to ArrayExpress metadata dataset>>
  • <<link to ArrayExpress reuse dataset>>
  • <<link to Scopus data>>

Discussion

Personal tools