DataONE:Notebook/Data Citation and Sharing Policy/2010/07/27: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 9: Line 9:
[[Category:DataONE]]  
[[Category:DataONE]]  


Today I have cleaned up some of my code from [ http://www.openwetware.org/wiki/DataONE:Notebook/Data_Citation_and_Sharing_Policy/2010/07/26 yesterday] and updated my [ spreadsheet to reflect the changes in columns Publisher Code, and some cleaning in the Subscription Model column.  
Today I have cleaned up some of my code from [http://www.openwetware.org/wiki/DataONE:Notebook/Data_Citation_and_Sharing_Policy/2010/07/26 yesterday] and updated my [http://spreadsheets.google.com/ccc?key=0AmL-EJ5-i7x7dEtSRFJZLWtiM1Z6VGRmS1VWbDVSVnc&hl=en public dataset] to reflect the changes in columns Publisher Code, and some cleaning in the Subscription Model column.  


*To begin, I ran the following  code:
*To begin, I ran the following  code:

Revision as of 08:53, 27 July 2010

Project name <html><img src="/images/9/94/Report.png" border="0" /></html> Main project page
<html><img src="/images/c/c3/Resultset_previous.png" border="0" /></html>Previous entry<html>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</html>Next entry<html><img src="/images/5/5c/Resultset_next.png" border="0" /></html>

Cleaner Analysis

Today I have cleaned up some of my code from yesterday and updated my public dataset to reflect the changes in columns Publisher Code, and some cleaning in the Subscription Model column.

  • To begin, I ran the following code:

<html><script src="http://gist.github.com/491173.js"> </script></html>

  • This gave me the following significant results (Full Results from R below)
    • For P Values of Impact Factor, Society Affiliation, and all publishers other than Wiley, Springer, Elsevier and Taylor Francis Ltd.
Coefficients:
                Estimate Std. Error z value Pr(>|z|) 
log(ImFa)        1.04003    0.29007   3.585 0.000337 ***
Afil             1.06761    0.46302   2.306 0.021125 *  
PubCodeother     1.52884    0.71422   2.141 0.032309 * 
    • With confidence intervals of :
                      2.5 %     97.5 %
log(ImFa)              0.4998151  1.6443777
Afil                         0.1929881  2.0227830
PubCodeother     0.2325047  3.1117025
    • And exp of :
   log(ImFa)                 Afil                      PubCodeother 
     2.82930675            2.90840677      4.61284400 
    • With exp confidence intervals of:
                     2.5 %     97.5 %
log(ImFa)       1.648416488  5.1777868
Afil            1.212868348  7.5593332
PubCodeother    1.261756334 22.4592496
    • Below are the full Results for context
> summary(mylogit)

Call:
glm(formula = requests ~ log(ImFa) + SomeOA + Afil + PubCode + 
    is.Eco + is.EnvSci + is.EvoBio, family = binomial(link = "logit"), 
    na.action = na.omit)

Deviance Residuals: 
    Min       1Q   Median       3Q      Max  
-1.5987  -0.5199  -0.3057  -0.1653   2.9759  

Coefficients:
                Estimate Std. Error z value Pr(>|z|)    
(Intercept)     -4.47416    0.94554  -4.732 2.22e-06 ***
log(ImFa)        1.04003    0.29007   3.585 0.000337 ***
SomeOA          -0.02429    0.43966  -0.055 0.955949    
Afil             1.06761    0.46302   2.306 0.021125 *  
PubCodeother     1.52884    0.71422   2.141 0.032309 *  
PubCodespringer -0.62794    1.19405  -0.526 0.598963    
PubCodetaylor   -0.07862    1.20986  -0.065 0.948188    
PubCodewiley     1.11960    0.76456   1.464 0.143093    
is.Eco          -0.33555    0.57484  -0.584 0.559403    
is.EnvSci        0.46216    0.65644   0.704 0.481416    
is.EvoBio        0.75327    0.67710   1.112 0.265925    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 

(Dispersion parameter for binomial family taken to be 1)

    Null deviance: 224.11  on 299  degrees of freedom
Residual deviance: 179.30  on 289  degrees of freedom
  (7 observations deleted due to missingness)
AIC: 201.3

Number of Fisher Scoring iterations: 6

> confint(mylogit)
Waiting for profiling to be done...
                     2.5 %     97.5 %
<b>(Intercept)     -6.4901304 -2.7394727</b>
log(ImFa)        0.4998151  1.6443777
SomeOA          -0.9051808  0.8301939
Afil             0.1929881  2.0227830
PubCodeother     0.2325047  3.1117025
PubCodespringer -3.6768695  1.5175314
PubCodetaylor   -3.1452976  2.1005325
PubCodewiley    -0.3119753  2.7709763
is.Eco          -1.4946319  0.7684235
is.EnvSci       -0.8165620  1.7687779
is.EvoBio       -0.5939997  2.0820360
> exp(mylogit$coefficients)
    (Intercept)       log(ImFa)          SomeOA            Afil    PubCodeother PubCodespringer   PubCodetaylor    PubCodewiley 
     0.01139980      2.82930675      0.97600675      2.90840677      4.61284400      0.53368914      0.92439051      3.06363807 
         is.Eco       is.EnvSci       is.EvoBio 
     0.71494624      1.58749158      2.12394266 
> exp(confint(mylogit)) # conf int for exp
Waiting for profiling to be done...
                      2.5 %     97.5 %
(Intercept)     0.001518351  0.0646044
log(ImFa)       1.648416488  5.1777868
SomeOA          0.404468731  2.2937636
Afil            1.212868348  7.5593332
PubCodeother    1.261756334 22.4592496
PubCodespringer 0.025302058  4.5609519
PubCodetaylor   0.043054111  8.1705199
PubCodewiley    0.731999586 15.9742218
is.Eco          0.224331162  2.1563641
is.EnvSci       0.441948475  5.8636828
is.EvoBio       0.552114556  8.0207830