DataONE:Notebook/Data Citation and Sharing Policy/2010/07/27: Difference between revisions
From OpenWetWare
Jump to navigationJump to search
(Autocreate 2010/07/27 Entry for DataONE:Notebook/Data_Citation_and_Sharing_Policy) |
|||
Line 6: | Line 6: | ||
| colspan="2"| | | colspan="2"| | ||
<!-- ##### DO NOT edit above this line unless you know what you are doing. ##### --> | <!-- ##### DO NOT edit above this line unless you know what you are doing. ##### --> | ||
== | ==Cleaner Analysis== | ||
[[Category:DataONE]] | [[Category:DataONE]] | ||
Today I have cleaned up some of my code from [ http://www.openwetware.org/wiki/DataONE:Notebook/Data_Citation_and_Sharing_Policy/2010/07/26 yesterday] and updated my [ spreadsheet to reflect the changes in columns Publisher Code, and some cleaning in the Subscription Model column. | |||
*To begin, I ran the following code: | |||
<html><script src="http://gist.github.com/491173.js"> </script></html> | |||
* This gave me the following significant results (Full Results from R below) | |||
** For P Values of Impact Factor, Society Affiliation, and all publishers other than Wiley, Springer, Elsevier and Taylor Francis Ltd. | |||
<pre>Coefficients: | |||
Estimate Std. Error z value Pr(>|z|) </pre> | |||
<pre>log(ImFa) 1.04003 0.29007 3.585 0.000337 *** | |||
Afil 1.06761 0.46302 2.306 0.021125 * | |||
PubCodeother 1.52884 0.71422 2.141 0.032309 * </pre> | |||
** With confidence intervals of : | |||
<pre> 2.5 % 97.5 % | |||
log(ImFa) 0.4998151 1.6443777 | |||
Afil 0.1929881 2.0227830 | |||
PubCodeother 0.2325047 3.1117025</pre> | |||
** And exp of : | |||
<pre> log(ImFa) Afil PubCodeother | |||
2.82930675 2.90840677 4.61284400 </pre> | |||
** With exp confidence intervals of: | |||
<pre> 2.5 % 97.5 % | |||
log(ImFa) 1.648416488 5.1777868 | |||
Afil 1.212868348 7.5593332 | |||
PubCodeother 1.261756334 22.4592496</pre> | |||
**Below are the full Results for context | |||
<pre>> summary(mylogit) | |||
Call: | |||
glm(formula = requests ~ log(ImFa) + SomeOA + Afil + PubCode + | |||
is.Eco + is.EnvSci + is.EvoBio, family = binomial(link = "logit"), | |||
na.action = na.omit) | |||
Deviance Residuals: | |||
Min 1Q Median 3Q Max | |||
-1.5987 -0.5199 -0.3057 -0.1653 2.9759 | |||
Coefficients: | |||
Estimate Std. Error z value Pr(>|z|) | |||
(Intercept) -4.47416 0.94554 -4.732 2.22e-06 *** | |||
log(ImFa) 1.04003 0.29007 3.585 0.000337 *** | |||
SomeOA -0.02429 0.43966 -0.055 0.955949 | |||
Afil 1.06761 0.46302 2.306 0.021125 * | |||
PubCodeother 1.52884 0.71422 2.141 0.032309 * | |||
PubCodespringer -0.62794 1.19405 -0.526 0.598963 | |||
PubCodetaylor -0.07862 1.20986 -0.065 0.948188 | |||
PubCodewiley 1.11960 0.76456 1.464 0.143093 | |||
is.Eco -0.33555 0.57484 -0.584 0.559403 | |||
is.EnvSci 0.46216 0.65644 0.704 0.481416 | |||
is.EvoBio 0.75327 0.67710 1.112 0.265925 | |||
--- | |||
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 | |||
(Dispersion parameter for binomial family taken to be 1) | |||
Null deviance: 224.11 on 299 degrees of freedom | |||
Residual deviance: 179.30 on 289 degrees of freedom | |||
(7 observations deleted due to missingness) | |||
AIC: 201.3 | |||
Number of Fisher Scoring iterations: 6 | |||
> confint(mylogit) | |||
Waiting for profiling to be done... | |||
2.5 % 97.5 % | |||
<b>(Intercept) -6.4901304 -2.7394727</b> | |||
log(ImFa) 0.4998151 1.6443777 | |||
SomeOA -0.9051808 0.8301939 | |||
Afil 0.1929881 2.0227830 | |||
PubCodeother 0.2325047 3.1117025 | |||
PubCodespringer -3.6768695 1.5175314 | |||
PubCodetaylor -3.1452976 2.1005325 | |||
PubCodewiley -0.3119753 2.7709763 | |||
is.Eco -1.4946319 0.7684235 | |||
is.EnvSci -0.8165620 1.7687779 | |||
is.EvoBio -0.5939997 2.0820360 | |||
> exp(mylogit$coefficients) | |||
(Intercept) log(ImFa) SomeOA Afil PubCodeother PubCodespringer PubCodetaylor PubCodewiley | |||
0.01139980 2.82930675 0.97600675 2.90840677 4.61284400 0.53368914 0.92439051 3.06363807 | |||
is.Eco is.EnvSci is.EvoBio | |||
0.71494624 1.58749158 2.12394266 | |||
> exp(confint(mylogit)) # conf int for exp | |||
Waiting for profiling to be done... | |||
2.5 % 97.5 % | |||
(Intercept) 0.001518351 0.0646044 | |||
log(ImFa) 1.648416488 5.1777868 | |||
SomeOA 0.404468731 2.2937636 | |||
Afil 1.212868348 7.5593332 | |||
PubCodeother 1.261756334 22.4592496 | |||
PubCodespringer 0.025302058 4.5609519 | |||
PubCodetaylor 0.043054111 8.1705199 | |||
PubCodewiley 0.731999586 15.9742218 | |||
is.Eco 0.224331162 2.1563641 | |||
is.EnvSci 0.441948475 5.8636828 | |||
is.EvoBio 0.552114556 8.0207830 </pre> | |||
<!-- ##### DO NOT edit below this line unless you know what you are doing. ##### --> | <!-- ##### DO NOT edit below this line unless you know what you are doing. ##### --> |
Revision as of 08:51, 27 July 2010
Project name | <html><img src="/images/9/94/Report.png" border="0" /></html> Main project page <html><img src="/images/c/c3/Resultset_previous.png" border="0" /></html>Previous entry<html> </html>Next entry<html><img src="/images/5/5c/Resultset_next.png" border="0" /></html> |
Cleaner AnalysisToday I have cleaned up some of my code from [ http://www.openwetware.org/wiki/DataONE:Notebook/Data_Citation_and_Sharing_Policy/2010/07/26 yesterday] and updated my [ spreadsheet to reflect the changes in columns Publisher Code, and some cleaning in the Subscription Model column.
<html><script src="http://gist.github.com/491173.js"> </script></html>
Coefficients: Estimate Std. Error z value Pr(>|z|) log(ImFa) 1.04003 0.29007 3.585 0.000337 *** Afil 1.06761 0.46302 2.306 0.021125 * PubCodeother 1.52884 0.71422 2.141 0.032309 *
2.5 % 97.5 % log(ImFa) 0.4998151 1.6443777 Afil 0.1929881 2.0227830 PubCodeother 0.2325047 3.1117025
log(ImFa) Afil PubCodeother 2.82930675 2.90840677 4.61284400
2.5 % 97.5 % log(ImFa) 1.648416488 5.1777868 Afil 1.212868348 7.5593332 PubCodeother 1.261756334 22.4592496
> summary(mylogit) Call: glm(formula = requests ~ log(ImFa) + SomeOA + Afil + PubCode + is.Eco + is.EnvSci + is.EvoBio, family = binomial(link = "logit"), na.action = na.omit) Deviance Residuals: Min 1Q Median 3Q Max -1.5987 -0.5199 -0.3057 -0.1653 2.9759 Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -4.47416 0.94554 -4.732 2.22e-06 *** log(ImFa) 1.04003 0.29007 3.585 0.000337 *** SomeOA -0.02429 0.43966 -0.055 0.955949 Afil 1.06761 0.46302 2.306 0.021125 * PubCodeother 1.52884 0.71422 2.141 0.032309 * PubCodespringer -0.62794 1.19405 -0.526 0.598963 PubCodetaylor -0.07862 1.20986 -0.065 0.948188 PubCodewiley 1.11960 0.76456 1.464 0.143093 is.Eco -0.33555 0.57484 -0.584 0.559403 is.EnvSci 0.46216 0.65644 0.704 0.481416 is.EvoBio 0.75327 0.67710 1.112 0.265925 --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 (Dispersion parameter for binomial family taken to be 1) Null deviance: 224.11 on 299 degrees of freedom Residual deviance: 179.30 on 289 degrees of freedom (7 observations deleted due to missingness) AIC: 201.3 Number of Fisher Scoring iterations: 6 > confint(mylogit) Waiting for profiling to be done... 2.5 % 97.5 % <b>(Intercept) -6.4901304 -2.7394727</b> log(ImFa) 0.4998151 1.6443777 SomeOA -0.9051808 0.8301939 Afil 0.1929881 2.0227830 PubCodeother 0.2325047 3.1117025 PubCodespringer -3.6768695 1.5175314 PubCodetaylor -3.1452976 2.1005325 PubCodewiley -0.3119753 2.7709763 is.Eco -1.4946319 0.7684235 is.EnvSci -0.8165620 1.7687779 is.EvoBio -0.5939997 2.0820360 > exp(mylogit$coefficients) (Intercept) log(ImFa) SomeOA Afil PubCodeother PubCodespringer PubCodetaylor PubCodewiley 0.01139980 2.82930675 0.97600675 2.90840677 4.61284400 0.53368914 0.92439051 3.06363807 is.Eco is.EnvSci is.EvoBio 0.71494624 1.58749158 2.12394266 > exp(confint(mylogit)) # conf int for exp Waiting for profiling to be done... 2.5 % 97.5 % (Intercept) 0.001518351 0.0646044 log(ImFa) 1.648416488 5.1777868 SomeOA 0.404468731 2.2937636 Afil 1.212868348 7.5593332 PubCodeother 1.261756334 22.4592496 PubCodespringer 0.025302058 4.5609519 PubCodetaylor 0.043054111 8.1705199 PubCodewiley 0.731999586 15.9742218 is.Eco 0.224331162 2.1563641 is.EnvSci 0.441948475 5.8636828 is.EvoBio 0.552114556 8.0207830 |