t-test: p=0.0157513
Neutral set average density: 0.00163451
Different set average density: 0.00194874
motif_present gene_set absent present different 587 407 neutral 1476 477
Fisher's exact test: p=6.2031e-20
Binomial GLM: p=4.86779e-20
Call: glm(formula = motif_present ~ different, family = binomial, data = daf) Deviance Residuals: Min 1Q Median 3Q Max -1.0264 -0.7484 -0.7484 1.3364 1.6790 Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -1.12957 0.05267 -21.447 <2e-16 *** differentTRUE 0.76336 0.08327 9.167 <2e-16 *** --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 (Dispersion parameter for binomial family taken to be 1) Null deviance: 3600.3 on 2946 degrees of freedom Residual deviance: 3516.6 on 2945 degrees of freedom AIC: 3520.6 Number of Fisher Scoring iterations: 4
Binomial GLM adjusted for length: p=0.144768
Call: glm(formula = motif_present ~ length + different, family = binomial, data = daf) Deviance Residuals: Min 1Q Median 3Q Max -2.5871 -0.6788 -0.5168 0.6433 2.1692 Coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) -2.5207648 0.0894101 -28.193 <2e-16 *** length 0.0065376 0.0003063 21.341 <2e-16 *** differentTRUE 0.1434957 0.0984019 1.458 0.145 --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 (Dispersion parameter for binomial family taken to be 1) Null deviance: 3600.3 on 2946 degrees of freedom Residual deviance: 2863.2 on 2944 degrees of freedom AIC: 2869.2 Number of Fisher Scoring iterations: 4