Statistics
Statistics
This page is a placeholder, or under current development; it is here principally to establish the logical framework of the site. The material on this page is correct, but incomplete.
Statistics is indispensable to describe and analyse large data sets, such as those commonly encountered in computational biology. Roughly, three areas apply: descriptive statistics, to describe features of sets of data, inferential statistics, to quantify the significance of observations, and probability theory, to provide the theoretical basis for drawing such inferences. In practice we often apply procedures of Exploratory Data Analysis to find interesting features of our data and devise hypotheses and strategies for its analysis.
Contents
Introductory reading
Nicholls (2011) What do we know?: simple statistical techniques that help. Methods Mol Biol 672:531-81. (pmid: 20838984) |
Further reading and resources
Johnson (2013) Revised standards for statistical evidence. Proc Natl Acad Sci U.S.A 110:19313-7. (pmid: 24218581) |
Xu et al. (2010) Categorical data analysis in experimental biology. Dev Biol 348:3-11. (pmid: 20826130) |
Xu et al. (2010) Categorical data analysis in experimental biology. Dev Biol 348:3-11. (pmid: 20826130) |
Cumming et al. (2007) Error bars in experimental biology. J Cell Biol 177:7-11. (pmid: 17420288) |
Explorations in Statistics series
Curran-Everett (2008) Explorations in statistics: standard deviations and standard errors. Adv Physiol Educ 32:203-8. (pmid: 18794241) |
Curran-Everett (2009) Explorations in statistics: hypothesis tests and P values. Adv Physiol Educ 33:81-6. (pmid: 19509391) |
Curran-Everett (2009) Explorations in statistics: confidence intervals. Adv Physiol Educ 33:87-90. (pmid: 19509392) |
Curran-Everett (2009) Explorations in statistics: confidence intervals. Adv Physiol Educ 33:87-90. (pmid: 19509392) |
Curran-Everett (2009) Explorations in statistics: the bootstrap. Adv Physiol Educ 33:286-92. (pmid: 19948676) |
Curran-Everett (2010) Explorations in statistics: power. Adv Physiol Educ 34:41-3. (pmid: 20522895) |
Curran-Everett (2010) Explorations in statistics: correlation. Adv Physiol Educ 34:186-91. (pmid: 21098385) |
Curran-Everett (2011) Explorations in statistics: regression. Adv Physiol Educ 35:347-52. (pmid: 22139769) |
Curran-Everett (2012) Explorations in statistics: permutation methods. Adv Physiol Educ 36:181-7. (pmid: 22952255) |
General
Nick (2007) Descriptive statistics. Methods Mol Biol 404:33-52. (pmid: 18450044) |
Tu (2007) Basic principles of statistical inference. Methods Mol Biol 404:53-72. (pmid: 18450045) |
Perkins (2007) Statistical inference on categorical variables. Methods Mol Biol 404:73-88. (pmid: 18450046) |
Wittkowski & Song (2010) Nonparametric methods for molecular biology. Methods Mol Biol 620:105-53. (pmid: 20652502) |
Alonzo & Pepe (2007) Development and evaluation of classifiers. Methods Mol Biol 404:89-116. (pmid: 18450047) |
Berman (2007) Comparison of means. Methods Mol Biol 404:117-42. (pmid: 18450048) |
Eberly (2007) Correlation and simple linear regression. Methods Mol Biol 404:143-64. (pmid: 18450049) |
Eberly (2007) Multiple linear regression. Methods Mol Biol 404:165-87. (pmid: 18450050) |
Ip (2007) General linear models. Methods Mol Biol 404:189-211. (pmid: 18450051) |
Oberg & Mahoney (2007) Linear mixed effects models. Methods Mol Biol 404:213-34. (pmid: 18450052) |
Grady (2007) Analysis of change. Methods Mol Biol 404:261-71. (pmid: 18450054) |
Nick & Campbell (2007) Logistic regression. Methods Mol Biol 404:273-301. (pmid: 18450055) |
Jiang & Fine (2007) Survival analysis. Methods Mol Biol 404:303-18. (pmid: 18450056) |
Glickman & van Dyk (2007) Basic Bayesian methods. Methods Mol Biol 404:319-38. (pmid: 18450057) |
Wilkinson (2007) Bayesian methods in bioinformatics and computational systems biology. Brief Bioinformatics 8:109-16. (pmid: 17430978) |
D'Agostino (2007) Overview of missing data techniques. Methods Mol Biol 404:339-52. (pmid: 18450058) |
Case & Ambrosius (2007) Power and sample size. Methods Mol Biol 404:377-408. (pmid: 18450060) |
Sabatti (2007) Avoiding false discoveries in association studies. Methods Mol Biol 376:195-211. (pmid: 17984547) |
Berman & Gullíon (2007) Working with a statistician. Methods Mol Biol 404:489-503. (pmid: 18450064) |