EDA
Jump to navigation
Jump to search
EDA (Exploratory Data Analysis)
This page is a placeholder, or under current development; it is here principally to establish the logical framework of the site. The material on this page is correct, but incomplete.
Exploratory Data Analysis (EDA) is a collection of statistical strategies to assist in the preparation of data for further processing and the generation of hypotheses for rigorous follow-up. It employs methods both from descriptive- as well as from inferential statistics, and, since one of its core objectives is to find relationships within datasets, it employs a large variety of data visualization techniques. (See also: Exploratory data analysis)
Contents
Introductory reading
Wu & Wu (2010) Exploration, visualization, and preprocessing of high-dimensional data. Methods Mol Biol 620:267-84. (pmid: 20652508) |
Strategies
...
Visualization
Krzywinski et al. (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639-45. (pmid: 19541911) |
Data Reduction
...
Model Based Exploration
...
Further reading and resources
Schreiber (2008) Visualization. Methods Mol Biol 453:441-50. (pmid: 18712318) |
Foulkes & Au (2011) R statistical tools for gene discovery. Methods Mol Biol 760:73-90. (pmid: 21779991) |
Teo (2010) Exploratory data analysis in large-scale genetic studies. Biostatistics 11:70-81. (pmid: 19828557) |
Azuaje et al. (2005) Non-linear mapping for exploratory data analysis in functional genomics. BMC Bioinformatics 6:13. (pmid: 15661072) |
Ivakhno & Armstrong (2007) Non-linear dimensionality reduction of signaling networks. BMC Syst Biol 1:27. (pmid: 17559646) |
Speed (2011) Mathematics. A correlation for the 21st century. Science 334:1502-3. (pmid: 22174235) |
Reshef et al. (2011) Detecting novel associations in large data sets. Science 334:1518-24. (pmid: 22174245) |
- (Also see: Supporting Online Material)