Difference between revisions of "Clustering"

From "A B C"
Jump to navigation Jump to search
(Created page with "<div id="APB"> <div class="b1"> Clustering and Classification </div> {{dev}} {{WP|Cluster analysis|'''Clustering'''}} and {{WP|Statistical classification|'''Classification...")
 
 
(2 intermediate revisions by the same user not shown)
Line 12: Line 12:
  
 
__TOC__
 
__TOC__
 
 
<!--
 
==Contents==
 
  
  
 
==Introductory reading==
 
==Introductory reading==
 
<section begin=reading />
 
<section begin=reading />
 +
{{#pmid: 20652512}}
 
<section end=reading />
 
<section end=reading />
 +
 +
 +
<!--
 +
==Contents==
  
  
Line 35: Line 36:
 
==Further reading and resources==
 
==Further reading and resources==
  
 +
{{#pmid: 22275205}}
 +
{{#pmid: 22144159}}
 +
{{#pmid: 19957146}}
 +
{{#pmid: 19381534}}
 
{{#pmid: 18712317}}
 
{{#pmid: 18712317}}
{{#pmid: 20652512}}
 
{{#pmid: 22144159}}
 
 
{{#pmid: 18450047}}
 
{{#pmid: 18450047}}
 
<!--
 
<!--

Latest revision as of 15:02, 17 October 2013

Clustering and Classification


This page is a placeholder, or under current development; it is here principally to establish the logical framework of the site. The material on this page is correct, but incomplete.


Clustering and Classification are conceptually related statistical techniques in that clustering attempts to identify groupings in existing data and classification asks how best to assign new data into existing groups. However, the techniques that are employed are quite different. Clustering techniques find partitions such that relationships within a set are greater than between members of different sets; they are often divided into connectivity based approaches (e.g. "hierarchical clustering") and centroid based approaches (e.g. K-means), but other approaches such as density based clustering (eg. DBSCAN) or the flow-based MCL algorithm are increasingly important in our field. Classification techniques rely heavily on machine learning methods, often in a Bayesian framework: neural networks, support vector machines, hidden Markov models, decision trees etc'.



Introductory reading

Nugent & Meila (2010) An overview of clustering applied to molecular biology. Methods Mol Biol 620:369-404. (pmid: 20652512)

PubMed ] [ DOI ]


Further reading and resources

Xu & Wunsch (2010) Clustering algorithms in biomedical research: a review. IEEE Rev Biomed Eng 3:120-54. (pmid: 22275205)

PubMed ] [ DOI ]

van Dongen & Abreu-Goodger (2012) Using MCL to extract clusters from networks. Methods Mol Biol 804:281-95. (pmid: 22144159)

PubMed ] [ DOI ]

Frades & Matthiesen (2010) Overview on techniques in cluster analysis. Methods Mol Biol 593:81-107. (pmid: 19957146)

PubMed ] [ DOI ]

Yona et al. (2009) Comparing algorithms for clustering of expression data: how to assess gene clusters. Methods Mol Biol 541:479-509. (pmid: 19381534)

PubMed ] [ DOI ]

McLachlan et al. (2008) Clustering. Methods Mol Biol 453:423-39. (pmid: 18712317)

PubMed ] [ DOI ]

Alonzo & Pepe (2007) Development and evaluation of classifiers. Methods Mol Biol 404:89-116. (pmid: 18450047)

PubMed ] [ DOI ]