Difference between revisions of "Information theory"

From "A B C"
Jump to navigation Jump to search
 
Line 43: Line 43:
 
==Further reading and resources==
 
==Further reading and resources==
 
<div class="reference-box">[http://cm.bell-labs.com/cm/ms/what/shannonday/shannon1948.pdf Shannon's "Mathematical Theory of Communication"] (at Bell labs)</div>
 
<div class="reference-box">[http://cm.bell-labs.com/cm/ms/what/shannonday/shannon1948.pdf Shannon's "Mathematical Theory of Communication"] (at Bell labs)</div>
 +
<div class="reference-box">[http://evfold.org/ EVfold homepage]</div>
 
<div class="reference-box">[http://weblogo.threeplusone.com/ WebLogo server]</div>
 
<div class="reference-box">[http://weblogo.threeplusone.com/ WebLogo server]</div>
  

Latest revision as of 00:16, 8 November 2012

Information theory


This page is a placeholder, or under current development; it is here principally to establish the logical framework of the site. The material on this page is correct, but incomplete.


This is an introduction to information theory for the bioinformatics lab.



 

Contents

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle H = - \sum_{i=0}^n p_i \log_{2} p_i}

Failed to parse (MathML with SVG or PNG fallback (recommended for modern browsers and accessibility tools): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle I = H_{ref} - H_{obs}}


   

Further reading and resources

Hopf et al. (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell 149:1607-21. (pmid: 22579045)

PubMed ] [ DOI ]

Marks et al. (2011) Protein 3D structure computed from evolutionary sequence variation. PLoS ONE 6:e28766. (pmid: 22163331)

PubMed ] [ DOI ]

Thomsen & Nielsen (2012) Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion. Nucleic Acids Res 40:W281-7. (pmid: 22638583)

PubMed ] [ DOI ]

Johansson & Toh (2010) A comparative study of conservation and variation scores. BMC Bioinformatics 11:388. (pmid: 20663120)

PubMed ] [ DOI ]

Dou et al. (2010) Several appropriate background distributions for entropy-based protein sequence conservation measures. J Theor Biol 262:317-22. (pmid: 19808039)

PubMed ] [ DOI ]

Capra & Singh (2007) Predicting functionally important residues from sequence conservation. Bioinformatics 23:1875-82. (pmid: 17519246)

PubMed ] [ DOI ]

Wang & Samudrala (2006) Incorporating background frequency improves entropy-based residue conservation measures. BMC Bioinformatics 7:385. (pmid: 16916457)

PubMed ] [ DOI ]

Vingron & Sibbald (1993) Weighting in sequence space: a comparison of methods in terms of generalized sequences. Proc Natl Acad Sci U.S.A 90:8777-81. (pmid: 8415606)

PubMed ] [ DOI ]

Henikoff & Henikoff (1994) Position-based sequence weights. J Mol Biol 243:574-8. (pmid: 7966282)

PubMed ] [ DOI ]

Schneider & Stephens (1990) Sequence logos: a new way to display consensus sequences. Nucleic Acids Res 18:6097-100. (pmid: 2172928)

PubMed ] [ DOI ]