Difference between revisions of "Reference species for fungi"
m |
|||
Line 285: | Line 285: | ||
;RBMs to MBP1_SACCE | ;RBMs to MBP1_SACCE | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
+ | <table cellpadding="5"> | ||
+ | <tr class="sh"><td>name</td><td><small>''Originally...''</small></td><td>RefSeqID</td><td> UniProtID </td></tr> | ||
+ | <tr class="s2"><td>MBP1_ASPNI</td><td>AN3154</td><td>[https://www.ncbi.nlm.nih.gov/protein/67525393 XP_660758]</td><td>[ Q5B8H6]</td></tr> | ||
+ | <tr class="s1"><td>MBP1_BIPOR</td><td>COCMIDRAFT_338</td><td>[https://www.ncbi.nlm.nih.gov/protein/627818929 XP_007682304]</td><td>[ W6ZM86]</td></tr> | ||
+ | <tr class="s2"><td>MBP1_NEUCR</td><td>Swi4</td><td>[https://www.ncbi.nlm.nih.gov/protein/85075775 XP_955821]</td><td>[ Q7RW59]</td></tr> | ||
+ | <tr class="s1"><td>MBP1_SACCE</td><td>Mbp1</td><td>[https://www.ncbi.nlm.nih.gov/protein/6320147 NP_010227]</td><td>[ P39678]</td></tr> | ||
+ | <tr class="s2"><td>MBP1_SCHPO</td><td>Res2</td><td>[https://www.ncbi.nlm.nih.gov/protein/19113944 NP_593032]</td><td>[ P41412]</td></tr> | ||
+ | <tr class="s1"><td>MBP1_COPCI</td><td> </td><td>[https://www.ncbi.nlm.nih.gov/protein/299748003 XP_001837394]</td><td>[ A8NYC6]</td></tr> | ||
+ | <tr class="s2"><td>MBP1_CRYNE</td><td> </td><td>[https://www.ncbi.nlm.nih.gov/protein/58263360 XP_569090]</td><td>[ Q5KMQ9]</td></tr> | ||
+ | <tr class="s1"><td>MBP1_PUCGR</td><td>PGTG_08863</td><td>[https://www.ncbi.nlm.nih.gov/protein/403167277 XP_003327086]</td><td>[ E3KED4]</td></tr> | ||
+ | <tr class="s2"><td>MBP1_USTMA</td><td>UMAG_11222</td><td>[https://www.ncbi.nlm.nih.gov/protein/758987770 XP_011392621]</td><td>[ A0A0D1DP35]</td></tr> | ||
+ | <tr class="s1"><td>MBP1_WALME</td><td> </td><td>[https://www.ncbi.nlm.nih.gov/protein/588255750 XP_006957051]</td><td>[ I4YGC0]</td></tr> | ||
+ | </table> | ||
{{Vspace}} | {{Vspace}} |
Revision as of 15:00, 4 October 2016
Reference fungi data
Explanation and definition for the "reference species" we use for the course.
Many bioinformatics procedures depend on the comparison of sequences between species. To make good use of evolutionary information, we should choose species that span the breadth of observations, and that are not biased towards a particular branch of the phylogenetic tree. To keep procedures manageable, the number of species cannot be "too large". For fungi, we make use of recent phylogenetic studies that establish the branching order of the entire kingdom, and we choose ten representatives for clades at the Class or subphylum level. To illustrate the "class" level: for animals the class levels include e.g. bony and cartilaginous fishes, segmented worms, amphibians, reptiles, birds and mammals - the familiar, very broad categories. I.e. a reference species list of animals, divided along class levels, might include zebrafish, african claw frog, the fruit fly, humans, the raven, the oyster etc. etc. Even though they are all fungi, our reference species are no more similar to each other than the former.
Contents
Reference species
To select a set of diverse species, the whole set of names of genome-sequenced fungi was loaded into the NCBI's Common Taxonomic Tree tree tool. Then ten representative species were manually selected as being well distributed across the tree. The selected species are:
Name | BICODE | tax ID | Classification |
Phylum Ascomycota | |||
Aspergillus nidulans | ASPNI | 162425 | Subphylum Pezizomycotina; Class Eurotiomycetes |
Bipolaris oryzae | BIPOR | 101162 | Subphylum Pezizomycotina; Class Dothideomycetes |
Neurospora crassa | NEUCR | 5141 | Subphylum Pezizomycotina; Class Sordariomycetes |
Saccharomyces cerevisiae | SACCE | 4932 | Subphylum Saccharomycotina |
Schizosaccharomyces pombe | SCHPO | 4896 | Subphylum Taphrinomycotina |
Phylum Basidiomyceta | |||
Coprinopsis cinerea | COPCI | 5346 | Subphylum Agaricomycotina; Class Agaricomycetes |
Cryptococcus neoformans | CRYNE | 5207 | Subphylum Agaricomycotina; Class Tremellomycetes |
Puccinia Graminis | PUCGR | 5297 | Subphylum Pucciniomycotina |
Ustilago maydis | USTMA | 5270 | Subphylum Ustilaginomycotina |
Wallemia mellicola | WALME | 1708541 | Subphylum Wallemiales incertae sedis |
Entrez
- Entrez selection code, e.g. for BLAST searches
"Wallemia mellicola"[organism] OR
"Puccinia Graminis"[organism] OR
"Ustilago maydis"[organism] OR
"Cryptococcus neoformans"[organism] OR
"Coprinopsis cinerea"[organism] OR
"Schizosaccharomyces pombe"[organism] OR
"Aspergillus nidulans"[organism] OR
"Neurospora crassa"[organism] OR
"Bipolaris oryzae"[organism] OR
"Saccharomyces cerevisiae"[organism]
Tax ID
- Taxonomy IDs, e.g. for the NCBI taxonomy browser
4896
4932
5141
5270
5297
5346
5207
101162
162425
1708541
Trees
- Text tree, based on the NCBI Taxonomy Common Tree
Dikarya
|
+--Basidiomycota
| |
| +-Agaricomycotina
| | +-Wallemia mellicola
| | +-Coprinopsis cinerea
| | +-Cryptococcus neoformans
| |
| +-Puccinia graminis
| +-Ustilago maydis
|
+--Ascomycota
|
+-Schizosaccharomyces pombe
|
+-saccharomyceta
+-Saccharomyces cerevisiae
|
+-leotiomyceta
+-Aspergillus nidulans
+-Neurospora crassa
+-Bipolaris oryzae
- Phylip tree format, e.g. to plot cladograms
(
(
'Wallemia mellicola':4,
'Puccinia graminis':4,
'Ustilago maydis':4,
(
'Coprinopsis cinerea':4,
'Cryptococcus neoformans':4
)Agaricomycotina:4
)Basidiomycota:4,
(
(
(
'Aspergillus nidulans':4,
'Bipolaris oryzae':4,
'Neurospora crassa':4
)leotiomyceta:4,
'Saccharomyces cerevisiae':4
)saccharomyceta:4,
'Schizosaccharomyces pombe':4
)Ascomycota:4
)Dikarya:4;
- Cladogram, drawn with the Phylip program
retree
┌──────────── Schizosaccharomyces pombe
│
│ ┌───────────── Aspergillus nidulans
┌─────────────+ │
│ │ ┌────────────+───────────── Bipolaris oryzae
│ │ │ │
│ └─────────────+ └───────────── Neurospora crassa
│ │
──+ └──────────── <span style="background-color:#EEEEBB;">Saccharomyces cerevisiae</span>
│
│ ┌──────────── Cryptococcus neoformans
│ ┌─────────────+
│ │ └───────────── Coprinopsis cinerea
│ │
└─────────────+───────────── Ustilago maydis
│
├───────────── Puccinia graminis
│
└───────────── Wallemia mellicola
R
- A vector of binomial species names.
REFspecies <- c("Aspergillus nidulans",
"Bipolaris oryzae",
"Coprinopsis cinerea",
"Cryptococcus neoformans",
"Neurospora crassa",
"Puccinia graminis",
"Saccharomyces cerevisiae",
"Schizosaccharomyces pombe",
"Ustilago maydis",
"Wallemia mellicola"
)
- A data frame of binomial species and TaxIDs.
refTaxa <- data.frame(
ID = as.integer(c(162425,
101162,
5141,
4932,
4896,
5346,
5207,
5297,
5270,
1708541)),
species = c("Aspergillus nidulans",
"Bipolaris oryzae",
"Neurospora crassa",
"Saccharomyces cerevisiae",
"Schizosaccharomyces pombe",
"Coprinopsis cinerea",
"Cryptococcus neoformans",
"Puccinia Graminis",
"Ustilago maydis",
"Wallemia mellicola"),
stringsAsFactors = FALSE)
Mbp1 orthologues
- RBMs to MBP1_SACCE
name | Originally... | RefSeqID | UniProtID |
MBP1_ASPNI | AN3154 | XP_660758 | [ Q5B8H6] |
MBP1_BIPOR | COCMIDRAFT_338 | XP_007682304 | [ W6ZM86] |
MBP1_NEUCR | Swi4 | XP_955821 | [ Q7RW59] |
MBP1_SACCE | Mbp1 | NP_010227 | [ P39678] |
MBP1_SCHPO | Res2 | NP_593032 | [ P41412] |
MBP1_COPCI | XP_001837394 | [ A8NYC6] | |
MBP1_CRYNE | XP_569090 | [ Q5KMQ9] | |
MBP1_PUCGR | PGTG_08863 | XP_003327086 | [ E3KED4] |
MBP1_USTMA | UMAG_11222 | XP_011392621 | [ A0A0D1DP35] |
MBP1_WALME | XP_006957051 | [ I4YGC0] |
Further reading and resources
Ebersberger et al. (2012) A consistent phylogenetic backbone for the fungi. Mol Biol Evol 29:1319-34. (pmid: 22114356) |