Difference between revisions of "Glossary"

From "A B C"
Jump to navigation Jump to search
Line 16: Line 16:
 
([http://en.wikipedia.org/wiki/Fasta_format Detailed information])
 
([http://en.wikipedia.org/wiki/Fasta_format Detailed information])
  
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
 +
 +
 
  
 
====multi FASTA format====
 
====multi FASTA format====

Revision as of 01:20, 17 November 2006




FASTA format

FASTA is a simple, ASCII based, text-file format for biological sequences. Minimally a FASTA file comprises a header line, initiated with the ">" character, followed by one or more lines containing nucleic acid or protein sequence in one-letter code. This is the most common input format for bioinformatics analysis programs and services.
Example
>gi|3402004|pdb|1MB1|  Mbp1 From Saccharomyces Cerevisiae
MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF
GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDHHHHHH

(Detailed information)


 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

multi FASTA format

A sequence file that contains more than one FASTA formatted sequence. The sequences are simply concatenated. This is a common input format for multiple sequence alignment or motif-finding programs.
Example
>Homeobox associated Leucine Zipper from gi|3868845  (134..178)
KQTEVDCELLRKCCASLTEENRRLQMEVDQLRALSTTQLHFSDFV
>Homeobox associated Leucine Zipper from gi 21264431 (168..212)
KQTEVDCEFLKKCCETLADENIRLQKEIQELKTLKLTQPFYMHMP
>Homeobox associated Leucine Zipper from gi|6634483  (212.. 256)
KQTEVDCELLKRCCETLTDENRRLHRELQELRALKLATAAAAPHH

(Detailed information)