Difference between revisions of "User:Boris/BCB hackathon 2018"

From "A B C"
Jump to navigation Jump to search
Line 66: Line 66:
 
{{Smallvspace}}
 
{{Smallvspace}}
  
Our imagination of the genome has matured tremendously. '''What is the image we will find for it?'''  
+
Our imagination of the genome has matured tremendously.  
 +
 
 +
{{Smallvspace}}
 +
 
 +
'''What is the image we will find for the Human Genome – 20 years on?'''  
  
  

Revision as of 01:53, 7 February 2018

BCB Hackathon 2018

(Topic Proposal: The Human Genome - 20 years later)


 


Abstract:

This topic proposal for the 2018 BCB hackathon is to explore new ways to provide a holistic overview of the contents of the human genome, for the occasion of the 20th anniversary of its sequence.


 


 

Background

 

The first two draft sequences of the human genome were published in February of 2001[1][2]. Three years from now will mark the twentieth anniversary of this accomplishment that like now other has shaped the landscape of bioinformatics, computational biology and molecular medicine.


 

In 2001, Celera - a private company founded three years earlier to commercialize genome information - published an iconic poster summarizing their version of the genome. It is still fascinating today.

CeleraPoster.med.jpg

This poster is significant, not so much for its interpretable content, but for the unique perspective it gives us on the entirety of information that constitutes our molecular identity.

CeleraPosterTop.med.jpg

The details are rich, in fact, surprisingly "modern", presenting features like CpG islands and SNP density, and exon transcripts with Gene Ontology functional categories colour coded, for forward and reverse strand, accurately plotted on the nucleotide backbone at about 500 kB per centimetre. This was computed from gff records with Josep Abril's gff2ps software[3].

CeleraPosterChr4.med.jpg


 

But we know so much more today. The number of sequenced genomes has exploded - selected individuals at first, then we envisioned the 1,000 genomes project (2008, completed 2012); quickly set our sights on 100,000 genomes (2012, almost completed), and as of today more than 500,000 human genomes have been sequenced overall. We have sequenced cancers, and genetic diseases. We have sequenced representatives of virtually all ethnicities on the planet. We have even sequenced Neanderthals and Denisovians, and we have sequenced other species far and wide to acquire a sense of where we fit into the landscape of evolution. We have annotated the contents of the genome in the ENCODE project. We have built databases that carefully dissect all proteins into their domains, such as InterPro. We have started to outline how things work together in functional networks such as the STRING data, or in modules as published by KEGG, and we are beginning to translate our insights into actionable information for medicine, at the OICR, at Sick Kids' TCAG.


 

Our imagination of the genome has matured tremendously.


 

What is the image we will find for the Human Genome – 20 years on?


 

Goals

 


 

Process

 


 

Perspectives

 


 

Notes

 
  1. Lander et al. (2001) Initial sequencing and analysis of the human genome. Nature 409:860-921. (pmid: 11237011)

    PubMed ] [ DOI ]

  2. Venter et al. (2001) The sequence of the human genome. Science 291:1304-51. (pmid: 11181995)

    PubMed ] [ DOI ]

  3. Abril & Guigó (2000) gff2ps: visualizing genomic annotations. Bioinformatics 16:743-4. (pmid: 11099262)

    PubMed ] [ DOI ]


 



About ...

 

Last update:

2017-02-06

Version:

1.0

Version history:

  • 1.0 First proposal


 


CreativeCommonsBy.png This copyrighted material is licensed under a Creative Commons Attribution 4.0 International License. Follow the link to learn more.