CSB Assignment Week 3
Assignments for Week 3
Collaboration tools, initializing our project.
< Assignment 2 | Assignment 4 > |
Note! This assignment is currently inactive. Major and minor unannounced changes may be made at any time.
Assigned material - concepts, exercises and reading - will be reflected in next week's evaluation and feedback session. Please remember to contribute to self-evaluation questions by Tuesday at noon.
Warm up
In many London Underground tube stations there are two escalators going up but only one going down.
Why? [I don't know...]
Seriously?
Could it have something to do with the required capacity?
Then just model the expected flow. Is it symmetric?
If you think so, you might need a hint... [Ok. A hint please...]
When do people use the escalator?[Sorry. I still don't get it: aren't there as many going down as coming up?]
We'll we'd hope that the number of people going down is the same as coming up. If there were twice as many people coming up, the must have been produced in the underground - an army of tube zombies being bred in the tunnels, spreading through the town to do their evil deeds...
No: the answer has to do with how the streams of passengers distribute over time. Passengers trickle into the station at more or less a constant rate, but when a train arrives, there's a mad scramble by a horde of passengers to get out - and for that the transport needs greater capacity.
Exercises
In this exercise we will attempt to extract a set of relevant genes for the pluripotency network from deposited expression data.
Task:
A recent paper has highlighted the lineage-specific roles of SOX2, OCT4 and NANOG in human cells.
Wang et al. (2012) Distinct lineage specification roles for NANOG, OCT4, and SOX2 in human embryonic stem cells. Cell Stem Cell 10:440-54. (pmid: 22482508) |
[ PubMed ] [ DOI ] Nanog, Oct4, and Sox2 are the core regulators of mouse (m)ESC pluripotency. Although their basic importance in human (h)ESCs has been demonstrated, the mechanistic functions are not well defined. Here, we identify general and cell-line-specific requirements for NANOG, OCT4, and SOX2 in hESCs. We show that OCT4 regulates, and interacts with, the BMP4 pathway to specify four developmental fates. High levels of OCT4 enable self-renewal in the absence of BMP4 but specify mesendoderm in the presence of BMP4. Low levels of OCT4 induce embryonic ectoderm differentiation in the absence of BMP4 but specify extraembryonic lineages in the presence of BMP4. NANOG represses embryonic ectoderm differentiation but has little effect on other lineages, whereas SOX2 and SOX3 are redundant and repress mesendoderm differentiation. Thus, instead of being panrepressors of differentiation, each factor controls specific cell fates. Our study revises the view of how self-renewal is orchestrated in hESCs. |
- First, we will access the relevant data series on GEO, the NCBI's database for expression data.
- Navigate to the pubMed page of the article via the link provided in the reference box above.
- Follow the link to associated GEO records in the right hand side of the PubMed page (under Related Information). The top hit is a Superseries, composed of a number of Subseries of experiments.
- Open its link in a new tab.
- Examine the samples that are included in this study by expanding the list of samples. You will notice that the sample titles tell you a bit about the experiment, the actual Subseries page describes more about the experiment, but here, and in general, for a reasonable understanding of the experimental variables, you will need to read the actual paper.
- Not for this first-look exercise however – just note: shXXX samples are knock-downs (KD) using a lentiviral short-hairpin RNA, OE is overexpression, H1 and H9 are human embryonal stem-cell lines.
We can pursue the question: if any or all of the pluripotency maintaining transcription factors are knocked down – presumably a surrogate for a differentiation signal – what are the downstream targets and what do they have in common; conversely, what complementary effects are observed when these factors are overexpressed? The first step therefore is to identify differentially expressed genes. Conveniently, GEO offers the GEO2R utility to help perform differential expression analysis.
- View the GEO2R video tutorial on youtube.
- Now proceed to apply this to the stem-cell transcription factor study
- On the Superset page, click on the Analyze with GEO2R link.
- Click on the Treatment column header to sort the series by experimental variable.
- Define meaningful groups: you could name them SOX2 KD, SOX2 OE, the same for NANOG and OCT4, and CTRL. (Note that these are just names, you could also have called the groups Capitoline, Palatine, Esquiline, Aventine, Caelian, Viminal, and Quirinal – if you remember what the names stand for.)
- Then associate the group names with relevant experiments, as shown in the video. For the control samples, you can combine the H1 "controls" and the H1 "untreated" samples from the BMP4 treatment series.
- Confirm that the value distributions are unbiased - overall, in such experiments, the bulk of the expression values should not change and thus means and quantiles of the expression levels should be about the same. You should note that the OE samples are systematically different from the others, and that one of the NANOG samples has very low values. Remove that series from your list and rerun the distribution to confirm that the data is no longer in the list.
- In the GEO2R tab, click on the Top 250 button to execute the analysis of significantly differentially expressed genes.
- By clicking on a few of the gene names in the Gene.symbol column, you can view the expression profiles that tell you why the genes were found to be differentially expressed. Can you identify a gene that increases in expression in response to all three factors?
- Finally, review the R script for your analysis. Check if there are any aspects of the code that you don't understand. That will give you an idea of the level to which you ought to bring your R skills. But not right now – and: no worries, R code analysis will not be required on Wednesday's quiz.
Pre-reading
No pre-reading: Open Project visions are due in class!
- That is all.
Footnotes and references
- Ask, if things don't work for you!
- If anything about the assignment is not clear to you, please ask on the mailing list. You can be certain that others will have had similar problems. Success comes from joining the conversation.
- Do consider how to ask your questions so that a meaningful answer is possible. the following two links:
- How to create a Minimal, Complete, and Verifiable example on stackoverflow and ...
- How to make a great R reproducible example
- ... are required reading.
< Assignment 2 | Assignment 4 > |