Expected Preparations:
|
|||||||||||
|
|||||||||||
Keywords: A small-molecule structure tutorial | |||||||||||
|
|||||||||||
Objectives:
This unit will …
|
Outcomes:
After working through this unit you …
|
||||||||||
|
|||||||||||
Deliverables: Time management: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit. Journal: Document your progress in your Course Journal. Some tasks may ask you to include specific items in your journal. Don’t overlook these. Insights: If you find something particularly noteworthy about this unit, make a note in your insights! page. |
|||||||||||
|
|||||||||||
Evaluation:
|
Creating with small molecule structures, finding complexes in the PDB that contain the molecule, and superimposing model and structure.
Task…
“Small” molecules are solvent, ligands, substrates, products, prosthetic groups, drugs - in short, essentially everything that is not made by DNA-, RNA-polymerases or the ribosome. Whereas the biopolymers are still front and centre in our quest to understand molecular biology, small molecules are crucial for our quest to interact with the inventory of the cell, create useful products, or advance medicine.
A number of public repositories make small-molecule information available, such as PubChem at the NCBI, the ligand collection at the PDB, the ChEBI database at the European Bioinformatics Institute, the Canadian DrugBank, or the NCI database browser at the US National Cancer Institute. One general way to export topology information from these services is to use SMILES strings(W) —a shorthand notation for the composition and topology of chemical compounds.
Task…
Caffeine at PubChem 1. Access PubChem. 1. Enter “caffeine”
as a search term in the Compound tab. A number of
matches to this keyword search are returned. 1. Click on the top hit -
1,3,7-Trimethylxanthine, the Caffeine molecule. Note that the page
contains among other items: 1. # A 2D structural sketch; 1. # An
idealized 3D structural conformer, for which you can download
coordinates in several formats; 1. # The IUPAC name:
1,3,7-trimethylpurine-2,6-dione
; 1. # The CAS identifier
58-08-2
which is a unique identifier and can be used as a
cross-reference ID; 1. # The SMILES string(W)
CN1C=NC2=C1C(=O)N(C(=O)N2C)C
; 1. # … and much more.
Task…
Caffeine at DrugBank 1. Access DrugBank. 1. Enter “Caffeine” in the
search form and. . 1. Click on the hit to “Caffeine”
itself. Note that the page contains among other items: 1. # A
detailed description 1. # A 2D structural sketch with a link to 3D
options; 1. # Synonyms, including the IUPAC name:
1,3,7-trimethylpurine-2,6-dione
; 1. # … and much more. 1.
Follow the link to
the 3D options and note the options for downloading information,
including the SMILES string and PDB formatted coordinates.
That’s great, but let’s sketch our own version of caffeine. Several versions of Peter Ertl’s Java Molecular Editor (JME)(W) are offered online, PubChem offers this functionality via its Sketcher tool and the PDB has a similar sketching tool on its ligand search page]
Task…
ChimeraX can translate SMILES strings to coordinates1.
Task…
surface gridSpacing 0.1
to increase
the resolution five-fold from its default 0.5A.lighting soft
camera
sbs
.{{Stereo|Caffeine_stereo.jpg|Wall-eye stereo view of the caffeine structure, surrounded by a transparent molecular surface. The image for the left eye is on the left side.}}
To investigate a small molecule structure variant in the context of a complex, we need to superimpose it with an existing ligand.
Task…
3G6M
in ChimeraX. This is one of the
hits returned from the PDB search for caffeine - a fungal chitinase for
which caffeine is a potent inhibitor.To superimpose the structures, we can’t use the standard “match” option, because that only works for protein or DNA molecules. Instead, we need to explicitly define matching pairs of atoms through ChimeraX’s command line interface. The command line interface is a very powerful way to issue ChimeraX commands, but it has a bit of a learning curve since we need to use a precise model/residue/atom selection syntax.
align
, and we need to feed the
command atoms in exactly the order of the pairs that the superposition
algorithm should superimpose. To identify the atom numbers, we can hover
over them with the mouse, or we can select the residue/atom and choose
Actions ▸ Label ▸
name. If we superimpose the four nitrogen atoms, the
correct command may be:align #1@N1,N2,N4,N3 to #2:427@N3,N1,N7,N9
to superimpose the model we
built from the SMILES string onto the structure - but the exact atom
names in the model structure depend on how the SMILES string was
written.
If in doubt, ask! If anything about this contents is not clear to you, do not proceed but ask for clarification. If you have ideas about how to make this material better, let’s hear them. We are aiming to compile a list of FAQs for all learning units, and your contributions will count towards your participation marks.
Improve this page! If you have questions or comments, please post them on the Quercus Discussion board with a subject line that includes the name of the unit.
[END]
There are also several online servers that translate SMILES strings to idealized structures, for example the online SMILES translation service at the NCI.↩︎