| In Spring 2011 the course is taught by dr. Erwin Bakker. IN4173 / MCBDelft - Leiden BioInformatics Master track
»Molecular Computational Biology 2009shouldn't that be "Computational Molecular Biology"?
A.P. Gultyaev (Sacha)Instituut voor Biologie Leiden (IBL)
 a.p.gultyaev (a) biology.leidenuniv.nl
 
H.J. Hoogeboom (Hendrik Jan) »Institute for Advanced Computer Science (LIACS)
 hoogeboom (a) liacs.nl
 
Schedule:
Tuesday Feb 3 - Apr 28, 9.00-10.45, Snellius  174.
classes
 
Exam:
2 June 2009, 14.00-17.00, Snellius 174To practise: old exams.
 Scheduletentative! 
In principle we will follow the same plan as last year.
To be updated during the semester.
| date |  |  | topic | handout |  
|  |  | APG | deadline exercises | see below |  
| 03.02.09 | 1 | APG | Basic molecular biology Databases
 | Lecture Notes (covering the full range of APG lectures) With exercises!
 |  
| 10.02.09 | 2 | HJH | String alignment by Dynamic programming | Copies of transparencies 
                [pdf]
	14.2'09 copies of Alignment notes
        (Ron Shamir
        Lecture 2)
 Eddy S.R. (2004a, Dynamic programming)
 (new, under development:) 
	lecture notes
	12.02'09
 |  
| 17.02.09 | 3 | APG | Alignment & database search |  |  
| 24.02.09 | 4 | HJH | Hidden Markov Models | Copies of transparencies
                [pdf]
                10.3'09 copies of Hidden Markov Models notes
        (Ron Shamir
        Lecture 5)
 Eddy S.R. (2004b, BLOSUM62).
 Eddy S.R. (2004c, hidden Markov).
 |  
| 03.03.09 | 5 | APG | Multiple alignment, profiles, Gene finding
 |  |  
| 10.03.09 | 6 | HJH | Applications of HMM | Copies of transparencies
                [pdf]
		                10.3'09 copies of HMM application notes Gene Finding
	(Ron Shamir 
        Lecture 7)
 |  
| 17.03.09 | 7 | APG | Structure prediction (1) |  |  
| 24.03.09 | 8 | APG | Structure prediction (2) |  |  
| 31.03.09 | 9 | HJH | Phylogeny | Copies of transparencies [pdf] 1MB
	5.4'06 lecture notes
	12.02'09
 alternatively, e.g.,
        copies of Phylogeny notes 
	(Ron Shamir 
        Lecture 8)
 or 
	Mona Singh
 |  
| 07.04.09 | 10 | APG | RNA |  |  
| 14.04.09 | 11 | HJH | Physical Mapping (PQ trees),
	sequencing | copies of  Physical Mapping
	(Ron Shamir 
        Lecture 9, sections 9.1, 9.2) and DNA sequencing, shortest common superstring
	(Saad Mneimneh
	Lecture 15)
 transparencies [pdf] 1MB
	16.4'08 
	[see also Mneimneh]
 (new, under development:) 
	lecture notes
	05.03'09
 |  
| 21.04'09 | 12 | HJH | Exact pattern matching: Aho-Corasick,
	Suffix trees
 | transparencies [pdf] 206kB
	21.4'09 |  
| 28.04'09 | - | - | available when needed |  |  
 Recommended reading and websitesFree text availability, e.g. via PubMedCentral     
PubMedCentral  (PMC) is indicated. Other articles
are available via Leiden University 
Digital Library  
or (within University)  at journal sites.
Altschul S.F. et al. 
The statistics of sequence similarity scores. (BLAST tutorial). 
www.ncbi.nlm.nih.gov/BLAST/tutorial/Altschul-1.html
Brudno M. et al. (2003). 
 LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res. 13:721-731. 
doi: 10.1101/gr.926603 
(free full-text at www.genome.org)
Durbin R., Eddy S.R., Krogh  A., Mitchison G. added (HJH)
 
Biological Sequence Analysis, Cambridge University Press, 1998.
(see e.g., amazon.co.uk)
Eddy S.R. (2004a). 
 What is dynamic programming? Nature Biotechnology 22:909-910
(doi: 10.1038/nbt0704-909)
Eddy S.R. (2004b). 
 Where did the BLOSUM62 alignment score matrix come from? 
Nature Biotechnology 22:1035-1036.
(doi: 10.1038/nbt0804-1035)
Eddy S.R. (2004c). added (HJH)
 What is a hidden Markov model? Nature Biotechnology 22:1315-1216
(doi: 10.1038/nbt1004-1315)
 This and other introductions to Computational Biology, as
Primer in Nature Biotechnology.
 
Gribskov M., McLachlan A.D. & Eisenberg D. (1987). 
 Profile analysis: detection of distantly related proteins. Proc. Natl. Acad. Sci. USA 84:4355-4358. 
(PMCID: 305087)
Pearson W.R. & Lipman D.J. (1988). 
 Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. USA 85:2444-2448. 
(PMCID: 280013)
Sippl M.J. (1993). 
 Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures. Journal of Computer-Aided Molecular Design 7:473-501. 
www.came.sbg.ac.at/Publications/JCAMD.7-473-501.pdf
Schwartz S. et al. (2003). 
 Human-mouse alignments with BLASTZ. Genome Res. 13:103-107. 
doi: 10.1101/gr.809403 (free full-text at www.genome.org)
Thompson J.D., Higgins D.G. & Gibson T.J. (1994). 
 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680. 
(PMCID: 308517)
Zuker M. (2000). 
 Calculating nucleic acid secondary structure. Curr. Opinion Struct. Biol. 10:303-310.
doi: 10.1016/S0959-440X(00)00088-9 
(within University free under Elsevier contract)
 WWWDatabases publicly available on the web were covered in the
first lecture.
The following resources are from 
the handout.
 
Some other addresses from the lecture notes:
 
Section 1.1: 
Sample GenBank Record
 
Section 5.2.1:
For instance, 
SWISS-MODEL, one of the first servers for protein structure predictions, initiated in 1993 and accessible via the ExPASy web server.
(swissmodel.expasy.org)
 
Section 6.1: 
Many of these parameters have been estimated from thermodynamic experiments (e.g. D.Turner et al.)  and are available in the Internet.
frontend.bioinfo.rpi.edu/zukerm/rna/energy/
 
The algorithm used for energy minimization by dynamic programming is somewhat similar to the algorithm used for alignment problem. The program mfold (M. Zuker) is the most frequently used.
Mfold web server:   
www.bioinfo.rpi.edu/applications/mfold
 
Section 6.2:
Kinefold server of H. Isambert allows one to visualise kinetics of RNA folding.
   kinefold.u-strasbg.fr
 
Section 6.4:
An example of user-friendly server is GPRM (genetic programming for RNA motifs) server (Hu, 2003).
 (bioinfo.cis.nctu.edu.tw/service/gprm).
 Exercises/assignments 2009Two out of ten points for the final grade are based
on the exercises from the hand-out (in other
words, without assignments the maximum mark is 8). 
The answers to the problems formulated in the assignments may
be either sent to a.p.gultyaev /at/ biology.leidenuniv.nl or submitted
after the lecture hours. 
The deadlines for submission in spring 2009:
| exercise | date |  
| 1 - 6 | March 10 |  
| 7 - 10 | March 20 |  
| 11 - 12 | April 20 |  
To avoid tedious re-typing of the sequences, we made the
assignments available
on the web.
 this page: http://www.liacs.nl/home/hoogeboo/mcb/ (jan 09)
 Comments to HJH
 |