printlogo
http://www.ethz.ch/index_EN
CBRG - Computational Biochemistry Research Group
 
print
  

Constructing Multiple Sequence Alignments - Exercise Week 9

ClustalW is a very popular software program to build multiple sequence alignments. ClustalW is available over many web-interfaces. One of them can be found at: http://www.ch.embnet.org/software/ClustalW.html. You can pass the sequences to ClustalW in the so-called Fasta format. Within this format each sequence has a description line starting with  > and the sequence starts on the next line and is all subsequent lines until the next line that starts with a > indicating that a new sequence starts. For more information on the Fasta format, see: http://www.ch.embnet.org/software/ClustalW_help.html

A newer and much faster MSA program is Mafft, also available as a web service.

Darwin has also some implemented algorithms to construct MSAs. These algorithms are implemented in the Darwin procedure called MAlign.  See ?MAlign and ?MAlignment for help. The result from MAlign is a MAlignment data structure. Create a multiple sequence alignment with the 'prob' method.  Using print will pretty print the alignment, the optimal score and the pairwise scores.

Task

Use the Triosephosphate Isomerase sequences from Human, Monkey, Mosquito and Rice (or any set of homologous sequences of your choice) and build a multiple sequence alignment with ClustalW, Mafft and Darwin's MAlign algorithm. The accession numbers (AC) of those sequences in SwissProt are:

Organism AC
Human P00938
Monkey P15426
Mosquito P92119
Rice P48494

Compare the alignments with respect to where they placed the gaps. Let us assume that you want to use these MSA's to do protein structure prediction. Would you predict similar or different structures based on the MSA's from the different methods?

 

Wichtiger Hinweis:
Diese Website wird in älteren Versionen von Netscape ohne graphische Elemente dargestellt. Die Funktionalität der Website ist aber trotzdem gewährleistet. Wenn Sie diese Website regelmässig benutzen, empfehlen wir Ihnen, auf Ihrem Computer einen aktuellen Browser zu installieren. Weitere Informationen finden Sie auf
folgender Seite.

Important Note:
The content in this site is accessible to any browser or Internet device, however, some graphics will display correctly only in the newer versions of Netscape. To get the most out of our site we suggest you upgrade to a newer browser.
More information

© 2012 ETH Zurich | Imprint | Disclaimer | 19 November 2010
top