Dotlet Exercises
Your name: Your email address:
The Swiss Institute for Bioinformatics provides a JAVA applet that perform interactive dot plots. It is called Dotlet (if site doesn't load, try here, here, here, or here [you might need to set the JAVA security setting to allow a particular site to run a script; see here for instructions. Under MacOSX, you also might need to go to the general security settings in system preferences and click on the "open anyhow" button]). The main use of dot plots is to detect domains, duplications, insertions, deletions, and, if you work at the DNA level, inversions (excellent illustrations of the use of dot plots are given on the examples page).
Jalview is a JAVA application to inspect and edit multiple sequence alignments. It also allows inspection of protein space for the aligned sequences. This works surprisingly well. The Jalview Homepage contains a lot of additional information.
Go to the jalview applet page and either start the Jalview desktop (link on top), or a jalview applet (links in the middle of page). If this does not work, download this file, unarchive it, and double-click on Jalview.app. Preferably, you want to load the JaLVIEWdesctop, but the Jalview lite version is just as fine, except the sequence input is more difficult (delete the sequences from the example, add the sequences form the file).
Close the windows that may have opened as a demonstration, except for the multiple sequence alignment window.
Load the sequences from the ATP-ase Subunit alignment into Jalview (either load from file, if the desktop application runs, or paste into the input window -- select new window after you are done pasting.).
Explore the different coloring options (COLOUR menu). Which one seems to work best (most meaningful - scroll through the alignment to a more conserved region). Note: You can change/edit the alignment by clicking on an amino acid residue and dragging it to the right or left using the arrow keys. Try it, but leave the sequences in an aligned state before you move on. (If this doesn't work, presss F2 and try again) Select all sequences. CALCULATE an AVERAGE DISTANCE TREE USING % identity Click somewhere in the resulting tree to color groups of related sequences in the same color. You can right-click (or command click) on a node to change color for a group of sequences. Chose a color scheme that colors all subunits of the same type in the same color CALCULATE the PRINCIPAL COMPONENT ANALYSIS. In a principal component analyses, the new dimensions are calculated as a linear combination of the original dimensions, so that greatest variance by any projection of the data set comes to lie on the first axis, etc. for the following dimensions. Can you find a higher dimension that breaks up the vacuolar ATPase A subunits? (Their names start with A.). Which of the A subunit sequences cluster together, if you use this dimension (1, 2 and 4 worked for me)?
Check the appropriate radio button below before pressing the submit button:
Send email to your instructor (and yourself) upon submit Send email to yourself only upon submit (as a backup) Show summary upon submit but do not send email to anyone.