MCB/EEB372
Exam 1 (take
home, return by March 29)
An electronic copy of this exam is available on the MCB372 - homepage
1) Assume that the following protein sequence were
obtained from the translation of a putative ORF of a thermophilic
bacterium:
>ORF
with unknown function
MIALILSREAIAGSCGGLRDVKQRRLILELPSDNICYMNPAGSSIHYKGRKLPHAVFDIPRIGVDGPLRDTAALRQFEMLGS
YPLNESVAIARARDKLRSMQLLARQGIDLPVTGIAHSPDDTSDLIDMVGGAPLVVKLVEKGTGIGVVLAETRQAAESVIDAF
QEYRGLNAHILVIKEAQGCDIRCVSGDTPVLLDAGERRIGDLFMEAIRPKERGEIGQNEEIRVLHDWRIYSGSMVETVEIVS
IYHVKIGNAHGKSVRTENGREVRVTPVHKLFNAIVNSVIERPASEVNEGDEIAWPSVSENGDSQTVTTTFLVLDRVVSEMKH
SGVFDVYDLMVPDYGYNFIGGNGLIVLHNCLVVGDEVVGCYARNIPPNDFRSNLNIGGAASVASITPQEREIAIAVGRTMAL
DVAGVDIVLMENDRAGNGPPGILVSLETTGGEKIAIDGKMIRYWITRHETTEAGLGCK
What might be the catalytic activity or function(s) of this protein?
Is
there a common
functional or structural feature among the homologous proteins? What does this tell you
about the protein encoded by this ORF?
Which catalytic activities would you expect for the encoded protein?
Briefly describe how you went about to find homologues to this sequence.
2) Background information: The Pyrococcus A-ATPases have been sequenced as part of
three different genome projects. In all
cases an additional sequence, an intein, was found inserted into the A
subunits. The A-ATPase A-subunit is
homologous to the F-ATPase beta subunits.
Where is the intein located in the Pyrococcus
A-ATPase A-subunits?
Utilizing the homology to the F-ATPases, determine which place of the F1-ATPase structure
(1bmf) corresponds to the intein integration point.
Your
short written answer should include a short description of the tools you used to address this question, and a picture of
the F1-ATPase with the residues homologous to the intein integration point
indicated.
It is
sufficient to use a pen and to draw an arrow pointing to the integration points
in this picture.
3) The following file (here)
contains sequences of several vacuolar ATPase A-subunits and prokaryotic
V/A-ATPase catalytic subunit. Using the
tools of bootstrapping, ml-mapping and/or ml ratio tests address the following
questions:
Do all of the bacterial sequences (Chlamydia,
Chlamydiophila, Enterococcus, Thermus, Deinococcus, Treponema and
Borrelia) form a monophyletic group?
Can one confidently exclude this possibility? Would the answer change, if
one only considers the sequences from Chlamydia, Chlamydiophila, Treponema and
Borrelia?
To incorporate among site rate variation into your model, use a Gamma
distribution with 8 categories and a shape parameter alpha of 0.6. Good starting point to design hypotheses are
the following (names truncated to 10 characters):
A completely resolved tree calculated via neighbor joining from pairwise distances
(((Deinococcu,Thermus),((Enterococc,(((Archaeoglo,(BMethanosa,MMethanosa)),(Haloferax,Halobacter)),
(Thermoplas,(((Thermococc,Desulfuroc),(aPyrococcu,hPyrococcu)),(jMethanoco,Methanobac))))),
((1Treponema,(Aeropyrum,(aSulfolobu,sSulfolobu))),((Chlamydia,Chlamydoph),(2Treponema,Borrelia))))),
(Giardia,(Plasmodium,((Entamoeba,((Dictyostel,((Ascidia,(Bos1,Manduca)),(Nosema,(Saccharomy,
Neurospora)))),(Cyanidium,(Arabidopsi,(2Acetabula,Scherffeli))))),(Trypanosom,Trichomona)))));
A tree that has all branches collapsed that were not significantly positive is the following:
(((Deinococcu,Thermus),(Enterococc,(Archaeoglo,(BMethanosa,MMethanosa),(Haloferax,Halobacter)),
Thermoplas,((Thermococc,Desulfuroc),(aPyrococcu,hPyrococcu)),(jMethanoco,Methanobac),(1Treponema,
(Aeropyrum,(aSulfolobu,sSulfolobu)),((Chlamydia,Chlamydoph),(2Treponema,Borrelia))))),(Giardia,
(Plasmodium,Trichomona,Trypanosom,Entamoeba,((Dictyostel,((Ascidia,Bos1,Manduca),(Nosema,(Saccharomy,Neurospora)))),(Cyanidium,(Arabidopsi,(2Acetabula,Scherffeli)))))));