Problem Set 1
Use the links and their instructions from the Applications section to help answer
the following questions.
1. Find the nucleotide sequences of the following genes:
a. Human ornithine aminotransferase (OAT), nuclear gene encoding
mitochondrial protein
b. YEL076C
gene in chromosome V of Saccharomyces cerevisiae.
c. CDC13p,
Saccharomyces cerevisiae single-stranded telomeric binding protein.
d. Homo
sapiens herpesvirus protein B (HVEB) mRNA
e. Simian immunodeficiency virus of Chimpanzee (Pan
troglodytes), related to HIV.
2. Find the possible origin and gene function of the following nucleotide sequences:
a. Sequence 1
CACGGTGATCAAAGTGAGAATGAGCTCCCAGGATTGGGGGGCAAGGAAGATAGGAGGGTCAAACAGAGTCGGGGAGAAGCCAGGGA
GAGCTACAGAGAAACCGGGTCCAGCAGAGCAAGTGATGAGAGAGCTGCCCATCTTCCAACCAGCACACCCCTAGACATTGACACTG
CATCGGAGTCAGGCCAAGATCCGCAGGACAGTCGAAGGTCAGCTGACGCCCTGCTCAGGCTGCAAGCCATGGCAGGAATCTTGGAG
GAACAAGGCTCAGACACGGACACCCCTAGGGTG
b. Sequence 2
CCCTGTGGAGCCACACCCTAGGGTTGGCCAATCTACTCCCAGGAGCAGGGAGGGCAGGAGCCAGGGCTGGGCATAAAAGTCAGGGC
AGAGCCATCTATTGCTTACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTCAAACAGACACCATGGTGCACCTGACTCCTG
AGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCTGGGCAGGTTGGTATCAAGG
TTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCATGTGGAGACAGAGAAGACTCTTGGGTTTCTGATAGGCACTGACTCTC
TCTGCCTATTGGTCTATTTTCCCACCCTTAGGCTGCTGGTGATCTACCCTTGGACCCAGAGGTTCTTTGAGTCCTTTGGGGATCTG
TCCACTCCTGATGCTGTTATGGG
c. Sequence 3
AGTACGGTACACCAATTATTTTGAAGGCCGCTTATGGAGGAGGTGGTCGTGGAATTCGTCGTGTTGATAAATTGGAAGAAGTTGA
AGAGGCATTCCGTAGATCCTACTCGGAAGCTCAAGCTGCGTTTGGAGACGGAAGTCTTTTCGTTGAAAAGTTTGTTGAGAGACCA
AGACATATTGAAGTTCAGCTGCTTGGAGACCATCATGGAAATATTGTTCATTTGTATGAGCGTGATTGTTCAGTGCAACGTCGTC
ATCAAAAGGTTGTTGAAATTGCTCCAGCGCCAGCTCTCCCAGAAGGTGTTCGTGAGAAAATTTTGGCAGACGCTCTTCGACTTGC
AAGACATGTTGGATACCAAAATGCTGGTACAGTCGAATTCCTGGTTGATCAGAAGGGCAACTACTATTTCATCGAAGTGAATGCA
CGTCTTCAAGTCGAGCATACAGTAACTGAAGAGATCACTGGTGTGGATCTTGTCCAAGCTCAAATTCGTATCGCCGAAGGAAAAT
CTCTGGATGATCTGAAGCTTTCACAGGAAACTATTCAAACTACTGGCTCAGCTATTCAATGTCGTGTCACAACTGAAGATCCAGC
TAAAGGATTCCAGCCAGATTCCGGAAGAATTGAAGTTTTCCGATCCGGAGAGGGAATGGGAATTCGTCTTGATTCAGCATCTGCC
TTCGCAGGATCAGTCATTTCACCTCACTACGATTCTTTGATGGTCAAAGTAATTGCATCGGCTAGAAATCATCCGAACGCTGCCG
CAAAAATGATTCGTGCCCTCAAAAAGTTCCGTATCCGAGGCGTAAAGACAAACATCCCATTTCTGCTCAACGTTCTTCGCCAGCC
CAGCTTCCTTGATGCATCCGTCGATACGTATTTCATTGATGAGCATCCAGAGTTGTTCCAATTCAAACCAAGCCAAAACCGTGCT
CAAAAGTTGTTGAACTATTTGGGAGAAGTTAAGGTGAACGGTCCAACTACTCCTCTTGCTACTGACCTGAAACCAGCAGTTGTTT
3. Align the following sequences (single sequence alignment):
a. Sequence 1
CAGGGGCAGTGCGGGAGGTGCTGGGCTTTCTCAACAGCTGAGGTGATTTCCGATCGAACATGTATTGCAAGCAATGGTACCCAACAACCAATCATCTCCCCAACTGATCTGCTCACTTGTTGTGGAATGTCATGCGGAGAGGGCTGTAACGGCGGCTAT
b. Sequence 2
ATGTCTACTATTCCATCAGAAATAATCAATTGGACAATTTTGAACGAGATAATTTCCATGGATGACGATG
ACAGTGATTTTAGCAAAGGATTAATCATTCAATTCATAGATCAGGCTCAAACCACGTTTGCACAAATGCA
GAGACAACTAGACGGCGAAAAGAATCTTACTGAACTGGATAACTTGGGGCATTTCTTAAAAGGATCGTCT
GCCGCGCTCGGCTTGCAAAGGATCGCTTGGGTTTGTGAGCGTATTCAGAATTTAGGGAGAAAGATGGAAC
ACTTTTTCCCTAACAAAACAGAACTAGTAAATACCCTTTCAGATAAGTCCATTATAAATGGAATCAACAT
TGACGAGGATGATGAAGAAATAAAAATTCAAGTCGACGATAAAGATGAAAATAGTATCTATTTGATTTTA
ATAGCAAAGGCCCTGAACCAATCTCGATTGGAGTTTAAATTAGCTAGAATTGAACTATCAAAGTACTATA
ATACTAATCTT
c. Sequence 3
ATTCTTTTGAGTCGGGAGAACTAGGTAACAATTCGGAAACTCCAAAGGGTGGATGAGGGGCGCGCGGGGTGTGTGTGGGGGATACTCTGGTCCCCCGTGCAGTGACCTCTAAGTCAGAGGCTGGCACACACACACCTTCCATTTTTTCCCAACCGCAGGATGGCGCCTCATCCCTTGGATGCGCTCACCATCCAAGTGTCCCCAGAGACACAACAACCTTTTCCCGGAGCCTCGGACCACGAAGTGCTCAGTTCCAATTCCACCCCACCTAGCCCCACTCTCATACCTAGGGACTGCTCCGAAGCAGAAGTGGGTGACTGCCGAGGGACCTCGAGGAAGCTCCGCGCCCGACGCGGAGGGCGCAACAGGCCCAAGAGCGAGTTGGCACTCAGCAAACAGCGAAGAAGCCGGCGCAAGAAGGCCAATGATCGGGAGCGCAATCGCATGCACAACCTCAACTCGGCGCTGGATGCGCTGCGCGGTGTCCTGCCCACCTTCCCGGATGACGCCAAACTTACAAAGATCGAGACCCTGCGCTTCGCCCACAACTACATCTGGGCACTGACTCAGACGCTGCGCATAGCGGACCACAGCTTCTATGGCCCGGAGCCCCCTGTGCCCTGTGGAGAGCTGGGGAGCCCCGGAGGTGGCTCCAACGGGGACTGGGGCTCTATCTACTCCCCAGTCTCCCAAGCGGGTAACCTGAGCCCCACGGCCTCATTGGAGGAATTCCCTGGCCTGCAGGTGCCCAGCTCCCCATCCTATCTGCTCCCGGGAGCACTGGTGTTCTCAGACTTCTTGTGAAGAGACCTGTCTGGCTCTGGGTGGTGGGTGCTAGTGGAAAGGGAGGGGACCACAGCC
4. Translate the following nucleotide sequences in 3 reading frames:
a. Sequence 1
CACGGTGATCAAAGTGAGAATGAGCTCCCAGGATTGGGGGGCAAGGAAGATAGGAGGGTCAAACAGAGTCGGGGAGAAGCCAGGGA
GAGCTACAGAGAAACCGGGTCCAGCAGAGCAAGTGATGAGAGAGCTGCCCATCTTCCAACCAGCACACCCCTAGACATTGACACTG
CATCGGAGTCAGGCCAAGATCCGCAGGACAGTCGAAGGTCAGCTGACGCCCTGCTCAGGCTGCAAGCCATGGCAGGAATCTTGGAG
GAACAAGGCTCAGACACGGACACCCCTAGGGTG
b. Sequence 2
CAGGGGCAGTGCGGGAGGTGCTGGGCTTTCTCAACAGCTGAGGTGATTTCCGATCGAACATGTATTGCAA
GCAATGGTACCCAACAACCAATCATCTCCCCAACTGATCTGCTCACTTGTTGTGGAATGTCATGCGGAGA
GGGCTGTAACGGCGGCTAT
c. Sequence 3
CCCTGTGGAGCCACACCCTAGGGTTGGCCAATCTACTCCCAGGAGCAGGGAGGGCAGGAGCCAGGGCTGGGCATAAAAGTCAGGGC
AGAGCCATCTATTGCTTACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTCAAACAGACACCATGGTGCACCTGACTCCTG
AGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCTGGGCAGGTTGGTATCAAGG
TTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCATGTGGAGACAGAGAAGACTCTTGGGTTTCTGATAGGCACTGACTCTC
TCTGCCTATTGGTCTATTTTCCCACCCTTAGGCTGCTGGTGATCTACCCTTGGACCCAGAGGTTCTTTGAGTCCTTTGGGGATCTG
TCCACTCCTGATGCTGTTATGGG
5. Find three article/journal citations for each the following topics:
a. Nanotechnology in medicine and the biological sciences.
b. Human Immunodeficiency Virus (HIV).
6. Does the restriction enzyme XhoI cut this sequence, and if so how many times does it cut, where does it cut (give approx. base pair region), and does the cut leave a blunt ends or a staggered ends?
ATTCTTTTGAGTCGGGAGAACTAGGTAACAATTCGGAAACTCCAAAGGGTGGATGAGGGGCGCGCGGGGTGTGTGTGGGGGATACTCTGGTCCCCCGTGCAGTGACCTCTAAGTCAGAGGCTGGCACACACACACCTTCCATTTTTTCCCAACCGCAGGATGGCGCCTCATCCCTTGGATGCGCTCACCATCCAAGTGTCCCCAGAGACACAACAACCTTTTCCCGGAGCCTCGGACCACGAAGTGCTCAGTTCCAATTCCACCCCACCTAGCCCCACTCTCATACCTAGGGACTGCTCCGAAGCAGAAGTGGGTGACTGCCGAGGGACCTCGAGGAAGCTCCGCGCCCGACGCGGAGGGCGCAACAGGCCCAAGAGCGAGTTGGCACTCAGCAAACAGCGAAGAAGCCGGCGCAAGAAGGCCAATGATCGGGAGCGCAATCGCATGCACAACCTCAACTCGGCGCTGGATGCGCTGCGCGGTGTCCTGCCCACCTTCCCGGATGACGCCAAACTTACAAAGATCGAGACCCTGCGCTTCGCCCACAACTACATCTGGGCACTGACTCAGACGCTGCGCATAGCGGACCACAGCTTCTATGGCCCGGAGCCCCCTGTGCCCTGTGGAGAGCTGGGGAGCCCCGGAGGTGGCTCCAACGGGGACTGGGGCTCTATCTACTCCCCAGTCTCCCAAGCGGGTAACCTGAGCCCCACGGCCTCATTGGAGGAATTCCCTGGCCTGCAGGTGCCCAGCTCCCCATCCTATCTGCTCCCGGGAGCACTGGTGTTCTCAGACTTCTTGTGAAGAGACCTGTCTGGCTCTGGGTGGTGGGTGCTAGTGGAAAGGGAGGGGACCACAGCC
7. Determine the restriction digest map of the following sequence with restriction enzymes that recognize six base pair sequences.
>gi|7524758|ref|NC_001776.1|| Oryza sativa mitochondrial
plasmid B2, complete sequence
CCGAGGTACCAGAGGATAGCCTTTAGGATATCCGTCGGTAGGAGATATCGAGTCGTTAGAGCACATATCTCCGTGATCCCTTCCACAGCTAAACGTAGATCTCTATCTTGTAGTGGAGTGTAGAGTAAAGATATACCGGAGTGGAGTGCAAGAAGGTTATAGGGGAGTGTGGCGCCCGTTGCCCGCCTTTCGTCTCGGTTGAAAAACAAAACTGTTGCTTGTTTTGTTCATTCAAGTAGTGGTTTAGCCCTCGAATCTATAGCGAAAGAGTTAGCTCAACCTATCTACTACTACCCTTGGAAAGGGGGTGTGTGTGGTATACCCGTTACGACTCCCCTCGAACTGAGCCATTTTCATGATTTTACAATCAAAGAAAGTTCCATACTTTTTTATTCCCGGTATGGAGCCAACGTCGCTTCGCTTTCGCAGGACAATTTCCTTCTGGAAAAATATCCAGTGGCATATCCCATCTAGCTCTGGGAATCTTTCCACAACCAGCGCCTTCTTGTCCGATGAAGTTCAAGTCTCTAACCTGTTCTTGCCCGGGAATGTCCCTCCTTCGAACTCGAAGTCAGAGTTTTCCGGTTATCCGATGGGCGATTTATGATAGTTCCGATGCACTTCTCATGCTAGCCTATCGACAAGATCAGGGTTCAACTACTGTTGCCTCAACGGGTTCGCTGGTCACTTTAACAGTTTTTTAAGCAGGCATTTTCGTGGTTCTCGTCATATAAACCAGTCAGGTACTAGCGTGATTTCATGCATTAATCTATTGCTCGAAGGAACCTTTCGTCAGAAAACTTATTCAAAAGGTACCCGGTGTTCTAGCGCATTTCTGTTAAGAGAAGTACCCCTTGTAGGCCTGAGATGCCCTCTATCGAGCGATATAACGACGGTTGCTATTTTTAGGTACACGTGTTTTTTGAAGAAAGCACTTTTAAAAAGGAAAGTAGTAAAATGAAAATACTCAATTTAATAAATCCTAAATTATCTGTGGCATTGAATTATGAAAGTAAACACTGAAAGTTTACCACTTACAAAAGTAAATGTACTACCCGACTAAAAGGAGGAAATCCAATTAGAGGTGTAAACTCTTTTCTTTTCTATCGAGTAGATTCCCTTTTATTTATAAATAGTAAACTAACGAATAGATAATTAGTATTCCCTCATATAATTAGTACTAGCTGTTCCTTCATTAATGGTACGTGTATTCACGAGGGCTCTGCTTCGCTCTATTCCACCTCCCGGGTGTATGGAAAAATCTTTAAAAGAAATAAATCATGCGGTACTTCTTGTTTTAATCACTCTTCCTTTCCTCCCGGAACTGGGGAAGAACCCTTCTGAAAAGGAATGGAGGTCTATCTTTACTCGGAGTGCAACGCAGAGATAGAGATCGAAGAGCCGCTGTGCTTGGGGACGAGGGGACTCACTCAACCTCAAAGTTGAGGGAATCCCGAAGCCTCGGAATCTAGCGTTAGCGTAAGG
8. Convert the following protein sequences back to their
nucleotide sequences using the correct codon usage table. For example, use the
Homo sapien codon usage table if the protein sequence corresponds to a human
gene product.
>gi|126001|sp|P00709|LCA_HUMAN ALPHA-LACTALBUMIN
PRECURSOR (LACTOSE SYNTHASE B PROTEIN)
MRFFVPLFLVGILFPAILAKQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAIVENNESTEYGLFQISNKLWCKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILDIKGIDYWLAHKALCTEKLEQWLCEKL
>gi|2119314|pir||I61690 myosin - human (fragment)
CVIISGESGAGKTVAAKYIMSYISRVSGGGTKVQHVKDIILQSNPLLEAFGNAKTVRNNNSSRFGKYFEIQFSPGGEPDGGKISNFLLEK
>gi|746398|gb|AAA97505.1| ampicillin-binding protein (E.
coli)
MRFSRFIIGLTSCIAFSVQAANVDEYITQLPAGANLALMVQKVGASAPAIDYHSQQMALPASTQKVITALAALIQLGPDFRFTTTLETKGNVENGVLKGDLVARFGADPTLKRQDIRNMVATLKKSGVNQIDGNVLIDTSIFASHDKAPGWPWNDMTQCFSAPPAAAIVDRNCFSVSLYSAPKPGDMAFIRVASYYPVTMFSQVRTLPRGSAEAQYCELDVVPGDLNRFTLTGCLPQRSEPLPLAFAVQDGASYAGAILKDELKQAGITWSGTLLRQTQVNEPGTVVASKQSAPLHDLLKIMLKKSDNMIADTVFRMIGHARFNVPGTWRAGSDAVRQILRQQAGVDIGNTIIADGSGLSRHNLIAPATMMQVLQYIAQHDNELNFISMLPLAGYDGSLQYRAGLHQAGVDGKVSAKTGSLQGVYNLAGFITTASGQRMAFVQYLSGYAVEPADQRNRRIPLVRFESRLYKDIYQNN
>gi|476966|pir||A47398 serotonin transporter - human
METTPLNSQKQLSACEDGEDCQENGVLQKVVPTPGDKVESGQISNGYSAVPSPGAGDDTRHSIPATTTTLVAELHQGERETWGKKVDFLLSVIGYAVDLGNVWRFPYICYQNGGGAFLLPYTIMAIFGGIPLFYMELALGQYHRNGCISIWRKICPIFKGIGYAICIIAFYIASYYNTIMAWALYYLISSFTDQLPWTSCKNSWNTGNCTNYFSEDNITWTLHSTSPAEEFYTRHVLQIHRSKGLQDLGGISWQLALCIMLIFTVIYFSIWKGVKTSGKVVWVTATFPYIILSVLLVRGATLPGAWRGVLFYLKPNWQKLLETGVWIDAAAQIFFSLGPGFGVLLAFASYNKFNNNCYQDALVTSVVNCMTSFVSGFVIFTVLGYMAEMRNEDVSEVAKDAGPSLLFITYAEAIANMPASTFFAIIFFLMLITLGLDSTFAGLEGVITAVLDEFPHVWAKRRERFVLAVVITCFFGSLVTLTFGGAYVVKLLEEYATGPAVLTVALIEAVAVSWFYGITQFCRDVKEMLGFSPGWFWRICWVAISPLFLLFIICSFLMSPPQLRLFQYNYPYWSIILGYCIGTSSFICIPTYIAYRLIITPGTFKERIIKSITPETPTEIPCGDIRLNAV
9. Determine all possible properties of the 4 protein sequences from Problem #8. Molecular weight, isoelectric point, extinction coefficients, titration curve, etc.
This site is funded by the National Science Foundation .