Janelle N. Ruiz Assignment 8

From OpenWetWare
Jump to navigationJump to search

Working with Protein Sequences In-class Activity

Task 1

Chapter 2: Retrieving Protein Sequences/Retrieving a list of Related protein sequences (pp. 42-51 in second edition). The example worked through in the book uses the sequence of an enzyme called dUTPase. Follow the book example yourself and then work through the example again, this time using the HIV gp120 envelope protein instead

  • We typed in “HIV gp120 envelope protein”, eliminating TrEMBL unsupervised computer translations, and obtained four pages of results. We selected P04578, ENV_HV1H2, Human immunodeficiency virus type 1 (isolate HXB2 group M subtype B) (HIV-1), to view amino acid sequence using FASTA formatting.
  • >sp|P04578|ENV_HV1H2 Envelope glycoprotein gp160 OS=Human immunodeficiency virus type 1 (isolate HXB2 group M subtype B) GN=env PE=1 SV=2:
  • <MRVKEKYQHLWRWGWRWGTMLLGMLMICSATEKLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLVNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVSLKCTDLKNDTNTNSSSGRMIMEKGEIKNCSFNISTSIRGKVQKEYAFFYKLDIIPIDNDTTSYKLTSCNTSVITQACPKVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSVNFTDNAKTIIVQLNTSVEINCTRPNNNTRKRIRIQRGPGRAFVTIGKIGNMRQAHCNISRAKWNNTLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTWSTEGSNNTEGSDTITLPCRIKQIINMWQKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTKAKRRVVQREKRAVGIGALFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGCSGKLICTTAVPWNASWSNKSLEQIWNHTTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWFNITNWLWYIKLFIMIVGGLVGLRIVFAVLSIVNRVRQGYSPLSFQTHLPTPRGPDRPEGIEEEGGERDRDRSIRLVNGSLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEVVQGACRAIRHIPRRIRQGLERILL>

Task 2

Chapter 4: Reading a SWISS-PROT entry (pp. 110-123 in the second edition). The example worked through in the book is the epidermal growth factor receptor. Work through this example and then do it again with the HIV gp120 envelope protein instead

Task 3

Chapter 5: ORFing your DNA sequence (pp. 146-147 in second edition). In the previous section of the course, we were working with DNA sequences from the HIV gp120 envelope protein. Take one of your DNA sequences and follow the instructions to find the open reading frames in the sequence. Since you were working with just a portion of the entire envelope protein, you may get some strange results. Compare your results with the SWISS-PROT entry you found for the protein above to decipher what the output means. Besides the NCBI Open Reading Frame Finder described in the book, ExPASy also has a translation tool you can use, found here

Task 4

*Chapter 6: Working with a single protein sequence (pp. 159-195 in second edition). Work through the following examples in this chapter using the entire HIV gp120 envelop protein sequence that you obtained from SWISS-PROT. We will then compare the results of these analyses with the actual structure of the gp120 protein obtained by X-ray crystallography

    • ProtParam
    • Looking for transmembrane segments