J'aime C. Moehlman's Week 8: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
(New page: ===Analyzing Protein Structure===)
 
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
===Analyzing Protein Structure===
===Analyzing Protein Structure===
*This week we are using the ''Bioinformatics for Dummies'' book in order to analyze the protein structure of gp120.
====Chapter 2: Retrieving Protein Sequences for the HIV gp120 Envelope Protein====
*Searched the protein knowledge base for the gp120 protein.
*Picked envelope glycoprotein 120 with the accession number of A0A897, its entry name is A0A897_9HIV1.
*We retrieved the FASTA sequence for the fragment:NLTDNAKTIIVHLNESVEINCTRPFNNTRTSXRIGPGQVFYRTGDITGSIRRAYCEINGT
KWNKVLXQVTEKLXEH
*We then conducted an advanced search to retrieve a protein sequence.
====Chapter 4:Reading a SWISS-PROT entry====
[[Image:Screenshotjaime.jpg|thumb|center|upright=4.0]]
*As seen in the screen shot above we know that the protein name is envelope glycoprotein 120
*The gene name is env
*We looked at the reference section and it shows that there is only one reference which doesn't imply major significance.
*The cross reference page link to entries within other databases:
**Screen shot from EMBL sequence database:
[[Image:Ch4screenshot.jpg|thumb|center|upright=3.5]]
*The keywords for the gp120 protein are; envelope protein and virion.
*In the features section we see non-terminal residues at positions 1 and 76.
====Chapter 5:ORFing your DNA sequence====
*Picked the DNA sequence from Subject 10 at visit 4 at clone 1.
*SCREENSHOT for ORF:
[[Image:Orfscreenshot.jpg|thumb|center|upright=3.5]]
*compared to the SWISS-PROT entry the orf sequence shows very small differences in the sequence.
====Chapter 6: Working with a Single Protein Sequence====
'''Predicting the main physico-chemical properties of a protein'''
*We used the expasy tool page in order to carry out a primary structure analysis.
*We used the accession number A0A987 from Swiss-Prot into the ProtParam.
*ProtParam generated the parameters of the entire gp120 sequence that we selected.
*The results include many things, such as; how many amino acids there are in the sequence, the molecular weight, the overall number of positively charged residues, and the total number of negatively charged residues.
'''Digesting a protein in a computer'''
*We pasted the gp120 seguence into the ExPasy website again in order to cut the protein.
'''Looking for transmembrane segmenting'''
*We used the accession number A0A987 again and put it into the ExPASy ProtScale site and conducted a full range analysis.
*The image was retrieved in GIF format.
'''Interpreting ProtScale results'''
*A piece of paper was used to help us locate the strongest peaks on the graph.
*We determined that there were four important transmembrane regions.
'''Running TMHMM'''
*We generated the TMHMM results by using a FASTA sequence of the gp120 protein.
'''Looking for PROSITE patterns'''
*Used the accession number A0A897 to determine which proteins we wanted to be scanned and then started the scan.
'''Finding domains with InterProScan'''
*We again used the same gp120 protein segment that we have been using throughout the entire assignment.
*We used the our class textbook to interpret the results that were generated by the InterProScan.
'''Finding domains with the CD server'''
*We used the same FASTA gp120 protein sequence.
*This image shows some of our results:
[[Image:SEQUENCEDATA2.JPG|thumb|center|upright=3.5]]
'''Finding domains with Motif Scan'''
*We pasted the FASTA sequence into the Input box.
*Selected PROSITE to conduct the search that we wanted.
*With the results, we saw different things: including a matches map, a list of matches, and a matches details section.
{{J'aime C. Moehlman}}

Latest revision as of 01:58, 14 March 2010

Analyzing Protein Structure

  • This week we are using the Bioinformatics for Dummies book in order to analyze the protein structure of gp120.

Chapter 2: Retrieving Protein Sequences for the HIV gp120 Envelope Protein

  • Searched the protein knowledge base for the gp120 protein.
  • Picked envelope glycoprotein 120 with the accession number of A0A897, its entry name is A0A897_9HIV1.
  • We retrieved the FASTA sequence for the fragment:NLTDNAKTIIVHLNESVEINCTRPFNNTRTSXRIGPGQVFYRTGDITGSIRRAYCEINGT

KWNKVLXQVTEKLXEH

  • We then conducted an advanced search to retrieve a protein sequence.

Chapter 4:Reading a SWISS-PROT entry

  • As seen in the screen shot above we know that the protein name is envelope glycoprotein 120
  • The gene name is env
  • We looked at the reference section and it shows that there is only one reference which doesn't imply major significance.
  • The cross reference page link to entries within other databases:
    • Screen shot from EMBL sequence database:
  • The keywords for the gp120 protein are; envelope protein and virion.
  • In the features section we see non-terminal residues at positions 1 and 76.

Chapter 5:ORFing your DNA sequence

  • Picked the DNA sequence from Subject 10 at visit 4 at clone 1.
  • SCREENSHOT for ORF:
  • compared to the SWISS-PROT entry the orf sequence shows very small differences in the sequence.

Chapter 6: Working with a Single Protein Sequence

Predicting the main physico-chemical properties of a protein

  • We used the expasy tool page in order to carry out a primary structure analysis.
  • We used the accession number A0A987 from Swiss-Prot into the ProtParam.
  • ProtParam generated the parameters of the entire gp120 sequence that we selected.
  • The results include many things, such as; how many amino acids there are in the sequence, the molecular weight, the overall number of positively charged residues, and the total number of negatively charged residues.

Digesting a protein in a computer

  • We pasted the gp120 seguence into the ExPasy website again in order to cut the protein.

Looking for transmembrane segmenting

  • We used the accession number A0A987 again and put it into the ExPASy ProtScale site and conducted a full range analysis.
  • The image was retrieved in GIF format.

Interpreting ProtScale results

  • A piece of paper was used to help us locate the strongest peaks on the graph.
  • We determined that there were four important transmembrane regions.

Running TMHMM

  • We generated the TMHMM results by using a FASTA sequence of the gp120 protein.


Looking for PROSITE patterns

  • Used the accession number A0A897 to determine which proteins we wanted to be scanned and then started the scan.

Finding domains with InterProScan

  • We again used the same gp120 protein segment that we have been using throughout the entire assignment.
  • We used the our class textbook to interpret the results that were generated by the InterProScan.

Finding domains with the CD server

  • We used the same FASTA gp120 protein sequence.
  • This image shows some of our results:


Finding domains with Motif Scan

  • We pasted the FASTA sequence into the Input box.
  • Selected PROSITE to conduct the search that we wanted.
  • With the results, we saw different things: including a matches map, a list of matches, and a matches details section.


My Pages

User: J'aime C. Moehlman

BIOL398-01/S10

Class Journals

Class Journal Week 2

Class Journal Week 3

Class Journal Week 4

Class Journal Week 5

Class Journal Week 6

Class Journal Week 7

Class Journal Week 8

Class Journal Week 9

Class Journal Week 11

Class Journal Week 12

Class Assignments

Week 2 Assignment

Week 3 Assignment

Week 4 Assignment

Week 5 Assignment

Week 6 Assignment

Week 7 Assignment

Week 8 Assignment

Week 9 Assignment

Week 11 Assignment

Week 12 Assignment

My Assignments

J'aime C. Moehlman's Week 2

J'aime C. Moehlman's Week 3

J'aime C. Moehlman's Week 4

J'aime C. Moehlman's Week 5

J'aime C. Moehlman's Week 6

J'aime C. Moehlman's Week 7

J'aime C. Moehlman's Week 8

J'aime C. Moehlman's Week 9

J'aime C. Moehlman's Week 11

J'aime C. Moehlman's Week 12

J'aime C. Moehlman's Week 13