BIOL368/F14:Chloe Jones Week 9

Defining Your HIV Structure Research Project

Project Partners: Isabel Gonzaga and Nicole Anguiano Chose to analyze Isabel's question into further detail, information below is taken from Isabel Gonzaga Week 8

Question

How does HIV status (diagnosed, progressing or non-trending) affect the structure of the V3 protein region?

Hypothesis

We hypothesize that diagnosed groups will express greater variability in the V3 region in their protein structure, in comparison to the non-trending groups. Initial comparisons show that diagnosed groups and progressing groups expressed greater genetic variability than non-trending groups. These changes may affect the third variable region, affecting the host's ability to adapt to the changes and generate sufficient immune response.

Subject Data

According to the BEDROCK HIV Sequence Data Table, I was able to determine which of the subjects used within my study actually developed aids. All 3 AIDS diagnosed were confirmed with the disease by their final visit. In the AIDS progressing groups, subjects developed AIDS within 1 year after their final visit. The Non-Trending groups all maintained high CD4 T Cell Counts above the threshold, even after the study was conducted. Sequences were for each visit and subject were chosen using a Random Integer Generator, to eliminate selection bias.

The following sequences was taken from the BEDROCK HIV Problem Space Database, from the Markham et al. (1998) study.
Table 1: Sequences analyzed

Group	Subject	Visit	Sequences
AIDS Diagnosed	3 10 15	1 6 1 6 1 4	1, 2, 4 3, 4, 5 3, 6, 7 2, 4, 8 2, 3, 4 5, 8, 10
AIDS Progressing	7 8 14	1 5 1 7 1 9	2, 3, 9 2, 8, 9 1, 4, 5 1, 6, 7 2, 3, 4 9, 10, 11
No Trend	5 6 13	1 5 1 9 1 5	1, 3, 8 4, 5, 2 1, 2, 3 6, 7, 9 1, 3, 4 3, 5, 4

Results and Methods

For this group project we were each assigned a group with three subjects and three clones for each subject to be analyzed. Nicole Anguiano is working with the AIDS Diagnosed patients, Isabel Gonzaga is working with the AIDS Progressing patients, and on this page I will be working with the patients that have no trend, referred to as "No Trend."
Convert DNA sequences into protein sequences:
- To convert the DNA sequences into Protein sequences, we could obtain the DNA sequences from BEDROCK HIV Problem Space Database, and then take the DNA sequences and code the nucleic acids into amino acids using the triplet code. The three nucleotide codon would encode one amino acid and not be time effective. Therefore, to do the job more efficiently we obtained the protein sequences from BEDROCK HIV Problem Space.

Multiple Sequence Alignment

Using Biology Workbench, I imputed the Amino Acid sequences in FASTA format into the program. To create multiple sequences alignment for proteins the “Protein tools” was selected. With the appropriate clones selected for Subjects 5, 6, and 13 I ran a ClustalW (multiple sequence alignment). I then ran it on all the subjects and the appropriate clones for the first visit, and the last visit. I followed the same procedure for the DNA sequences, however the “Nucleic Acid tool” was selected and DNA sequences were entered in FASTA format.

Consensus Key
- *-single, fully conserved residue

Subject 5

Protein

**Figure 2.Phylogenetic tree of amino acid sequences for Subject 5 at the first and final visit.**

DNA

**Figure 3.Multiple Sequence alignment comparing nucleic acid sequences for Subject 5 at the first and final visit.**

**Figure 4.Phylogenetic tree of nucleic acid sequences for Subject 5 at the first and final visit**

Subject 6

Protein

**Figure 6.Phylogenetic tree of amino acid sequences for Subject 6 at the first and final visit.**

DNA

**Figure 7.Multiple Sequence alignment comparing nucleic acid sequences for Subject 6 at the first and final visit.**

**Figure 8.Phylogenetic tree of nucleic acid sequences for Subject 6 at the first and final visit.**

Subject 13

Protein

**Figure 10.Phylogenetic tree of amino acid sequences for Subject 13 at the first and final visit.**

DNA

**Figure 11.Multiple Sequence alignment comparing nucleic acid sequences for Subject 13 at the first and final visit.**

**Figure 12.Phylogenetic tree of nucleic acid sequences for Subject 13 at the first and final visit.**

Table 1:Showing the percent difference in the Protein Sequence and DNA Sequence for Subject 5,6,13 for the first visit, final visit, and all the visits.

Visit	Type	Number of Differences	Percentage Different
1	Amino Acids	36	36/95 = 37.9%
1	DNA	67	67/285 = 23.5%
Final	Amino Acids	38	38/95 = 40%
Final	DNA	67	67/285 = 23.5%
Both	Amino Acids	43	43/95 = 45.3%
Both	DNA	79	79/ 285=27.7%

The percent difference remained relatively the same throughout the visits in regards to amino acid and DNA sequences. Indicative of a Non-progressor or Non-trending.

Locate V3 Region

After reading the Huang et al. paper and giving the presentation I had some insight as the V3 region would look like on the gp120 protein. After downloading Huang et al. (2005) Structure 2B4C into StarBiochem and adjusting the size of atoms and secondary structure, I was able to obtain a clear picture of the V3 region on the protein.
In StarBiochem, I selected import, then typed in the protein sequence 2B4C which is indicative of the Huang et al. protein presented in his paper
The V3 region protrudes from the protein and can be identified by its loop and its adjacent beta sheets.
In Figure. 20, the V3 region is highlighted with Star Biochem database tools. With the tools it has the ability to enlarge particular areas of interest, which in this case was the V3 region residues 296-331.

**Figure 19.Depicting the structure 2B4C Huang et al. (2005), using Star Biochem. Area of interest is the V3 region which is highlighted in the image**

**Figure 21.Depicting the structure 2B4C Huang et al. (2005), using Cn3D software site Area of interest is the V3 region which is highlighted in the image .**

To determine the N-terminus and C-terminus I again used Star Biochem and clicked on the quaternary tab which broke the protein into 4 chains: Pink (chain c), Salmon (chain H), Green (Chain L), and Yellow (Chain G)

Yellow (Chain G:84:G-492:G)
- N-terminus: [VAL]84:G
- C-terminus: [GLU]492;G
Pink (Chain C:1:C-175:C)
- N-terminus: [LYS]1:C
- C-terminus: [VAL]175;C
Green (Chain L:1:L- 214:L)
- N-terminus: [GLU]1:L
- C-terminus:[CYS]214:L
Salmon(Chain 2:H -216:H)
- N-terminus: [GLN]2:H
- C-terminus: [CYS]216:H

V3 Amino Acid Properties

Nonpolar: 12
Polar: 11
Positively charged: 6
Negatively charged: 2
Aromatic: 2

Secondary Structure Elements:

Beta Sheets: 22
Alpha Helices: 17
Random Coil: 84

Secondary Structure Prediction on the V3 Fragment

The gp120 protein from Huang et al. was ran on PSIpred in order to predict secondary structures. The V3 region sequence was identified at residues 293 through 326. This was compared to the PSIPred's for the Markham et al. sequences. The amino acid for the Markham et al. sequences were identified from residues 29-63. In each of the PSIPred runs, a corresponding beta sheet and alpha helix exists, as expected. This shows that the proper portion of the Markham et al. sequences were identified as the V3 regions.

In Figure. 20 the V3 region is depicted as having two antiparallel beta sheets, however the PsiPred results predict that their is an helix at the end which leads us to believe that their is some discrepancy. However, the confidence level for the helix prediction is low and denoted by a lower bar in comparison to tbe beta sheets.

BIOL368/F14:Chloe Jones Week 9

Contents

Defining Your HIV Structure Research Project

Question

Hypothesis

Subject Data

Results and Methods

Multiple Sequence Alignment

Locate V3 Region

Secondary Structure Prediction on the V3 Fragment

Presentation Link

Electronic Lab Notebook

Weekly Assignments

Class Journals

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

research

Tools