IGEM:Imperial/2010/Modelling: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
Line 281: Line 281:
#Immediate cell death slows down production of coloured output. Depending on the threshold concentration this can delay the detectable response by a few minutes.  
#Immediate cell death slows down production of coloured output. Depending on the threshold concentration this can delay the detectable response by a few minutes.  
#If Catechol is added before t=1000s, then cell death slows down the response considerably.
#If Catechol is added before t=1000s, then cell death slows down the response considerably.
===Constants for Modelling===
{| class="wikitable" style="text-align: center; width: 80%; height: 170px;" border="1"
|-
  ! Type of constant !! Derivation of value
|-
  | TEV Enzyme dynamics
  | Enzymatic Reaction:
E + S <-> ES -> E + P
Let
*k1 = rate constant for E + S -> ES
*k2 = rate constant for E + S <- ES
*kcat = rate constant for ES -> E + P
We know that Km = (kcat + k2)/k1
Assuming that kcat << k2 << k1, we can rewrite Km ~ k2/k1
From this [http://peds.oxfordjournals.org/cgi/reprint/14/12/993 paper] constants for TEV can be found:
*e.g. wildtype TEV
*Km = 0.061 +/- 0.010 mM
*kcat = 0.16 +/- 0.01 s^-1
These values correspond with our assumption that kcat ~ 0.1 s^-1 and Km ~ 0.01 mM.
Hence, we can estimate the following orders of magnitude for the rate constants:
*k1 ~ 10^8 M^-1 s^-1
*k2 ~ 10^3 s^-1
Using these values should be a good approximation for our model.
|-
  | Degradation rate
(common for all)
  | Assumption: To be approximated by cell division (dilution of media) as none of the proteins are involved in any active degradation pathways
Growth rate (divisions/h): [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC235685/pdf/jbacter00326-0019.pdf 0.53<G.r.<2.18]
Hence on average,  g.r.=1.5 divisions per hour
=> 1 division every 1/1.5=0.6667 of an hour (40mins)
To deduce degradation rate use the following formula:
τ_(1⁄2)=ln2/k
Where τ_(1⁄2)=0.667 hour, k – is the degradation rate
k=ln2/τ_(1⁄2) = 0.000289 s^(-1)
|-
  | Production rate
(TEV and dioxygenase)
  |
We had real trouble finding the production rate values in the literature and we hope to be able to perform experiments to obtain those values for (TEV protease and catechol 2,3-dioxygenase). The experiments will not be possible to be carried out soon, so for the time being we suggest very simplistic approach for estimating production rates.
We have found production rates for two arbitrary proteins in E.Coli. We want to get estimates of production rates by comparing the lengths of the proteins (number of amino-acids).
As this approach is very vague, it is important to realise its limitations and inconsistencies:
*Found values are taken from E.Coli not B.sub.
*The two found rates are of the same value for quite different amino-acid number which indicates that protein folding is limiting the production rates (we use the chosen approach as the only way of getting the estimate of order of reaction)
LacY production = [http://bionumbers.hms.harvard.edu/bionumber.aspx?s=y&id=100738&ver=0&hlid=29205 100 molecules/min]  ([http://www.uniprot.org/uniprot/P02920 417 Amino Acids])
LacZ production = [http://bionumbers.hms.harvard.edu/bionumber.aspx?s=y&id=100737&ver=0&hlid=29206 100 molecules/min] ([http://www.uniprot.org/uniprot/P00722 1024 AA])
Average production ≈ 100molecules/min 720 AA
That gives us:
*TEV production ≈ 24 molecules/min = 0.40 molecules/s ([http://www.uniprot.org/uniprot/P04517 3054 AA])
As production rate needs to be expressed in concentration units per unit volume, the above number is converted to mols/s and divided by the cell volume → 2.3808*10^(-10) mol*dm^(-3)*s^(-1)
*C23D production ≈ 252 molecules/min = 4.2 molecules/s ([http://www.uniprot.org/uniprot/P54721#section_x-ref 285 AA]) → 2.4998*10^(-9) mol*dm^(-3)*s^(-1)
We will treat these numbers as guiding us in terms of range of orders of magnitudes. We will try to run our models for variety of values and determine system’s limitations.
|-
|Kinetic parameters
of dioxygenase
|
Initial velocity of the enzymatic reaction was investigated at pH 7.5 and 30 °C.
*Wild type (we use that one)
Km = 10μM
kcat = 52 s^(−1)
*Mutated type
Km = 40μM
kcat = 192 s^(−1)
Consequently, the kcat/Km 4.8 of the mutant was slightly lower than kcat/Km 5.2 of the wild type, indicating that the mutation has little effect on catalytic efficiency.
[http://www.springerlink.com/content/e3718758m5052214/fulltext.pdf reference]
|-
|Dimensions of
Bacillus subtillis cell
|
Dimensions of Basillus subtilis (cylinder/rod shape) in reach media (ref. bionumbers):
# diameter: d= 0.87um
# length: l=4.7 um in rich media
This gives:  Volume= π∙(d/2)^2∙l=2.793999 μm^3≈ 2.79∙10^(-15) dm^3 
|-
| Split TEV
production rates
|*Assume the both parts of split TEV are half of size of the whole TEV (3054/2=1527 AA)
*The length of the coil is 90 AA
The whole construct is then: 1617 AA
→ split TEV production rate ≈ 1.2606*10^(-10) mol*dm^(-3)*s^(-1)
|-
| Relevant concentrations of catechol
| We have catechol in the lab in powder form so we're limited only by catechol's solubility.
For concentration of '''0.1 M''' with built up levels of dioxygenase the colour change happens within seconds!
We will run our models for 0.1M ± several orders of magnitude to determine the smallest catechol concentration that will give appreciable difference between simple production response and the amplified response.
|}

Revision as of 08:45, 1 September 2010

Objectives

Output Amplification Model

Protein Display Model

Notebook

Week 4 and 5

Output Amplification Model

Motivation: We have come up with a simple concept of amplification of output done by enzymes. Before the final constructs are assembled within the bacterial ogranism, it is beneficial for us to model the behaviour of our design.

The questions that need to be answered:

  1. How beneficial is the use of amplification? (compare speeds of response of transcription based output to amplified outputs)
  2. How many amplification steps are beneficial to have? (if too many amplification steps are involved, the associated time delay with expressing even amplfiied output may prove not to be beneficial.)
  3. Does mixing of amplfication levels have a negative influence on the output? Is it better to use TEV all the way or HIV1? Modelling should allows us to make a decision on which design is more efficient.

First attempt

A
At each stage of amplification a distinct protease is being used
A
At each stage of amplification a distinct protease is being used

A
TEV is used at both stages of amplification
A
TEV is used at both stages of amplification

Second attempt

A
Model improved to account for the enzymes (protease action)

Implementation in Matlab

The Matlab code for the different stages of amplification and diagrams can be found here.

Kinetic constants

Quality GFP TEV split TEV split GFP
Km and Kcat Doesn't apply TEV constants (Km and kcat) 40% of whole TEV Doesn't apply
half-life or degradation rate Half-life of GFP in Bacillus = 1.5 hours - ref. Chris ? ? Half-life shorter than GFP
production rate in B.sub ? ? ? ?

Conclusions

We couldn't obtain all the necessary constants. Hence, we decided to make educated guesses about possible relative values between the constants as well as varying them and observing the change in output.

As the result, we concluded that the amplification happens at each amplification level proposed. It's magnitude varies depending on the constants. There doesn’t seem to be much difference in substitution of TEV with HIV1.

Week 6

Output Amplification Model - Modified version

Michaelis Menten kinetics does not apply

We cannot use Michaelis-Menten kinetics because of its preliminary assumptions, which our system does not fulfil. These assumptions are:

  • Vmax is proportional to the overall concentration of the enzyme.

But we are producing enzyme, so Vmax will change! Therefore, the conservation E0 = E + ES does not hold for our system.

  • Substrate >> Enzyme.

Since we are producing both substrate and enzyme, we have roughly the same amount of substrate and enzyme.

  • Enzyme affinity to substrate has to be high.

Therefore, the model above is not representative of the enzymatic reaction. As we cannot use the Michaelis-Menten model we will have to solve from first principle (which just means writing down all of the biochemical equations and solving for these in Matlab).

Changes in the system

GFP is not used any more as an output. It is dioxygenase acting on the catechol (activating it into colourful form). Catechol will be added to bacteria, it won't be produced by them. Hence, basically in our models dioxynase is going to be treated as an output as this enzyme is recognised as the only activator of catechol in our system. This means that change of catechol into colourful form is dependent on dioxygenase concentration.

Models:

Model preA: Production of Dioxygenase

This model includes transcription and translation of the dioxygenase. It does not involve any amplification steps. It is our control model against which we will be comparing the results of other models.

Model A: Activation of Dioxygenase by TEV enzyme

The reaction can be rewritten as: TEV + split Dioxygenase <-> TEV-split Dioxygenase -> TEV + Dioxygenase. This is a simple enzymatic reaction, where TEV is the enzyme, Dioxygenase the product and split Dioxygenase the substrate. Choosing k1, k2, k3 as reaction constants, the reaction can be rewritten in these four sub-equations:

  1. [T'] = -k1[T][sD] + (k2+k3)[TsD] + sT - dT[T]
  2. [sD']= -k1[T][sD] + k2[TsD] + ssD - dsD[sD]
  3. [TsD'] = k1[T][sD] - (k2+k3)[TsD] - dTsD[TsD]
  4. [D'] = k3[TsD] - dD[D]

These four equations were implemented in Matlab, using a built-in function (ode45) which solves ordinary differential equations. The Matlab code for this module can be found here.

A
Results of the Matlab simulation, setting all constants to 1

Implementation in TinkerCell

  • Another approach to model the amplification module would be to implement it in a program such as TinkerCell (or CellDesigner). It would also be useful to check whether the Matlab model works.
A
LHS: Network implemented in TinkerCell, RHS: constants and results

Model B: Activation of Dioxygenase by TEV or activated split TEV enzyme

  1. T + sTa + sTb + sD <-> T-sD --> T + D + sTa + sTb
  2. T + D + sTa + sTb <-> T-sTa --> T + Tsa + D + sTb
  3. T + D + Tsa + sTb <-> T-sTb --> T + Tsb + Tsa + D
  4. T + Tsa + Tsb + D <-> Tsa-Tsb --> Ts + T + D
  5. T + sTa + sTb + sD -- T-sTa --> T + Tsa + sTb + sD
  6. T + Tsa + sTb + sD -- T-sD --> T + Tsa + sTb + D
  7. T + Tsa + sTb + D -- T-sTb --> T + Tsa + Tsb + D
  8. T + Tsa + Tsb + D -- Tsa-Tsb --> T + Ts + D
  9. T + sTa + sTb + sD -- T-sTb --> T + Tsb + sTa + sD
  10. T + Tsb + sTa + sD -- T-sD --> T + Tsb + sTa + D
  11. T + Tsb + sTa + D -- T-sTa --> T + Tsb + Tsa + D
  12. T + Tsb + Tsa + D -- Tsa-Tsb --> T + Ts + D
  13. T + sTa + sTb + sD -- T-sTa --> T + Tsa + sTb + sD
  14. Ts + T + sD -- Ts-sD --> Ts + T + D

This version includes the following features:

  • 2 amplification steps (TEV and split TEV)
  • Split TEV is specified to have a and b parts
  • TEVa is forbidden to interact TEVa (though in reality there could be some affinity between the two). Same for interaction between Tevb and Tevb
  • Both TEV and TEVs are allowed to activate dioxugenase molecule
  • Dioxugenase is assumed to be active as a monomer
  • Activate split TEV (TEVs) is not allowed to activate sTEVa or sTEVb (this kind of interaction is accounted for in the next model version)
  • There is no specific terms for time delays included

The MatLab code can be found here. Note that no final conclusions should be drawn before realistic estimates for kinetic constants are included. It wasn’t done so far.

A
All chemical species appearing in the model
A
Network of the improved model
A
Resulting graphs part 1. Compare the production graphs of TEV (transcribed and translated from scratch and the Dioxugenase which is the final species in the whole cascade
A
Resulting graphs part 2.

Model C: Further improvement

This model is not implemented yet.

This version adds the following features:

  • activated split TEV (TEVs) is allowed to activate not only sD but sTEVa and sTEVb
A
Network of the further improved model
A
Network of the further improved model (continued)

Week 7 and 8

Output Amplification Model - Results

The major concern of the results that we got (in particular for small concentrations < 10^(-4) mol*dm^(-3)) was that the solver was oscillating about positive or zero values but marking concentration values below zero too. It was recognised as faulty and probably leading the solver to false solutions.

Trying to prevent the ode solver going crazy, the following precuations were implemented:

  • function preventing solver from going to negative values (does it really work) - still some marginally negative values show
  • Scaling - all the values were scaled up by a factor of 10^6 as working on small numbers could be problematic for matlab (especially at the beggining of the cascade). Once the result is generated by the solver the resulting matrix is scaled back down by 10^6

Model pre-A

This is the result for the simulation of simple production of Dioxygenase. It can be seen that the concentration will tend towards a final value of around 8*10^-6 mol dm^(-3). This final value is dependent on the production rate (which we have just estimated!).

A
Results of the Matlab simulation of Model preA

Model A

When we entered production and degradation rates into our model, it did not seem to work properly (e.g. we got negative concentrations as our output). We found out that this is due to our set of differential equations being stiff. Since ode45 cannot solve stiff differential equations, we had to switch to using ode15s.

  • Initial Concentration

The initial concentration of split Dioxygenase, c0, determines whether the system is amplifying. The minimum concentration for any amplification to happen is 10^-5 mol dm^(-3). If the initial concentration of split Dioxygenase is higher, then the final concentration of Dioxygenase will be higher as well (see graphs below). Note that the obtained threshold value is above the maximum value that can be generated in the cell according to Model pre-A!!!

A
Comparison between Model pre-A and Model A. Initial concentration of split Dioxygenase: 10^-5 mol dm^(-3)
  • Changing Km:

Km is indirectly proportional to the "final concentration" (which is the concentration at the end of the simulation), i.e. the bigger the evalue of Km, the smaller the "final concentration" will be. Different Km values determine how quickly the amplification will take place.

(Also, it was found that the absolute value of k1 and k2 entered into Matlab does not change the outcome as long as the ratio between them (Km~k2/k1) is kept constant. This is important when simulating (in case entering very high values for k1 and k2 takes too long to simulate).

  • Changing kcat

Model predicts the concentration values for dioxygenase to raise quicker and to higher values for increasing values of kcat=k3. That indicates that k3 is actually the slowest step in enzymatic reaction and allows us to appreciate how our system is dependent on kinetic properties of enzyme.

  • Changing production rate

At the moment, our biggest source of error could be the production rate, which we weren't able to obtain from literature. So, we had to estimate (see below) the value of the production rate. We hope to be able to take rough measurements of that value in the lab as it has big effect on models' behaviour.

Sensitivity of Model A (20/08/2010)

We want to determine how our system reacts if different parameters are changed. This is to find out which parameters our system is very sensitive to.

Parameter Sensitivity
Initial concentration of split Dioxygenase Change of one order of magnitude in the initial concentration, c0, gives change of oen order of magnitude in the output concentration (range: 1>c0>10^-5). Loses sensitivity for extremely high or low values.
Km Change of one order of magnitude results in change of output concentration by one order of magnitude (0.01<Km<100). At values smaller than 0.01, the sensitivity is lost. For higher values than 100 the sensitivity is at least the same as the change of order of magnitude.
kcat kcat proportional to dioxygenase production (1-to-1 sensitivity for all values) for an initial concentratio of 0.01 mol/dm^3. For very high initial concentrations, the system is very sensitive to changes in kcat.
Production rate of TEV 1-1 sensitivity for most values. At some point the system’s response is limited by the initial concentration of sD, so for very high TEV production rates not much change is observed.
Production rate of split Dioxygenase Not much influence on 1-step amplification. However, the value seems to be crucial for simple production of Dioxygebase (1-1 order of magnitude sensitivity).
Degradation rates Sensitive within the relevant range. Not very sensitive for values smaller than 10^-6;. For high degradation rates (1>degradation rate>0.01): unexplainable behaviours.

Hence, the system is sensitive to most of the constants (given a particular range of values). The most crucial one, however, seems to be the initial concentration of split Dioxygenase.

Model B

  • Initial Concentration

The initial concentration of split Dioxygenase, c0, determines whether the system is amplifying. .

The behaviour in varying the initial concentration obeys similar relationship as the one of Model A: If the initial concentration of split Dioxygenase is higher, then the final concentration of Dioxygenase will be higher as well (see graphs below).

  • Model A vs. B

Having run both models with the same initial conditions (c0=10^-5 mol dm^(-3)). It has been noted that Model B does not generate very siginificant amplfication over the Model A. Hence, it would be more sensible to integrate a one step amplification module.

A
Comparison between Models pre-A, A and B

Colour response model (25/08/2010)

Initially, dioxygenase was being treated as "output" just because we decided to model what is different between the models in order to determine which one is better. We have found that 2 step amplification (Model B) added to a system presented little improvement over the 1 step amplification (Model A). This conclusion was worrying as action of dioxygenase on catechol to produce colour compound is enzymatic, hence a amplification step by itself. Adding it to all models meant that Model A becomes a 2 step amplifier and Model B becomes a 3 step amplifier. This was worrying having previously drawn the conclusion that 2 step amplifier is not much better than 1 step one. We decided to model that in order to check whether that deduction was true. If it was true that would mean that our construct is not innovative at all.

The important information about colour production is that the coloured compound kills the cells. It is not catechol killing them but the coloured compound.

It is suggested that product of Catechol destroys the cell membrane by inhibiting lipid peroxidation. It causes significant changes in the structure and functioning of membrane components (e.g. disruption of membrane potential, removal of lipids and proteins, loss of magnesium and calcium ions). These effects cause the loss of membrane functions, leading to cell death.

Since the product of Catechol acts on the cell membrane, it will not affect our enzymatic reaction immediately. In our simulation, we will try to model immediate cell death as well as neglect the effect that Catechol has on the cell and compare the two if there is a significant difference in results. If the differences will be appreicable we will try to model the slow cell death in more detail.

Initial conclusions(26/08/2010)

Despite our model not working entirely correctly (the simulator goes a bit funny), we deduced several points.

  • The images presented below show cathecol being added at 3 different points in time. Cross section refers to a point in time at which concentration of dioxygenase in amplified systems crosses to be above the concentration of the non-amplified system. From those graphs it is clear that amplification in output is visible only after the cross-section has been reached. Note that those simulations were run for 1M solutions of catechol (which is quite high) which allows to see the differences between various amplification models easily.
A
Concentration of coloured compound for catechol being added before the cross section is reached
A
Concentration of coloured compound for catechol being added when the cross section is reached
A
Concentration of coloured compound for catechol being added after the cross section has been reached
  • We noticed that amplification has point only for quite high catechol initial concentrations (>0.01M). For smaller concentrations of catechol the dioxygenase conetrations in different system do not seem to be decisive about the speed of response (no difference between all 3 models). That means that basically, in systems with small catechol concentration added the amplified systems end up being redundant (dioxygenase is overproduced) as concnetration of dioxygenase from simple production seems to high enough to convert catechol almost instantenously. Amplification models become only meaningful when they have a lot of substrate to act on (ie. high concentration of catechol). This leads us back to determination of colour compound concentration threshold for visibility. It will a crucial factor in deciding whether amplifiers designed by our team obtain the fast response or not.

Troubles

Once we have implmented the colour change into models, we have noticed that there is a problem with it. After adding catechol, some concentrations were jumping to negative values, which was worrying. We checked our equations and constants but could not find the mistake. Hence, we concluded that there is something wrong with the way that MatLab evaluates the equation or treats numbers. The trouble seemed to have origined from the very rapid concentration change of catechol which disrupts the whole system Prospective solutions to problem:

Implementation in TinkerCell (31/08/2010)

We hoped TinkerCell will have non-negative conditions imposed on its solutions and will do better job than raw solver manipulating the equations. Hence, we implemented the whole amplification model (including coloured output) in TinkerCell. However, we realized that TinkerCell does not deal very well with very high or low numbers (For example, values higher than 10^5 are not acceptable - this is important since our rate constants (k1) are usually bigger than 10^5. Also, the low degradation rates (10^-9) result in a zero output line). However, TinkerCell can still be used for testing that our Matlab programs behave the way we anticipated (by using default values of 1), as well as producing diagrams of our system.

Varying ODE solver options in MatLab (31/08/2010)

We had a close look at the ODE solver options in MatLab. We those that could have helped us. In the end we found that decreasing Relative and Aboslute tolerances (to values as small as 1E-15) was significantly improving the situation. However, this is not an ultimate solution as in simulations still negative numbers appear of order of 10^(-15). As no better solution than that was found, we dicded to accept that approach. We decided as well that we are just interested in first 100 to 150 seconds after adding catechol, while concentrations hit the negatve values way after that. The species that were concerned with hitting negative values were approaching to 0.

THe images below show the influence of the relative and absolute tolerance values on the model. Note it was important to allow ODE solver to adjust the time step automatically, as big time steps (1second) were generating false answers for the catechol model. Adjustign manually time steps to really small values was not efficient (the whole simulation does not require very high definition simluation). Function of automatic choice of step showed to be able to cope well with that issue.

A
Graphs representing conctrations of catechol, dioxygenase, catechol-dioxygenase complex and colur compound when adding catechol at t=2000s. Note the really negative values that concentration of catechol hits.
A
Here the Relative and the Absolute Tolerances are lowers to 1E-18 and 1E-12, respectively. From graphs it is not clear that any of the concentration parameters hits negative values. However, after having a look at raw data it is clear that catechol and catechol-dioxygenase complex get slightly negative values like -2E-120
Using SimBiology to model (31/08/2010)

We again hoped that SimBiology package could be more suited to mdelling our situation then raw ODE solvers, so we implemented our some models in it. This package offers interactive user interface similar to Tinker Cell, but uses MatLab to simulate. Initially, we have confirmed that our simple production model (Model PreA) and 1 step amplification model (Model A) implemented in visual manner in MatLab generated exactly the same results as our manually derived ODE equation based models. The interface allowed us to have clearer control over paramters. As well it allowed modelling special events, like adding catechol at certain point in time. Previously we had to split simulation into to parts.

Final Conclusions (31/08/2010)

The conclusions obtained after fixing the error of negative concentrations in the model, did not change our initial conclusions. However, they are stated here clearly:

  • Changing time when catechol is added:

If Catechol is added before t= 1000s, then the coloured output will reach its threshold value faster by simple production. If Catechol is added when t>1000s, then the coloured output will increase (marginally) faster through the amplification step in Model A. This does not seem to be appreciable difference between the two models in given time scale. These observations are true for intial concentration of dioxygenase equal to 1E-5 mol*dm^(-3). However, we noticed that if the initial concentration is raised to 1E-4 mol dm^(-3), then Model A can be advategous over Model preA within as little as 100 seconds.

  • Changing concentration of catechol added:

There seem to be 3 regions of catechol concentration that influence the system in different ways. Those regions are: c>1M, 1M>c>0.01M, 0.01M>c. The boundatries of these regions tend to vary depending on the choice of other initial conditions. The values given above apply to boundary conditions that currently are considered physiologically relevant. Varying initial concentration of catechol within the highest region doesn't result in any change in colour output response (probably all enzymes are occupied, the solution is over saturated with catechol). In the middle region the catechol ocncentration has influence on the amplficaytion. Amplification becomes less and less towards c=0.01M. When 3 region is entered, there is no difference in output production by the two models

  • Cell death

The coloured product of catechol kills cells by destroying the cell membrane. However, we don't know how quickly the cells will die. Therefore, we examined two different cases: immediate cell death and negligible cell death (i.e. cells death is negligible because it takes too long)

Running the simluation in Matlab (not Simbiology!), our conclusions are:

  1. Immediate cell death slows down production of coloured output. Depending on the threshold concentration this can delay the detectable response by a few minutes.
  2. If Catechol is added before t=1000s, then cell death slows down the response considerably.