BioSysBio:abstracts/2007/Imtiaz Khan: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
Line 3: Line 3:
'''Affiliations:''' <sup>1</sup>Biostatistics and Bioinformatics Unit, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. <sup>2</sup>Department of Pathology, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. <sup>3</sup>Department of Medical Biochemistry and Immunology, Cardiff University, Cardiff, CF14 4X, UK.
'''Affiliations:''' <sup>1</sup>Biostatistics and Bioinformatics Unit, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. <sup>2</sup>Department of Pathology, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. <sup>3</sup>Department of Medical Biochemistry and Immunology, Cardiff University, Cardiff, CF14 4X, UK.
<br>
<br>
'''Contact:''' wpciak@cf.ac.uk <br>
'''Contact:''' mailto:wpciak@cf.ac.uk Imtiaz Khan <br>
'''Keywords:''' Cell Cycle, Cell lineage, Bioinformatics, Timelapse microscopy.  
'''Keywords:''' Cell Cycle, Cell lineage, Bioinformatics, Timelapse microscopy.  



Revision as of 06:11, 29 September 2006

A Bioinformatics approach for the interrogation of molecular events in single cells: transforming fluorescent timelapse microscopy images into numbers
Author(s): I. A. Khan 1, J. Fisher 2, P. J. Smith 2 and R. J. Errington 3
Affiliations: 1Biostatistics and Bioinformatics Unit, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. 2Department of Pathology, School of Medicine, Cardiff University, Cardiff, CF14 4XN, UK. 3Department of Medical Biochemistry and Immunology, Cardiff University, Cardiff, CF14 4X, UK.
Contact: mailto:wpciak@cf.ac.uk Imtiaz Khan
Keywords: Cell Cycle, Cell lineage, Bioinformatics, Timelapse microscopy.

Abstract

Cell-based assays, for high-content-screening, demand unique solutions which enable image encoding and interrogation of spatio-temporal cellular events. Our overall aim is to develop data mining tools and mathematical models capable of producing predictive cell response profiles for use in pre-clinical screening, experimental therapeutics and hypothesis-driven research. Our principal paradigm is that cellular bioinformatics interrogation of kinetic microscopy data opens a route to revealing the nature and time frames for the modulation of the cell cycle in disease and under stress. A high level informatics enables linking of multi-scalar events that comprise innate and induced population heterogeneity in dynamic cellular systems. We have developed a bioinformatics environment where analysis tools efficiently encode microscopy images into numbers and deposit the extracted descriptors into a relational database for sequential data access and data visualisation. In this current work we show that this bioinformatics approach to image derived cell-based measurements reveals inter-event relationships such as novel cellular phenotypic and molecular bifurcation patterns.

Introduction

Exploring and exploiting the enormous potential for pharmacological modulation of the mammalian cell cycle are key goals for basic research and drug discovery. We have developed a critical advance - the high temporal resolution monitoring of cell cycle progression enabling the tracking of single cell checkpoint transitions in a non-invasive manner even within heterogeneous populations. The green fluorescent protein (GFP)-based probe has expression, location and destruction characteristics that shadow cyclin B1 dynamics in living cells (Thomas 2003). The non-perturbing stealth reporter performance has been validated on high content to high throughput detection platforms comprising multi-well HTS imaging, single cell kinetic-tracking and multi-parameter flow cytometry (Thomas and Goodyer, 2003; Thomas et al 2005). Cyclin-B1 tracking provides sub-phase information on cell cycle progression, cell-cycle regulator dynamics in parallel with morphological landmarks and DNA content analysis. We have sought to track the functional operation of checkpoints within a given cell cycle compartments and track the outcomes in bifurcating lineages.

Single cell lineages which underlie the basic concept provide an elegant assay for determining the evolving and complex interplay for tumour survival at the single cell level. Cell lineage mapping has been most comprehensively described for Caenorhabditis elegans to elucidate developmental mechanisms and nematode evolution (Sternberg and Horvitz, 1981; Borgonie et al., 2000). Lineage tracking imposes a labour-intensive task and a limited amount of software tools have been developed to ease this burden (Braun et al, 2003). In experimental terms a cell lineage is defined as descent in a line from a common progenitor that was exposed to a given influence for a discrete period. The behaviour of both the progenitor and the descending line of offspring reveal the time-integrated pharmacodynamic response (e.g. changes in inter-generation cell division time or cell death). For example, this would have direct relevance to how viable populations, representing resistant fractions, might be maintained in drug-treated tumour cell populations. Timelapse imaging and analysis has been used in this context extensively by others to examine the viability and proliferation of uninucleated and multinucleated giant cells formed after 6 Gy X irradiation (Prieur-Carrillo et al., 2003) or apoptotic induction post-irradiation in p21 gene knock out cell lines (Chu et al., 2004). However we have experience with our studies that these type of analysis encounter sever hurdles without a meta-data structure that allows for parameter management. It also severely restricts the approaches to data analysis to a time-orientated analysis rather than a lineage or relationship analysis.

Our primary premise is that a bioinformatics approach to cell-based measurements provides an essential route to determining inter-event relationships revealing novel cellular and molecular event patterns. Our overall aim is to develop an integrated bioinformatics environment where analysis tools efficiently encode microscopy images into numbers and deposit the encoded data into a relational database. The intention is to provide a web-based interface with the database called CycleTRAK which provides data access enabling robust hypothesis-driven data-mining and drug signature queries for data interrogation.

Materials and Methods

Cell molecular reporter system: The parental cell line used in these studies was a human osteosarcoma cell line derived from a 15 year old Caucasian female U-2 OS (ATCC HTB-96)23. U-2 OS cells was transfected with G2M Cell Cycle Phase Marker (GE Healthcare, UK) using Fugene (Roche) according to the manufacturers instructions. Following selection with 1000 μg/ml Geneticin (Sigma G7040) the expressing cells were enriched using high speed FACS (Mo flow (Cytomation) and sorted into 96 well plates (1 green fluorescent cell/well). Colonies were expanded and clones whose green fluorescence varied with the cell cycle as predicted and as determined by flow cytometry were selected for the current study.

Cell culture and maintenance condition: The stably transfected cells were maintained at 37oC and 5 % CO2 using standard tissue culture techniques. Media used was McCoys 5A modified (Sigma) supplemented with 2 mM glutamine, 100 units/ml penicillin, 100 μg/ml streptomycin, 10 % fetal calf serum and 1000 μg/ml geneticin.

Single cell timelapse screening: High resolution fluorescence cell tracking was performed with cells seeded into a 12 well coverslip-bottomed multi-well plate. Immediately post-addition (of a drug perturbation) the cultured dishes were placed on to a timelapse instrument designed to capture bright-field phase images and GFP fluorescence (480/25 nm excitation and 525/30 nm emission). An Axiovert 100 microscope (Carl Zeiss, Welwyn Garden City, UK), was fitted with an incubator for 37oC/5% CO2 maintenance (Solent Scientific, Portsmouth, UK), and an ORCA-ER 12-bit, CCD camera (Hamamatsu, Reading, UK). Illumination was controlled by means of a shutter in front of the transmission lamp, and an an x,y positioning stage with separate z-focus (Prior Scientific, Cambridge, UK) controlled multi-field acquisition. Image capture was controlled by AQM 2000 (Kinetic Imaging Ltd). All images were collected with a 40x, 0.75 NA air apochromat objective lens providing a field size of 125x125 μm. Sequences were captured every 20 minutes for 48 hours, ordinarily at least three fields per treatment regime.

At the end of the experiment the images were stacked and saved as *.stk or *.AVI format. MetaMorph (Molecular Devices, California) was used to view the stacked image as sequence of images. We then developed Quercus – Fluor (in-house software) to work in conjunction with MetaMorph to encode and transform the images into a parametised database.

Quercus – Fluor: For each microscopy or image based screen, all experimental descriptors were documented into an accessible but simple excel file. This screen file acts as a digital laboratory notebook where all experimental descriptors for all drug screens are recorded with some details being recorded manually, while many, such as the progenitor cell morphological descriptors derived from the raw images, were acquired semi-automatically via a Perl script which communicates with MetaMorph. The principal objective of this screen file is to attribute a unique encryption tag for each origin or starting cell within the field which we term a progenitor cell. This is critical, since the complexity evolves as each starting cell divides or dies and hence produces progeny which populate a full lineage. Therefore every subsequent event within the lineage can be rooted or associated to the progenitor cell including tagged parameters.

Quercus – Fluor is part of an ‘in-house’ software encoding suite ‘Quercus’ designed to parametise fluorescent timelapse microscopy image sequences. Quercus – Fluor has been written in a single Perl script and the Perl-Tk module has been used to generate the graphical user interface and the canvas upon which the lineage is displayed. The software is divided into three interlinked parts. The first part interacts with the digital laboratory notebook mentioned above and directs users to a specific progenitor cell location, this part of the programme also generates the tag through which the progenitor cell becomes indexed later in the database. The second part interacts with MetaMorph and draws the evolving lineage to the canvas. Finally, the third part writes the image derived parameters associated with each cell of the lineage into the database called CycleTRAK.

Encoding the fluorescent images: A user sequentially selects experimental attributes based on experiment, well, field and cell position, additionally a graphical display of the multi well plate facilitates users in navigating a sequential sieving process. Quercus – Fluor dynamically interacts with the digital laboratory notebook and generates the informative graphical display. For example, when a particular experiment has been chosen by the user, Quercus – Fluor interacts with the screen file and both reads and display all information regarding each well within the specific screen the process continues up till the selection cell level. When the tagging is complete, a cell is created in the canvas of Quercus – Fluor, the raw image counter part of this newly created cell is located in the MetaMorph video window and tracked. MetaMorph is interfaced with Quercus – Fluor via a dynamic data exchange link log file. For the cell of interest, in each frame three regions-of-interest (ROIs) are used to extract parameters from the raw image sequence viewed in MetaMorph. The first ROI is always positioned on the nucleus and the other two ROIs are positioned on the cytoplasmic regions, usually on the opposite side of the nucleus of the cell of interest. For each ROI, MetaMorph extracts 10 parameters from the raw image and the parameter includes – Frame number, X coordinate, Y coordinate, width of ROI in pixel, average intensity, Intensity standard deviation, intensity signal/noise ratio, integrated intensity, minimum intensity, maximum intensity. Once the ROIs are positioned, the cell of interest is tracked frame by frame starting at frame one. Increment of the frame is automatic when the user presses the ‘Log Data’ button in the MetaMorph, for any frame if the cell of interest moved considerably from its last frame position, users need to reposition the ROIs manually. Additionally when ‘Log Data’ button is pressed, major and minor events are also logged for that frame. Both event types are displayed in the ‘Labeled Logged Data’ window of MetaMorph, by default the major event is ‘N’ and the minor event is ‘null’ meaning no major and minor event respectively. However with the progression of logging in a frame by frame manner, when some changes (rounding up) happens to the cell, users need to change the minor event to ‘start’ from its default ‘null’ label indicating some event has started to happen and finally when the event (mitosis, death) ends users changes the major event label accordingly and minor event label to ‘end’, indicating some major event has ended. According to the major event and the time associated with it, Quercus – Fluor draws the lineage within the canvas. While encoding, it is often required to revisit the bifurcating points (where one cell divides into two daughter cells) in the image, because only one cell can be tracked at any given time. Quercus – Fluor has this feature through which any bifurcating point of a lineage can be indexed in the raw image sequence viewed under Metamorph. Once a lineage for a progenitor cell is complete it is saved as a text file. Any complete or partially encoded lineage can be visualized just by selecting the appropriate lineage text file and editing (delete part of the lineage) is also possible. The editing feature of Quercus – Fluor provides the opportunity to users to delete any part of the lineage which contributes an added layer of accuracy to the encoded lineage data. Quercus – Fluor provides complete flexibility as it can map lineages based on all possible outcomes of a cell division, for example unusual circumstances such as the generation of three or four daughters due to abnormal cytokinesis, or the generation of a polyploidy cell. Quercus – Fluor also assigns to each cell a unique identifier, the start or the progenitor cell is named as ‘B’ if this cell divides into two daughter cells then the two daughters are named as ‘BN’ and ‘BS’ respectively. For a re-fused or polyploidy outcome the designation is ‘BE’. Three individual daughters are named ‘BN’, ‘BE’ and ‘BS’ while four daughters are named ‘BN’, ‘BU’, ‘BL’ and ‘BS’ respectively. This identification pattern also establishes the relationship between different cells within a lineage.

This semi-automated and user directed fashion of lineage encoding from the raw images is indeed time consuming, depending upon the size of the lineage, expertise of the user and cell density in the image, it may take few minutes to an hour to encode a lineage. This semi-automated manner of encoding is undeniably the rate limiting step but user’s interaction ensures the highest precision of the data being encoded. A combination of automated and user-interactive bioinformatics software has been suggested within a recent review (Taylor and Giuliano, 2005) as the challenge and opportunity for the next generation high content screening. Once the encoding of a lineage is complete, the completed lineage dataset is placed into the temporary lineage database where all lineage data are stored in tab delimited text file format. One lineage constitutes a single text file and the name of the text file is the tag assigned to the progenitor cell. The tag or name of a lineage has 23 parameters associated with it, which makes it distinguishable from all other lineages of the database. Within the text file each row represents a cell in a particular frame and 30 columns of data represents the data from 3 ROIs (10 data points for each ROI) for that cell in that frame. Unique nomenclature of the cells within a lineage enables any computer language to access the data while maintaining lineage relationships, and moreover the nomenclature of the lineage itself facilitates lineage classification based on user defined conditions, e.g. drug dose. All lineages accumulated into the lineage database have passed through an automated but rigid quality control check which ensures that all lineages are stored in the correct data structure. Data from these text files are converted into a MySQL database which can be accessed via the web.

Result

Continuous cell cycle tracking at the single cell level. The fluorescent G2M Cell Cycle Phase Marker reporter system depends on the control of expression levels and location of GFP as a cell progresses to the later cell cycle stages and negotiates mitotic entry and exit. This is achieved by using the functional components from cyclin B1 to confer switch-like properties to the stealth reporter. Expression is driven by the promoter region, removal via the destruction box (D-box) and translocation from the cytoplasm to the nucleus compartment via the cytoplasmic retention signal (CRS). Cyclin B1 expression is tightly regulated and acts as a major control switch suitable for following the transition from S-phase through the G2 phase into mitosis. Importantly since the cyclin box is absent from the reporter it does not interfere with or perturb cell cycle progress.

An important aspect of the cyclin B1 signal readout is whether it is amenable to parameterisation and hence potentially incorporated into algorithms for automated analysis and signature identification. Time-based tracking of the cyclin B1 fluorescence at the single cell level revealed an average intermitotic time of 27.3 ± 6 hours, with an overall relative increase of fluorescence of 3.4 ± 0.8 fold from basal expression in G1 to G2 levels (pre-mitosis). A movie sequence of cells traversing the cell cycle and dividing (Figure 1) shows the fluorescence changes as the cell progresses to mitosis from G1, individual cells ramp up cyclin B1 expression (become brighter), a translocation event (cytoplasm to nucleus) occurs just before mitosis. Finally, signal switch-off during cell division triggered at metaphase occurrs on average over 2 ± 0.8 hours.

Figure 1: Fluorescent timelapse microscopy image sequence


Figure 2: Shows an example lineage where the progenitor cell divides into two daughter cells at 5 hours of start of the experiment. The north daughter (BN) again divide at 27.66 hours into two daughter cells BNN and BNS which is labelled as Track 1 and Track 2. The south daughter (BS) failed to divide within the duration of the experiment (48 hours) and labelled as Track 3.


Figure 3: Shows the intensity profile and the motility of the three tracks of figure 1. The Upper panel depicts the Cyclin-B intensity profile, Red line shows the intensity of the nucleous while the Black lines show the intensity of two cytoplasmic regions. The lower panel depicts corresponding movement of the cell (nucleolus) . Distance travelled by the cell in each frame was calculated using Pythagorean theorem for distance measurement and presented in cumulative fashion.

Constructing a bioinformatics environment to access single cell molecular fingerprints from a proliferation bifurcation map. This lineage is an illustration of the molecular dynamics and the phenotypic variations at a single cell level. This lineage consists of 3 tracks as described in figure 1. For the first 5 hours all three tracks are identical, since they are from a single cell, for the same reason track 1 and track 2 are identical up till 27.6 hours. Track 1 and Track 2 illustrate a typical cell cycle dynamics where the cell cycle time is around 22 hours. The probe fluorescence intensity in the cytoplasm starts to rise from G1 phase, however the intensity from the nucleus remained lower than the cytoplasmic intensity till the late G2 phase when the cell rounds up and the cytoplasmic region become indistinguishable from nucleus and soon after the mitosis the intensity falls to the basic level both in cytoplasm and nucleus. The motility along these two tracks remains linear through-out the experimental duration, except at bifurcating point (27.6 hour), this abrupt increment is solely attributed to translocation of the mitotic cell. Comparing the intensity and mobility profile of track 3 (BS) with that of track 1 and 2 (BN), it is interesting to observe and associate the intensity profile remains flat compared to the increasing intensity of track 1 and 2 (BN), this would be classified as a G1 arrested cell. However the motility of cell BS remains almost identical to the sister counterpart (BN) up to 27 hour which went through mitosis and divides into two daughter cells. Following the failure to commit to mitosis, the motility of cell BS dropped exceptionally and remains as such till the end of the experiment. This failure to commit to mitosis followed by a decrease of motion links these two phenotypic responses which would have important implications for wound healing and requires further dissection at the molecular level to determine the mechanisms which underpins this cellular interaction.

Conclusion

In the current study we describe a novel cell lineage encoding method which enables us to parametise molecular signatures derived from stealth fluorescence reporters on a bifurcation map which represents cellular proliferation phenotypic responses. Quercus – Fluor provides a step change in our ability to encode and access information on multi-scalar dynamic cell behaviour. We believe that kinetic measurements provide an essential route to revealing important time windows and informative cells to study the mechanism of action of individual agents and their response pathways. This encoding process encapsulates the critical features of cell-cell heterogeneity, molecular dynamics and time-dependent events. The multi-level descriptors and parameters attributed to each cell within the resultant cell lineage maps provide a unique framework for applying bioinformatics-like query algorithms such as those used for genomic databases, and the ability to locate with high temporal resolution cell cycle phase traverse and checkpoint responses. Cells responding to pharmacologically active agents in a non-invasive manner provides a means of linking causative events with later outcomes at the molecular level and the data generated creates the opportunity for pharmacokinetic (PK) and pharmacodynamic (PD) modelling and validation of intracellular dynamics in response to drug. The lineage map importantly provides a functional map upon which other information can be linked, such as proteomic and genomic expression data. The approach may also address the significant challenge of tracking the evolution of clonal variation in tumour cell populations using micro-array approaches.

Future

Another aspect of this work is to develop predictive algorithms which link our ‘real’ screen data with mathematical models describing drug-target interaction and cellular driven resistance mechanisms at the PK and PD level. (Evans, et al., 2004). Importantly, the biological outputs from our lineage databases translate directly to clinically relevant indicators of the therapeutic response of tumour cell populations in terms of initial response, growth delay and the appearance of surviving (‘resistant’) cells with distinct progenitor characteristics or evolving phenotypes.

Establishing Quercus – Fluor in such a generic way has allowed us to adapt to any type of high-content screening assay, work is underway to interlink multi scalar data generated from different assay techniques. Furthermore we are using different analytical processes to search for drug response fingerprints to inform the temporal windows for a simplified analysis of drug action suitable for micro-scale formats on optical biochip platforms. A significant challenge for us is to convert Quercus – Fluor into an automated encoding programme while maintaining the current highly robust event identification element. This is not a trivial undertaking and will require some novel cell tracking algorithms appropriate for fluorescent microscopy image sequences.

Implementation and update

Work is in progress to establish the lineage information for the CycleTRAK database using the Quercus tools described above and the outcome of this work will be located at our main CyMart site, which contains other databases such as ProgeniTRAK. CyMART is built on an Apache web server and the MySQL4 database system running on a Linux platform. CyMART offers a query approach for data sieving and navigation through the appropriate experimental screens. A detailed description of the basic concepts behind the CyMART family of databases is available. The contents of the databases will be updated quarterly. New functionalities such as visualisation and data mining modules for determining functional relationships between phenotypic behaviour and molecular profiling will be pursued.

Web address for CyMART: http://biodiversity.cs.cf.ac.uk/quercus/

References

  • Borgonie,G., K. Jacobsen, and A. Coomans (2000), Embryonic lineage evolution in nematodes, Nematology, 2, 65–69.
  • Braun V, R.B. Azevedo, M. Gumbel, P.M. Agapow, A.M. Leroi, H.P. Meinzer (2003). ALES: cell lineage analysis and mapping of developmental events, Bioinformatics, 19(7), 851-8.
  • Chu, K., N. Teele, N. Albright and W.C. Dewey (2004). Computerized Video Time Lapse Study of Cell Cycle Delay and Arrest, Mitotic Catastrophe, Apoptosis and Clonogenic Survival in Irradiated 14-3-3s and CDKN1A (p21) Knockout Cell Lines, Radiation Research, 162, 270–286.
  • Evans, N.D., R.J. Errington, M. Shelley, G.P. Feeney, M.J. Chapman, K.R. Godfrey, P.J. Smith and M.J. Chappell (2004). A mathematical model for the in vitro kinetics of the anti-cancer agent topotecan, Math Biosci, 189(2),185-217.
  • Prieur-Carrillo, G., K. Chu, J. Lindqvist and W.C. Dewey (2003). Computerized Video Time-Lapse (CVTL) Analysis of the Fate of Giant Cells Produced by X-Irradiating EJ30 Human Bladder Carcinoma Cells. Radiation Research, 159, 705–712 .
  • Sternberg, P.W. and H.R. Horvitz, (1981) Gonadal cell lineages of the nematode Panagrellus redivivus and implications for evolution by the modification of cell lineage, Dev. Biol., 88, 147– 166.
  • Taylor, D.L. and K.A. Giuliano (2005). Multiplexed high content screening assays create a systems cell biology approach to drug discovery. Drug Discov Today: Technologies, 2(2),149-154.
  • Thomas, N. and I.D. Goodyear (2003). Stealth sensors: real-time monitoring of the cell cycle, Targets, 2, 26-23.
  • Thomas, N. (2003) Lighting the circle of life. Cell Cycle 2(6): 545-549.