![]() |
Artificial Intelligence Research Laboratory Department of Computer Science Iowa State University |
Algorithmic Tools for Gene Expression Analysis
Personnel
Project Summary
Funding
Publications
Additional Information
Projects
AI Lab
The central dogma of modern biology states that the functional state of an organism is determined largely by the pattern of expression of genes. Thus, the function of a cell, how well a cell performs its function, and even the determination of a cell's type is controlled by the level at which genes are expressed in the cell. Different gene expression levels can, for example, differentiate a neuron from a muscle cell or a normal cell from a cancerous cell of the same type. However, many biological processes of interest (e.g., cellular differentiation during development, aging, disease) are controlled by complex interactions over time between hundreds of genes. Furthermore, each gene is involved in multiple functions. Given the fact that thousands of genes are involved in determining the functional state of an organism, the task of assigning functions to genes, identifying underlying genetic signalling pathways, genetic control and regulatory networks is a formidable task.
The advent of microarray technology provides biologists with the ability to measure the expression levels of thousands of genes in a single experiment. Microarrays come in several flavors. In a common type of microarray, typically called a DNA microarray, several thousand DNA samples are fixed to a glass slide, each at a known position in the array. Each probe (a DNA sample) corresponds to a single gene within the organism under investigation. Messenger RNA samples are then collected from a population of cells under a given set of experimental conditions. These samples are converted into cDNA via reverse transcription and labeled with a fluorescent dye. The cDNA sample is hybridized with the microarray. The level of expression of a particular gene is roughly proportional to the amount of cDNA that hybridizes with the DNA affixed to the slide. By repeating this process under different experimental conditions, using different fluorescent dyes to distinguish between any pair of experimental conditions, it is possible to measure the expression levels of different thousands of genes under the chosen conditions. Thus, for example, the relative levels of expression of a particular gene under any pair of experimental conditions can be obtained by measuring the ratio of the amount of the two dyes present at the position of each DNA sequence on the slide using laser scanning technology.
With the increasing use of DNA microarray and related technologies for gathering gene expression data from plants and animals, there is a growing need for sophisticated computational tools for extracting biologically significant information from gene expression data, assigning functions to genes, and identifying shared signalling pathways and control circuits (e.g., genetic signalling and genetic regulatory networks). Against this background, our research is aimed at precise characterization of the information requirements of these tasks and the design and implementation of suitable algorithms for gene expression analysis. Algorthms are being developed for:
The resulting computational tools for gene expression analysis will be applied to a variety of problems in computational biology including:
To appear.
© Vasant Honavar, 1999, 2000.