ABSTRACTS

ESHG - Posters: P 11 Genomics and Bioinformatics

P0587 

Comparative mapping of human chromosome 22 

A. Aleyasin 1, W. Barendse 2;
1National Research Center for Genetic Engineering and Biotechnology, Tehran, IRAN (ISLAMIC REPUBLIC OF), 2TAG, CSIRO, Brisbane, Qld, AUSTRALIA. 

 

Human chromosome 22 is one of the smallest autosomes involved in many human genetic disease. Comparative mapping of human chromosome 22 has not been very well described in most unrelated species. This work was to choose homologous genes from human chromosome 22 (HSA22) and map them in cattle to understand the homology between human chromosome 22 and cattle chromosomes. In this way polymorphisms for six genes (ADSL, BZRP, CYP2D, CRYBA4, DIA1 & PDGFB) that showed strong sequence homology with their human counterparts and their locations are well known in human have been developed in cattle. Polymorphisms were developed for introns or 3'UTRs of those genes. Those polymorphisms have been genotyped and genetically mapped in cattle using International Bovine Reference panel containing individuals related to three generations. Those genes have been mapped by genetic linkage analysis in the frame work provided by International cattle genome mapping facility in CSIRO, Brisbane Australia. They have been mapped on cattle chromosome 5 and 17 and developed understandings of conservation between human chromosome 22 and cattle chromosome 5 and 17.

 

P0588 

Molecular characterisation of pericentric inversion breakpoints on chimpanzee chromosome 19 compared to human 

H. Kehrer-Sawatzki 1, S. Tänzer 2, M. Platzer 2, H. Hameister 1;
1Department of Human Genetics, University of Ulm, GERMANY, 2Department of Genome Analysis, Institute for Molecular Biotechnology, Jena, GERMANY. 

 

Structural differences of the primate chromosomes include heterochromatin variability, inversions, and the telomeric fusion of two ancestral chromosomes, which resulted in the generation of human chromosome 2. Since genetic disparities are assumed to be responsible for phenotypic differences between hominoid species, molecular characterisation of the genomic regions harbouring evolutionary breakpoints is important. These analyses will help to understand the mechanisms involved in chromosome evolution. Additionally, the molecular definition of evolutionary breakpoints will reveal whether they overlap with human genomic regions susceptible to constitutional or somatic rearrangements and whether owing to intrinsic sequence features these breakpoints have been utilised more than once during evolution. So far, only the fusion event, which gave rise to human chromosome 2 has been characterized on the molecular level. Currently, we are investigating the pericentric inversion of the chimpanzee (PTR) chromosome 19, orthologous to human chromosome 17. Human BAC clones from 17p13 and 17q21.3 were identified, which span the inversion breakpoints. FISH with these BACs resulted in split hybridization signals on PTR19. Subfragments of these human BACs were used to isolate chimpanzee BACs, which in turn cover the inversion breakpoints. Sequence analysis of the junction and the respective normal sequences revealed that the inversion took place in 5-7kb spanning regions rich of repetitive elements and that nonhomologous recombination between Alu elements has been involved. We specified the sequence environment flanking the breakpoints, identified the genes next to the breakpoints on either site, and addressed the contribution of this inversion to the structure of these genes.

 

P0589 

Human data in SWISS-PROT and TrEMBL 

Y. Karavidopoulou, S. Contrino, P. Kersey, M. Magrane, A. Williams, R. Apweiler;
European Bioinformatics Institute (EBI), Cambridge, UNITED KINGDOM. 

 

TrEMBL was created as a supplement to the SWISS-PROT Protein Knowledgebase. Whereas SWISS-PROT is a non-redundant, manually curated protein sequence database, TrEMBL is a computer-annotated database containing the translations of all coding sequences present in the EMBL-Bank nucleotide sequence database that are not present in SWISS-PROT.
SWISS-PROT contains 7,652 annotated human sequences. 31,673 exist in TrEMBL awaiting annotation and/or merging. We aim to fully annotate and describe these human protein sequences and distribute them to the scientific community as soon as possible, thus enriching the biological knowledge of the human genome. More about the 'Human Proteomics Initiative' (HPI) at http://www.ebi.ac.uk/swissprot/hpi/ .
A human SWISS-PROT/TrEMBL non-redundant proteome set was built, consisting of 24,049 entries (from a total of 39,325 human entries). Proteome analysis was performed, offering statistical, structural, functional and comparative analyses, http://www.ebi.ac.uk/proteome/ .
As all human coding sequences are not yet in the nucleotide sequence database, the SWISS-PROT and Ensembl teams jointly constructed a draft complete non-redundant proteome, comprising 23,377 SWISS-PROT/TrEMBL non-redundant entries and additional 9,136 Ensembl predictions.
Gene sets for the human chromosomes were constructed, providing a comprehensive reference of the human data in SWISS-PROT/TrEMBL. Each set is an alphabetic listing of the genes with the HUGO approved gene symbol (or the NCBI LocusLink provisional symbol) encoded on that chromosome, together with the chromosome position, the protein it encodes and useful links to other databases, such as GeneCards, OMIM, Ensembl, InterPro and CluSTr.
Currently, 14,932 gene mappings have been made, 8,427 of which with an official HUGO gene name.

 

P0590 

Biomax PEDANTÔ Human Genome Database - Automatic and Manual Functional Annotation of the Human Genome 

A. Fritz 1, B. Geier 1, V. Solovyev 2, M. Firsov 3, C. M. E. Schüller 1;
1Biomax Informatics AG, Martinsried, GERMANY, 2Softberry Inc., Mount Kisco, NY, 3Petrogen Ltd., St. Petersburg, RUSSIAN FEDERATION. 

 

Public and private efforts have begun to extract valuable information from the wealth of human genome data. Biomax Informatics systematically analyzed data for all human genes (both known and unknown) in a consistent manner to determine (possible) protein function and, when necessary, refined the annotation manually . All information has been stored in a relational database, which can be accessed via a Web-based user interface.
Using the publicly available working draft assembly of the human genome, we identified the location of known and putative genes with the exclusive Fgenesh++ software from Softberry, Inc. The Pedant-ProÔ Sequence Analysis Suite from Biomax was used to perform systematic, comprehensive and consistent analysis for in-depth functional and structural characterization of the predicted proteome. This characterization includes functional class assignments according to a functional catalogue (with more than 1500 functional categories) and assignment of EC numbers, PROSITE patterns, Pfam domains and SCOP classifications.
The PEDANTÔ Human Genome Database based on the 12 December 2000 working draft assembly of the human genome and contains 44,403 gene models. Using sequence similarity to manually annotated proteomes, proteins have been assigned to functional categories. In addition, the Pedant-Pro software performed further functional and structural analyses including similarity searches, annotation of functional domains, and assignment of keywords, EC numbers and transmembrane domains. Report pages display all important automatic and manual annotations for each protein and hyperlinks to the corresponding databases. The easy-to-use graphical user interface provides search tools as well as DNA and protein viewers.

 

P0591 

The dynamic process of gene discovery: characterization of 19 novel transcripts from human chromosome 21 

S. Deutsch 1, A. Camargo 2, A. Reymond 1, B. J. Stevenson 3, R. B. Parmigiani 2, C. Ucla 1, F. Bettoni 2, C. Iseli 3, V. Jongeneel 4, P. Bucher 3, A. J. G. Simpson 2, S. E. Antonarakis 1;
1Division of Medical Genetics, University of Geneva Medical School, Geneva, SWITZERLAND, 2Ludwig Institute for Cancer Research, Sao Paulo, BRAZIL, 3Swiss Institute of Bioinformatics, Epalinges, SWITZERLAND, 4Ludwig Institute for Cancer Research, Epalinges, SWITZERLAND. 

 

The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes, and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of C21orfs and PREDs, identified by mapping spliced ESTs to the genome and only by in silico analysis, respectively. This study underscored the limit of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3’ transcript ends (3’-tags) to the genome. We then created a specific 21q Acedb that incorporates new ESTs as well as features such as CpG islands, repeats and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs, performed RT-PCR, 5’ and 3’RACE procedures and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were not previously identified probably because they are small and encode small proteins. We also identified 4 transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene predictions algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.

 

P0592 

GENATLASThird Millenium: a database dedicated to gene and disease annotation 

C. Mugnier 1, M. Chauvet 2, K. Tanin 2, F. Barbet 2, A. Munich 3, M. Roux-Rouquie 4, J. Frezal 5;
1Université de Paris V, Paris, FRANCE, 2Université de Paris V - Génopole Ile de France, Paris, FRANCE, 3U393 INSERM - Université de PARIS V - Génopole Ile de France, Paris, FRANCE, 4Institut PASTEUR, Paris, FRANCE, 5Hôpital Necker - Université de Paris V - Génopole Ile de France, Paris, FRANCE. 

 

We designed a new conceptual model to improve data storage and retrieval in GENATLAS
Data structure in GENATLASTM : The new relational GENATLAS, noted GENATLASTM, was developed with ORACLE and contains five major sections which consist in the GENATLASTM core. The section GENE stores information on DNA type, arrangement and structure for a specific gene or DNA sequence. The section RNA contains data on the major gene transcript type and isoforms as well as tissue expression or associated pathologies (stored in the special section VARIANT/PATHOLOGY). Protein products are categorized according to their structures: secondary structure, structure motifs/domains, structure homology, etc., and their activity categories in the PROTEIN section. In addition, the protein EXPRESSION section displays ontologies about cellular localization (extra- and subcellular), tissue type, stage, etc.
The GENATLASTM core is linked to three distinct directories which are the CITATION directory (containing more than 40.000 occurrences linked to Medline abstracts), the PHENOTYPE directory gathering information of mendelian disorders, susceptibility genes, somatic genetic disorders (tumors, malformations, etc.) and the LINKAGE directory includes 28.000 pairs At the moment, GENATLASTM contains more than 12.000 genes, and 2.600 diseases are instantiated.
Querying GENATLASTM The strategies to query GENATLASTM implement multicriterion approaches according to the core sections (GENE, VARIANT/PATHOLOGY, RNA, PROTEIN, EXPRESSION) or particular fields within these sections. Actually, criterion combinatory querying constitutes one of main features of GENATLASTM; more than one hundred criterions can be used to query for gene structure, function and diseases.
GENATLASTM would be accessible at : http://www.dsi.univ-paris5.fr/genatlas

 

P0593 

In man trinucleotide repeats are underrepresented but some motifs are overexpanded: a novel signature for eukaryotic genomes? 

P. Astolfi 1, D. Bellizzi 2, V. Sgaramella 2 ,3;
1Dept. Genetics and Microbiology, University of Pavia, Pavia, ITALY, 2Dept. Cell Biology, University of Calabria, Cosenza, ITALY, 3Centro Linceo Interdisciplinare, Rome, ITALY. 

 

Aiming at assessing whether the reported shortage in trinucleotide repeats could be ascribed to specific motifs, such as those found dynamically mutable in man, we used the Tandem Repeat Finder program and detected approximate di-, tri- and tetra-repeats in human chromosomes 21 and 22 and in five organisms (M. musculus, D. melanogaster, C. elegans, A. thaliana, S. cerevisiae). Di-repeats are always the most represented, tetra-repeats are more represented in mammals than in the other organisms, whereas tri-repeats are consistently scarce. In man tri-repeat representation is less than ¼ of tetra-; their frequency is 6.4 repeat/Mb and their coverage 473 b/Mb. Motifs such as ACG, ACT, CCG are scarce, while the others present taxa-specific variations. Motif representation seems related to base sequence rather than content, with AA contributing positively and CG negatively. In man ACG is the least represented, ATC has the highest coverage, AAT the most frequent. Tri-repeat coverage generally increases linearly with frequency, except for AAG, ACC, AGG, ATC in man, and AAG, AGG in mouse: these repeats tend to expand in the former more than in the latter and occasionally, but never in exons, exceed copy number of 40, the observed limit for tri-repeat pathological expansions. Tri-repeat scarcity seems a feature of mammals and the variable representation of their motifs may constitute a novel signature for eukaryotic genomes: since both probably result from a structural control on DNA synthesis, the aberrant elongations found in dynamic mutations may be due to the derangement of one or more control elements.

 

P0594 

DNA-Protein Interaction DataBaseProject: Bioinformatic DataBase of Third Genertion. 

S. A. Vasil'ev 1, A. V. Alexeevski 1, S. A. Spirin 1, V. N. Tashlitski 2, A. S. Karyagina-Zhulina 3, Y. Kalaidzidis 1;
1Belozersky Institute of Physico-Chemical Biology, MSU, Moscow, RUSSIAN FEDERATION, 2Chemical department, MSU, Moscow, RUSSIAN FEDERATION, 3Institute of Agricultural Biotechnology RAAS, Moscow, RUSSIAN FEDERATION. 

 

Increasing significance of theoretical modeling in the field of molecular medicine and molecular biology makes databases (DB) for three-dimensional (3D) structures of biological macromolecules particularly important. Currently available DB for 3D structures of biological macromolecules can be divided by three generations. First generation (G1) is DB-depositories. To the moment Protein Data Bank (PDB) is the most successful and actively developing G1 DB. Second generation (G2) is specially designed DB. Nucleic Acid Database (NDB) is an example of G2 bioinformatics DB (BDB). Along with information stored in PDB this type of DB allows linking to other biological DB (PFAM; GenBank, etc.). Third generation (G3) is highly specialized informational systems (IS). This type is presented by classified IS like Protein-DNA Interaction and PDBSum projects that provide both detailed classification and analysis of the information stored in PDB and NDB. Currently we develop G3 BDB -DNA-Protein Interaction DataBase (DPIDB) version 3.0.1. based on Pluk (object oriented) technology.
DPIDB contains complete information on structure of DNA-protein complexes (derived from PDB and NDB) in both usual and specially designed by us data format. Our data format allowed revealing specific features of DNA-protein interaction in complexes. The up to date classification of DNA-protein complexes is used for DPIDB. The Internet site of DPIDB v 3.0.1 (http://www.dpidb.belozersky.msu.ru) is now under construction. It will include DPIDB as well as the results of our own researches, for example, the statistical analysis of DNA-protein interactions. To the moment a review regarding two DNA-binding protein groups are presented on the site.

 

P0595 

Aggregation of post-transcriptionally coregulated mRNAs in cytoplasmic ribonucleoprotein particles 

A. Totaro, M. D'Apolito, A. Notarangelo, L. Zelante, A. Quattrone;
IRCCS CSS, S. Giovanni Rotondo, ITALY. 

 

A cytoplasmic ribonucleoprotein (mRNP) infrastructure works in the cell as an interface between the transcriptome and the proteome, being responsible for transcript transport, localization, turnover and translational control. This complex is composed by aggregates of distinct subsets of mRNAs with regulatory RNA binding proteins, which finally interact with ribosomes and the cytoskeletal apparatus to allow protein synthesis. It has been proposed that the clusters of monocistronic mRNAs composing individual mRNPs function like poycistronic mRNAs transcribed from operons in procariotic organisms, adding with their combinatorial nature a further level of regulative complexity. If these mRNPs are really regulative units, the bound mRNAs should be endowed with uniform turnover and translation rate. Here we show that a clusterization of human primary fibroblast mRNAs based on their individual half life (measured by an high density microarray approach) produces groups of mRNAs which can be demonstrated to be colocalized, and probably belong to the same mRNP type. The understanding of this new level of cell organization has profound implications for the pathogenesis of those mendelian diseases, like myotonic dystrophy and X fragile syndrome, in which a single alteration in a mRNP component results in a complex clinical phenotype.

 

P0596 

A Web-based SNP search tool 

T. M. Strom 1 ,2, C. Glöckner 1 ,2, T. Meitinger 1 ,2;
1Institute of Human Genetics, GSF National Research Center, Munich, GERMANY, 2Institute of Human Genetics, Klinikum rechts der Isar, Technical University, Munich, GERMANY. 

 

We constructed a database using dbSNP and the GoldenPath data which can be accessed via a Web-browser (http://ihg.gsf.de). The search tool allows to retrieve SNPs in the genomic region of RefSeq and Ensembl transcripts and to download the genomic sequence around each SNP as given by the GoldenPath. SNPs were masked by an 'N' in the genomic sequence in order to avoid primers to be positioned across SNPs.
The database contains all entries of dbSNP mapped to the GoldenPath by the NCBI. Entries are only displayed when they hit a single chromosome, a single contig and maximally twice within a certain contig. The proportion of repeat sequence within the region -100 bp to +100 bp was calculated for each SNP. The number of exons for each gene is derived from the GoldenPath annotation.
The database can be searched by i) chromosome coordinates, ii) RefSeq or Ensembl accession numbers, or iii) rsSNP IDs. A non-redundant dataset of RefSeq and Ensembl transcripts is used for the chromosome search. All SNPs up to 10,000 bp upstream and downstream of the transcripts are displayed. Transcripts without SNPs are indicated. The search can be narrowed by the proportion of repeats and the average estimated heterozygosity.
The tool allows seven features to be downloaded: i) rsSNP ID, ii) RefSeq or Ensembl accession number, iii) chromosome position on the GoldenPath, iv) average estimated heterozygosity, v) proportion of repeats, vi) observed polymorphism (dbSNP) and vii) sequence around the SNPs either as given in dbSNP or in the GoldenPath.

 

P0597 

UMD-STA and UMD-LMNA: Locus Specific DataBases for emerinopathy and laminopathy 

C. Beroud 1, R. Ben Yaou 2, A. Helbling-Leclerc 2, S. Llense 1, J. C. Kaplan 1, F. Leturcq 1, D. Recan 1, G. Bonne 2;
1Institut Cochin de Génétique Moléculaire, Paris, FRANCE, 2INSERM U523 - Institut de Myologie, Paris, FRANCE. 

 

About 75 neuro-muscular disease genes have been identified so far. Among them, Emery-Dreifuss syndrome is due to defects in the emerin gene (STA on chromosome X), or in the lamin A/C gene (LMNA on chromosome 1), both coding for proteins associated to the nuclear envelope. A large clinical intra- and inter-family variability is associated to both emerinopathy and laminopathy. In addition, up to five different clinical entities are associated to LMNA mutations. To handle and exploit the numerous data collected by the “French Clinical and Research network for Emery-Dreifuss muscular dystrophies and other nucleopathies" connected to "EUROMEN" network on emerinopathy and laminopathy (a subsidiary of the “Myo-Cluster” EC initiative), we developed a Locus Specific DataBase (LSDB). It was adapted from the Universal Mutation Database (UMD) package previously used for many LSDB's such as p53, APC, FBN1, LDLR, VHL, MEN1, ATP7B... The specific features of the present software are: (i) access to the various routines via internet; (ii) inclusion of multi-parametric analytic tools allowing optimized searching of correlations. This requires a full implementation of relevant clinical, para-clinical and biological features. We have developed a specific module to display phenotype-genotype and genotype-phenotype correlations. To date the UMD-STA database contains 130 records and the UMD-LMNA database 250 records.
This work was supported by the AFM/INSERM French Network "Clinical and research network for Emery–Dreifuss muscular dystrophies and other nucleopathies" and the European "Myo-Cluster/EUROMEN" Network.

 

P0598 

HC Forum (r): a generic bioinformatic platform dedicated to medical genetics 

O. Cohen;
Universite Joseph Fourier, Medical school of Grenoble, FRANCE. 

 

HC Forum® is the first secure and generic informatic platform dedicated to medical genetics, in which information is structured according to pedigrees and the different components of medical genetics. It includes a database accessible from a Web site. User-friendly interfaces require no specialised computer skills, but only an access with a simple Web navigator. Graphic tools allow inputting and search for information in the database. A high level of security includes authentication with smart cards, certified high level encryption (128 bits), medical data coding to avoid any patronymic names, and digital tattooing of images.
HC Forum® is approved by the French "Informatics and Freedom Commission " and by security audits, a medical ethics laboratory (Pr C. HERVE, Paris) and by specialized lawyer.
HC Forum® offers geneticists private working spaces dedicated to cytogenetics, monogenic diseases and dysmorphological syndromes, including LDDB thesaurus (R. WINTER) and international nomenclatures. It allows the inputting of genetic records with pedigree and a standardised textual description of diseases, enriched with images from the different clinical and paraclinical investigations.
Furthermore, HC Forum® allows collaborative working: (i) shared records allow several doctors to work on the same record, (ii) thematic networks allow professionals to share data related to a same disease.
HC Forum® establishes a link between medical genetics departments, research laboratories and teaching centres. HC Forum® is academic, free of charge and available to the international scientific and medical communities.
URL: https://HCForum.imag.fr

 

P0599 

Genomic Assays Project: Building an SNP-Based Linkage Disequilibrium Map and Ready-to-Use 5’ Nuclease Assays for 200,000 SNPs 

P. Dong, F. de la Vega, E. Spier, J. Stevens, K. Guegler, D. Dailey, G. Nunn, L. Wogan, J. Ziegle, M. Rhodes, D. Madden, D. Gilbert;
Applied Biosystems, Foster City, CA. 

 

The human genome project has identified the 3 billion base human genome sequence in the past decade. SNPs in the human genome, encoding the human variance, are expected to become a powerful tool in deciphering complex traits such as disease susceptibilities and drug responses.
We describe the project to create a ready-to-use solution for association studies in the process of gene discovery. We are building a genome wide linkage disequilbrium (LD) map based on data from both the Celera and the public sequencing efforts. The SNPs selected for the LD map are validated in regards to both assay performance and allelic frequencies in at least two populations. During the project, we will make available 200,000 SNP ready-to-use assays named Assays-On-DemandTM.
All SNP assays created in this project will be of high information content, quality and performance, based on the process we have introduced. These SNP assays will be easily accessible via a simple interface, enabling researchers to assemble the list of SNP assays tailored for their individual projects.
We will be presenting details of the genomic assays project including SNP selection , assay validation and LD map creation. The ultimate goal of this project is make genetic studies faster and more efficient.

 

P0600 

Comparative functional genomics of an entire chromosome: coding and regulatory annotation of HC21 using the mouse genome 

E. T. Dermitzakis 1, A. Reymond 1, S. Deutch 1, B. J. Stevenson 2, C. Iseli 2, P. Bucher 2, V. Jongeneel 2, S. E. Antonarakis 1;
1Division of Medical Genetics, University of Geneva, Geneva, SWITZERLAND, 2Ludwig Institute for Cancer Research, Lausanne, SWITZERLAND. 

 

Chromosome 21 (HC21) is the smallest human chromosome. Several diseases are associated with genes or genomic regions of HC21 including Down syndrome, the most common cause of mental retardation. The availability of the mouse genomic sequence gives the opportunity to combine whole chromosome homologous sequence comparisons with experimental assays to reveal novel functional regions that correspond either to putative genes or regulatory regions. We used PipMaker to perform long genomic alignments of the entire human chromosome 21q sequence with the homologous sequences from mouse chromosomes 16, 17 and 10 obtained from the Celera mouse genome assembly. 3896 blocks of >100bps in length and >70% sequence identity were extracted and fragments corresponding to known HC21 exons were excluded. From the remaining 2383 unassigned conserved blocks, 10% correspond to deposited usually singly represented human ESTs, 37% contain significant ORFs and several have strong similarity to deposited coding sequences. These suggest that many of the conserved blocks are unknown exons of known or unknown genes that were neither experimentally identified nor predicted by the sequence analysis of HC21, probably due to short length and restricted spatial and temporal expression. Conserved blocks corresponding to putative exons or regulatory regions are currently being tested experimentally for their functional role. Additionally, coding and intergenic sequences are being analyzed in order to reveal their evolutionary properties. We will present a combination of comprehensive computational and experimental analysis in an effort to improve and expand the coding and regulatory annotation of HC21 and study its evolutionary history.

 

P0601 

Transcriptome variations in Caco-2 cells during their differentiation : a model for studying intestinal iron absorption ? 

H. Ferran Bédrine 1, I. Gicquel 1, M. Lecunff 2, I. Guisle 2, N. Soriano 1, P. Fergelot 1, J. Y. Le Gall 1, J. Leger 2, J. Mosser 1;
1UMR 6061CNRS, Rennes, FRANCE, 2INSERM U533, Nantes, FRANCE. 

 

Iron homeostasis is controlled by intestinal absorption. Many factors implicated in this absorption have been recently characterized without explaining its adaptation to the iron store of the whole organism. This adapted absorption is probably due to an unknown sensibilization of intestinal crypts undifferentiated cells. These cells differentiate during their migration along the crypt-villosity axis and would become fully differentiated enterocytes with an appropriate iron absorption capacity. Caco-2 cells, a colon adenocarcinoma cell line, seem to be a good model for studying iron effects on gene expression during the enterocytic differentiation.
A transcriptomic approach has been implemented to identify genes involved in the regulation of iron absorption in the intestine : 1536 cDNA were selected by subtractive suppressive hybridization ( SSH ) and spotted on micro arrays. Their sequences reveal that each array represents 700 genes, half of them with an unknown function. mRNA from Caco-2 cells at different stages of differentiation were hybridised to the microarrays. These experiments show that 400 out of the 700 genes can be classified in seven distinct expression profiles that seem to correlate with functional role, most of iron metabolism genes being indeed overexpressed in differentiated cells. The effects of intracellular iron concentration on the expression of those genes are currently being studied. These observations should enable us to better understand the mechanisms underlying primary iron overload. (genetic heterogeneity, incomplete penetrance of the C282Y mutation of the HFE1 gene.)

 

P0602 

The use of multiple algorithms and a novel quality metric system for high throughput and accurate microsatellite- and SNP-based genotyping 

D. C. Siu, D. Bishop, T. Hatch, C. Frantz, Y. Lou, M. Rhodes, C. Brown;
Applied Biosystems, Foster City, CA. 

 

With the advent of automated fluorescent genotyping using medium-to-high throughput systems such as the ABI PRISM ® 3700 and 3100 Genetic Analyzers, a fast and accurate genotyping software tool is needed to perform accurate allele calling with minimal human interventions.
The ABI PRISM® GeneMapperÔ software, containing a multiple algorithm module and a novel quality metric system, was developed to meet the requirements of these automated, microsatellite- and SNP-based genotyping processes. In GeneMapperÔ software, an algorithm module containing multiple allele calling algorithms - size calling, binning, allele calling, has been developed to optimize the accuracy of final genotype assignments. Coupled with a novel Process Based Quality Value (PQV) system, which contains over 20 quality metrics to pinpoint the origins of genotyping failures, GeneMapper software enables users to choose to manually examine only allele calls with low quality scores. For projects that require frequent modifications of markers, a novel panel management feature has been developed to allow on-the-fly editing of bins within markers.
A collection of genotyping data has been processed using GeneScan® software, GenotyperÔ software and GeneMapperÔ software, and analysis results were compared. Allele calling accuracy and time required for complete analysis and data editing will be presented.

 

P0603 

Genomic Assays To Enable High Throughput Genomics 

M. D. Rhodes, S. Eddins, A. Dodge, K. Rogers, D. Dailey, P. Dong, D. Madden, D. Gilbert;
Applied Biosystems, Foster City, CA. 

 

The draft of the human genome enables scientific discoveries in many fields including gene expression and genotyping. The challenge is to make this wealth of genome information available to the laboratory. Applied Biosystems has launched two complementary product offerings that together provide a complete solution for ready to use assays for quantification of gene expression and genotyping via SNPs These assays use the 5‘ nuclease assay with Taqman® MGB probes. All assays are provided in an easy to use single tube format. Experimental setup is simple consisting of addition of two reagents (assay mix and universal master mix) to the Target DNA.
The Assays-by-Design SM service, will allow customers to submit a target sequence of interest. Applied Biosystems uses proprietary algorithms to design an assay for each target. Following synthesis and formulation, the assay will be analytically tested for integrity before shipping to the customer. The second offering, the Assays-on-Demand TM products, will provide ready-to-use assays to pre-defined targets which customers may select by, SNP or transcript ID, gene name or attribute. Genomic assays provided by both Both Assays-by-Design SM and Assays-on-Demand TM are provided in an easy to use single tube format. Experimental setup is minimized consisting of addition of two reagents (assay mix and universal master mix) to the Target DNA. The process is highly automatable and signal detection can be carried out unattended using the ABI Prism® 7900HT with robotic accessories
Details on the design, production and testing of these assays will be presented.