Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
Open AccessMethodology

LD-Spline: Mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium

William S Bush1 email, Guanhua Chen2 email, Eric S Torstenson1 email and Marylyn D Ritchie1 email

Center for Human Genetics Research, Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA

Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA

author email corresponding author email

BioData Mining 2009, 2:7doi:10.1186/1756-0381-2-7

Published: 3 December 2009

Abstract

Background

Gene-centric analysis tools for genome-wide association study data are being developed both to annotate single locus statistics and to prioritize or group single nucleotide polymorphisms (SNPs) prior to analysis. These approaches require knowledge about the relationships between SNPs on a genotyping platform and genes in the human genome. SNPs in the genome can represent broader genomic regions via linkage disequilibrium (LD), and population-specific patterns of LD can be exploited to generate a data-driven map of SNPs to genes.

Methods

In this study, we implemented LD-Spline, a database routine that defines the genomic boundaries a particular SNP represents using linkage disequilibrium statistics from the International HapMap Project. We compared the LD-Spline haplotype block partitioning approach to that of the four gamete rule and the Gabriel et al. approach using simulated data; in addition, we processed two commonly used genome-wide association study platforms.

Results

We illustrate that LD-Spline performs comparably to the four-gamete rule and the Gabriel et al. approach; however as a SNP-centric approach LD-Spline has the added benefit of systematically identifying a genomic boundary for each SNP, where the global block partitioning approaches may falter due to sampling variation in LD statistics.

Conclusion

LD-Spline is an integrated database routine that quickly and effectively defines the genomic region marked by a SNP using linkage disequilibrium, with a SNP-centric block definition algorithm.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.