23
Computational resources to facilitate variant discovery and analysis

Tuesday, March 18, 2014: 4:10 PM
308-309 (Community Choice Credit Union Convention Center)
James M Reecy , Iowa State University, Ames, IA
C. Baes , Qualitas AG, Zug, Switzerland
E. Fritz-Waters , Iowa State University, Ames, IA
James E. Koltes , Iowa State University, Ames, IA
M. Dolezal , Università degli Studi di Milano, Milano, Italy
B. Bapst , Qualitas AG, Zug, Switzerland
C. Flury , Bern University of Applied Sciences, Zollikofen, Switzerland
H. Signer-Hasler , Bern University of Applied Sciences, Zollikofen, Switzerland
C. Stricker , agn Genetics GmbH, Davos, Switzerland
Rohan L Fernando , Iowa State University, Ames, IA
Dorian J. Garrick , Iowa State University, Ames, IA
Fritz Schmitz-Hsu , Swissgenetics, Zollikofen, Switzerland
B. Gredler , Bern University of Applied Sciences, Zollikofen, Switzerland
Matt Vaugh , Texas Advance Computing Center, University of Texas, Austin, TX
Abstract Text:

Next generation sequencing has facilitated the sequencing of large numbers of individuals for variant detection. A challenge facing the livestock industry is the establishment of efficient workflows to process raw sequence data to accurate variant data such that implementation of whole genome sequence information in livestock breeding programs can be accomplished. Within the iPlant infrastructure, we have implemented currently available variant calling techniques and have applied them to a dairy cattle data set.  The Burrows-Wheeler aligner (BWA) was used to align paired end Illumina reads from 66 bulls to the bos taurus UMD3.1 reference assembly. A pipeline, integrating data preparation, insertion and deletion realignment, base quality score recalibration and variant calling, was created to manage variant calling comparisons. Variant calling was done using three different calling methods: UnifiedGenotyper of the Genome Analysis Tool Kit, SAMTools Mpileup and Platypus. A total of nine different options were compared with these methods.  Significant variation in the ability of the different methods calling of variants was observed. Furthermore, tremendous variation in the concordance of variant calling and SNPchip genotyping were observed as well.  These results indicate that a more detailed analysis and/or the use of a combination of callers is necessary if the livestock industry is to fully utilize genome re-sequencing information to improve livestock breeding.

The authors gratefully acknowledge financial support from the Swiss Cattle Breeders Association (ASR) and the Swiss Commission for Technology and Innovation (CTI), and USDA-NIFA project 2013-01001.

Keywords: next generation sequencing