GPU based Eulerian Assembly of Genomes

dc.contributor.advisorRangwala, Huzefa
dc.contributor.authorMahmood, Syed Faraz
dc.creatorMahmood, Syed Faraz
dc.date2012-12-04
dc.date.accessioned2013-02-18T17:32:09Z
dc.date.availableNO_RESTRICTION
dc.date.available2013-02-18T17:32:09Z
dc.date.issued2013-02-18
dc.description.abstractAdvances in sequencing technologies have revolutionized the eld of genomics by providing cost e ective and high throughput solutions. In this paper, we develop a parallel sequence assembler implemented on general purpose graphic processor units (GPUs). Our work was largely motivated by a growing need in the genomic community for sequence assemblers and increasing use of GPUs for general purpose computing applications. We investigated the implementation challenges, and possible solutions for a data parallel approach for sequence assembly. We implemented an Eulerian-based sequence assembler (GPU-Euler) on the nVidia GPUs using the CUDA programming interface. GPU-Euler was benchmarked on three bacterial genomes using input reads representing the new generation of sequencing approaches. Our empirical evaluation showed that GPU-Euler produced lower run times, and comparable performance in terms of contig length statistics to other serial assemblers. We were able to demonstrate the promise of using GPUs for genome assembly, a computationally intensive task. An error correction step was also incorporated into GPU-Euler to be able to process reads containing some errors. Error correction output was benchmarked on simulated read on three bacterial genomes with different read length.
dc.identifier.urihttps://hdl.handle.net/1920/8013
dc.language.isoen
dc.subjectSequence assembly
dc.subjectEuler Tour
dc.subjectGPU
dc.subjectError correction
dc.subjectCUDA
dc.subjectSpectral alignment
dc.titleGPU based Eulerian Assembly of Genomes
dc.typeThesis
thesis.degree.disciplineComputer Science
thesis.degree.grantorGeorge Mason University
thesis.degree.levelMaster's
thesis.degree.nameMaster of Science in Computer Science

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mahmood_thesis_2012.pdf
Size:
3.41 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: