Background: Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. Results: This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data parallelism, thread-level coarse-grained parallelism, and vector-level fine-grained parallelism. Furthermore, we re-organize the sequence datasets and use Xeon Phi shuffle operations to improve I/O efficiency. Conclusions: Evaluations show that our method achieves a peak overall performance up to 220 GCUPS for scanning real protein sequence databanks on a single node consisting of two Intel E5-2620 CPUs and two Intel Xeon Phi 7110P cards. It also exhibits good scalability in terms of sequence length and size, and number of compute nodes for both database scanning and multiple sequence alignment. Furthermore, the achieved performance is highly competitive in comparison to optimized Xeon Phi and GPU implementations. Our implementation is available at http://ift.tt/2a8p57s.
from #MedicinebyAlexandrosSfakianakis via xlomafota13 on Inoreader http://ift.tt/2acU2dI
via IFTTT
Αρχειοθήκη ιστολογίου
-
►
2020
(289)
- ► Φεβρουαρίου (28)
-
►
2019
(9071)
- ► Δεκεμβρίου (19)
- ► Σεπτεμβρίου (54)
- ► Φεβρουαρίου (3642)
- ► Ιανουαρίου (3200)
-
►
2018
(39872)
- ► Δεκεμβρίου (3318)
- ► Σεπτεμβρίου (3683)
- ► Φεβρουαρίου (2693)
- ► Ιανουαρίου (3198)
-
►
2017
(41099)
- ► Δεκεμβρίου (3127)
- ► Σεπτεμβρίου (2173)
-
▼
2016
(13807)
- ► Δεκεμβρίου (700)
- ► Σεπτεμβρίου (600)
-
▼
Ιουλίου
(1200)
-
▼
Ιουλ 19
(50)
- Sick leave and disability across three decades aft...
- Thrombocytosis in splenic trauma: in-hospital cour...
- Education, exposure and experience of prehospital ...
- Clinical significance of anterior humeral line in ...
- Safe and easy access technique for the first troca...
- Surface-enhanced Raman scattering
- Prevalence and risk factors of HCV infection in a ...
- Rotavirus vaccination and infection induce VP6-spe...
- Rapid protein immobilization for thin film continu...
- Host-guest binding motifs based on hyperbranched p...
- What is the effect of soft tissue thickness on cre...
- Teacher Quality and Learning Outcomes in Kindergarten
- Digitising the British Library’s Collection of Heb...
- Discrete element modelling of methane hydrate soil...
- Neuroprotective effects of heat shock proteins in ...
- Mechanical Properties of Plasterboards: Experiment...
- Monitoring and predicting actions and their conseq...
- Extracting human antibody sequences from public da...
- Exogenous Administration of Recombinant MIF at Phy...
- Out-of-plane seismic performance of plasterboard p...
- Co-dependence of the neural and humoral pathways i...
- Caffeoylquinic Acid Derivatives Extract of Erigero...
- New holostean fishes (Actinopterygii: Neopterygii)...
- TGF-β1 factor in the cerebrovascular diseases of A...
- The Comparison of Dietary Behaviors among Rural Co...
- Ερευνητικά νέαΥπερκινησίες και οι μύες της κατάποσ...
- Extremely Robust and Post-Functionalizable Gold Na...
- A method for exploring implicit concept relatednes...
- Does encoding matter? A novel view on the quantita...
- RefSelect: a reference sequence selection algorith...
- Integrating unified medical language system and as...
- Parallel algorithms for large-scale biological seq...
- Body Mass Changes Across a Variety of Running Race...
- Decitabine inhibits tumor cell proliferation and u...
- Epstein-barr virus strains and variations: Geograp...
- Acute viral respiratory infections among children ...
- Reactions in ultra-small droplets by tip-assisted ...
- Men at risk for paradoxical adipose hyperplasia af...
- Neural stem cells in lead toxicity
- Volumes of Cochlear Nucleus Regions in Rodents
- ACR Manual Version 2016 for Contrast Media: Summary
- Association of Living Arrangement Conditions and S...
- Esophageal Stent for Refractory Variceal Bleeding:...
- Delivery after Operation for Deeply Infiltrating E...
- Surface patterning of polyacrylamide gel using sca...
- Chromophore-immobilized luminescent metal-organic ...
- Awareness and attitude toward using dental magnifi...
- Mineralising and antibacterial effects of modified...
- Neural stem cells in lead toxicity
- Tunable Reactivity of Geminal Bis(silyl) Enol Deri...
-
▼
Ιουλ 19
(50)
- ► Φεβρουαρίου (1350)
- ► Ιανουαρίου (1400)
-
►
2015
(1500)
- ► Δεκεμβρίου (1450)
Ετικέτες
Τρίτη 19 Ιουλίου 2016
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου