Background: Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models which require quantitative encodings for the genotypes. There are lots of work on the prediction algorithms, but none of the existing work investigated the effects of the encodings on the genetic trait prediction problem. Methods: In this work, we view the genetic trait prediction problem from a novel angle: a multiple regression on categorical data problem, which requires encoding the categorical data into numerical data. We further proposed two novel encoding methods and we show that they are able to generate numerical features with higher predictive power.Results and DiscussionOur experiments show that our methods are superior to the other encoding methods for both single marker model and epistasis model. We showed that the quantitative genetic trait prediction problem heavily depends on the encoding of genotypes, for both single marker model and epistasis model. Conclusions: We conducted a detailed analysis on the performance of the hybrid encodings. To our knowledge, this is the first work that discusses the effects of encodings for genetic trait prediction problem.
from #MedicinebyAlexandrosSfakianakis via xlomafota13 on Inoreader http://ift.tt/2a8oX7R
via IFTTT
Αρχειοθήκη ιστολογίου
-
►
2020
(289)
- ► Φεβρουαρίου (28)
-
►
2019
(9071)
- ► Δεκεμβρίου (19)
- ► Σεπτεμβρίου (54)
- ► Φεβρουαρίου (3642)
- ► Ιανουαρίου (3200)
-
►
2018
(39872)
- ► Δεκεμβρίου (3318)
- ► Σεπτεμβρίου (3683)
- ► Φεβρουαρίου (2693)
- ► Ιανουαρίου (3198)
-
►
2017
(41099)
- ► Δεκεμβρίου (3127)
- ► Σεπτεμβρίου (2173)
-
▼
2016
(13807)
- ► Δεκεμβρίου (700)
- ► Σεπτεμβρίου (600)
-
▼
Ιουλίου
(1200)
-
▼
Ιουλ 19
(50)
- Sick leave and disability across three decades aft...
- Thrombocytosis in splenic trauma: in-hospital cour...
- Education, exposure and experience of prehospital ...
- Clinical significance of anterior humeral line in ...
- Safe and easy access technique for the first troca...
- Surface-enhanced Raman scattering
- Prevalence and risk factors of HCV infection in a ...
- Rotavirus vaccination and infection induce VP6-spe...
- Rapid protein immobilization for thin film continu...
- Host-guest binding motifs based on hyperbranched p...
- What is the effect of soft tissue thickness on cre...
- Teacher Quality and Learning Outcomes in Kindergarten
- Digitising the British Library’s Collection of Heb...
- Discrete element modelling of methane hydrate soil...
- Neuroprotective effects of heat shock proteins in ...
- Mechanical Properties of Plasterboards: Experiment...
- Monitoring and predicting actions and their conseq...
- Extracting human antibody sequences from public da...
- Exogenous Administration of Recombinant MIF at Phy...
- Out-of-plane seismic performance of plasterboard p...
- Co-dependence of the neural and humoral pathways i...
- Caffeoylquinic Acid Derivatives Extract of Erigero...
- New holostean fishes (Actinopterygii: Neopterygii)...
- TGF-β1 factor in the cerebrovascular diseases of A...
- The Comparison of Dietary Behaviors among Rural Co...
- Ερευνητικά νέαΥπερκινησίες και οι μύες της κατάποσ...
- Extremely Robust and Post-Functionalizable Gold Na...
- A method for exploring implicit concept relatednes...
- Does encoding matter? A novel view on the quantita...
- RefSelect: a reference sequence selection algorith...
- Integrating unified medical language system and as...
- Parallel algorithms for large-scale biological seq...
- Body Mass Changes Across a Variety of Running Race...
- Decitabine inhibits tumor cell proliferation and u...
- Epstein-barr virus strains and variations: Geograp...
- Acute viral respiratory infections among children ...
- Reactions in ultra-small droplets by tip-assisted ...
- Men at risk for paradoxical adipose hyperplasia af...
- Neural stem cells in lead toxicity
- Volumes of Cochlear Nucleus Regions in Rodents
- ACR Manual Version 2016 for Contrast Media: Summary
- Association of Living Arrangement Conditions and S...
- Esophageal Stent for Refractory Variceal Bleeding:...
- Delivery after Operation for Deeply Infiltrating E...
- Surface patterning of polyacrylamide gel using sca...
- Chromophore-immobilized luminescent metal-organic ...
- Awareness and attitude toward using dental magnifi...
- Mineralising and antibacterial effects of modified...
- Neural stem cells in lead toxicity
- Tunable Reactivity of Geminal Bis(silyl) Enol Deri...
-
▼
Ιουλ 19
(50)
- ► Φεβρουαρίου (1350)
- ► Ιανουαρίου (1400)
-
►
2015
(1500)
- ► Δεκεμβρίου (1450)
Ετικέτες
Τρίτη 19 Ιουλίου 2016
Does encoding matter? A novel view on the quantitative genetic trait prediction problem
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου