Publication:
Shrinking a large dataset to identify variables associated with increased risk of Plasmodium falciparum infection in Western Kenya

dc.contributor.authorTREMBLAY, M.
dc.contributor.authorDAHM1, J. S.
dc.contributor.authorWAMAE, C. N.
dc.contributor.authorGLANVILLE, W. A. DE
dc.contributor.authorFÈVRE, E. M.
dc.contributor.authorDÖPFER, D.
dc.date.accessioned2018-02-13T10:08:03Z
dc.date.available2018-02-13T10:08:03Z
dc.date.issued2015-04-16
dc.description.abstractLarge datasets are often not amenable to analysis using traditional single-step approaches. Here, our general objective was to apply imputation techniques, principal component analysis (PCA), elastic net and generalized linear models to a large dataset in a systematic approach to extract the most meaningful predictors for a health outcome. We extracted predictors for Plasmodium falciparum infection, from a large covariate dataset while facing limited numbers of observations, using data from the People, Animals, and their Zoonoses (PAZ) project to demonstrate these techniques: data collected from 415 homesteads in western Kenya, contained over 1500 variables that describe the health, environment, and social factors of the humans, livestock, and the homesteads in which they reside. The wide, sparse dataset was simplified to 42 predictors of P. falciparum malaria infection and wealth rankings were produced for all homesteads. The 42 predictors make biological sense and are supported by previous studies. This systematic datamining approach we used would make many large datasets more manageable and informative for decision-making processes and health policy prioritization.en_US
dc.identifier.urihttp://erepository.mku.ac.ke/handle/123456789/5550
dc.language.isoen_USen_US
dc.publisherCambridge University Pressen_US
dc.subjectCattle, data mining, Kenya, malaria, zoonotic diseasesen_US
dc.titleShrinking a large dataset to identify variables associated with increased risk of Plasmodium falciparum infection in Western Kenyaen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Article_Shriking large dataset.pdf
Size:
111 KB
Format:
Adobe Portable Document Format
Description:
Full-text

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections