Genomics big data hybrid depositories architecture to unlock precision medicine: a conceptual framework

Ummul H. Mohamad; Mohamad T. Ijab; Rabiah A. Kadir

doi:10.14419/ijet.v7i4.16893

Authors

Ummul H. Mohamad
INSTITUTE OF VISUAL INFORMATICS, UNIVERSITI KEBANGSAAN MALAYSIA
Mohamad T. Ijab
INSTITUTE OF VISUAL INFORMATICS, UNIVERSITI KEBANGSAAN MALAYSIA
Rabiah A. Kadir
INSTITUTE OF VISUAL INFORMATICS, UNIVERSITI KEBANGSAAN MALAYSIA

Received date: August 5, 2018

Accepted date: September 18, 2018

Published date: September 24, 2018

DOI:

https://doi.org/10.14419/ijet.v7i4.16893

Keywords:

Architecture Design of Hybrid Depositories, Data Driven Genomics, Personalized Medicine Framework.

Abstract

As the genome sequencing cost becomes more affordable, genomics studies are extensively carried out to empower the ultimate healthcare goal which is the precision medicine. By tailoring each individual medical treatment through precision medicine, it will potentially lead to nearly zero occurrence of the drugs side effects and treatment complications. Unfortunately, the complexity of the genomics data has been one of the bottlenecks that deter the advances of healthcare practices towards precision medicine. Therefore, based on the extensive literature review on the data driven genomics challenges towards precision medicine, this paper proposes two new contributions to the field; the conceptual framework for the genomics-based precision medicine and the architectural design for the development of hybrid depositories as the initial step to bridge the gap towards precision medicine. The genomics big data hybrid depositories architecture design is composed of few components; storage layer and service layer interconnected system such as visualization, data protection modeling, event processing engine and decision support, to carry out their purpose of merging the genomics data with the healthcare data.
Â

References

[1] J. Jameson and D. Longo, â€œPrecision medicineâ€”personalized, problematic, and promising,â€ Obstet. Gynecol. Surv., vol. 70, no. 10, pp. 612â€“614, 2015. https://doi.org/10.1097/01.ogx.0000472121.21647.38.
[2] E. A. Ashley, â€œThe precision medicine initiative: a new national effort,â€ Jama, vol. 313, no. 21, pp. 2119â€“2120, 2015. https://doi.org/10.1001/jama.2015.3595.
[3] I. Ezkurdia et al., â€œMultiple evidence strands suggest that there may be as few as 19 000 human protein-coding genes,â€ Hum. Mol. Genet., vol. 23, no. 22, pp. 5866â€“5878, 2014. https://doi.org/10.1093/hmg/ddu309.
[4] M. Grossglauser and H. Saner, â€œData-driven healthcare: from patterns to actions,â€ Eur. J. Prev. Cardiol., vol. 21, no. 2_suppl, pp. 14â€“17, Nov. 2014.
[5] G. Mendel, â€œMendelâ€™s Journey from Peas to Petabytes,â€ Biol. Imagin. Innov. Biosci., p. 121, 2014.
[6] A. Oâ€™Driscoll, J. Daugelaite, and R. Sleator, â€œâ€˜Big dataâ€™, Hadoop and cloud computing in genomics,â€ J. Biomed. Inform., vol. 46, no. 5, pp. 774â€“781, 2013. https://doi.org/10.1016/j.jbi.2013.07.001.
[7] T. A. Peterson, E. Doughty, and M. G. Kann, â€œTowards precision medicine: advances in computational approaches for the analysis of human variants,â€ J. Mol. Biol., vol. 425, no. 21, pp. 4047â€“4063, 2013. https://doi.org/10.1016/j.jmb.2013.08.008.
[8] S. Zhao et al., â€œRainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing,â€ BMC Genomics, vol. 14, no. 1, p. 425, 2013. https://doi.org/10.1186/1471-2164-14-425.
[9] M. Chen, S. Mao, and Y. Liu, â€œBig data: A survey,â€ Mob. Networks Appl., vol. 19, no. 2, pp. 171â€“209, 2014. https://doi.org/10.1007/s11036-013-0489-0.
[10] Z. D. Stephens et al., â€œBig Data: Astronomical or Genomical?,â€ PLoS Biol., vol. 13, no. 7, p. e1002195, Jul. 2015. https://doi.org/10.1371/journal.pbio.1002195.
[11] M. Viceconti, P. Hunter, and R. Hose, â€œBig data, big knowledge: big data for personalized healthcare,â€ IEEE J. Biomed. Heal. Informatics, vol. 19, no. 4, pp. 1209â€“1215, 2015. https://doi.org/10.1109/JBHI.2015.2406883.
[12] J. Andreu-Perez, C. C. Y. Poon, R. D. Merrifield, S. T. C. Wong, and G.-Z. Yang, â€œBig data for health,â€ IEEE J. Biomed. Heal. informatics, vol. 19, no. 4, pp. 1193â€“1208, 2015.
[13] F. S. Collins and V. A. McKusick, â€œImplications of the Human Genome Project for medical science,â€ Jama, vol. 285, no. 5, pp. 540â€“544, 2001. https://doi.org/10.1001/jama.285.5.540.
[14] K. Offit, â€œPersonalized medicine: new genomics, old lessons,â€ Hum. Genet., vol. 130, no. 1, pp. 3â€“14, 2011. https://doi.org/10.1007/s00439-011-1028-3.
[15] P. Muir, S. Li, S. Lou, and D. Wang, â€œThe real cost of sequencing: scaling computation to keep pace with data generation,â€ Genome, vol. 17, no. 1, p. 53, 2016.
[16] M. H.-Y. Fritz, R. Leinonen, G. Cochrane, and E. Birney, â€œEfficient storage of high throughput DNA sequencing data using reference-based compression,â€ Genome Res., vol. 21, no. 5, pp. 734â€“740, 2011. https://doi.org/10.1101/gr.114819.110.
[17] N. Khan et al., â€œBig data: survey, technologies, opportunities, and challenges,â€ Sci. World J., vol. 2014, 2014.
[18] N. S. Mauthner and O. Parry, â€œOpen Access Digital Data Sharing: Principles, Policies and Practicesâ˜†,â€ Soc. Epistemol., vol. 27, no. 1, pp. 47â€“67, 2013. https://doi.org/10.1080/02691728.2012.760663.
[19] J. L. Jennings and T. J. Hudson, â€œAbstract 130: International Cancer Genome Consortium (ICGC),â€ Cancer Res., vol. 76, p. 130, 2016. https://doi.org/10.1158/1538-7445.AM2016-130.
[20] V. Marx, â€œBiology: The big challenges of big data,â€ Nature, p. 255, 2013. https://doi.org/10.1038/498255a.
[21] E. S. Dove, Y. Joly, and A. TassÃ©, â€œGenomic cloud computing: legal and ethical points to consider,â€ Eur. J. Hum. Genet., vol. 23, no. 10, pp. 1271â€“1278, 2015. https://doi.org/10.1038/ejhg.2014.196.
[22] S. Kaisler, F. Armour, J. A. Espinosa, and W. Money, â€œBig data: Issues and challenges moving forward,â€ in System Sciences (HICSS), 2013 46th Hawaii International Conference on, 2013, pp. 995â€“1004.
[23] N. Levin, R. M. Salek, and C. Steinbeck, â€œFrom Databases to Big Data,â€ Metab. Phenotyping Pers. Public Healthc., p. 317, 2016.
[24] S. Choudhury, J. R. Fishman, M. L. McGowan, and E. T. Juengst, â€œBig data, open science and the brain: lessons learned from genomics,â€ Front. Hum. Neurosci., vol. 8, 2014.
[25] D. Kim, S. Song, and B.-Y. Choi, â€œIntroduction,â€ in Data Deduplication for Data Optimization for Storage and Network Systems, Springer, 2017, pp. 3â€“21. https://doi.org/10.1007/978-3-319-42280-0_1.
[26] D. Kim, S. Song, and B.-Y. Choi, â€œExisting Deduplication Techniques,â€ in Data Deduplication for Data Optimization for Storage and Network Systems, Springer, 2017, pp. 23â€“76. https://doi.org/10.1007/978-3-319-42280-0_2.
[27] H. H. Do, J. Jansson, K. Sadakane, and W.-K. Sung, â€œFast relative Lempelâ€“Ziv self-index for similar sequences,â€ Theor. Comput. Sci., vol. 532, pp. 14â€“30, 2014. https://doi.org/10.1016/j.tcs.2013.07.024.
[28] S. Deorowicz, A. Danek, and M. Niemiec, â€œGDC 2: Compression of large collections of genomes.,â€ Sci. Rep., vol. 5, p. 11565, Jun. 2015. https://doi.org/10.1038/srep11565.
[29] W. Christopher and M. Simon, â€œReview on Genomics APIs,â€ Comput. Struct. Biotechnol. J., 2016.
[30] E. Wang, N. Zaman, S. Mcgee, J.-S. Milanese, A. Masoudi-Nejad, and M. Oâ€™Connor-McCourt, â€œPredictive genomics: a cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data,â€ in Seminars in cancer biology, 2015, vol. 30, pp. 4â€“12. https://doi.org/10.1016/j.semcancer.2014.04.002.
[31] N. Tung, C. Battelli, B. Allen, R. Kaldate, and S. Bhatnagar, â€œFrequency of mutations in individuals with breast cancer referred for BRCA1 and BRCA2 testing using nextâ€generation sequencing with a 25â€gene panel,â€ Cancer, vol. 121, no. 1, pp. 25â€“33, 2015. https://doi.org/10.1002/cncr.29010.
[32] T. Cooke, J. Reeves, A. Lanigan, and P. Stanton, â€œHER2 as a prognostic and predictive marker for breast cancer,â€ Ann. Oncol., pp. 23â€“28, 2001. https://doi.org/10.1093/annonc/12.suppl_1.S23.
[33] M. West, G. S. Ginsburg, A. T. Huang, and J. R. Nevins, â€œEmbracing the complexity of genomic data for personalized medicine,â€ Genome Res., vol. 16, no. 5, pp. 559â€“566, 2006. https://doi.org/10.1101/gr.3851306.
[34] L. Chin, W. C. Hahn, G. Getz, and M. Meyerson, â€œMaking sense of cancer genomic data,â€ Genes Dev., vol. 25, no. 6, pp. 534â€“555, 2011. https://doi.org/10.1101/gad.2017311.
[35] J. G. Dunn and J. S. Weissman, â€œPlastid: nucleotide-resolution analysis of next-generation sequencing and genomics data,â€ BMC Genomics, vol. 17, no. 1, p. 958, 2016. https://doi.org/10.1186/s12864-016-3278-x.
[36] D. C. Koboldt, K. M. Steinberg, D. E. Larson, R. K. Wilson, and E. R. Mardis, â€œThe next-generation sequencing revolution and its impact on genomics,â€ Cell, vol. 155, no. 1, pp. 27â€“38, 2013. https://doi.org/10.1016/j.cell.2013.09.006.
[37] C. Castaneda et al., â€œClinical decision support systems for improving diagnostic accuracy and achieving precision medicine,â€ J. Clin. Bioinforma., vol. 5, no. 1, p. 4, 2015. https://doi.org/10.1186/s13336-015-0019-3.
[38] L. Schriml, C. Arze, S. Nadendla, and Y. Chang, â€œDisease Ontology: a backbone for disease semantic integration,â€ academia.edu, vol. 40, no. D1, pp. 910â€“946, 2011.
[39] D. Gomez-Cabrero et al., â€œData integration in the era of omics: current and future challenges,â€ BMC Syst. Biol., vol. 8, no. 2, p. I1, 2014. https://doi.org/10.1186/1752-0509-8-S2-I1.
[40] G. O. Consortium, â€œExpansion of the Gene Ontology knowledgebase and resources,â€ Nucleic Acids Res., vol. 45, no. D1, pp. 331â€“338, 2017. https://doi.org/10.1093/nar/gkw1108.
[41] M. Subhani, A. Anjum, and A. Koop, â€œClinical and genomics data integration using meta-dimensional approach,â€ Proc. 9th, pp. 416â€“421, 2016.
[42] B. Louie, P. Mork, F. Martin-Sanchez, and A. Halevy, â€œData integration and genomic medicine,â€ J. Biomed. Inform., vol. 40, no. 1, pp. 5â€“16, 2007. https://doi.org/10.1016/j.jbi.2006.02.007.
[43] P. Appleby, â€œLinking Genomic Data with Phenotypes Derived from Electronic Health Records.,â€ Int. J. Popul. Data Sci., vol. 1, no. 1, 2017.
[44] M. D. Ritchie, M. De Andrade, and H. Kuivaniemi, â€œThe foundation of precision medicine: integration of electronic health records with genomics through basic, clinical, and translational research,â€ Front. Genet., vol. 6, 2015.
[45] P. Khatri and S. DrÄƒghici, â€œOntological analysis of gene expression data: current tools, limitations, and open problems,â€ Bioinformatics, vol. 21, no. 18, pp. 3587â€“3595, 2005. https://doi.org/10.1093/bioinformatics/bti565.
[46] S. Palaniappan and N. Y. Huey, â€œA tool for healthcare information integration,â€ J. ICT, vol. 5, pp. 29â€“44, 2006.
[47] M. Dugas, A. Meidt, P. Neuhaus, M. Storck, and J. Varghese, â€œODMedit: uniform semantic annotation for data integration in medicine based on a public metadata repository.,â€ BMC Med. Res. Methodol., vol. 16, p. 65, 2016. https://doi.org/10.1186/s12874-016-0164-9.
[48] J. MarÃ©s et al., â€œp-medicine: A medical informatics platform for integrated large scale heterogeneous patient data,â€ in AMIA Annual Symposium Proceedings, 2014, vol. 2014, p. 872.
[49] F. Schera, G. Weiler, E. Neri, S. Kiefer, and N. Graf, â€œThe p-medicine portalâ€”a collaboration platform for research in personalised medicine,â€ Ecancermedicalscience, vol. 8, 2014.
[50] A. Alyass, M. Turcotte, and D. Meyre, â€œFrom big data analysis to personalized medicine for all: challenges and opportunities,â€ BMC Med. Genomics, vol. 8, no. 1, p. 33, 2015. https://doi.org/10.1186/s12920-015-0108-y.
[51] F. Cheng, J. Zhao, and Z. Zhao, â€œAdvances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes,â€ Brief. Bioinform., vol. 17, no. 4, pp. 642â€“656, Jul. 2016. https://doi.org/10.1093/bib/bbv068.
[52] J. Howison and J. Bullard, â€œSoftware in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature,â€ J. Assoc. Inf. Sci. Technol., vol. 67, no. 9, pp. 2137â€“2155, 2016. https://doi.org/10.1002/asi.23538.
[53] S. Goodwin, J. D. McPherson, and W. R. McCombie, â€œComing of age: ten years of next-generation sequencing technologies,â€ Nat. Rev. Genet., vol. 17, no. 6, pp. 333â€“351, 2016. https://doi.org/10.1038/nrg.2016.49.
[54] M.-A. Madoui et al., â€œGenome assembly using Nanopore-guided long and error-free DNA reads,â€ BMC Genomics, vol. 16, no. 1, p. 327, 2015. https://doi.org/10.1186/s12864-015-1519-z.
[55] T. Madden, â€œThe BLAST sequence analysis tool,â€ 2013.
[56] R. Wilton, T. Budavari, B. Langmead, S. J. Wheelan, S. L. Salzberg, and A. S. Szalay, â€œArioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space,â€ PeerJ, vol. 3, p. e808, 2015. https://doi.org/10.7717/peerj.808.
[57] F. E. Faisal, L. Meng, J. Crawford, and T. MilenkoviÄ‡, â€œThe post-genomic era of biological network alignment,â€ EURASIP J. Bioinforma. Syst. Biol., vol. 2015, no. 1, p. 3, 2015. https://doi.org/10.1186/s13637-015-0022-9.
[58] R. Margolis, L. Derr, M. Dunn, and M. Huerta, â€œThe National Institutes of Healthâ€™s Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data,â€ J. Am. Med. Informatics Assoc., vol. 21, no. 6, pp. 957â€“958, 2014. https://doi.org/10.1136/amiajnl-2014-002974.
[59] T. Barreto, A. Mand, M. Spielberg, D. MacKenzie, and S. Ghods, â€œManaging updates at clients used by a user to access a cloud-based collaboration service.â€ Google Patents, 21-Apr-2015.
[60] T. Takai-Igarashi et al., â€œSecurity controls in an integrated Biobank to protect privacy in data sharing: rationale and study design,â€ BMC Med. Inform. Decis. Mak., vol. 17, no. 1, p. 100, 2017. https://doi.org/10.1186/s12911-017-0494-5.
[61] E. S. Dove, â€œBiobanks, Data Sharing, and the Drive for a Global Privacy Governance Framework,â€ J. Law, Med. Ethics, vol. 43, no. 4, 2015.
[62] F. Carrasco-Ramiro, R. PeirÃ³-Pastor, and B. Aguado, â€œHuman genomics projects and precision medicine,â€ Gene Ther., vol. 24, no. 9, p. 551, 2017. https://doi.org/10.1038/gt.2017.77.
[63] T. Schultz, â€œTurning healthcare challenges into big data opportunities: A useâ€case review across the pharmaceutical development lifecycle,â€ Bull. Assoc. Inf. Sci. Technol., vol. 39, no. 5, pp. 34â€“40, 2013. https://doi.org/10.1002/bult.2013.1720390508.
[64] J. Luo, M. Wu, D. Gopukumar, and Y. Zhao, â€œBig data application in biomedical research and health care: A literature review,â€ Biomed. Inform. Insights, vol. 8, p. 1, 2016. https://doi.org/10.4137/BII.S31559.
[65] A. Alzuâ€™bi, L. Zhou, and V. Watzlaf, â€œPersonal genomic information management and personalized medicine: challenges, current solutions, and roles of HIM professionals.,â€ Perspect. Heal. Inf. Manag., vol. 11, no. Spring, p. 1c, 2014.
[66] M. Beck, V. Haupt, J. Roy, J. Moennich, and R. JÃ¤kel, Genecloud: Secure cloud computing for biomedical research. Springer, Cham., 2014.
[67] M. D. AssunÃ§Ã£o, R. N. Calheiros, S. Bianchi, M. A. S. Netto, and R. Buyya, â€œBig Data computing and clouds: Trends and future directions,â€ J. Parallel Distrib. Comput., vol. 79, pp. 3â€“15, 2015. https://doi.org/10.1016/j.jpdc.2014.08.003.
[68] A. P. Heath et al., â€œBionimbus: a cloud for managing, analyzing and sharing large genomics datasets,â€ J. Am. Med. Informatics Assoc., vol. 21, no. 6, pp. 969â€“975, Nov. 2014. https://doi.org/10.1136/amiajnl-2013-002155.
[69] S. Datta, K. Bettinger, and M. Snyder, â€œPractical Guidelines for Secure Cloud Computing using Genomic Data,â€ bioRxiv, p. 34876, 2015.
[70] Q. Jiang, M. K. Khan, X. Lu, J. Ma, and D. He, â€œA privacy preserving three-factor authentication protocol for e-Health clouds,â€ J. Supercomput., vol. 72, no. 10, pp. 3826â€“3849, 2016. https://doi.org/10.1007/s11227-015-1610-x.
[71] A. Park et al., â€œThe Blockchain for Personalized Medicine,â€ 2017.
[72] Z. Shae and J. J. P. Tsai, â€œOn the Design of a Blockchain Platform for Clinical Trial and Precision Medicine,â€ in Distributed Computing Systems (ICDCS), 2017 IEEE 37th International Conference on, 2017, pp. 1972â€“1980.
[73] D. Milius et al., â€œThe International Cancer Genome Consortiumâ€™s evolving data-protection policies,â€ Nat. Biotechnol., vol. 32, no. 6, pp. 519â€“523, 2014. https://doi.org/10.1038/nbt.2926.
[74] R. C. Green, D. Lautenbach, and A. L. McGuire, â€œGINA, genetic discrimination, and genomic medicine,â€ N. Engl. J. Med., vol. 372, no. 5, pp. 397â€“399, 2015. https://doi.org/10.1056/NEJMp1404776.
[75] C. Auffray et al., â€œMaking sense of big data in health research: towards an EU action plan,â€ Genome Med., vol. 8, no. 1, p. 71, 2016. https://doi.org/10.1186/s13073-016-0323-y.
[76] U. H. Mohamad, M. T. Ijab, and R. A. Kadir, â€œBridging the Gap in Personalised Medicine Through Data Driven Genomics,â€ in International Visual Informatics Conference, 2017, pp. 88â€“99. https://doi.org/10.1007/978-3-319-70010-6_9.
[77] A. Shachak, K. Shuval, and S. Fine, â€œBarriers and enablers to the acceptance of bioinformatics tools: a qualitative study,â€ J. Med. Libr. Assoc. JMLA, vol. 95, no. 4, p. 454, 2007. https://doi.org/10.3163/1536-5050.95.4.454.
[78] L. Samuel, â€œDrug dosing goes digital with new algorithm,â€ Stat, 2016. [Online]. Available: https://www.statnews.com/2016/04/06/tailoring-dosages-patients/. [Accessed: 19-Jan-2018].
[79] L. Wang, R. Ranjan, J. Kolodziej, A. Y. Zomaya, and L. Alem, â€œSoftware Tools and Techniques for Big Data Computing in Healthcare Clouds.,â€ Futur. Gener. Comp. Syst., vol. 43, pp. 38â€“39, 2015. https://doi.org/10.1016/j.future.2014.11.001.
[80] S. Wilson et al., â€œDeveloping Cancer Informatics Applications and Tools Using the NCI Genomic Data Commons API,â€ Cancer Res., vol. 77, no. 21, pp. e15â€“e18, 2017. https://doi.org/10.1158/0008-5472.CAN-17-0598.
[81] I. V Hinkson, T. M. Davidsen, J. D. Klemm, I. Chandramouliswaran, A. R. Kerlavage, and W. A. Kibbe, â€œA Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine,â€ Frontiers in Cell and Developmental Biology, vol. 5. p. 83, 2017. https://doi.org/10.3389/fcell.2017.00083.
[82] A. Bisnajak, â€œThe Bio-Nespresso Project: The design of a small-scale manufacturing unit for personalized medicine production,â€ 2018.
[83] A. B. of Directors, â€œLaboratory and clinical genomic data sharing is crucial to improving genetic health care: a position statement of the American College of Medical Genetics,â€ Genet. Med., 2017.
[84] A. C. Resnick et al., â€œAbstract LB-008: The Pediatric Brain Tumor Atlas: building an integrated, multi-platform data-rich ecosystem for collaborative discovery in the cloud.â€ AACR, 2017.
[85] E. R. Hsu, J. D. Klemm, A. R. Kerlavage, D. Kusnezov, and W. A. Kibbe, â€œCancer Moonshot Data and Technology Team: Enabling a National Learning Healthcare System for Cancer to Unleash the Power of Data,â€ Clin. Pharmacol. Ther., vol. 101, no. 5, pp. 613â€“615, 2017. https://doi.org/10.1002/cpt.636.
[86] A. Palmisano, Y. Zhao, M.-C. Li, E. C. Polley, and R. M. Simon, â€œOpenGeneMed: a portable, flexible and customizable informatics hub for the coordination of next-generation sequencing studies in support of precision medicine trials,â€ Brief. Bioinform., vol. 18, no. 5, pp. 723â€“734, 2016. https://doi.org/10.1093/bib/bbw059.
[87] D. R. Leff and G.-Z. Yang, â€œBig data for precision medicine,â€ Engineering, vol. 1, no. 3, pp. 277â€“279, 2015. https://doi.org/10.15302/J-ENG-2015075.
[88] K. Lauter, A. LÃ³pez-Alt, and M. Naehrig, â€œPrivate Computation on Encrypted Genomic Data.,â€ in International Conference on Cryptology and Information Security in Latin America, 2014, pp. 3â€“27.
[89] J. D. Tenenbaum et al., â€œAn informatics research agenda to support precision medicine: seven key areas,â€ J. Am. Med. Informatics Assoc., vol. 23, no. 4, pp. 791â€“795, 2016. https://doi.org/10.1093/jamia/ocv213.
[90] D. PÃ©rez-Rey et al., â€œONTOFUSION: Ontology-based integration of genomic and clinical databases,â€ Comput. Biol. Med., vol. 36, no. 7â€“8, pp. 712â€“730, 2006. https://doi.org/10.1016/j.compbiomed.2005.02.004.
[91] N. R. Sperber et al., â€œChallenges and strategies for implementing genomic services in diverse settings: experiences from the Implementing GeNomics In pracTicE (IGNITE) network.,â€ BMC Med. Genomics, vol. 10, no. 1, p. 35, May 2017. https://doi.org/10.1186/s12920-017-0273-2.
[92] B. M. Welch, K. Eilbeck, G. Del Fiol, L. J. Meyer, and K. Kawamoto, â€œTechnical desiderata for the integration of genomic data with clinical decision support,â€ J. biomed. info, vol. 51, pp. 3â€“7, 2014.
[93] X. Wu, X. Zhu, G.-Q. Wu, and W. Ding, â€œData mining with big data,â€ IEEE Trans. Knowl. Data Eng., vol. 26, no. 1, pp. 97â€“107, 2014. https://doi.org/10.1109/TKDE.2013.109.
[94] H. Chang and M. Choi, â€œBig data and healthcare: building an augmented world,â€ Healthc. Inform. Res., vol. 22, no. 3, pp. 153â€“155, 2016. https://doi.org/10.4258/hir.2016.22.3.153.
[95] N. V Chawla and D. A. Davis, â€œBringing big data to personalized healthcare: a patient-centered framework,â€ J. Gen. Intern. Med., vol. 28, no. 3, pp. 660â€“665, 2013. https://doi.org/10.1007/s11606-013-2455-8.
[96] C. W. Tsao and R. S. Vasan, â€œCohort Profile: The Framingham Heart Study (FHS): overview of milestones in cardiovascular epidemiology,â€ Int. J. Epidemiol., vol. 44, no. 6, pp. 1800â€“1813, 2015. https://doi.org/10.1093/ije/dyv337.
[97] T. Gordon, W. P. Castelli, M. C. Hjortland, W. B. Kannel, and T. R. Dawber, â€œHigh density lipoprotein as a protective factor against coronary heart disease: the Framingham Study,â€ Am. J. Med., vol. 62, no. 5, pp. 707â€“714, 1977. https://doi.org/10.1016/0002-9343(77)90874-9.
[98] I. S. Kohane, â€œTen things we have to do to achieve precision medicine,â€ Science (80-. )., vol. 349, no. 6243, pp. 37â€“38, 2015.
[99] M. J. Van De Vijver et al., â€œNo TitleA gene-expression signature as a predictor of survival in breast cance,â€ N. Engl. J. Med., vol. 347, no. 25, pp. 1999â€“2009, 2002. https://doi.org/10.1056/NEJMoa021967.
[100] I. Kotenko, O. Polubelova, A. Chechulin, and I. Saenko, â€œDesign and implementation of a hybrid ontological-relational data repository for siem systems,â€ Futur. internet, vol. 5, no. 3, pp. 355â€“375, 2013.
[101] H. Garcia-Molina, Database systems: the complete book. Pearson Education India, 2008.
[102] D. Marco, â€œBuilding and managing the meta data repository,â€ A full lifecycle Guid., 2000.
[103] J. W. Smoller et al., â€œAn eMERGE clinical center at partners personalized medicine,â€ J. Pers. Med., vol. 6, no. 1, p. 5, 2016. https://doi.org/10.3390/jpm6010005.
[104] M. D. Ritchie et al., â€œElectronic medical records and genomics (eMERGE) network exploration in cataract: several new potential susceptibility loci,â€ Mol. Vis., vol. 20, p. 1281, 2014.
[105] M. I. Babar, M. Jehanzeb, M. Ghazali, D. N. A. Jawawi, F. Sher, and S. A. K. Ghayyur, â€œBig data survey in healthcare and a proposal for intelligent data diagnosis framework,â€ in 2nd IEEE International Conference on Computer and Communications (ICCC), 2016, pp. 7â€“12.
[106] A. V Fedorchenko, I. V Kotenko, E. V Doynikova, and A. A. Chechulin, â€œThe ontological approach application for construction of the hybrid security repository,â€ in Soft Computing and Measurements (SCM), 2017 XX IEEE International Conference on, 2017, pp. 525â€“528.

Genomics big data hybrid depositories architecture to unlock precision medicine: a conceptual framework

Authors

Ummul H. Mohamad

Mohamad T. Ijab

Rabiah A. Kadir

How to Cite

DOI:

Keywords:

Abstract

References

Downloads

How to Cite