Content-Length: 2771 | pFad | https://www.jmir.org/article/export/ris/jmir_v21i4e13043
TY - JOUR AU - McPadden, Jacob AU - Durant, Thomas JS AU - Bunch, Dustin R AU - Coppi, Andreas AU - Price, Nathaniel AU - Rodgerson, Kris AU - Torre Jr, Charles J AU - Byron, William AU - Hsiao, Allen L AU - Krumholz, Harlan M AU - Schulz, Wade L PY - 2019 DA - 2019/04/09 TI - Health Care and Precision Medicine Research: Analysis of a Scalable Data Science Platform JO - J Med Internet Res SP - e13043 VL - 21 IS - 4 KW - data science KW - monitoring, physiologic KW - computational health care KW - medical informatics computing KW - big data AB - Background: Health care data are increasing in volume and complexity. Storing and analyzing these data to implement precision medicine initiatives and data-driven research has exceeded the capabilities of traditional computer systems. Modern big data platforms must be adapted to the specific demands of health care and designed for scalability and growth. Objective: The objectives of our study were to (1) demonstrate the implementation of a data science platform built on open source technology within a large, academic health care system and (2) describe 2 computational health care applications built on such a platform. Methods: We deployed a data science platform based on several open source technologies to support real-time, big data workloads. We developed data-acquisition workflows for Apache Storm and NiFi in Java and Python to capture patient monitoring and laboratory data for downstream analytics. Results: Emerging data management approaches, along with open source technologies such as Hadoop, can be used to create integrated data lakes to store large, real-time datasets. This infrastructure also provides a robust analytics platform where health care and biomedical research data can be analyzed in near real time for precision medicine and computational health care use cases. Conclusions: The implementation and use of integrated data science platforms offer organizations the opportunity to combine traditional datasets, including data from the electronic health record, with emerging big data sources, such as continuous patient monitoring and real-time laboratory results. These platforms can enable cost-effective and scalable analytics for the information that will be key to the delivery of precision medicine initiatives. Organizations that can take advantage of the technical advances found in data science platforms will have the opportunity to provide comprehensive access to health care data for computational health care and precision medicine research. SN - 1438-8871 UR - https://www.jmir.org/2019/4/e13043/ UR - https://doi.org/10.2196/13043 UR - http://www.ncbi.nlm.nih.gov/pubmed/30964441 DO - 10.2196/13043 ID - info:doi/10.2196/13043 ER -Fetched URL: https://www.jmir.org/article/export/ris/jmir_v21i4e13043
Alternative Proxies: