NIH funds new tools to crack genomic big data

NIH cloud-based Data Commons pilot launches

The National Institutes of Health announced the launch of its NIH Data Commons Pilot Phase, a cloud-based project to test ways researchers can access, share and store biomedical data and associated tools.

During the four-year pilot phase, researchers will be working with three high-value datasets to determine what policies, processes and architectures they need to so ensure resources are findable, accessible, interoperable and reusable (FAIR) and to accelerate new biomedical discoveries.

“Poor data accessibility is a major barrier to translating data into understanding," said NIH Director Dr. Francis S. Collins. "The NIH Data Commons Pilot Phase is an important effort to remove that barrier. Harvesting the wealth of information in biomedical data will advance our understanding of human health and disease,”

The NIH Data Commons will be a community-controlled cloud infrastructure to support collective uses of computing, storage and data for biomedical research by NIH and its academic and industry collaborators.

To start developing the capabilities needed to make the Data Commons a reality, NIH made 12 awards, totaling $9 million, for projects that will help make data transparent and interoperable, safeguard  patient data and encourage community buy-in for standards. The 12 awardees will form the core of an NIH Data Commons Pilot Phase Consortium -- a group of data scientists, IT engineers, cloud service providers and biomedical researchers. This consortium will develop processes for FAIR data management and create a roadmap for building the Data Commons.  

NIH also will be working with MITRE Corp. to assure cost-effective cloud-based computing and storage for scientific data; analyses related to usage, cost, and comparative business models; and other considerations to assure long-term viability of NIH data science efforts.

The pilot phase also will inform future data management models based on how frequently particular data sets are used, who uses them and whether or how end-users could contribute to the costs of sustaining the resource.

The pilot is expected to span fiscal years 2017-2020, with an estimated total budget of approximately $55.5 million, pending available funds, NIH said.

About the Author

Susan Miller is executive editor at GCN.

Over a career spent in tech media, Miller has worked in editorial, print production and online, starting on the copy desk at IDG’s ComputerWorld, moving to print production for Federal Computer Week and later helping launch websites and email newsletter delivery for FCW. After a turn at Virginia’s Center for Innovative Technology, where she worked to promote technology-based economic development, she rejoined what was to become 1105 Media in 2004, eventually managing content and production for all the company's government-focused websites. Miller shifted back to editorial in 2012, when she began working with GCN.

Miller has a BA from West Chester University and an MA in English from the University of Delaware.

Connect with Susan at smiller@gcn.com or @sjaymiller.

Featured

Charter Sponsors