The PARADIM and NanoHub 2D data framework (2DDF) supplement was critical to creating the infrastructure that now underpins the MIP’s data and data-driven efforts including centralized hosting, a rich semantic data model, a high-performance computational platform providing modern data-science tools, API access, a mature python library for remote access to NanoHub tools, FAIR-compliant data releases, and training materials. Infrastructure development has facilitated machine learning applications and the first streaming-data platform for materials data with end-to-end data encryption.
Figure 1: Goals and Building blocks for the development of a collaborative data investigation platform that brings all aspects of data analysis to the Materials Innovation Platform (MIP) and its users.
|Figure 1: Goals and Building blocks for the development of a collaborative data investigation platform that brings all aspects of data analysis to the Materials Innovation Platform (MIP) and its users.|
Embracing the vision of placing MIPs at the center of the 2DDF, PARADIM took a leadership role and partnered with NIST to create the first 2DDF Training Workshop. The workshop covered fundamental data topics over 4.5 days and was facilitated by PARADIM infrastructure. PARADIM’s leadership role in materials data now reaches beyond the 2DDF and has been central to the recent creation of the Materials Research Data Alliance (MaRDA, https://www.marda-alliance.org) which focuses on the high-priority, MGI strategic goal of building a network of materials data stakeholders to identify community needs and bring together infrastructure providers to leverage strengths while pushing forward better capabilities and stronger integration of materials data resources.
Figure 2: The FAIR principles (findable, accessible, interoperable, and reuseable) for materials data lead to the creation of three levels of compliance that users can choose from to share the generated data.
What Has Been Achieved:
As we develop better, more meaningful ways to open materials data, PARADIM users can provide the generated data in three ways:
- Publicly available through browsing at https://www.paradim.org/publications/data_sets; Example: https://data.paradim.org/176
- Publicly available with the addition of citation though PARADIM minting of Digital Object Identifiers (DOIs) that provide DataCite schema compliant metadata, and permanent landing pages; Example: https://doi.org/10.34863/1lk1-pd01
- Publicly available, citable, and highly curated which adds richer information including associated analysis codes or insights as befits the study, Example: https://doi.org/10.34863/g4wa-0j57.
Importance of the Achievement: FAIR compliance for materials data is evolving with need for improvements in metadata richness, standards for interoperability, and integration with publishers. Despite these challenges, PARADIM’s 2DDF supplement work provides meaningful FAIR compliance through DataCite DOIs, use of open or standard formats, and a data policy that defaults to open data with an explicit, Creative Commons license.