Computational technologies and tools are vital to clinical and translational research; however Hubs currently develop, deploy, and manage these independently. As a result, these processes are tedious, costly, heterogeneous, and less secure. This core is establishing a common tool and cloud computing architecture to provide CTSA hubs with an affordable, easy to use, and scalable deployment paradigm. Hubs can promote and deploy their own tools as well as adopt others’ products, thereby transcending long-standing “boundaries” and solving common and recurring information needs.
Create common cloud computing architecture that can enable the rapid deployment and sharing of reusable software components by CTSA hubs;
Demonstrate the use of shared tools and platforms for the collaborative analysis of clinical data in a manner that transcends individual CTSA hub “boundaries”
Disseminate a common set of tools that can be employed for the both local and collaborative query of common data warehousing platforms and underlying data models
Pilot the “cloudification” of software artifacts that can be shared across CTSA hubs to address common and recurring information needs.
EXPLORE OUR CURRENT AND PAST PROJECTS & CORE DETAILS
CLOUD-based DUA: This project is based on a pilot with the FDA and will create a cloud-based data use agreement toolkit to support the entry of de-identified EHR data from partner institutions into the sandboxes. The project will leverage a preconfigured FHIR repository maintained on the CD2H/NCATS cloud or behind the partner institution’s firewall as a demonstration. The team will work with the community to write Governance, SOPs, and policy for CTSA informatics community collaboration. A pan-sandbox Governance group will have CD2H and community representatives to contribute subject matter for specific domains.
Competitions: Stabilization, Enhancement, and Dissemination: As a continuation of a Phase II project, Competitions provides a platform for robust peer review across CTSAs with a cloud-based, single sign-on software tool for investigators, reviewers, and administrators.
Tools and Cloud Core Architecture: This ongoing core infrastructure project focuses on establishing a CTSA tool registry to facilitate discovery and confident adoption as well as implementation of tools and algorithms by CTSA investigators, including the provision of standard mechanisms for cloud-based data access and use.
CONTINUING & PHASE II:
PHASE III SANDBOXES:
Cloud-based Sandbox for Text Analytics: A continuation of Phase II collaborative work with the iEC working group, this project aims to deploy a suite of natural language processing (NLP) tools and realize evaluation measures and tools as well as best practices.
Cloud-based Sandbox for Best Practices in Clinical Machine Learning: A new sandbox project designed to create a best practices platform for deploying and evaluating clinical machine learning tools and algorithms. Goals include provisioning community-vetted solutions to common clinical machine learning challenges including data preparation, analysis of bias sources, and evaluation/validation of algorithms.
Cloud-based Sandbox for the Evaluation of Data Quality Assessment Methods: A new sandbox project designed to develop, evaluate, and share tools and methods for data quality assessment. This sandbox project will include a pilot that leverages the Accrual to Clinical Trials (ACT) Network data to understand the quantity and completeness of ACT data and differences in coding practices across institutions.
We are actively seeking community members to provide feedback on architecture, contribute test cases, and help build infrastructure.
Core community meetings occur the last Tuesday of the month at (12 pm PT/3 pm ET) and are the primary mechanism to engage. (Contact firstname.lastname@example.org for meeting invitation.)
Signal your interest at cd2h.org/onboard