BIG DATA for DISCOVERY SCIENCE
infographic
The Big Data for Discovery Science Center (BDDS) - comprised of leading experts in biomedical imaging, genetics, proteomics, and computer science - is taking an "-ome to home" approach toward streamlining big data management, aggregation, manipulation, integration, and the modeling of biological systems across spatial and temporal scales.
 
Under Development

BIG DATA REPOSITORY (BDR)

What is BDR?

TThe BDDS Data Repository (BDR) is a set of data services built to support the publication and discovery of biomedical data. BDR can be used to store and publish biomedical data from many domains, including neuroimaging, proteomics, and genomics. The production-ready data publication service offers a scalable repository where researchers can publish, preserve, and share even the largest biomedical research data. Data deposited in the BDR is associated with a persistent identifier that can be uniquely referenced in publications. Researchers themselves manage access to published data, including options to share data publically or with specific groups of users. Published data can be described with a variety of standard, customized and domain-specific metadata, all of which is indexed in the BDR discovery service and may be searched by other users.

Features

Publication of large datasets
BDR offers researchers access to reliable and high-performance data storage
Customizable metadata descriptions
BDR collection owners can define and use their own metadata schemas to describe their published datasets
Flexible access control
Published datasets may be private, shared with a particular group of users, or shared publicly
Flexible user-oriented curation workflows
BDR offers researchers access to reliable and high performance data storage
Choice of persistent identifier (DOI, Handle)
Data deposited in the BDR is associated with a persistent identifier that can be uniquely referenced in publications
Powerful search capabilities
Indexed data can be described with a variety of standard, customized, and domain-specific metadata facilitating search

How It Works

BDR is built upon scalable Globus data publication capabilities. These capabilities are delivered through a hosted service with metadata stored and indexed in the cloud and data storage provided by an allocation at Argonne National Laboratory or any other specified remote storage provider made accessible via Globus Connect.

Published datasets are organized by "communities" and their member "collections". A variety of specific policies can be set on communities or collections to manage:

BDR users can create and manage their own collections directly through the BDR website.