BIG DATA for DISCOVERY SCIENCE
infographic
The Big Data for Discovery Science Center (BDDS) - comprised of leading experts in biomedical imaging, genetics, proteomics, and computer science - is taking an "-ome to home" approach toward streamlining big data management, aggregation, manipulation, integration, and the modeling of biological systems across spatial and temporal scales.
 
 

Peptide Atlas BDBag Minid Use Case

The Peptide Atlas mass spectrometry proteomics data respository has begun adding support for the BDDS toolset, including BDBags and Minids. This enables users to obtain data from PeptideAtlas more easily especially when using BDBag- and Minid-enabled software such as the TPP and Globus Galaxy.

The Tiered Human Integrated Search Proteomes (THISPs) are now being made available via BDBags and Minids. Simply visit the THISP home page, click the little clipboard icon next to the release you want to download, and paste the Minid into your Minid-enabled software (such as the TPP). The THISP release will be automatically downloaded, unpacked, and validated for you. This can easily be scripted so that each new release, available on the first of the month, is downloaded automatically.

Individual datasets are now also being made available via BDBags and Minids. Simply visit the PeptideAtlas Raw Data Repository, find a suitable dataset, find the Minid of the dataset you want to download, and paste the Minid into your Minid-enabled software (such as the TPP). The dataset will then be automatically downloaded, unpacked, and validated for you. Certain software (such as the TPP) can then use the information in the BDBag manifest to set up a potential data analysis plan for inspection, modification, and launch by the user. New datasets are being encoded into DBBags and Minids. Older datasets are being repackaged, but this has not been completed.