Software tools and data resources developed for the management, manipulation, and mining of large-scale biomedical data can be used in a wide variety of educational and training contexts.These include workshops, symposia, formal university courses, as well as forming a basis for a variety of online materials.
The Big Data for Discovery Science Center (BDDS) - comprised of leading experts in biomedical imaging, genetics, proteomics, and computer science - is taking an "-ome to home" approach toward streamlining big data management, aggregation, manipulation, integration, and the modeling of biological systems across spatial and temporal scales.


BDDS and ISB researcher Gustavo Glusman was interviewed about the genome fingerprints method for The Bioinformatics Chat podcast.April 7, 2018

In this episode Gustavo Glusman explains his method of reducing a VCF file to a small “fingerprint”, which could be then used to detect duplicate genomes, infer relatedness, map the population structure, and more.

For more information, here is the link:

BDDS Big Data Technologies for Biomedical Knowledge Discovery Seminar for the American Medical Informatics Association (AMIA) Annual Symposium 2016September 30, 2016

Conference url:

When: Saturday, Nov. 12, 8:30 am to 12:00 pm (half-day session)

Where: Hilton Chicago, Chicago, Il

Instructors: Ravi Madduri, Kyle Chard, Ian Foster

The Big Data for Discovery Science (BDDS) project will present various big data technologies to assist in discovery science.

BBDS Big Data Tutorial at the ACM Bioinformatics, Computational Biology and Health Informatics Conference 2016June 7, 2016

Conference URL:

When: Sunday, Oct. 2, 1:00 to 3:00 pm

Where: Motif Seattle, 1415 5th Avenue, Seattle, WA 98101 (Room TBD)

Instructors: Naveen Ashish, Jack Van Horn, Gustavo Glusman and Ben Heavner

The Big Data for Discovery Science (BDDS) project will conduct a tutorial on big data technologies with a particular focus on data harmonization and automated data mapping. BDDS is supported by the NIH’s BD2K program.

BBDS Full Day Training Symposium at the Society for Neuroscience (SFN) Annual Meeting 2016June 7, 2016

Conference URL:

When: Thursday November 17th 2016, 10 am – 4 pm

Where: San Diego Convention Center (Room TBD)

Instructors: Naveen Ashish, Mike D’Arcy, Jack Van Horn, Carl Kesselman and Arthur W. Toga

The Big Data for Discovery Science (BDDS) project will conduct a day-long educational and training symposium on big data processing and analysis technologies. Scientific use cases using the BDDS discovery science tools and platform will also be presented. BDDS is supported by the NIH’s BD2K program.

Beta Release of BD Bag SoftwareApril 6, 2016

BDDS has released the initial public beta pre-release of its BD Bag software. The BD Bag software allows researchers to address a significant Big Data challenge of assembling, identifying, and providing access to subsets of data in a large and complex data collection workflow such as from a catalog search to an analysis pipeline and to a publication service. The beta release can be found here:

BDDS launches Minimal Viable Identifier (minid)April 6, 2016

The Big Data for Discovery Science has created the Minimal Viable Identifier (minid) pilot, which enables unambiguous naming and identification of research data products. Minids are designed to be lightweight, enabling simple creation and use, and requiring only minimal metadata to be specified about the object. For more information:

NIH Awards 32 Million Dollars for Big Data ProjectsOctober 9, 2014

Hoping to tame the torrent of data churning out of biology labs, the National Institutes of Health (NIH) today announced $32 million in awards in 2014 to help researchers develop ways to analyze and use large biological data sets.

First Recipients Biomedical Big Data GrantsOctober 9, 2014

In March 2012, the Obama Administration launched the Big Data Research and Development Initiative. To catalyze new biomedical Big Data research, the Obama Administration and the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative in April 2013. Today, NIH awarded a total of $32 million in new BD2K grants.

NIH Invests 32 Million in Big DataOctober 9, 2014

Obama Administration and the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative in April 2013. Today, NIH awarded a total of $32 million in new BD2K grants.

LA Scientists Big DataOctober 9, 2014

Of the 12 "centers of excellence" to be established under the BD2K initiative, four California institutions -- UCLA, USC, UC Santa Cruz and Stanford University -- will be tapped to play a major role. Collectively, the four universities are to be awarded $7 million in 2014 and are slated to receive close to $38 million over the next four years.

The Human Health RenaissanceOctober 9, 2014

It's a new academic year at USC, and that means yet another new season of progress and possibility across all our campuses. Four years ago, I had just taken office as USC’s president. And I came here to deliver my first major address. Even before I had delivered my inaugural speech to the larger Trojan Family, I wanted to speak to you right here on the Health Sciences Campus.

NIH AwardsOctober 9, 2014

The National Institutes of Health on Thursday awarded almost $32-million in grants to more than two dozen institutions to devise innovative ways of helping researchers handle huge sets of data seen as increasingly central to future medical discoveries.

Neuroscientists Claim Rare Pair of Research GrantsOctober 9, 2014

In a rare distinction for one university, neuroimaging world leaders and USC Professors Arthur Toga and Paul Thompson will receive two major research center awards to advance their exploration of the human brain.