text-only page produced automatically by LIFT Text Transcoder Skip all navigation and go to page contentSkip top navigation and go to directorate navigationSkip top navigation and go to page navigation
National Science Foundation Home National Science Foundation - Biological Sciences (BIO)
Integrative Organismal Systems (IOS)
design element
IOS Home
About IOS
Funding Opportunities
Career Opportunities
Examples of Broader Impacts
Supplements & Other Opportunities
See Additional IOS Resources
View IOS Staff
BIO Organizations
Biological Infrastructure (DBI)
Environmental Biology (DEB)
Emerging Frontiers (EF)
Integrative Organismal Systems (IOS)
Molecular and Cellular Biosciences (MCB)
Proposals and Awards
Proposal and Award Policies and Procedures Guide
Proposal Preparation and Submission
bullet Grant Proposal Guide
  bullet Grants.gov Application Guide
Award and Administration
bullet Award and Administration Guide
Award Conditions
Other Types of Proposals
Merit Review
NSF Outreach
Policy Office
Additional IOS Resources
BIO Dear Colleague Letters
BIO Reports
Interdisciplinary Research
Merit Review
Merit Review Broader Impacts Criterion: Representative Activities
Image Credits
Other Site Features
Special Reports
Research Overviews
Multimedia Gallery
Classroom Resources
NSF-Wide Investments

Email this pagePrint this page

"Bottom-up" proteomics

NSF-funded supercomputer helps researchers interpret genomes

iluustration showing a map of links between the genes of the mustard plant Arabidopsis thaliana

AraNet: a genome-wide gene function association network for Arabidopsis thaliana.
Credit and Larger Version

July 1, 2014

[The following is Part seven in a series of stories that highlight recent discoveries enabled by the Stampede supercomputer. Read parts one, two, three, four, five and six to find out how Stampede is making a difference through science and engineering.]

Tandem protein mass spectrometry is one of the most widely used methods in proteomics, the large-scale study of proteins, particularly their structures and functions.

Researchers in the Marcotte group at the University of Texas at Austin are using the Stampede supercomputer to develop and test computer algorithms that let them more accurately and efficiently interpret proteomics mass spectrometry data.

The researchers are midway through a project that analyzes the largest animal proteomics dataset ever collected (data equivalent to roughly half of all currently existing shotgun proteomics data in the public domain). These samples span protein extracts from a wide variety of tissues and cell types sampled across the animal tree of life.

The analyses consume considerable computing cycles and require the use of Stampede's large memory nodes, but they allow the group to reconstruct the 'wiring diagrams' of cells by learning how all of the proteins encoded by a genome are associated into functional pathways, systems, and networks. Such models let scientists better define the functions of genes, and link genes to traits and diseases.

"Researchers would usually analyze these sorts of datasets one at a time," Edward Marcotte said. "TACC let us scale this to thousands."

Marcotte's work was featured in the New York Times in August 2012.

--  Aaron Dubrow, (703) 292-4489 adubrow@nsf.gov

Pamela Ronald
Edward Marcotte

Related Institutions/Organizations
University of Texas at Austin
University of California-Davis

Austin , Texas

Related Programs
Plant Genome Research Program
Petascale Computing Resource Allocations

Related Awards
#1134872 Enabling, Enhancing, and Extending Petascale Computing for Science and Engineering
#1237975 Network-Guided Predictions and Characterization of Genes Governing Pattern Recognition Receptor-Mediated Immunity in Cereals

Years Research Conducted
2013 - 2017

Total Grants

Related Websites
Stampede supercomputer: https://www.tacc.utexas.edu/stampede/


Email this pagePrint this page
Back to Top of page