Email Print Share

Fact Sheet

Impacts Range from Fundamental Biology to Disease

Molecular functions go according to form as 3-D structures do life's work

structure of DNA

The structure of DNA in its most common configuration (B-form).

January 21, 2004

This material is available primarily for archival purposes. Telephone numbers or other contact information may be out of date; please see current contact information at media contacts.


Notes to editors and news directors: The Protein Data Bank, a publicly accessible collection of nearly 24,000 macromolecules, is administered by the Research Collaboratory for Structural Bioinformatics with support from the National Science Foundation and seven other federal agencies. A detailed news release announces a new five-year agreement with the RCSB.

Molecule images are available here:
For additional images, contact the Protein Data Bank by email at or visit its web site at

Each with its role in a biological process, and many with several roles, every structure in the Protein Data Bank has value. The structures range from small pieces of protein or DNA to complex machines, such as the ribosome.

Among the most intricate and remarkable of nature's machines, the ribosome's structure includes more than 100,000 atoms, reflecting its complex and crucial function as the cell's protein-assembly factory.

A complex cell can have many thousands of different types of protein, each of which is a long chain of amino acids assembled according to the precise instructions found in the DNA sequence of a gene. To use these instructions, the cell first makes a copy of the DNA information in the form of another molecule, a single-stranded kin to DNA called messenger RNA (mRNA).

The mRNA is then threaded through the ribosome, which reads the instructions one step at a time, adding each amino acid in the exact order prescribed by the gene sequence. The result is a specific protein molecule with the precise shape and chemical properties that allow it to provide an important function for the cell. This stepwise process for converting information stored in DNA into functional, biological molecules is referred to as 'gene expression.'

Also represented in the Protein Data Bank's database are enzymes, viruses and molecular assemblies (such as the nucleosome). Their structures provide insight into these molecules' roles in fundamental biological processes and, in some cases, into their possible roles in disease or drug interactions.

The diversity and complexity of the collection is illustrated by those structures highlighted in the PDB's "Molecule of the Month" series, authored by David S. Goodsell of The Scripps Institute of Oceanography (

Examples taken from "Molecule of the Month" follow (with their PDB data ID and a link to profiles).

Additional examples follow to illustrate the breadth of impact of the structural information contained in the Protein Data Bank:

Photosynthesis: The structures of the last of the 3 major assemblies involved in photosynthesis has now been determined revealing for the first time exactly how plants and some bacteria harness sunlight to produce energy for growth. (1JB0, 1IZL, 1UM3)

Breath of Life: Carbonic anhydrase enzymes are essential to a remarkable spectrum of biological processes, ranging from breathing in animals to photosynthesis in plants. The functions performed by these enzymes are so crucial that they may have been invented by nature 3 or more separate times in the course of evolution. (See 1CA2, 1DDZ, 1THJ for examples; also

An RNA World: Catalysis - directing a chemical reaction - once thought to be the province of proteins alone is now known to be a property of RNAs, which may have preceded proteins in early evolution. (See 488D for an example.)

Channel Surfing: The structures of channels that control the entry and exit of water and ions from cells reveal how cells control their internal environment. Research on cell membrane channels was recognized by the 2003 Nobel Prize in Chemistry. (1FQY, 1BL8)

HIV (Human Immunodeficiency Virus): Structures such as reverse transcriptase and HIV protease have already been used to develop new drugs used in the fight against AIDS. (7HVP, 1HSG, 1HXB, 1HXW)

Mad Cow disease: The structure of segments of the prion protein (that may form the infectious agent for this disease) has been solved. An understanding of its conversion from normal to the abnormal disease-causing form will help scientists understand the progression of the disease. (1DX0, 1E1W, 1G6W)

SARS (Severe Acute Respiratory Syndrome): The structures of the protease and polymerase proteins of this virus have been used to design drugs against this virus. (1Q2W, 1UK4)


Media Contacts
Sean Kearns, NSF, (703) 292-7963, email:

The National Science Foundation (NSF) is an independent federal agency that supports fundamental research and education across all fields of science and engineering. In fiscal year (FY) 2018, its budget is $7.8 billion. NSF funds reach all 50 states through grants to nearly 2,000 colleges, universities and other institutions. Each year, NSF receives more than 50,000 competitive proposals for funding and makes about 12,000 new funding awards.

mail icon Get News Updates by Email 

Useful NSF Web Sites:
NSF Home Page:
NSF News:
For the News Media:
Science and Engineering Statistics:
Awards Searches: