Much of the information for this section was provided by the Manufacturing and Construction Division of the U.S. Bureau of the Census, which collected and compiled the survey data for NSF. Copies of the technical papers cited can be obtained from NSF's Research and Development Statistics Program in the Division of Science Resources Statistics.
The reporting unit for the Survey of Industrial Research and Development is the company,[5] defined as a business organization of one or more establishments under common ownership or control. The survey includes two groups of enterprises: (1) companies known to conduct R&D and (2) a sample of companies for which information on the extent of R&D activity is uncertain.
The Standard Statistical Establishment List (SSEL), a Bureau of the Census compilation that contains information on more than 3 million establishments with paid employees, was the target population from which the frame used to select the 2001 survey sample was created (see table B-1 for population and sample sizes). For companies with more than one establishment, data were summed to the company level and the resulting company record was used to select the sample and to process and tabulate the survey data.
After data were summed to the company level, each company then was assigned a single North American Industrial Classification System (NAICS) code based on payroll. The 1999 survey was the first year that companies were classified using NAICS. Prior to 1999, the Standard Industrial Classification (SIC) system was used. The two systems are discussed later under Comparability of Statistics. The method used followed the hierarchical structure of the NAICS. The company was first assigned to the economic sector, defined by a two-digit NAICS code representing manufacturing, mining, trade, etc., that accounted for the highest percentage of its aggregated payroll. Then the company was assigned to a subsector, defined by a three-digit NAICS code, that accounted for the highest percentage of its payroll within the economic sector. Finally, the company was assigned a four-digit NAICS code within the subsector, again based on the highest percentage of its aggregated payroll. Assignment below the four-digit level was not done because of the concentration of R&D in relatively few industries and disclosure concerns. Both issues are discussed later in this section.
The frame from which the survey sample was drawn included all for-profit companies classified in nonfarm industries. For surveys prior to 1992, the frame was limited to companies above certain size criteria based on number of employees (U.S. Bureau of the Census 1994d). These criteria varied by industry. Some industries were excluded from the frame because it was believed that they contributed little or no R&D activity to the final survey estimates. For the 1992 sample, new industries were added to the frame (listed and discussed below under Comparability of Statistics), and the size criteria were lowered considerably and applied uniformly to firms in all industries. As a result, nearly 2 million enterprises with five or more employees were given a chance of selection for subsequent samples, including the 2001 sample. For comparison, the frame for the 1987 sample included 154,000 companies of specified sizes and industries.
A fundamental change initiated in 1995 and repeated for subsequent samples was the redefinition of the sampling strata. For the survey years 199294, 165 sampling strata were established, each stratum corresponding to one or more three-digit-level SIC codes. The objective was to select sufficient representation of industries to determine whether alternative or expanded publication levels were warranted. For the 199598 surveys, the sampling strata corresponded to publication level industry aggregations. For each year, 40 publication levels were defined. These corresponded to the original 25 groupings of manufacturing industries used as sampling strata before 1992 and an additional 15 groupings of nonmanufacturing industries. Beginning with the 1999 survey and the conversion to NAICS, 29 manufacturing and 20 nonmanufacturing strata were defined corresponding to the four-digit industries and groups of industries for which statistics were developed and published.
Since industrial R&D is performed by relatively few companies and the national estimate is dependent primarily on large R&D performers concentrated in a small number of industries, it is important to capture and retain large performers for the sample. For this reason some companies are selected with certainty. Certainty companies fall into two categories. The first (arbitrary certainties) includes all companies whose total R&D expenditures in the prior year's survey was $5 million or more. The $5 million dollar cutoff was first set in 1996. The second (analytical certainties) includes those companies whose size is sufficient to guarantee their inclusion given the predefined sampling error constraints set for the R&D industry groupings.
Before 1994, companies with 1,000 or more employees had been selected with certainty, but it was observed that the level of spending varied considerably and that many of these companies reported no R&D expenditures each year. For these reasons, it was determined that these companies should be given chances of selection based upon the size of their R&D spending if they were in the previous survey or upon an estimated R&D value if they were not. Consequently, the size criterion based on the number of employees was dropped for surveys after 1994. With a fixed total sample size, there was concern that the representation of the very large noncertainty universe by a smaller sample each year would be inadequate. So, to limit the growth occurring each year in the number of certainty cases within the total sample, the certainty criterion was raised for the 1996 survey from $1 million to $5 million in total R&D expenditures based on data gathered from the 1995 survey.
Frame Partitioning
Partitioning of the frame for noncertainty companies into large and small companies was first introduced in 1994 because of concern arising from a study of 1992 survey results that showed that a disproportionate number of small companies was being selected for the sample and often assigned very large weights. These small companies seldom reported R&D activity. This disproportion was a result of the minimum probability rule (see Sample Size below) used as part of the independent probability proportionate to size (pps) sampling procedure employed exclusively prior to 1994 (pps is discussed in detail later under Sample Selection). This rule increased the probabilities of selection for several hundred thousand smaller companies. For the 1994 and subsequent surveys, simple random sampling (srs) was applied to the small company partition causing the smaller companies to be sampled more efficiently than with independent pps sampling since there was little variability in their size (srs also is discussed in detail later under Sample Selection). The large company partition continued to be sampled using independent pps sampling until 1998 when fixed sample size pps sampling was begun.
For the 1994 and 1995 surveys, total company payroll was the basis for partitioning the noncertainty frame. For each industry grouping, the largest companies representing the top 90 percent of the total payroll for the industry grouping were included in the pps frame. The balance, the smaller companies comprising the remaining 10 percent of payroll for the industry grouping, was included in the srs frame.
Beginning with the 1996 survey, total company employment became the basis for partitioning the frame. The total company employment levels defining the partitions were based on the relative contribution to total R&D expenditures of companies in different employment size groups in both the manufacturing and nonmanufacturing sectors. In the manufacturing sector, all companies with total employment of 50 or more were included in the large company partition. In the nonmanufacturing sector, all companies with total employment of 15 or more were included in the large company partition. Companies in the respective sectors with employment below these values were included in the small company partition. In the 2001 survey, the large company partition contained about 650,000 companies and the small company partition contained approximately 1.3 million companies. For comparison, these counts in the 2000 survey were about 632,000 and 1.3 million, respectively.
One final modification in the frame development for 1996, which was repeated for the 1997 and 1998 surveys, was the designation of "zero" industries in the large company partition. Zero industries were those three-digit SIC industries having no R&D expenditures reported in survey years 199294the years when estimates by three-digit SIC industry were formed. These industries remained within the scope of the survey, but only a limited sample was drawn from them because it was unlikely that these industries conducted R&D. Simple random sampling was used to control the number of companies selected from these industries. For the 1999 through 2001 surveys, no zero industries were defined because of the conversion to NAICS. For the next several cycles of the survey, NAICS industries will be evaluated to ascertain if any of them should be designated zero industries.
Beginning with the 1996 cycle of the survey and repeated for subsequent surveys, a significant revision in the procedure for selecting samples from the partitions led to a change in the development and presentation of estimates. For the 1995 survey, the sample of companies from the large company partition was selected using probability proportionate to size sampling (discussed in detail below) in each of the 40 strata (discussed previously under Defining Sampling Strata). Likewise, the simple random sampling of the small company partition was done for each of the 40 strata. However, beginning in 1996, the number of strata established for the small company partition was reduced to two. One stratum consisted of small companies classified in manufacturing industries and the second stratum consisted of small companies classified in nonmanufacturing industries. Simple random sampling continued as the selection method for these two strata.
The purpose of selecting the small company panel from these two strata was to reduce the variability in industry estimates largely attributed to the random year-to-year selection of small companies by industry and the high sampling weights that sometimes occurred. As a consequence of this change, estimates for industry groups within manufacturing and nonmanufacturing were not made from these two strata as noted on affected tables. The statistics for the detailed industry groups were based only on the sample from the large company partition. Estimates from the small company partition were included in statistics for total manufacturing, total nonmanufacturing, and all industries. For completeness, in the affected tables for 199698 the estimates also were added to the categories "other manufacturing" and "other nonmanufacturing." For 1999 and 2000, the estimates were published separately in the "small manufacturing companies" and "small nonmanufacturing companies" categories.
For 2001, the sampling of the small companies was again limited to the two strata. However, because of the increasing growth of the R&D activity observed in these strata, it was decided to code the reporting companies into their appropriate industries and include them in the industry estimates as was done prior to 1996. As a result the small manufacturing companies and small nonmanufacturing companies categories were eliminated. In future sampling operations, it is probable that the small companies will again be partitioned into and sampled by the 47 industry strata so as to insure their representation in all industries. Increased year-to-year variability of the industry estimates is to be expected.
Probability Proportionate to Size
Imputing R&D. Except for the companies that were in a previous survey or for which there is information from external sources, it is impossible to know the R&D expenditures for every firm in the universe because R&D information is not available from the Standard Statistical Establishment List (SSEL). Consequently, the probability of selection for most companies is based on estimated R&D expenditures. Since total payroll is known for each company in the universe (payroll information is available from the SSEL), it is possible to estimate R&D from payroll using relationships derived from previous survey data. Imputation factors relating these two variables are derived for each industry grouping. To impute R&D for a given company, the imputation factors are applied to the company payroll in each industry grouping. A final measure is obtained by adding the industry grouping components. The effect, in general, is to give firms with large payrolls higher probabilities of selection in agreement with the assumption that larger companies are more likely to perform R&D. Estimated R&D values are computed for companies in the small company partition as well. The aggregate of reported and estimated R&D from each company in both the large and small company partitions represent a total universe measure of the previous year's R&D expenditures. However, assigning R&D to every company results in an overstatement of this measure. To adjust for the overstatement, the universe measure is scaled down using factors developed from the relationship between the frame measure of the prior year's R&D and the final prior-year survey estimates. These factors, computed at levels corresponding to published industry levels, are used to adjust the originally imputed R&D values so that the new frame total for R&D at these levels approximates the prior year's published values. This adjustment provides for better allocation of the sample among these levels.
For 2001, the distribution of companies by payroll and estimated R&D in the large company partition was skewed as in earlier frames (i.e., the correlation of payroll and estimated R&D was high because estimated R&D had been calculated based on payroll). Because of this skewness, pps sampling remained the appropriate selection technique for this group.[6] That is, large companies had higher probabilities of selection than did small companies. However, a different approach to pps sampling was introduced beginning with the 1998 survey. Historically, pps sampling had been accomplished using an independent sampling methodology, i.e., the selection (or nonselection) of a given company was independent of the sampling result (select or nonselect) of any other company. This implied that over repeated samplings in a given stratum, different size samples would result, adding more variability to the sample estimates. For 1998, a fixed sample size pps method was introduced. This method ensured that the sample size desired for a given stratum was achieved, thus eliminating error because of sample size variation from the sample estimates. For a given sample size, the fixed sample size method produces more precise estimates on average than the independent method. The fixed sample size methodology has been used every year since 1998.
Simple Random Sampling
As described earlier, only two major strata were defined for samples in the small company partition, manufacturing and nonmanufacturing. The use of srs implied that each company within a stratum had an equal probability of selection with the exception of the preassigned arbitrary and analytical certainties (discussed previously). The total sample allocated to the small company partition was dependent upon the total sample specified for the survey and upon the total sample necessary to satisfy criteria established for the large partition. Once determined, the allocation of this total by stratum was made proportionate to the stratum's payroll contribution to the entire partition. For 2001, there was also a third srs stratum that contained 9,975 company records where the NAICS code was unknown at the time the sample was selected. Companies that were missing or had incomplete North American Industrial Classification System (NAICS) codes at the time of sampling were assigned to an "unclassified" industry category temporarily. If an unclassified company reported R&D expenditures, its primary industrial activity was investigated and a NAICS code was assigned during statistical processing.
Sample Stratification and Relative Standard Error Constraints
The particular sample selected was one of a large number of samples of the same type and size that by chance might have been selected. Statistics resulting from the different samples would differ somewhat from each other. These differences are represented by estimates of sampling error or variance. The smaller the sampling error, the more precise is the statistic, that is, the closer it is to the result that would have occurred had a complete canvass been done under the same definitions and procedures. The accuracy of the estimate, that is, how close it is to the true value, is a function not only of sampling error, but also of nonsampling error.
Controlling Sampling Error. Historically, it has been difficult to achieve control over the sampling error of survey estimates. Efforts were confined to controlling the amount of error due to sample size variation, but this was only one component of the overall sampling error. The other component depended on the correlation between the data from the sampling frame used to assign probabilities (namely R&D values either imputed or reported in the previous survey) and the actual current year reported data. The nature of R&D is such that these correlations could not be predicted with any reliability. Consequently, precise controls on overall sampling error were difficult to achieve.
For recent surveys, primary concern was placed on controlling error for the large company partition since nearly all of the R&D activity was identified from that portion of the sample. Since 1998, with the introduction of the fixed sample size sampling procedure, the component of sampling error due to sample size variation was eliminated. However, the amount of error attributable to the remaining component of the sample remained. Since there was still no way to predict how well the data from the sampling frame would correlate with actual survey data, the approach taken to allocate the sample across the various strata was to assign probabilities in the same manner as in the past when independent sampling was used. The probabilities resulting from this allocation technique determined the sample sizes to be selected from each stratum subject to the overall sample size constraint dictated by the survey budget. Although the actual survey sampling errors could not be predicted, the parameters used to assign probabilities and the use of the minimum probability rule resulted in a desirable number of companies being sampled from the large company partition (see Sample Size below).
Sampling Strata and Standard Error Estimates. A limitation of the sample allocation process for the large company partition should be noted. The constraints used to control the sample size in each stratum were based on a universe total that was improvised. That is, as previously noted, an R&D value was assigned to every company in the frame, even though most of these companies actually may not have had R&D expenditures. The value assigned was imputed for the majority of companies in the frame and, as a consequence, the estimated universe total and the distribution of individual company values, even after scaling, did not necessarily reflect the true distribution. Assignment of sampling probability was nevertheless based on this distribution. The presumption was that actual variation in the sample design would be less than that estimated, because many of the sampled companies have true R&D values of zero, not the widely varying values that were imputed using total payroll as a predictor of R&D. Previous sample selections indicate that in general this presumption held, but exceptions have occurred when companies with large sampling weights have reported large amounts of R&D spending. See table B-2 for a list by industry of the relative standard error estimates for selected items and table B-3 for a list of the relative standard error estimates of total R&D by state.[7]
Nonsampling Error. In addition to sampling error, estimates are subject to nonsampling error. Errors are grouped in five categories: specification, coverage, response, nonresponse, and processing. For detailed discussions on the sources, control, and measurement of each of these types of error, see U.S. Bureau of the Census (1994b and 1994f).
The parameters set to control sampling error discussed above resulted in a sample size of 18,880 companies from the large company partition. For the small company partition, two strata (manufacturing and nonmanufacturing) were identified. Also included was a separate stratum of small companies that could not be classified into a NAICS industry because of incomplete industry identification in the SSEL. In 2001, as in the 1994 through 2000 surveys, a small number of companies were selected from this group in the hope that an accurate industry identification could be obtained at a later point (as discussed above). Ultimately, a final sample of 6,086 companies was selected from the small company and unclassified partitions. Companies in the small manufacturing and unclassified partitions received weights slightly less than 100 and their sample size accounted for one one-hundredth of the population in each partition (see Weighting and Maximum Weights later in this section). The sample size of the "small nonmanufacturing companies" category was the difference between the desired total sample size of 25,000 and the sum of the large manufacturing, small manufacturing, large nonmanufacturing, and unclassified partitions. This total included an adjustment to the sample size based on a minimum probability rule and changes in the operational status of some companies. With the use of fixed sample size pps sampling for the large company partition and simple random sampling for the small company partition (and with no zero-industry stratum for 2000), the target sample size was met.
Minimum Probability Rule. A minimum probability rule was imposed for both partitions. As noted earlier, for the large partition, probabilities of selection proportionate to size were assigned to each company, where size was the reported or imputed R&D value assigned to each company. Selected companies received a sample weight which was the inverse of their probability. Selected companies that ultimately report R&D expenditures vastly larger than their assigned values can have adverse effects on the statistics, which were based on the weighted value of survey responses. To lessen the effects on the final statistics, the maximum weight of a company was controlled by specifying a minimum probability that could be assigned to the company. If the probability, based on company size, was less than the minimum probability, then it was reset to this minimum value. The consequence of raising these original probabilities to the minimum probability was to raise the sample size. Similarly, a maximum weight for each stratum was established for the simple random sampling of the small company partition. If the sample size initially allocated to a stratum resulted in a stratum weight above this maximum value, then the sample size was increased until the maximum weight was achieved.
Changes in Operational Status. Between the time that the frame was created and the survey was prepared for mailing, the operational status of some companies changed. That is, they were merged with or acquired by another company, or they were no longer in business. Before preparing the survey for mailing, the operational status was updated to identify these changes. As a result, the number of companies mailed a survey form was somewhat smaller than the number of companies initially selected for the survey.
Weights were applied to each company record to produce national estimates. Within the pps partitions of the sample, the maximum company weight allowed regardless of NAICS category was 50. Within the srs partitions, company records classified in the "small nonmanufacturing companies" category were given weights up to a maximum of 250; company records classified in the remaining NAICS categories were given maximum weights of 100.
Two forms are used each year to collect data for the survey. Known large R&D performers are sent a detailed survey form, Form RD-1. The Form RD-1 requests data on sales or receipts, total employment, employment of scientists and engineers, expenditures for R&D performed within the company with Federal funds and with company and other funds, character of work (basic research, applied research, and development), company-sponsored R&D expenditures in foreign countries, R&D performed under contract by others, federally funded R&D by contracting agency, R&D costs by type of expense, domestic R&D expenditures by state, energy-related R&D and foreign R&D by country. Because companies receiving the Form RD-1 have participated in previous surveys, computer-imprinted data reported by the company for the previous year are supplied for reference. Companies are encouraged to revise or update this imprinted data if they have more current information; however prior-year statistics that previously have been published are revised only if large disparities are reported. (See Revisions to Historical and Immediate Prior-Year Statistics later in this section.)
Small R&D performers and firms included in the sample for the first time are sent Form RD-1A. This form collects the same information as Form RD-1 except for five items: Federal R&D support to the firm by contracting agency, R&D costs by type of expense, domestic R&D expenditures by state, energy-related R&D, and foreign R&D by country. It also includes a screening item that allows respondents to indicate that they do not perform R&D. No prior-year information is made available since the majority of the companies that receive the Form RD-1A have not been surveyed in the previous year.
Recent Survey Form Content Changes
Beginning with the 1997 survey, data on federally funded R&D performed under contract to others (or "contracted out") were collected to better measure the amount of R&D performed both within and between companies. For earlier years, data were collected only on nonfederally funded contracted-out R&D.[8]
A new item, R&D depreciation costs, was added to the 1998 Form RD-1. In prior years R&D depreciation was included in the "other costs" category of R&D expenditures. Also beginning with the 1998 survey, items used to collect detailed information on the allocation of R&D expenditures by field of science and engineering and by product class, and R&D expenditures for pollution abatement were eliminated. Further, the amount of detail requested for energy-related R&D was reduced. Item nonresponse on each of these items was unacceptably high relative to their response burden.
To control burden and continuity during the transition to NAICS, the 1999 and 2000 survey forms remained as they were for 1998.
For 2001, an existing item was expanded and two new items were added. In response to inquiries from data users, a reporting category for "fringe benefits" was added to the existing "type of R&D" item. To probe two other areas of increasing interest, an item asking for the distribution of R&D by technology area and an item asking for the distribution of R&D performed by others by type of performing organization were added.
Number of Survey Forms Sent
Form RD-1 was mailed to companies that reported R&D expenditures of $5 million dollars or more in the 2000 survey and to "analytical certainty" companies (discussed previously). Approximately 1,800 companies received Form RD-1 and approximately 23,000 received Form RD-1A. Both survey forms and the instructions provided to respondents are available in section C, Survey Documents.
Survey Nonresponse
For various reasons, some firms did not choose to return the survey form or returned it with one or more blank items. (For detailed discussions on the sources, control, and measurement error resulting from item nonresponse, see U.S. Bureau of the Census (1994b).) For some firms, internal accounting systems and procedures may not have allowed quantification of specific expenditures. Others may have refused to answer any voluntary questions as a matter of company policy. All but five itemstotal R&D, Federal R&D, net sales, total employment, and R&D by state, which are included in the Census Bureau's annual mandatory statistical programare voluntary. The R&D by state item was designated mandatory for the first time for the 2001 cycle of the survey. (See further discussion under Response Rates and Mandatory Versus Voluntary Reporting later in this section.)
The 2001 survey forms were mailed in March 2002. Recipients of Form RD-1A were asked to respond within 30 days, while Form RD-1 recipients were given 60 days. A followup form and letter were mailed to RD-1A recipients every 30 days if their completed survey forms had not been received; a total of five followup mailings were conducted for nonresponding RD-1A recipients.
A letter was mailed to Form RD-1 recipients 30 days after the initial mailing, reminding them that their completed survey forms were due within the next 30 days. A second form and reminder letter were mailed to Form RD-1 respondents after 60 days. Two additional followup mailings were sent to nonresponding Form RD-1 recipients.
In addition to the mailings, telephone followup was used to encourage response from those firms ranked among the 300 largest R&D performers, based on total R&D expenditures reported in the previous survey. Table B-4 shows the number of companies in each industry or industry group that received the survey form and the percentage that responded to the survey.
If all attempts to get a response failed and no current-year information was reported, data for domestic sales, total employment, total R&D, and the number of R&D scientists and engineers were imputed as described in the next section.
Imputation for Unit and Item Nonresponse
When respondents did not provide the requested information, estimates for the missing data were made using various methods. Specific rules governed imputation for missing data depending on the item being imputed. For some items (domestic sales, total employment, total R&D, and number of research scientists and engineers) missing current year data are always imputed. Rates of change are applied to prior-year data regardless of whether prior-year data were reported or imputed. For other items (e.g., basic research, subcontracted R&D, and foreign R&D) missing current year data are imputed only if the company reported the item in either of the prior two years. A third type of imputation occurs when detail does not sum to the total (e.g., Federal R&D by agency). In this case if prior-year detail is not imputed, then current-year data are distributed based on the previous distribution pattern of the reporting unit. Otherwise, an industry average distribution is applied to the total to derive a value for each detail item. Rates of change are calculated by item within each NAICS category or industry. The calculations are based on weighted data for all companies that reported both variables. In the case of inter-item ratios (e.g., R&D to sales), calculations are based on data for all companies that reported both items in the current reporting period. For current-to-prior-year ratios (e.g., employment), calculations are based on data for all companies that reported that item in both years. (For detailed descriptions and analyses of the imputation methods and algorithms used, see U.S. Bureau of the Census (1994c).)
Outside sources of information are also used for imputing missing data. During the edit review process, analysts compare data reported to the Survey of Industrial Research and Development by publicly-owned companies with the company's report to the Securities and Exchange Commission (SEC). Data items matched include domestic sales, domestic employment, total or company-funded R&D, and in some cases, federally funded R&D. This comparison provides analysts a means to (1) potentially resolve inconsistencies between current- and prior-year data on the R&D survey, (2) impute missing data for specific items, and (3) ensure that companies are reporting comparable values in both reports.[9] A second source for obtaining or verifying domestic employment and domestic sales data is the U.S. Census Bureau's Business Register. Data for these items are collected on economic census and annual survey forms. If using these outside sources of information fails to supply missing data or resolve major inconsistencies, data are imputed. Table B-5 contains imputation rates for the principal survey items.
Survey reporting requirements divided survey items into two groups: mandatory and voluntary. Response to five data items was mandatory; response to the remaining items was voluntary. The mandatory items were total R&D expenditures, Federal R&D funds, net sales, total employment, and the distribution of R&D by state. During the 1990 survey cycle, NSF conducted a test of the effect of reporting on a completely voluntary basis to determine if combining both mandatory and voluntary items on one survey form influences response rates. For this test, the 1990 sample was divided into two panels of approximately equal size. One panel, the mandatory panel, was asked to report as usual on four mandatory items with the remainder voluntary; and the other panel was asked to report all items on a completely voluntary basis. The result of the test was a decrease in the overall survey response rate to 80 percent from levels of 88 percent in 1989 and 89 percent in 1988. The response rates for the mandatory and voluntary panels were 89 percent and 69 percent, respectively. Detailed results of the test were published in Research and Development in Industry: 1990. For firms that reported R&D expenditures in 2001, table B-6 shows the percentage that also reported data for other selected items.
Response to questions about character of work (basic research, applied research, and development) declined in the mid-1980s, and, as a result, imputation rates increased. The general imputation procedure described above became increasingly dependent upon information imputed in prior years, thereby distancing current-year estimates from any reported information. Because of the increasing dependence on imputed data, NSF chose not to publish character of work estimates in 1986. The imputation procedure used to develop these estimates was revised in 1987 for use with later data and differs from the general imputation approach. The new method calculated the character of work distribution for a nonresponding firm only if that firm reported a distribution within a five-year period, extending from two years before to two years after the year requiring imputation. Imputation for a given year was initially performed in the year the data were collected and was based on a character of work distribution reported in either of the two previous years, if any. It was again performed using new data collected in the next two years. If reported data followed no previously imputed or reported data, previous period estimates were inserted based on the currently reported information. Similarly, if reported data did not follow two years of imputed data, the two years of previously imputed data were removed. Thus, character of work estimates were revised as newly reported information became available and were not final for two years following their initial publication.
Beginning with 1995, previously estimated values were not removed for firms that did not report in the third year, nor were estimates made for the two previous years for firms reporting after two years of nonresponse. This process was changed because, in the prior period, revisions were minimal. Estimates continued to be made for two consecutive years of nonresponse and discontinued if the firm did not report character of work in the third year. If no reported data were available for a firm, character of work estimates were not imputed. As a consequence, only a portion of the total estimated R&D expenditures were distributed at the firm level. Those expenditures not meeting the requirements of the new imputation methodology were placed in a "not distributed" category.
NSF's objective in conducting the survey has always been to provide estimates for the entire population of firms performing R&D in the United States. However, the revised imputation procedure would no longer produce such estimates because of the "not distributed" component. A baseline estimation method thus was developed to allocate the not distributed amounts among the character of work components. In the baseline estimation method, the not distributed expenditures were allocated by industry group to basic research, applied research, and development categories using the percentage splits in the distributed category for that industry. The allocation was done at the lowest level of published industry detail only; higher levels were derived by aggregation, just as national totals were derived by aggregation of individual industry estimates, and result in higher performance shares for basic and applied research and lower estimates for development's share than would have been calculated using the previous method.
Using data collected during the 1999 and 2000 cycles of the survey, reporting anomalies for the character of work survey items, especially for basic research, were investigated. It was discovered that a number of large companies known to develop and manufacture products reported all of their R&D as basic research. This phenomenon is not logical and prompted a renewed effort to strengthen character of work estimates produced from the survey. Identification of the anomalous reporting patterns has been completed. Appropriate methods to deal with the anomalies have been implemented. Publication of character of work distributions of R&D has been resumed, and the tables containing historical basic research, applied research, and development estimates (tables A-24, A-25, and A-26) have been revised and footnoted accordingly.
Form RD-1 requests a distribution of the total cost of R&D among the state(s) where the R&D was performed. Prior to the 1999 survey, an independent source, the Directory of American Research and Development, published by the Data Base Publishing Group of the R. R. Bowker Company was used in conjunction with previous survey results to estimate R&D expenditures by state for companies that did not provide this information. The information on scientists and engineers published in the directory was used as a proxy indicator of the proportion of R&D expenditures within each state. R&D expenditures by state were estimated by applying the distribution of scientists and engineers by state from the directory to total R&D expenditures for these companies. These estimates were included with reported survey data to arrive at published estimates of R&D expenditures for each state. However, the practice of using outside information to formulate or adjust estimates of R&D expenditures for each state has been discontinued because a suitable source for supporting information is no longer available. State estimates resulting from the 1999 and 2000 surveys were based solely on respondent reports and information internal to the survey.
For 2001, because of the lack of a reliable, comprehensive outside source of information, in an effort to improve the quality of reported data, NSF sought and was granted authorization to require reporting of the distribution of R&D by state from the Office of Management and Budget (OMB), the Federal agency that oversees and controls burden on respondents.
This section summarizes survey improvements, enhancements, and changes in procedures and practices that may have affected the comparability of statistics produced from the Survey of Industrial Research and Development over time and with other statistical series (see also NSF (2002a) and U.S. Bureau of the Census (1995)).
Beginning with the 1999 cycle of the survey, industry statistics are published using the North American Industrial Classification System (NAICS). The ongoing development of NAICS has been a joint effort of statistical agencies in Canada, Mexico, and the United States. The system replaced the Standard Industrial Classification (1980) of Canada, the Mexican Classification of Activities and Products (1994), and Standard Industrial Classification (SIC, 1987) of the United States. (For a detailed comparison of NAICS to the Standard Industrial Classification (1987) of the United States, visit http://www.census.gov/epcd/www/naics.html.) NAICS was designed to provide a production-oriented system under which economic units with similar production processes are classified in the same industry. NAICS was developed with special attention to classifications for new and emerging industries, service industries, and industries that produce advanced technologies. NAICS not only eases comparability of information about the economies of the three North American countries, but it also increases comparability with the two-digit level of the United Nations' International Standard Industrial Classification (ISIC) system. Important for the Survey of Industrial Research and Development is the creation of several new classifications that cover major performers of R&D in the U.S. Among manufacturers, the computer and electronic products classification (NAICS 334) includes makers of computers and peripherals, semiconductors, and navigational and electromedical instruments. Among nonmanufacturing industries are information (NAICS 51) and professional, scientific, and technical services (NAICS 54). Information includes publishing, both paper and electronic; broadcasting; and telecommunications. Professional, scientific, and technical services includes a variety of industries. Of specific importance for the survey are engineering and scientific R&D service industries.
Effects of NAICS on Survey Statistics. The change of industry classification system affects most of the detailed statistical tables produced from the survey. Prior to the 1999 report, tables classified by industry contained the current survey's statistics plus statistics for 10 previous years. Because of the new classification system, the tables classified in this report contain only statistics for the current year (2001) and previous years back to 1999. However, to provide a bridge for users who want to make year-to-year comparisons below the aggregate level, in several tables in Research and Development in Industry: 2000 statistics from the 1997 and 1998 cycles of the survey, which were previously classified and published using the SIC system, were reclassified using the new NAICS codes. These reclassified statistics were slotted using their new NAICS classifications alongside the 1999 and 2000 statistics, which were estimated using NAICS from the outset.
Beginning with the 1999 cycle of the survey, the number of company size categories used to classify survey statistics was increased. The original 6 categories were expanded to 10 to emphasize the role of small companies in R&D performance. During 1998, companies with fewer than 500 employees spent $30.2 billion on industrial R&D performed in the United States. During 1999, they spent $34.1 billion (NSF 2001a). Because of the addition of the new size classifications, we can say that of the $34.1 billion, 21 percent ($7.0 billion) was spent by the smallest companies (those with at least 5 but fewer than 25 employees). The more detailed business size information also facilitates better international comparisons. Generally, statistics produced by foreign countries that measure their industrial R&D enterprise are reported with more detailed company size classifications at the lower end of the scale than U.S. industrial R&D statistics traditionally have been. (For more information, visit the Organisation for Economic Co-operation and Development (OECD) website at http://www.oecd.org.) The new classifications of the U.S. statistics enable more direct comparisons with other countries' statistics.
Revisions to historical statistics usually have been made because of changes in the industry classification of companies caused by changes in payroll composition detected when a new sample was drawn. Various methodologies have been adopted over the years to revise, or backcast, the data when revisions to historical statistics have become necessary. Documented revisions to the historical statistics from post-1967 surveys through 1992 are summarized in NSF (1994) and in annual reports for subsequent surveys. Detailed descriptions of the specific revisions made to the statistics from pre-1967 surveys are scarce, but U.S. Bureau of the Census (1995) summarizes some of the major revisions.
Changes to reported data can come from three sources: respondents, analysts involved in survey and statistical processing, and the industry reclassification process. Prior to 1995, routine revisions were made to prior-year statistics based on information from all three sources. Consequently, results from the current-year survey were used not only to develop current-year statistics, but also to revise immediate prior-year statistics. Beginning with the 1995 survey, this practice was discontinued. The reasons for discontinuation of this practice were annual sampling; continual strengthening of sampling methodology; and improvements in data verification, processing, and nonresponse followup. Moreover, it was not clear that respondents or those who processed the survey results had any better information a year after the data were first reported. Thus, it was determined that routinely revising published survey statistics increased the potential for error and often confused users of the statistics. Revisions are now made to historical and immediate prior-year statistics only if substantive errors are discovered.
For 1999, an error in the sample frame caused one very large company (based on payroll) to be selected for the sample and its statistical record to be assigned a large weight (see Frame Creation and Weighting and Maximum Weights above). Because the company's record had received a large weight during 1999 sampling, the company was selected with certainty for the 2000 sample and assigned a weight of one (see Identifying Certainty Companies above). This sampling artifact caused an abnormally large decrease in the company's data, especially for sales and employment, when comparing the 2000 statistics with the statistics originally published for 1999. The weight in the company's record in the 1999 statistical file was corrected, and revised 1999 statistics are included in the tables in this report. R&D estimates for the company also were affected; however, the amount of R&D reported was relatively small, even after weighting.
As summarized above under Character of Work Estimates, reporting anomalies for the character of work survey items, especially for basic research, were discovered and investigated using data collected during the 1999 and 2000 cycles of the survey. Companies known to develop and manufacture products but that reported all of their R&D as basic research were contacted and queried regarding their R&D activities. After reviewing the definitions of basic research, applied research, and development, all but several changed their distribution of R&D. Census, the collection and tabulation agent for the survey, was able to go back as far as 1998 and correct the statistical files. Consequently, the tables containing historical basic research, applied research, and development estimates (tables A-24, A-25, and A-26) have been revised and footnoted accordingly.
Comparability from year to year may be affected by new sample design, annual sample selection, and industry shifts.
Sample Design
By far the most profound influence on statistics from recent surveys occurred when the new sample design for the 1992 survey was introduced. Revisions to the 1991 statistics were dramatic (see Research and Development in Industry: 1992 for a detailed discussion). While the allocation of the sample was changed somewhat, the sample designs used for subsequent surveys were comparable to the 1992 sample design in terms of size and coverage.
Annual Sample Selection
With the introduction of annual sampling in 1992, more year-to-year change has resulted than when survey panels were used for two reasons. First, changes in classification of companies not surveyed are not reflected in the year-to-year movement. Prior to annual sampling, a wedging operation, which was performed when a new sample was selected, was a means of adjusting the data series to account for the changes in classification that occurred in the frame (see the discussion on wedging later under Time Series Analyses). Second, yearly correlation of R&D data is lost when independent samples are drawn each year.
Industry Shifts
The industry classification of companies is redefined each year with the creation of the sampling frame. By redefining the frame, the sample reflects current distributions of companies by size and industry. A company may move from one industry to another because of either changes in its payroll composition, which is used to determine the industry classification code (see previous discussion under Frame Creation); changes in the industry classification system itself; or changes in the way the industry classification code was assigned or revised during survey processing.
A company's payroll composition can change because of the growth or decline of product or service lines, the merger of two or more companies, the acquisition of one company by another, divestitures, or the formation of conglomerates. Although an unlikely occurrence, a company's industry designation could be reclassified yearly with the introduction of annual sampling. The result is that a downward movement in R&D expenditures in one industry is balanced by an upward movement in another industry from one year to the next.
From time to time, the industry coding system used by Federal agencies that publish industry statistics is changed or revised to reflect the changing composition of U.S. and North American industry. The Standard Industrial Classification (SIC) system, as revised in 1987, was used for statistics developed from the 198891 panel surveys and the 1992-98 annual surveys. As discussed above, the industrial classification system has been completely changed, and beginning with the 1999 cycle of the survey, the North American Industrial Classification System (NAICS) is now used.
The method used to classify firms during survey processing was revised slightly in 1992. Research has shown that the impact on individual industry estimates was minor. (The effects of changes in the way companies were classified during survey processing are discussed in detail in U.S. Bureau of the Census (1994a and 1994e). The current method used to classify firms was discussed previously under Frame Creation. Methods used for past surveys are discussed in U.S. Bureau of the Census (1995).)
Before the 1992 survey, the sample of firms surveyed was selected at irregular intervals; until 1967, samples were selected every 5 years. Subsequent samples were selected for 1971, 1976, 1981, and 1987. In intervening years, a panel of the largest firms known to perform R&D was surveyed. For example, a sample of about 14,000 firms was selected for the 1987 survey. For the 198891 studies, about 1,700 of these firms were resurveyed annually; the other firms did not receive survey forms, and their R&D data were estimated. This sample design was adequate during the survey's early years because R&D performance was concentrated in relatively few manufacturing industries. However, as more and more firms began entering the R&D arena, the old sample design proved increasingly deficient because it did not capture births of new R&D-performing firms. The entry of fledgling R&D performers into the marketplace was completely missed during panel years. Additionally, beginning in the early 1970s, the need for more detailed R&D information for nonmanufacturing industries was recognized. At that time, the broad industry classifications "miscellaneous business services" and "miscellaneous services" were added to the list of industry groups for which statistics were published. By 1975, about 3 percent of total R&D was performed by firms in nonmanufacturing industries. (See also NSF 1994, 1995, and 1996a.)
During the mid-1980s, there was evidence that a significant amount of R&D was being conducted by an increasing number of companies classified among the nonmanufacturing industries. Again the number of industries used to develop the statistics for nonmanufacturers was increased. Consequently, the annual reports in this series for 1987-91 included separate R&D estimates for firms in the communication, utility, engineering, architectural, research, development, testing, computer programming, and data processing service industries; hospitals; and medical labs. Approximately 9 percent of the estimated industrial R&D performance during 1987 was undertaken by nonmanufacturing firms.
After the list of industries for which statistics were published was expanded, it became clear that the sample design itself should be changed to reflect the widening population of R&D performers among firms in the nonmanufacturing industries (NSF 1995a) and small firms in all industries so as to account better for births of R&Dperforming firms and to produce more reliable statistics. Beginning with the 1992 survey, NSF decided (1) to draw new samples with broader coverage annually and (2) to increase the sample size to approximately 25,000 firms.[10] As a result of the sample redesign, for 1992 the reported nonmanufacturing share was (and has continued to be) 25-30 percent of total R&D. (See also NSF 1997, 1998, 1999b, 2000, 2001, and 2002b.)
The statistics resulting from this survey on R&D spending and personnel are often used as if they were prepared using the same collection, processing, and tabulation methods over time. Such uniformity has not been the case. Since the survey was first fielded, improvements have been made to increase the reliability of the statistics and to make the survey results more useful. To that end, past practices have been changed and new procedures instituted. Preservation of the comparability of the statistics has, however, been an important consideration in making these improvements. Nonetheless, changes to survey definitions, the industry classification system, and the procedure used to assign industry codes to multiestablishment companies have had some, though not substantial, effects on the comparability of statistics. (For discussions of each of these changes, see U.S. Bureau of the Census (1994g); for considerations of comparability, see U.S. Bureau of the Census (1993 and 1994e).)
The aspect of the survey that had the greatest effect on comparability was the selection of samples at irregular intervals and the use of a subset or panel of the last sample drawn to develop statistics for intervening years. As discussed earlier, this practice introduced cyclical deterioration of the statistics. As compensation for this deterioration, periodic revisions were made to the statistics produced from the panels surveyed between sample years. Early in the survey's history, various methods were used to make these revisions (U.S. Bureau of the Census 1995). After 1976 and until the 1992 advent of annual sampling, a linking procedure called wedging was used. In wedging, the 2 sample years on each end of a series of estimates served as benchmarks in the algorithms used to adjust the estimates for the intervening years. (The process was dubbed wedging because of the wedgelike area produced on a graph that compares originally reported statistics with the revised statistics that resulted after linking. For a full discussion of the mathematical algorithm used for the wedging process that linked statistics from the 1992 survey with those from the 1987 survey, see U.S. Bureau of the Census (1994g) and NSF (1995).)
NSF collects data on federally financed R&D from both Federal funding agenciesusing the Survey of Federal Funds for Research and Developmentand from performers of the R&Dindustry, Federal labs, universities, and other nonprofit organizationsusing the Survey of Industrial Research and Development and other surveys (http://www.nsf.gov/statistics/publication.cfm). As reported by Federal agencies, NSF publishes data on Federal R&D budget authority and outlays, in addition to Federal obligations. These terms are defined below (NSF 2002b):
National R&D expenditure totals in NSF's National Patterns of R&D Resources report series are primarily constructed with data reported by performers and include estimates of Federal R&D funding to these sectors. But until performer-reported survey data on Federal R&D expenditures are available from industry and academia, data collected from the Federal agency funders of R&D were used to project R&D performance. When survey data from the performers subsequently are tabulated, as they were for this report, these statistics replace the projections based on funder expectations. Historically, the two survey systems have tracked fairly closely. For example, in 1980, performers reported using $29.5 billion in Federal R&D funding, and Federal agencies reported total R&D funding between $29.2 billion in outlays and $29.8 billion in obligations (NSF 1996b). In recent years, however, the two series have diverged considerably. The difference in the Federal R&D totals appears to be concentrated in funding of industry, primarily aircraft and missile firms, by the Department of Defense. Overall, industrial firms have reported significant declines in Federal R&D support since 1990 (table A-1), while Federal agencies have reported level or slightly increased funding of industrial R&D (NSF 1999a). NSF continues to identify and examine the factors behind these divergent trends.
| Table | Table Name | ||
|---|---|---|---|
| B-1 | Survey of Industrial Research and Developmentnumber of companies in the target population and selected for the sample, by industry and size of company: 2001 | .xls | |
| B-2 | Survey of Industrial Research and Developmentrelative standard error for survey estimates, by industry and size of company: 2001 | .xls | |
| B-3 | Survey of Industrial Research and Developmentrelative standard error for estimates of total R&D and percentage of estimates attributed to certainty companies, by state: 2001 | .xls | |
| B-4 | Survey of Industrial Research and Developmentunit response rates-number and percentage of companies that responded to the survey and percentage of companies that performed R&D, by industry and type of survey form: 2001 | .xls | |
| B-5 | Survey of Industrial Research and Developmentimputation rates for survey items, by industry and size of company: 2001 | .xls | |
| B-6 | Survey of Industrial Research and Developmentpercentage of R&D-performing companies that reported non-zero data for major survey items: 2000 | .xls | |
| B-7 | Survey of Industrial Research and Developmentfunds for and number of companies that performed industrial basic research, applied research, and development, in the U.S. and funds and percent of funds not distributed, by industry and size of company, by source of funds: 2001 | .xls |
Employment, FTE R&D scientists and engineers. Number of people employed in the 50 U.S. states and D.C. by R&D-performing companies who were engaged in scientific or engineering work at a level that required knowledge, gained either formally or by experience, of engineering or of the physical, biological, mathematical, statistical, or computer sciences equivalent to at least that acquired through completion of a 4-year college program with a major in one of those fields. The statistics show full-time-equivalent (FTE) employment of persons employed by the company during the January following the survey year who were assigned full time to R&D, plus a prorated number of employees who worked on R&D only part of the time.
Employment, total. Number of people employed in the 50 U.S. states and D.C. by R&D-performing companies in all activities during the pay period that included the 12th of March of the study year (March 12 is the date most employers use when paying first quarter employment taxes to the Internal Revenue Service).
Federally funded R&D centers (FFRDCs). R&D-performing organizations administered by industrial, academic, or other institutions on a nonprofit basis and exclusively or substantially financed by the Federal Government. To avoid the possibility of disclosing company-specific information and therefore violating the confidentiality provisions of Title 13 of the United States Code, beginning in 2001 data for industry-administered FFRDCs are now collected through NSF's annual academic R&D expenditure survey, the Survey of Research and Development Expenditures at Universities and Colleges, as are data from FFRDCs administered by academic institutions and nonprofit organizations. NSF estimates that industry-administered FFRDCs performed $1.9 billion during 2001. (More information about this survey is available from NSF's Division of Science Resources Statistics website at http://www.nsf.gov/statistics/rdexpenditures/. For current lists of FFRDCs, visit http://www.nsf.gov/statistics/ffrdc/.)
Funds for R&D, company and other non-Federal. The cost of R&D performed within the company and funded by the company itself or by other non-Federal sources in the 50 U.S. states and D.C.; does not include the cost of R&D funded by the company but contracted to outside organizations such as research institutions, universities and colleges, nonprofit organizations, orto avoid double countingother companies.
Funds for R&D, Federal. The cost of R&D performed within the company in the 50 U.S. states and D.C. funded by Federal R&D contracts, subcontracts, R&D portions of Federal procurement contracts and subcontracts, grants, or other arrangements; does not include the cost of R&D supported by the Federal government but contracted to outside organizations such as research institutions, universities and colleges, nonprofit organizations, or other companies.
Funds for R&D, total. The cost of R&D performed within the company in its own laboratories or in other company-owned or company-operated facilities in the 50 U.S. states and D.C., including expenses for wages and salaries, fringe benefits for R&D personnel, materials and supplies, property and other taxes, maintenance and repairs, depreciation, and an appropriate share of overhead; does not include capital expenditures or the cost of R&D contracted to outside organizations such as research institutions, universities and colleges, nonprofit organizations, orto avoid double-countingother companies.
Funds per R&D scientist or engineer. All costs associated with the performance of industrial R&D (salaries, wages, and fringe benefits paid to R&D personnel; materials and supplies used for R&D; depreciation on capital equipment and facilities used for R&D; and any other R&D costs) divided by the number of R&D scientists and engineers employed in the 50 U.S. states or D.C. To obtain a per person cost of R&D for a given year, the total R&D expenditures of that year were divided by an approximation of the number of full-time-equivalent (FTE) scientists and engineers engaged in the performance of R&D for that year. For accuracy, this approximation was the mean of the numbers of such FTE R&D-performing scientists and engineers as reported in January for the year in question and the subsequent year. For example, the mean of the numbers of FTE R&D scientists and engineers in January 2001 and January 2002 was divided into total 2001 R&D expenditures for a total cost per R&D scientist or engineer in 2001.
Net sales and receipts. Dollar values for goods sold or services rendered by R&D-performing companies to customers outside the company, including the Federal Government, less such items as returns, allowances, freight, charges, and excise taxes. Domestic intracompany transfers and sales by foreign subsidiaries were excluded, but transfers to foreign subsidiaries and export sales to foreign companies were included.
R&D and industrial R&D. R&D is the planned, systematic pursuit of new knowledge or understanding toward general application (basic research); the acquisition of knowledge or understanding to meet a specific, recognized need (applied research); or the application of knowledge or understanding toward the production or improvement of a product, service, process, or method (development). Basic research analyzes properties, structures, and relationships toward formulating and testing hypotheses, theories, or laws; applied research is undertaken either to determine possible uses for the findings of basic research or to determine new ways of achieving specific, predetermined objectives; and development draws on research findings or other scientific knowledge for the purpose of producing new or significantly improving products, services, processes, or methods. As used in this survey, industrial basic research is the pursuit of new scientific knowledge or understanding that does not have specific immediate commercial objectives, although it may be in fields of present or potential commercial interest; industrial applied research is investigation that may use findings of basic research toward discovering new scientific knowledge that has specific commercial objectives with respect to new products, services, processes, or methods; and industrial development is the systematic use of the knowledge or understanding gained from research or practical experience directed toward the production or significant improvement of useful products, services, processes, or methods, including the design and development of prototypes, materials, devices, and systems. The survey covers industrial R&D performed by people trained, either formally or by experience, in engineering or in the physical, biological, mathematical, statistical, or computer sciences and employed by a publicly or privately owned firm engaged in for-profit activity in the United States. Specifically excluded from the survey are quality control, routine product testing, market research, sales promotion, sales service, and other nontechnological activities; routine technical services; and research in the social sciences or psychology.
National Science Foundation (NSF). 1956. Science and Engineering in American Industry: Final Report on a 1953-54 Survey. NSF 56-16. Washington, DC: U.S. Government Printing Office.
National Science Foundation (NSF). 1960. Science and Engineering in American Industry: 1956. NSF 59-50. Washington, DC: U.S. Government Printing Office.
National Science Foundation (NSF). 1994. "1992 R&D Spending by U.S. Firms Rises, NSF Survey Improved." SRS Data Brief. NSF 94-325. Arlington, VA.
National Science Foundation (NSF). 1995. "1993 Spending Falls for U.S. Industrial R&D, Nonmanufacturing Share Increases." SRS Data Brief. NSF 95-325. Arlington, VA.
National Science Foundation (NSF). 1995a. Research and Development in Industry: 1992, NSF 95-324. Arlington, VA.
National Science Foundation (NSF). 1996a. "1994 Company Funding of U.S. Industrial R&D Rises as Federal Support Continues to Decline." SRS Data Brief. NSF 96-310. Arlington, VA.
National Science Foundation (NSF). 1996b. National Patterns of R&D Resources: 1996. NSF 96-333. Arlington, VA.
National Science Foundation (NSF). 1997. "1995 U.S. Industrial R&D Rises, NSF Survey Statistics Expanded to Emphasize Role of Nonmanufacturing Industries." SRS Data Brief. NSF 97-332. Arlington, VA.
National Science Foundation (NSF). 1998. "1996 U.S. Industrial R&D: Firms Continue to Increase Their Investment." SRS Data Brief. NSF 98-317. Arlington, VA.
National Science Foundation (NSF). 1999a. National Patterns of R&D Resources: 1998. NSF 99-335. Arlington, VA.
National Science Foundation (NSF). 1999b. "1997 U.S. Industrial R&D Performers." SRS Topical Report. NSF 99-355. Arlington, VA.
National Science Foundation (NSF). 2000. "1998 U.S. Industrial R&D Performers Report Increase R&D." SRS Data Brief. NSF 00-320. Arlington, VA.
National Science Foundation (NSF). 2001a. "U.S. Industrial R&D Performers Report Increased R&D in 1999; New Industry Coding and Size Classifications for NSF Survey." SRS Data Brief. NSF 01-326. Arlington, VA.
National Science Foundation (NSF). 2001b. "U.S. Industrial R&D: NSF Announces New Information Retrieval System and Historical Database" SRS Data Brief. NSF 01-338. Arlington, VA.
National Science Foundation (NSF). 2002a. "U.S. Industrial R&D Expenditures and R&D-to-Sales Ratio Reach Historical Highs in 2000" SRS InfoBrief. NSF 03-306. Arlington, VA.
National Science Foundation (NSF). 2002b. Federal Research and Development Funding by Budget Function: Fiscal Years 2000-2002. NSF 02-301. Arlington, VA.
National Science Foundation (NSF). 2003. "U.S. Industry Sustains R&D Expenditures Despite Decline in Performers' Sales and Employment During 2001" SRS InfoBrief. NSF 04-301. Arlington, VA.
U.S. Bureau of the Census. 1993. "Effects of the 1987 SIC Revision on Company Classification in the Survey of Industrial Research and Development (R&D)." Technical Memorandum. December 6.
U.S. Bureau of the Census. 1994a. "Comparison of Company Coding Between 1992 and 1993 for the Survey of Industrial Research and Development." Technical Memorandum. November 3. Washington, DC.
U.S. Bureau of the Census. 1994b. Documentation of Nonsampling Issues in the Survey of Industrial Research and Development. RR94/03. Washington, DC.
U.S. Bureau of the Census. 1994c. An Evaluation of Imputation Methods for the Survey of Industrial Research and Development. ESMD-9404. Washington, DC.
U.S. Bureau of the Census. 1994d. "Evaluation of Total Employment Cut-Offs in the Survey of Industrial Research and Development." Technical Memorandum. November 3. Washington, DC.
U.S. Bureau of the Census. 1994e. "Reclassification of Companies in the 1992 Survey of Industrial Research and Development (R&D) for the Generation of the 'Analytical' Series." Technical Memorandum. October 25. Washington, DC.
U.S. Bureau of the Census. 1994f. A Study of Processing Errors in the Survey of Industrial Research and Development. ESMD-9403. Washington, DC.
U.S. Bureau of the Census. 1994g. "Wedging Considerations for the 1992 Research and Development (R&D) Survey." Technical Memorandum. June 10. Washington, DC.
U.S. Bureau of the Census. 1995. Documentation of the Survey Design for the Survey of Industrial Research and Development: A Historical Perspective. Washington, DC.
[5] In the Survey of Industrial Research and Development and in the publications presenting statistics resulting from the survey, the terms "company", "firm," and "enterprise" are used interchangeably. "Industry" refers to the 2-, 3-, or 4-digit North American Industrial Classification System (NAICS) codes or group of NAICS codes used to publish statistics resulting from the survey.
[6] Had there been a zero-industry stratum in the 2001 sample, it would have been sampled using srs as discussed previously under "Identifying "Zero" Industries."
[7] The relative standard error (RSE) is a percentage that can be added to and subtracted from the published estimate to allow the user to construct an interval with prescribed confidence that the interval includes the actual value. The 2001 survey sample was designed to produce RSEs targeted at 1 percent for all industries except for four industries in nonmanufacturing whose R&D expenditures in 2000 was estimated below $1 billion. For those industries the constraint was 2 percent. As has been noted, actual estimated RSEs can exceed these target RSEs because of a poor correlation between the frame measure of size used to assign probabilities and the actual reported data.
[8] Even though data on federally funded contracted-out R&D are collected, the tables based on the data tend to be "spotty." That is, because federally funded contracted-out R&D is reported by so few companies, most of the resulting statistics arrayed by industry have to be suppressed because of confidentiality. Further, because of the sporadic nature of Federal funding of R&D in some industries, even in the aggregate, year-to-year changes can be quite large. Consequently, the tables containing the statistics are not published. Following are the results of recent data collections. In the 1997 table, the "all industries" total had to be suppressed, so no meaningful estimate could be made for that year. For 1998, the "all industries" total was $4.3 billion; for 1999, the data were not tabulated; for 2000, the "all industries" total was $0.8 billion; and for 2001, it was $0.5 billion. We will continue to tabulate this item and report the aggregate estimate when possible.
[9] If values are not comparable, companies are contacted, the reason(s) for incomparability are recorded, and data are corrected, if appropriate.
[10] Annual sampling also remedies the cyclical deterioration of the statistics that results from changes in a company's payroll composition because of product line and corporate structural changes.