text-only page produced automatically by LIFT Text Transcoder Skip all navigation and go to page contentSkip top navigation and go to directorate navigationSkip top navigation and go to page navigation
National Science Foundation Home National Science Foundation - Social, Behavioral & Economic Sciences (SBE)
Social, Behavioral & Economic Sciences (SBE)
design element
SBE Home
About SBE
Funding Opportunities
Awards
News
Events
Discoveries
Publications
Advisory Committee
Career Opportunities
See Additional SBE Resources
View SBE Staff
SBE Organizations
Behavioral and Cognitive Sciences (BCS)
National Center for Science and Engineering Statistics (NCSES)
Social and Economic Sciences (SES)
SBE Office of Multidisciplinary Activities (SMA)
Proposals and Awards
Proposal and Award Policies and Procedures Guide
  Introduction
Proposal Preparation and Submission
bullet Grant Proposal Guide
  bullet Grants.gov Application Guide
Award and Administration
bullet Award and Administration Guide
Award Conditions
Other Types of Proposals
Merit Review
NSF Outreach
Policy Office
Additional SBE Resources
Exploring What Makes Us Human
Rebuilding the Mosaic Report
Bringing People Into Focus: How Social, Behavioral & Economic Research Addresses National Challenges
"Youth Violence: What We Need to Know" Report to NSF
Social, Behavioral and Economic Research in the Federal Context Report
Expedited Review of Social and Behavioral Research Activities Report
SBE Advisory Committee Web Site (for members only)
Other Site Features
Special Reports
Research Overviews
Multimedia Gallery
Classroom Resources
NSF-Wide Investments

Email this pagePrint this page

Discovery
Detecting social patterns from shifting dialects

A powerful computer program allows scientists to map shifts in regional accents. Such research can aid development of speech recognition technology

Photo of the Philadelphia skyline at night

Philadelphia, seen here at night, has moved to a more Northern-sounding dialect.
Credit and Larger Version

August 20, 2013

Knowing glances may dot a room when listeners hear the line, "You say tomato, I say tomahto," from the popular Gershwin song "Let's Call the Whole Thing Off." Whether you're from Philadelphia or Fresno, Winnetka or Waco, your dialect often identifies you with a particular locale.

Now, using a powerful computer program, researchers at the University of Pennsylvania (Penn) are providing insight into a significant change in the dialect of Philadelphians. In a century's time, the sound of Philadelphia has shifted from a somewhat Southern accent to a more Northern one. And it's not just a few areas of Philadelphia. The entire city shifted.

"The reversal indicates major changes in social patterns," says Penn linguist William Labov.

Considered the northernmost of the Southern cities, Philadelphia has continued to progress toward a more northern sounding dialect.

"All those things that align Philadelphia with the South are disappearing," says Labov. "The South is receding, and language is very sensitive to profound social attitudes." Younger people are less likely to pick up or use Southern inflections.

"When we study how language changes, we gain an understanding of what we're like as human beings," says Labov. "Regional dialects in America are getting more and more different and carrying each region away from the other."

One vowel at a time

Labov and his colleagues developed their conclusions using a program called Forced Alignment & Vowel Extraction (FAVE). It allowed them to automatically analyze vowel sounds on recordings of interviews with speakers from 89 neighborhoods throughout the city whose birth years ranged from 1889 through 1991. The interviews were compiled yearly beginning in 1973 as part of a long-term language study undertaken by Labov and his students.

"We wanted to make automatic what, in the past, was a painfully slow-hand process," says Labov of the computer analysis program. Previously, vowel analysis required listening to a digital recording on a computer and physically stopping the audio to make a measurement of a vowel sound. The few automated analysis programs available required quality checks to determine if the program had correctly identified the start and end of a vowel sound.

"When the original algorithm was working correctly, very few errors were found. However, when it was off, it was off by a lot and introduced numerous errors," says Josef Fruehwald, a doctoral student working with Labov. Older analysis programs were also unable to accurately sort through the extraneous noises introduced on the recordings by household sounds such as water running or a television playing in the background.

Two years in the making, the FAVE program follows every word on an interview transcript and looks up each word's sound in a pronunciation dictionary. For the word "bat," for instance, the algorithm marks the beginning and end of b, a and t. It then provides analysis for vowels throughout the entire interview. The program is so efficient that in one hour it provides 7000 measurements for one interview. Before FAVE, an analysis could take three days and yield just 300 measurements.

"The program has really exploded the volume of data we get from each speaker," says Fruehwald. The researchers have measured about 1 million vowels in the study. The increased data improves the accuracy of language analysis and provides a higher level of confidence in the results.

Moving data

Presenting such a large amount of data in a meaningful way was paramount for Fruehwald. So he created motion diagrams of how vowel sounds in the study changed over time. One data point on the diagram for the "aw" sound, for instance, moves up into a more Southern pronunciation for about 75 years and then turns back toward a more Northern pronunciation.

Fruehwald says that the software is finding a larger audience as evidenced by an increasing number of related presentations at professional conferences. "This is all going to be taking off," says Fruehwald. Linguists interested in using the FAVE suite can download it or use its online interface free of charge at the FAVE site.

The end result

Sound changes such as those studied here remain a major obstacle to communication, especially when it comes to machine recognition of spontaneous speech. Companies engaged in creating speech recognition programs have used the Atlas of North American English, produced by Labov's research group, to define the range of dialects that must be represented in the database of sounds used to "train" the speech recognition software. Philadelphia teachers are also using the group's results to refine their classroom plans so that they account for speech variations among students.

Future research by the Labov team will involve learning why accents in all of the study neighborhoods moved in the same direction at the same time and how minority participation impacts changing dialect patterns.

Editor's note:
This Behind the Scenes article was first provided to LiveScience.com in partnership with the National Science Foundation.

--  Susan Reiss, National Science Foundation (703) 292-8070 sreiss@nsf.gov

Investigators
Jiahong Yuan
William Labov

Related Institutions/Organizations
University of Pennsylvania

Locations
Pennsylvania

Related Awards
#0921643 Automatic Alignment and Analysis of Linguistic Change.

Total Grants
$255,363

graphic representation showing multiple bars marking the vocal progression of a speaker
A FAVE-suite spectrogram of an 1888-born speaker vocally progressing from the word "make"
Credit and Larger Version

graphic representation of a speaker vocal progression from the word make to meek
A FAVE-suite spectrogram of an 1988-born speaker vocally progressing from the word "make"
Credit and Larger Version

map showing Philadelphia as the northernmost of the Southern cities
Atlas of North American English map showing Philadelphia as the northernmost of the Southern cities.
Credit and Larger Version



Email this pagePrint this page
Back to Top of page