text-only page produced automatically by Usablenet Assistive Skip all navigation and go to page content Skip top navigation and go to directorate navigation Skip top navigation and go to page navigation
National Science Foundation
design element
Search Awards
Recent Awards
Presidential and Honorary Awards
About Awards
Grant Policy Manual
Grant General Conditions
Cooperative Agreement Conditions
Special Conditions
Federal Demonstration Partnership
Policy Office Website

Award Abstract #1244713

Collaborative Research: Automatically Annotated Repository of Digital Video and Audio Resources Community (AARDVARC)

Division Of Behavioral and Cognitive Sci
divider line
Initial Amendment Date: September 23, 2012
divider line
Latest Amendment Date: April 8, 2013
divider line
Award Number: 1244713
divider line
Award Instrument: Standard Grant
divider line
Program Manager: William J. Badecker
BCS Division Of Behavioral and Cognitive Sci
SBE Direct For Social, Behav & Economic Scie
divider line
Start Date: September 15, 2012
divider line
End Date: February 28, 2015 (Estimated)
divider line
Awarded Amount to Date: $84,982.00
divider line
Investigator(s): Damir Cavar dcavar@indiana.edu (Principal Investigator)
Helen Aristar-Dry (Former Principal Investigator)
Anthony Aristar (Former Co-Principal Investigator)
Damir Cavar (Former Co-Principal Investigator)
divider line
Sponsor: Eastern Michigan University
Office of Research Development
YPSILANTI, MI 48197-2212 (734)487-3090
divider line
NSF Program(s): Data Infrastructure
divider line
Program Reference Code(s): 7433
divider line
Program Element Code(s): 8068


Audio and video data from understudied languages is useful to linguists, anthropologists, educators, and computer scientists interested in visual action extraction, speech technology or software localization. Terabytes of such data exist, having been collected in large amounts by documentary linguists since the advent of easy digital recording via handheld devices. As records of vanishing languages and cultures, video and audio records are far richer and more captivating than paper records, but they need to be indexed and transcribed so that they reach their full potential as research tools. The current project, AARDVARC (Automatically Annotated Repository of Digital Audio and Video Resources Community) will address the problem of untranscribed, and therefore unavailable, documentation of understudied languages by building an interdisciplinary community of linguists, anthropologists, and computer scientists to share knowledge and collaborate on the specification of a repository and suite of tools to facilitate transcription. It will provide for two workshops and a symposium to design a "take one leave one" repository and to explore recent advances in speech and video processing that will allow anthropologists and linguists to break the 'transcription bottleneck' for language data. Even partial automation will greatly facilitate the work of the analyst and dramatically increase the amount of transcribed audio and video available to researchers in multiple disciplines.


Please report errors in award information by writing to: awardsearch@nsf.gov.



Print this page
Back to Top of page
Research.gov  |  USA.gov  |  National Science Board  |  Recovery Act  |  Budget and Performance  |  Annual Financial Report
Web Policies and Important Links  |  Privacy  |  FOIA  |  NO FEAR Act  |  Inspector General  |  Webmaster Contact  |  Site Map
National Science Foundation Logo
The National Science Foundation, 4201 Wilson Boulevard, Arlington, Virginia 22230, USA
Tel: (703) 292-5111, FIRS: (800) 877-8339 | TDD: (800) 281-8749
  Text Only Version