Award Abstract # 1251276
BIGDATA: Small: DCM: DA: Collaborative Research: SMASH -- Scalable Multimedia content AnalysiS in a High-level language

NSF Org: IIS
Div Of Information & Intelligent Systems
Awardee: INTERNATIONAL COMPUTER SCIENCE INSTITUTE
Initial Amendment Date: June 12, 2013
Latest Amendment Date: April 26, 2018
Award Number: 1251276
Award Instrument: Standard Grant
Program Manager: Maria Zemankova
IIS
 Div Of Information & Intelligent Systems
CSE
 Direct For Computer & Info Scie & Enginr
Start Date: June 15, 2013
End Date: May 31, 2019 (Estimated)
Total Intended Award Amount: $403,982.00
Total Awarded Amount to Date: $419,982.00
Funds Obligated to Date: FY 2013 = $403,982.00
FY 2016 = $16,000.00
History of Investigator:
  • Gerald  Friedland (Principal Investigator)
    fractor@icsi.berkeley.edu  (510)666-2900
Awardee Sponsored Research Office: International Computer Science Institute
2150 Shattuck Ave, Suite 1100
Berkeley
CA  US  94704-1345
(510)666-2900
Sponsor Congressional District: 13
Primary Place of Performance: International Computer Science Institute
Berkeley
CA  US  94704-1198
Primary Place of Performance
Congressional District:
13
DUNS ID: 187909478
Parent DUNS ID:
NSF Program(s): Information Technology Researc,
Big Data Science &Engineering
Primary Program Source: 040100 NSF RESEARCH & RELATED ACTIVIT
040100 NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1640, 7433, 7923, 8083, 9251
Program Element Code(s): 1640, 8083
Award Agency Code: 4900
Fund Agency Code: 4900
CFDA Number(s): 47.070

ABSTRACT

This big data project develops tools to support researchers and developers in the task of prototyping multimedia content analysis algorithms in a large scale. Typically, scientists and engineers prefer to use high-level programming languages such as Python or MATLAB to conduct experiments, as they allow for a quick implementation of a novel idea. Experiments on big data, however, are often computationally-intensive and therefore must eventually be recoded into a low-level language by expert programmers in order to achieve sufficient performance, creating a gap between productivity and performance. In addition, multiple strategies may exist for mapping a problem onto parallel hardware depending on the input data size and the hardware parameters, further exacerbating the problem. Using the application area of multimedia content analysis as an example (an area with one of the largest and the fastest growing amounts of data due to the steady upload of consumer produced videos), this project performs research on a pattern-oriented, application-specific specialization framework that uses a tiered approach to parallel programming. The ultimate aim is to provide the scalability of diverse parallel processing at the productivity level of high-level languages.

Social media videos are increasingly being used for scientific research, as they allow us to observe and model many phenomena studied, for example, in social sciences, economics, meteorology and medicine. More scalable content analysis impacts any field that uses social media videos. Moreover, social media videos are an everyday part of many people's lives. Making multimedia content analysis more scalable allows for better algorithms to be developed by more students and researchers, and therefore impacts many people's lives. The framework is made available on the project website (http://smash.icsi.berkeley.edu).

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, and Li-Jia Li. "The New Data in Multimedia Research." Communications of the ACM , v.59 , 2016 , p.64--73 10.1145/2812802
Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, and Li-Jia Li. "The New Data in Multimedia Research.. Communications of the ACM." Communications of the ACM , 2016 DOI: 10.1145/2812802
L. Jing, B Liu, A. Janin, J. Choi, J. Bernd. M. Mahoney, G. Friedland "A Discriminative and Compact Audio Representation for Event Detection" ACM Multimedia 2016 , 2016
Jaeyoung Choi, Bart Thomee, Martha Larson. "Practical guide to using the YFCC100M and MMCOMMONS on a budget." ACM SIGMM Records , v.9 , 2017
Liping Jing, Bo Liu, Jaeyoung Choi, Adam Janin, Julia Bernd, Michael W. Mahoney, Gerald Friedland "DCAR: A Discriminative and Compact Audio Representation for Audio Processing." IEEE Transactions on Multimedia. , 2017
Mario Michael Krell, Julia Bernd, Daniel Ma, Damian Borth, Jaeyoung Choi, Gerald Friedland "Field Studies with Multimedia Big Data: Opportunities and Challenges." ICSI Technical Report TR-17-002 , 2017
Daniel Ma, Gerald Friedland and Mario Michael Krell "OrigamiSet1.0: Two New Datasets for Origami Classification and Difficulty Estimation." 7th International Meeting on Origami in Science, Mathematics and Education (7OSME). , 2018
Jack Ye, Jaeyoung Choi, Gerald Friedland "Supervised Deep Hashing for Cover Song Detection" IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR) , 2019

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

The SMASH project initially proposed to create a software framework that makes it easy for students and practitioners to analyze and manipulate video, image and audio data in large scale. The vision was to create a library that makes written program code automatically scalable to cloud computing so that a successfull experiment in the small could be immediately repeated in the large.

The outcomes of this award are as follows. We organized the creation of a research corpus of 100 Million images and 1 Million videos from Flickr (YFCC100m). The corpus contains various annotations, such as tags, title, descriptions, geo-tags, time stamps, etc. As far as we know, it is the largest openly available research corpus available for multimedia data as of today. Amazon agreed to host the corpus as part of their Open Data Initiative. Lawrence Livermore National Lab hosts the corpus for goverment research purposes. We then created the Multimedia Commons initiative to further the creation of annotation and code sharing around this dataset. We also contributed the orginally proposed scalable multimedia analysis framework (Smash) based on Amazon's cloud tools and Jupyter Notebook. Smash allows students to use Python to perform various multimedia analysis experiments directly on 100M images and 1M videos. The corpus and it's infrastructure have been widely adapted into the research community as indicated by a) mainstream media coverage (including CNN, BBC News, and Forbes) and b) references to the main article in the Communications of the ACM, at a rate of more than two citations per week. 

The result of this NSF grant allows for empirical studies at never-before-seen scale. The images and videos show many facts that would have to otherwise be analyzed as part of interviewing, traveling and/or performing lab experiments. With the help of undergraduate supplement funding we also created the Multimedia Commons search engine which allows users to create subcorpora for their specific research questions. An example result of using this search engine by an undergraduate student for his research idea was the proposed redefinition of a difficulty metric for Origami tutorial videos. The paper was not only accepted at the 2018 International conference on Origami in Science, Mathematics and Education, the student also won a scholarship. 

The results of the Smash project will continue to have impact in teaching and research at UC Berkeley and in the multimedia community as a whole beyond the scope of the NSF funding, especially since large industry players such as IBM and Google reportedly rely on YFCC100m and Multimedia Commons and are actively funding research with it.


Last Modified: 07/31/2019
Modified by: Gerald Friedland

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page