text-only page produced automatically by LIFT Text Transcoder Skip all navigation and go to page contentSkip top navigation and go to directorate navigationSkip top navigation and go to page navigation
National Science Foundation
Search  
Awards
design element
Search Awards
Recent Awards
Presidential and Honorary Awards
About Awards
Grant Policy Manual
Grant General Conditions
Cooperative Agreement Conditions
Special Conditions
Federal Demonstration Partnership
Policy Office Website


Award Abstract #0325949
ITR: COLLABORATIVE RESEARCH: A Data Mining and Exploration Middleware for Grid and Distributed Computing


NSF Org: CCF
Division of Computer and Communication Foundations
divider line
divider line
Initial Amendment Date: October 30, 2003
divider line
Latest Amendment Date: October 30, 2006
divider line
Award Number: 0325949
divider line
Award Instrument: Continuing grant
divider line
Program Manager: Almadena Y. Chtchelkanova
CCF Division of Computer and Communication Foundations
CSE Directorate for Computer & Information Science & Engineering
divider line
Start Date: November 1, 2003
divider line
Expires: October 31, 2007 (Estimated)
divider line
Awarded Amount to Date: $611000
divider line
Investigator(s): Vipin Kumar kumar@cs.umn.edu (Principal Investigator)
Jon Weissman (Co-Principal Investigator)
divider line
Sponsor: University of Minnesota-Twin Cities
200 OAK ST SE
MINNEAPOLIS, MN 55455 612/624-5599
divider line
NSF Program(s): ITR MEDIUM (GROUP) GRANTS
divider line
Field Application(s): 0000099 Other Applications NEC,
0000912 Computer Science
divider line
Program Reference Code(s): HPCC, 9216, 4080, 1687, 1652
divider line
Program Element Code(s): 1687

ABSTRACT

The purpose of this project is to develop and implement a middleware that allows mining and analysis of distributed data. The research will address an increasing interest in the exploration and mining of the large volume of data that is generated by the business, scientific, engineering, academic and defense communities.

Novel techniques will be developed in the following areas:

Data and Policy Management Services: These services will provide organizational level access restriction capabilities for the owners of the data while allowing users to access a more efficient transport of data.

Data Mining and Exploration Services: This framework will include a library of data mining tools that will work effectively for example if: data is distributed on multiple sites, the user has varying privileges depending on what site they are accessing, a user wants to trade off computing time versus accuracy or a user wants to access the site remotely.

Scheduling and Replication Services: System administration will be based upon policies at the participating organizations and their established privileges for users. Computation and data will be scheduled jointly to optimize metrics.

This combination of distributed computing and data mining will be made widely available to students, researchers, and other interested groups in government, industry and education.


PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

(Showing: 1 - 10 of 12)
  Show All

Darin England and Jon B. Weissman. "A Resource Leasing Policy for On-demand Computing," International Journal for High Performance Computing and Applications (IJHPCA), v.20, 2006, p. 000.

Gyorgy Simon, Eric Eilertson, Vipin Kumar, Zhi-Li Zhang and Hui Xiong. "Scan Detection: A Data Mining Approach," Proceedings of the Sixth SIAM International Conference on Data Mining, v.6, 2006, p. 000.

H. Xiong, M. Steinbach, and V. Kumar. "Privacy Leakage in Multi-relational Databases via Pattern based Semi-supervised Learning," Proc. of the ACM Conference on Information and Knowledge Management, v.CIKM, 2005, p. 000.

H. Xiong, X. He, C. Ding, Y. Zhang, V. Kumar, and S. R. Holbrook.. "Identification of Functional Modules in Protein Complexes via Hyperclique Pattern Discovery.," Proc. of the Pacific Symposium on Biocomputing (PSB 2005), 2005., v.PSB 200, 2005, p. 000.

Hui Xiong, Gaurav Pandey, Michael Steinbach, Vipin Kumar. "Enhancing Data Analysis with Noise Removal," IEEE Transactions on Knowledge and Data Engineering (TKDE), v.18, 2006, p. 304.

Hui Xiong, Michael Steinbach, and Vipin Kumar. "Privacy Leakage in Multi-relational Databases: A Semi-supervised Learning Perspective," VLDB Journal Special Issue on Privacy Preserving Data Management, v.15, 2006, p. 388.

Hui Xiong, Shashi Shekhar, Pang-Ning Tan, and Vipin Kumar. "TAPER: A Two-Step Approach for All-strong-pairs Correlation Query in Large Databases," IEEE Transactions on Knowledge and Data Engineering (TKDE), v.18, 2006, p. 493.

Jason D. Sonnek and Jon B. Weissman. "A Quantitative Comparison of Reputation Systems in the Grid," 6th IEEE/ACM International Workshop on Grid Computing, v.6, 2005, p. 000.

Jason D. Sonnek, Mukesh Nathan, Abhishek Chandra, and Jon B. Weissman. "Reputation Based Scheduling on Unreliable Distributed Infrastructures," Proceedings of the 26th International Conference on Distributed Computing Systems, v.26, 2006, p. 000.

Jon B. Weissman, Seonho Kim, and Darin England. "Supporting the Dynamic Grid Service Lifecycle," IEEE/ACM CCGrid International Symposium on Cluster Computing and the Grid, 2005, p. 000.


(Showing: 1 - 10 of 12)
  Show All




 

Please report errors in award information by writing to: awardsearch@nsf.gov.

 

 

Print this page
Back to Top of page
  Web Policies and Important Links | Privacy | FOIA | Help | Contact NSF | Contact Web Master | SiteMap  
National Science Foundation
The National Science Foundation, 4201 Wilson Boulevard, Arlington, Virginia 22230, USA
Tel: (703) 292-5111, FIRS: (800) 877-8339 | TDD: (800) 281-8749
Last Updated:
April 2, 2007
Text Only


Last Updated:April 2, 2007