 |
 |
 |
Award Abstract #0325949
ITR: COLLABORATIVE RESEARCH: A Data Mining and Exploration Middleware for Grid and Distributed Computing

| NSF Org: |
CCF
Division of Computer and Communication Foundations
|
 |
 |
| Initial Amendment Date: |
October 30, 2003 |
 |
| Latest Amendment Date: |
October 30, 2006 |
 |
| Award Number: |
0325949 |
 |
| Award Instrument: |
Continuing grant |
 |
| Program Manager: |
Almadena Y. Chtchelkanova
CCF Division of Computer and Communication Foundations
CSE Directorate for Computer & Information Science & Engineering
|
 |
| Start Date: |
November 1, 2003 |
 |
| Expires: |
October 31, 2007 (Estimated) |
 |
| Awarded Amount to Date: |
$611000 |
 |
| Investigator(s): |
Vipin Kumar kumar@cs.umn.edu (Principal Investigator)
Jon Weissman (Co-Principal Investigator)
|
 |
| Sponsor: |
University of Minnesota-Twin Cities
200 OAK ST SE
MINNEAPOLIS, MN 55455 612/624-5599
|
 |
| NSF Program(s): |
ITR MEDIUM (GROUP) GRANTS
|
 |
| Field Application(s): |
0000099 Other Applications NEC, 0000912 Computer Science
|
 |
| Program Reference Code(s): |
HPCC, 9216, 4080, 1687, 1652
|
 |
| Program Element Code(s): |
1687
|
ABSTRACT

The purpose of this project is to develop and implement a middleware that allows mining and analysis of distributed data. The research will address an increasing interest in the exploration and mining of the large volume of data that is generated by the business, scientific, engineering, academic and defense communities.
Novel techniques will be developed in the following areas:
Data and Policy Management Services: These services will provide organizational level access restriction capabilities for the owners of the data while allowing users to access a more efficient transport of data.
Data Mining and Exploration Services: This framework will include a library of data mining tools that will work effectively for example if: data is distributed on multiple sites, the user has varying privileges depending on what site they are accessing, a user wants to trade off computing time versus accuracy or a user wants to access the site remotely.
Scheduling and Replication Services: System administration will be based upon policies at the participating organizations and their established privileges for users. Computation and data will be scheduled jointly to optimize metrics.
This combination of distributed computing and data mining will be made widely available to students, researchers, and other interested groups in government, industry and education.
PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

|
(Showing: 1 - 10 of 12)
(Showing: 1 - 12 of 12)
|
Show All |
Darin England and Jon B. Weissman. "A Resource Leasing Policy for On-demand Computing," International Journal for High Performance Computing and Applications (IJHPCA), v.20, 2006, p. 000.
Gyorgy Simon, Eric Eilertson, Vipin Kumar, Zhi-Li Zhang and Hui Xiong. "Scan Detection: A Data Mining Approach," Proceedings of the Sixth SIAM International Conference on Data Mining, v.6, 2006, p. 000.
H. Xiong, M. Steinbach, and V. Kumar. "Privacy Leakage in
Multi-relational Databases via Pattern based Semi-supervised Learning," Proc. of the ACM Conference on Information and Knowledge Management, v.CIKM, 2005, p. 000.
H. Xiong, X. He, C. Ding, Y. Zhang, V. Kumar, and S. R. Holbrook.. "Identification of Functional Modules in Protein Complexes via
Hyperclique Pattern Discovery.," Proc. of the Pacific Symposium on
Biocomputing (PSB 2005), 2005., v.PSB 200, 2005, p. 000.
Hui Xiong, Gaurav Pandey, Michael Steinbach, Vipin Kumar. "Enhancing Data Analysis with Noise Removal," IEEE Transactions on Knowledge and Data Engineering (TKDE), v.18, 2006, p. 304.
Hui Xiong, Michael Steinbach, and Vipin Kumar. "Privacy Leakage in Multi-relational Databases: A Semi-supervised Learning Perspective," VLDB Journal Special Issue on Privacy Preserving Data Management, v.15, 2006, p. 388.
Hui Xiong, Shashi Shekhar, Pang-Ning Tan, and Vipin Kumar. "TAPER: A Two-Step Approach for All-strong-pairs Correlation Query in Large Databases," IEEE Transactions on Knowledge and Data Engineering (TKDE), v.18, 2006, p. 493.
Jason D. Sonnek and Jon B. Weissman. "A Quantitative Comparison of Reputation Systems in the Grid," 6th IEEE/ACM International Workshop on Grid Computing, v.6, 2005, p. 000.
Jason D. Sonnek, Mukesh Nathan, Abhishek Chandra, and Jon B. Weissman. "Reputation Based Scheduling on Unreliable Distributed Infrastructures," Proceedings of the 26th International Conference on Distributed Computing Systems, v.26, 2006, p. 000.
Jon B. Weissman, Seonho Kim, and Darin England. "Supporting the Dynamic Grid Service Lifecycle," IEEE/ACM CCGrid International Symposium on Cluster Computing and the Grid, 2005, p. 000.
Michael Steinbach and Vipin Kumar. "Generalizing the Notion of Confidence," Fifth IEEE International Conference on Data Mining (ICDM?05), v.5, 2005, p. 402.
Vipin Kumar. "Parallel and Distributed Computing for Cyber Security," DS Online Journal, v.6, 10, p. 000.
|
(Showing: 1 - 10 of 12) (Showing: 1 - 12 of 12) |
Show All |
Please report errors in award information by writing to: awardsearch@nsf.gov.
|
 |
 |