Large-Scale Parallel Data Mining
- Author(s): Mohammed J. Zaki, Ching-Tien Ho,
- Publisher: Springer Science & Business Media
- Pages: 260
- ISBN_10: 3540671943
ISBN_13: 9783540671947
- Language: en
- Categories: Computers / General , Computers / Artificial Intelligence / General , Computers / Business & Productivity Software / General , Computers / Computer Science , Computers / Database Administration & Management , Computers / Data Science / Data Analytics , Computers / Data Science / Data Warehousing , Computers / System Administration / Storage & Retrieval , Computers / Information Technology , Computers / Networking / General , Computers / Distributed Systems / General , Computers / Software Development & Engineering / General , Computers / Programming / Algorithms , Computers / Networking / Hardware , Computers / Parallel Processing , Medical / General ,
Description:... Withthe unprecedented rate at which data is being collected today in almostall elds of human endeavor, there is an emerging economic and scientic need to extract useful information from it. For example, many companies already have data-warehouses inthe terabyte range (e.g., FedEx, Walmart).The WorldWide Web has an estimated 800 millionweb-pages. Similarly, scienti c data is rea- ing gigantic proportions (e.g., NASA space missions, Human Genome Project). High-performance, scalable, parallel, and distributed computing is crucial for ensuring system scalabilityand interactivityas datasets continue to grow in size and complexity. Toaddress thisneedweorganizedtheworkshoponLarge-ScaleParallelKDD Systems, which was held in conjunction with the 5th ACM SIGKDD Inter- tional Conference on Knowledge Discovery and Data Mining, on August 15th, 1999, San Diego, California. The goal of this workshop was to bring researchers and practitioners together in a setting where they could discuss the design, - plementation, anddeploymentoflarge-scaleparallelknowledgediscovery (PKD) systems, which can manipulate data taken from very large enterprise or sci- tic databases, regardless of whether the data is located centrally or is globally distributed. Relevant topics identie d for the workshop included: { How to develop a rapid-response, scalable, and parallel knowledge discovery system that supports global organizations with terabytes of data.
Show description