Document Type

Technical Report

Publication Date

2002-08-01

Filename

wucse-2002-23.pdf

DOI:

10.7936/K7W957JC

Technical Report Number

WUCSE-2002-23

Abstract

In many data mining applications, the size of the database is not only extremely large, it is also growing rapidly. Even for relatively simple searches, the time required to move the data off magnetic media, cross the system bus into main memory, copy into processor cache, and then execute code to perform a search is prohibitive. We are proposing that a significant portion of the data mining task (i.e., the portion that examines the bulk of the raw data) be implemented in fast hardware, close to the magnetic media on which it is stored. Furthermore, this hardware can be replicated allowing mining tasks to be performed in parallel, thus providing further speed up for the overall mining application. In this paper, we describe a general framework under which this can be accomplished, and identify a number of important research questions that must be addressed for it to be practical.

Comments

Permanent URL: http://dx.doi.org/10.7936/K7W957JC

Share

COinS