Introduction to Data Mining

<<< Previous    Up    Next >>>

Lesson 6.6.3

Efficiency Improvement

 

One way to improve efficiency of the APRIORI would be to

bullet

Prune without checking all k-1 subsets.

bullet

Join without looping over the entire set, Lk-1.

This can be done by using hash trees. More details on this can be found in the following lessons.

 

Other methods to improve efficiency are:

bullet

Speed up searching and matching.

bullet

Reduce the number of transactions (a kind of instance selection).

bullet

Reduce the number of passes over data on disk. E.g. Reducing scans via Partition.

bullet

Reduce number of subsets per transaction that must be considered.

bullet

Reduce number of candidates (a kind of feature selection).

 

<<< Previous    Up    Next >>>