Introduction to Data Mining

<<< Previous    Up    Next >>>

Lesson 6.4

Support and Confidence

 

Support for an itemset X in a transactional database D is defined as count(X) / |D|.

For an association rule X Þ Y, we can calculate

bullet

support(X Þ Y) = support(XY) = support(X union Y).

bullet

confidence(X Þ Y) = support(XY) / support(X).

 

    Support (S) and Confidence (C) can also be related to joint probabilities and conditional probabilities as follows.

bullet

support(X Þ Y) = P(XY).

bullet

confidence(X Þ Y) = P(Y/X).

 

    The number of association rules that can be derived from a dataset D are exponentially large. Interesting association rules are those whose support and confidence are greater than minSupp and minConf.

Frequent itemsets (also called as large itemsets), are those itemsets whose support is greater than minSupp. The apriori property (downward closure property) says that any subsets of an frequent itemset are also frequent itemsets.

   

<<< Previous    Up    Next >>>