-
Pre-preparation and pre-processing are the
most time-consuming process in KDD.
|
Hint |
-
Data normalization is useful, but not
necessary in general.
|
Hint |
-
Handling missing data is a straightforward
procedure.
|
Hint |
-
Feature Discretization can be performed
without class information.
|
Hint |
-
Feature Selection is basically a search
problem for finding an optimal subset of features.
|
Hint |
-
Sampling is not a costly procedure. That is
the reason for using sampling when dealing with large datasets.
|
Hint |