XMeans

Package

weka.clusterers

Synopsis

Cluster data using the X-means algorithm.

X-Means is K-Means extended by an Improve-Structure part In this part of the algorithm the centers are attempted to be split in its region. The decision between the children of each center and itself is done comparing the BIC-values of the two structures.

For more information see:

Dan Pelleg, Andrew W. Moore: X-means: Extending K-means with Efficient Estimation of the Number of Clusters. In: Seventeenth International Conference on Machine Learning, 727-734, 2000.

Available in Weka 3.6.x - 3.7.1. Available via the package management system for Weka >= 3.7.2 (XMeans)

Options

The table below describes the options available for XMeans.

Option

Description

KDTree

The KDTree to use.

binValue

Set the value that represents true in the new attributes.

cutOffFactor

the cut-off factor to use

debugLevel

The debug level to use.

debugVectorsFile

The file containing the debug vectors (only for debugging!).

distanceF

The distance function to use.

inputCenterFile

The file to read the list of centers from.

maxIterations

the maximum number of iterations to perform

maxKMeans

the maximum number of iterations to perform in KMeans

maxKMeansForChildren

the maximum number of iterations KMeans that is performed on the child centers

maxNumClusters

set maximum number of clusters

minNumClusters

set minimum number of clusters

outputCenterFile

The file to write the list of centers to.

seed

The random number seed to be used.

useKDTree

Whether to use the KDTree.

Capabilities

The table below describes the capabilites of XMeans.

Capability

Supported

Class

No class

Attributes

Missing values, Date attributes, Numeric attributes

Min # of instances

1