GeneralizedSequentialPatterns

Package

weka.associations

Synopsis

Class implementing a GSP algorithm for discovering sequential patterns in a sequential data set.
The attribute identifying the distinct data sequences contained in the set can be determined by the respective option. Furthermore, the set of output results can be restricted by specifying one or more attributes that have to be contained in each element/itemset of a sequence.

For further information see:

Ramakrishnan Srikant, Rakesh Agrawal (1996). Mining Sequential Patterns: Generalizations and Performance Improvements.

Available in Weka 3.6.x - 3.7.1. Available via the package management system for Weka >= 3.7.2 (generalizedSequentialPatterns).

Options

The table below describes the options available for GeneralizedSequentialPatterns.

Option

Description

dataSeqID

The attribute number representing the data sequence ID.

debug

If set to true, algorithm may output additional info to the console.

filterAttributes

The attribute numbers (eg "0, 1") used for result filtering; only sequences containing the specified attributes in each of their elements/itemsets will be output; -1 prints all.

minSupport

Minimum support threshold.

Capabilities

The table below describes the capabilites of GeneralizedSequentialPatterns.

Capability

Supported

Class

No class

Attributes

Empty nominal attributes, Binary attributes, Nominal attributes, Unary attributes

Min # of instances

1