RMIS | Adrem Data Lab

Abstract: Pattern mining techniques generally enumerate lots of uninteresting and redundant patterns. To obtain less redundant collections, techniques exist that give condensed representations of these collections. However, the proposed techniques often rely on complete enumeration of the pattern space, which can be prohibitive in terms of time and memory. Sampling can be used to filter the output space of patterns without explicit enumeration. We propose a framework for random sampling of maximal itemsets from transactional databases. The presented framework can use any monotonically decreasing measure as interestingness criteria for this purpose. Moreover, we use an approximation measure to guide the search for maximal sets to different parts of the output space. We show in our experiments that the method can rapidly generate small collections of patterns with good quality. The sampling framework has been implemented in the interactive visual data mining tool called MIME, as such enabling users to quickly sample a collection of patterns and analyze the results.

VERSION 3 (data model = RealKD)

RMIS version 3.0.0 (SNAPSHOT) (version released on the 9th of June 2018)

VERSION 2 (data model = mime_plain)

RMIS version 2.4.0 (version released on the 3rd of Januari 2018)

RMIS version 2.0.0 (version released on the 18th of July 2013)

VERSION 1

RMIS version 1.0.0 (version released on the 4th of July 2013)

CODING

All releases can be found on GitLab: Adrem Data Lab releases as maven repository. Source code is available at GitLab: Adrem Data Lab: rmis.