Parallel Implementations of ETI mining and SNN clustering algorithms


Parallel Error Tolerant Itemset Mining

The following gzipped tar archive contains an implementation of error tolerant itemset mining.  This implementation uses MPI for parallel processing.  The code can also be run in sequential mode.

The code must be compiled for your platform before it can be used.  Please go through the README file for details on compiling and using the code.

Download link: ParETI.tar.gz

Datasets

Size
Link
Description
Small
mushroom (34KB)
html
Medium
accident (5.5MB) pdf
Large
webdocs (489MB) pdf


Parallel Shared Nearest Neighbor (SNN) Clustering

To be available shortly.




Please contact Shyam Boriah () if you have trouble with the code.