Frequent Pattern Mining - niranjv/ml-notes GitHub Wiki

  • Overview
  • Methods
    • FP-Growth
    • Association Rules
    • PrefixSpan
    • Apriori
    • Eclat

Overview

Explore frequent pattern discvery in python, R, Spark

Problems

  • Association Rules (with confidence score) - Order of items is not considered
  • Sequential pattern mining - Order of items matters

Metrics

  • Support
  • Confidence
  • Lift
  • Conviction

Methods

FP-growth

  • Find frequent itemsets without traversing all possible itemsets
  • Implementations
  • Spark: FPGrowth.train(), FPGrowthModel

Association Rules

Association rule learning - discover 'interesting' relations between variables

Implementations

  • Spark: AssociationRules, FPGrowth.FreqItemset

PrefixSpan

  • For sequential pattern mining

Implementations

  • Spark: PrefixSpan

  • Apriori

  • Eclat - Equivalence Class Transformation

References