A primer to frequent itemset mining for bioinformatics

Naulaerts, Stefan; Meysman, Pieter; Bittremieux, Wout; Vu, Trung Nghia; Vanden Berghe, Wim; Goethals, Bart; Laukens, Kris

doi:10.1093/bib/bbt074

Published October 26, 2013 | Version v1

Journal article Open

A primer to frequent itemset mining for bioinformatics

1. Department of Mathematics and Computer Science, University of Antwerp, Antwerp, Belgium
2. Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium

Over the past two decades, pattern mining techniques have become an integral part of many bioinformatics solutions. Frequent itemset mining is a popular group of pattern mining techniques designed to identify elements that frequently co-occur. An archetypical example is the identification of products that often end up together in the same shopping basket in supermarket transactions. A number of algorithms have been developed to address variations of this computationally non-trivial problem. Frequent itemset mining techniques are able to efficiently capture the characteristics of (complex) data and succinctly summarize it. Owing to these and other interesting properties, these techniques have proven their value in biological data analysis. Nevertheless, information about the bioinformatics applications of these techniques remains scattered. In this primer, we introduce frequent itemset mining and their derived association rules for life scientists. We give an overview of various algorithms, and illustrate how they can be used in several real-life bioinformatics application domains. We end with a discussion of the future potential and open challenges for frequent itemset mining in the life sciences.

Files

Naulaerts2013.pdf

Files (767.0 kB)

Name	Size	Download all
Naulaerts2013.pdf md5:f30eb1e5bcd78fb1facd0f86b23aa474	767.0 kB	Preview Download

	All versions	This version
Views	443	442
Downloads	190	190
Data volume	151.1 MB	151.1 MB

A primer to frequent itemset mining for bioinformatics

Authors/Creators

Description

Files

Naulaerts2013.pdf

Files (767.0 kB)