Workshop day: 
July 23, 2002

Strathcona room

Scope and Program

Important Dates

Accepted Papers
(available online)




Program Committee

Relevant Links

KDD 2002

KDD Workshops


MRDM 2002

Workshop on Multi-Relational Data Mining

in conjunction with 

The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Multi-Relational Data Mining (MRDM) is the multi-disciplinary field dealing with knowledge discovery from relational databases consisting of multiple tables. Mining data which consists of complex/structured objects also falls within the scope of this field, since the normalized representation of such objects in a relational database requires multiple tables. The field aims at integrating results from existing fields such as inductive logic programming, KDD, machine learning and relational databases; producing new techniques for mining multi-relational data; and practical applications of such tecniques. 

Typical data mining approaches look for patterns in a single relation of a database. For many applications, squeezing data from multiple relations into a single table requires much thought and effort and can lead to loss of information. An alternative for these applications is to use multi-relational data mining. Multi-relational data mining can analyze data from a multi-relation database directly, without the need to transfer the data into a single table first. Thus the relations mined can reside in a relational or deductive database. Using multi-relational data mining it is often also possible to take into account background knowledge, which often corresponds to views in the database. 

Present MRDM approaches consider all of the main data mining tasks, including association analysis, classification, clustering, learning probabilistic models and regression. The pattern languages used by single-table data mining approaches for these data mining tasks have been extended to the multiple-table case. Relational pattern languages now include relational association rules, relational classification rules, relational decision trees, and probabilistic relational models, among others. MRDM algorithms have been developed to mine for patterns expressed in relational pattern languages. Typically, data mining algorithms have been upgraded from the single-table case: for example, distance-based algorithms for prediction and clustering have been upgraded by defining distance measures between examples/instances represented in relational logic. 

MRDM methods have been successfully applied accross many application areas, ranging from the analysis of business data, through bioinformatics (including the analysis of complete genomes) and pharmacology (drug design) to Web mining (information extraction from text and Web sources). 

The aim of the workshop is to bring together researchers and practitioners of data mining interested in methods for finding patterns in expressive languages from complex / multi-relational / structured data and their applications. 

Why the topic is of interest?

An increasing number of data mining applications involve the analysis of complex and structured types of data (such as sequences in genome analysis, HTML and XML documents) and require the use of expressive pattern languages. There is thus a clear need for multi-relational data mining (MRDM) techniques. 

On the other hand, there is a wealth of recent work concerned with upgrading some recent successful data mining approaches to relational logic. A case in point are kernel methods (support-vector machines): the development of kernels for structured and richer data types is a hot research topic. Another example is the development of probabilistic relational representations and methods for learning in them (e.g., probabilistic relational models, first-order Bayesian networks, stochastic logic programs, etc.) 

Non-exclusive list of topics. listed in alphabetical order: 

  • Applications of (multi-)relational data mining 
  • Data mining problems that require (multi-)relational methods 
  • Distance-based methods for structured/relational data 
  • Inductive databases 
  • Kernel methods for structured/relational data 
  • Learning in probabilistic relational representations 
  • Link analysis and discovery 
  • Methods for (multi-)relational data mining 
  • Mining structured data, such as amino-acid sequences, chemical compounds, HTML and XML documents, 
  • Propositionalization methods for transforming (multi-)relational data mining problems to single-table data mining problems 
  • Relational neural networks 
  • Relational pattern languages 

Contact information of organizers

Program Committee Members

  • Hendrik Blockeel (Katholieke Universiteit Leuven) 
  • Jean-Francois Boulicaut (University of Lyon) 
  • Diane Cook (University of Texas at Arlington) 
  • Mark Craven (University of Wisconsin at Madison) 
  • Luc Dehaspe (PharmaDM) 
  • Pedro Domingos (University of Washington) 
  • Peter Flach (University of Bristol) 
  • Lise Getoor (University of Maryland) 
  • David Jensen (University of Massachusets at Amherst 
  • Ross King (University of Aberystwith) 
  • Stefan Kramer (Albert-Ludwigs-Universitaet Freiburg) 
  • Nada Lavrac (Jozef Stefan Institute) 
  • Donato Malerba (University of Bari) 
  • Heikki Mannila (Nokia Research / Helsinki Institute for Information Technology) 
  • Tom Mitchell (Carnegie Mellon University) 
  • Hiroshi Motoda (University of Osaka) 
  • Stephen Muggleton (Imperial College) 
  • David Page (University of Wisconsin at Madison) 
  • Foster Provost (Stern School of Business, New York University) 
  • Celine Rouveirol (University Paris Sud XI) 
  • Gunter Saake (Otto-von-Guericke Universitaet Magdeburg) 
  • Michele Sebag (University Paris Sud XI) 
  • Arno Siebes (Universiteit Utrecht) 
  • Hannu Toivonen (University of Helsinki / Nokia Research) 

Important Dates

  • Submissions due: May 27, 2002
  • Notification: June 10, 2002
  • Camera ready: June 24, 2002
  • Workshop day: July 23, 2002

Accepted Papers

Last modified: July 4th, 2002 by