Workshop day: 
August 27, 2003

Location:
Monroe West

 
Scope and Program

Important Dates

Invited talk

Accepted Papers

Schedule


Organization

Organizers

Program Committee


 
Relevant Links

KDD 2003

KDD Workshops



 

MRDM 2003

2nd Workshop on Multi-Relational Data Mining

in conjunction with 

The Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining



Multi-Relational Data Mining (MRDM) is the multi- disciplinary field dealing with knowledge discovery from relational databases consisting of multiple tables. Mining data which consists of complex/structured objects also falls within the scope of this field, since the normalized representation of such objects in a relational database requires multiple tables. The field aims at integrating results from existing fields such as inductive logic programming, KDD, machine learning and relational databases; producing new techniques for mining multi-relational data; and practical applications of such techniques.

Typical data mining approaches look for patterns in a single relation of a database. For many applications, squeezing data from multiple relations into a single table requires much thought and effort and can lead to loss of information. An alternative for these applications is to use multi-relational data mining. Multi-relational data mining can analyze data from a multi-relation database directly, without the need to transfer the data into a single table first. Thus the relations mined can reside in a relational or deductive database. Using multi-relational data mining it is often also possible to take into account background knowledge, which often corresponds to views in the database.

Present MRDM approaches consider all of the main data mining tasks, including association analysis, classification, clustering, learning probabilistic models and regression. The pattern languages used by single-table data mining approaches for these data mining tasks have been extended to the multiple-table case. Relational pattern languages now include relational association rules, relational classification rules, relational decision trees, and probabilistic relational models, among others. MRDM algorithms have been developed to mine for patterns expressed in relational pattern languages. Typically, data mining algorithms have been upgraded from the single-table case: for example, distance-based algorithms for prediction and clustering have been upgraded by defining distance measures between examples/instances represented in relational logic.

MRDM methods have been successfully applied across many application areas, ranging from the analysis of business data, through bioinformatics (including the analysis of complete genomes) and pharmacology (drug design) to Web mining (information extraction from text and Web sources).

The aim of the workshop is to bring together researchers and practitioners of data mining interested in methods for finding patterns in expressive languages from complex / multi-relational / structured data and their applications.

This workshop is the second of its kind. It follows the success of the first workshop on Multi-Relational Data Mining, held at SIGKDD 2002, a report on which appears in SIGKDD Explorations, Vol 4(2), 2002. Further information on MRDM-02 can be found at the workshop web site : this includes an on-line version of the workshop proceedings. Based on MRDM-02, a special issue of SIGKDD Explorations on MRDM (Vol 5(1), 2003) is being co-edited by Saso Dzeroski and Luc De Raedt.

Why the topic is of interest?

An increasing number of data mining applications involve the analysis of complex and structured types of data (such as sequences in genome analysis, HTML and XML documents) and require the use of expressive pattern languages. There is thus a clear need for multi-relational data mining (MRDM) techniques.

On the other hand, there is a wealth of recent work concerned with upgrading some recent successful data mining approaches to relational logic. A case in point are kernel methods (support-vector machines): the development of kernels for structured and richer data types is a hot research topic. Another example is the development of probabilistic relational representations and methods for learning in them (e.g., probabilistic relational models, first-order Bayesian networks, stochastic logic programs, etc.).

Non-exclusive list of topics. listed in alphabetical order:

  • Applications of (multi-)relational data mining 
  • Data mining problems that require (multi-)relational methods 
  • Distance-based methods for structured/relational data 
  • Inductive databases 
  • Kernel methods for structured/relational data 
  • Learning in probabilistic relational representations 
  • Link analysis and discovery 
  • Methods for (multi-)relational data mining 
  • Mining structured data, such as amino-acid sequences, chemical compounds, HTML and XML documents, 
  • Propositionalization methods for transforming (multi-)relational data mining problems to single-table data mining problems 
  • Relational neural networks 
  • Relational pattern languages 

Contact information of organizers

  • Saso Dzeroski ( Saso.Dzeroski@ijs.si) (contact)

  • Jozef Stefan Institute, Jamova 39, SI-1000 Ljubljana, Slovenia.
    phone: +386 1 477 3217, fax: +386 1 425 1083
  • Luc De Raedt

  • Albert-Ludwigs-Universitaet Freiburg, Germany.
  • Stefan Wrobel

  • Fraunhofer Institute for Intelligent Systems, Sankt Augustin, and University of Bonn, Germany. 

Program Committee Members

  • Hendrik Blockeel (Katholieke Universiteit Leuven) 
  • Jean-Francois Boulicaut (University of Lyon) 
  • Soumen Chakrabarty (Indian Institute of Technology, Bombay) 
  • Diane Cook (University of Texas at Arlington) 
  • Mark Craven (University of Wisconsin at Madison) 
  • Luc Dehaspe (PharmaDM) 
  • Pedro Domingos (University of Washington) 
  • Peter Flach (University of Bristol) 
  • Lise Getoor (University of Maryland) 
  • David Jensen (University of Massachusetts at Amherst)
  • Joerg-Uwe Kietz (kdlabs AG, Zurich) 
  • Ross King (University of Aberystwith) 
  • Stefan Kramer (Albert-Ludwigs-Universitaet Freiburg) 
  • Nada Lavrac (Jozef Stefan Institute) 
  • Donato Malerba (University of Bari) 
  • Heikki Mannila (Nokia Research / Helsinki Institute for Information Technology)
  • Tom Mitchell (Carnegie Mellon University) 
  • Hiroshi Motoda (University of Osaka) 
  • David Page (University of Wisconsin at Madison) 
  • Foster Provost (Stern School of Business, New York University) 
  • Celine Rouveirol (University Paris Sud XI) 
  • Michele Sebag (University Paris Sud XI) 
  • Arno Siebes (Universiteit Utrecht) 
  • Takashi Washio (University of Osaka) 

Important Dates

  • Submissions due: June 6, 2003
  • Notification: June 30, 2003
  • Camera ready: July 21, 2003
  • Workshop day: August 27, 2003

Invited Talk

Accepted Papers

Available for download!




Last modified: July 8th, 2003 by Saso.Dzeroski@ijs.si