Learning Rules for Protein Exposure (KUL)

Application domain:	Protein Engineering
Further specification:	Data set
Pointers:	Contact Hendrik Blockeel Hendrik.Blockeel@cs.kuleuven.ac.be
Data complexity:	860 KB
Data format:	interpretations (ICL/Claudien)

Annotation:

This application is very similar to the well-known protein secondary structure prediction problem. A protein is a long sequence of amino acids, that is somehow folded in 3-D space. Some parts of the sequence will be located at the surface of the 3-D shape formed by the protein (these are exposed), others are located at the inside (buried). The aim of the learning process is to predict which parts of a protein are exposed.

The background information is the same as the one available for the secondary structure prediction problem. The data themselves are the same as used in previous experiments by Holbrook et al. (1990).

Reference

S.R. Holbrook, S.M. Muskal and S.H. Kim. Predicting surface exposure of amino acids from protein sequence. Protein Engineering, 3, 289-294. 1990.

back to index