|Application domain||mRNA signal structures|
|Source||TZI, University of Eremen|
|Dataset size||400 mRN A instances|
|Data format||ground Prolog facts|
This application  is concerned with identifying subsequences in mRNA that are responsible for biological functions. mRNA is a sequence of nucleotic acids (guanine, adenine, uracil, cytosine). The secondary structure of an mRNA contains special subsequences called signal structures that are responsible for special biological functions, such as RNA-protein interactions and cellular transport.
Signal structures are grouped into classes, such as IRE (Iron Responsive Element) that have the same (or similar) biological function. Horvath et al.  used RIBL to classify signal structures with unknown classes. They achieve highest accuracy using a term-based representation and similarity measure.