| Application domain | mRNA signal structures |
| Source | TZI, University of Eremen |
| Dataset size | 400 mRN A instances |
| Data format | ground Prolog facts |
| Systems used | RIEL |
| References | [1, 2] |
| Pointers | tamas.horvath@gmd.de |
This application [1] is concerned with identifying subsequences in mRNA that are responsible for biological functions. mRNA is a sequence of nucleotic acids (guanine, adenine, uracil, cytosine). The secondary structure of an mRNA contains special subsequences called signal structures that are responsible for special biological functions, such as RNA-protein interactions and cellular transport.
Signal structures are grouped into classes, such as IRE (Iron Responsive Element) that have the same (or similar) biological function. Horvath et al. [1] used RIBL to classify signal structures with unknown classes. They achieve highest accuracy using a term-based representation and similarity measure.