Application domain | Natural Language Processing |
Source: | Dataset of English verbs |
Dataset size: | 500 entries |
Data format: | Prolog, pairs (present,past) |
Systems used: | Progol4.4 - Analogical Prediction |
References: | (Muggletona and Bain 1999) |
Pointers: | ftp://ftp.cs.utexas.edu/pub/mooney/nl-ilp-data/alphabetic-past-data ftp://ftp.cs.york.ac.uk/pub/ML_GROUP/progol4.4/ |
The best prediction of English Past Tense so far has been obtained with Progol4.4 using Analogical Prediction (AP) (Muggleton and Bain 1999). Analogical Prediction (AP) takes Background Knowledge B and training examples E and then for each test example <x,y> forms an hypothesis Hx from B,E,x. Evaluation of AP is based on estimating the probability that Hx(x) = y for a randomly chosen <x,y> AP has been implemented within Progol4.4. Experiments show that on English past tense data AP has significantly higher predictive accuracy on this data than both previously reported results and Progol in inductive mode. AP has advantages for domains in which a large proportion of the examples must be treated as exceptions with respect to the hypothesis vocabulary. The relationship of AP to analogy and instance-based learning is discussed.