Application domain: | Transfer Rules Learning |
Source: | Telia Research |
Further specification: | Data set of 246 QLF pairs |
Pointers | http://www.dsv.su.se/ML/ |
Data complexity: | 560 KB |
Data format: | Prolog |
A transfer rule specifies a pair of QLF patterns (i.e. either atoms or compound terms, where the latter may contain variables). The first argument corresponds to a QLF in one language and the second argument to a QLF in the other. The QLF patterns in these rules can be QLF (sub)expressions, or such expressions with transfer variables showing the correspondence between the two arguments. Many transfer rules are simple relations between two atoms, e.g.\
trans(flight_AirplaneTrip,flygning_Flygresa)while others are more complicated, e.g.\
trans([and,X1,form(_,verb(no,A,yes,M,D),V,Y1,_)], [and,X2,[island,form(_,verb(pres,A,no,M,D),V,Y2,_)]]):- trans(X1,X2), trans(Y1,Y2).The transfer rules are normally hand-crafted through inspection of a set of non-transferrable QLF pairs, which is a tedious and time-consuming task. The main problem addressed in here is how to use ILP techniques in order to automatically learn transfer rules from examples. So far, one example set has been obtained from TELIA Research AB consisting of 246 QLF pairs of various size, ranging from QLFs corresponding to two word phrases (e.g. to Denver) to full sentence parses (e.g.What is the cheapest one way fare from Boston to Washington). Despite the sentences being restricted to the Air Travel Information domain, the number of lexical item pairs extracted amounts to 176 pairs.
There are however techniques that overcome this by instead of producing clauses one by one, produce a set of clauses by specialising an overly general hypothesis in the form of a logic program. A system based on this idea, called TRL, has been developed by STO. TRL first generates a set of overly general clauses, which then are specialised. The problem of over-generalisation due to lack of negative examples is handled by assuming output completeness, i.e. for each source QLF in a pair, it is assumed that all but the target QLF in the pair are undesired. This is a reasonable assumption in the transfer rule learning domain as normally only one transfer is desired (in case of multiple transfers, CLE uses an additional module for choosing one of them).
Experiments are currently being performed using the following
scheme. QLF pairs from (a subset of) the set of all available examples
are randomly split into two disjoint sets: one used for learning and the
other for testing. The rules generated are tested for at least two parameters:
coverage and determinacy. Whereas defining coverage is almost straightforward,
as the percentage of the target QLFs from the test set that could be obtained
from their sources via transfer, measuring determinism poses some problems.
Measures such as the average number of transfer outputs seem to be too
rough to estimate the extent of the phenomenon. Actually, not all non-determinism
is bad: some may promote wider choice at the generation end of the translation
process. This raises the point that evaluating the quality of transfer
rules should take into account features of the generation module. In the
case of the CLE-based translation system, the post-transfer module, in
addition to the target language grammar, features a number of preference
metrics ranging from checks refusing certain (syntactically correct) QLFs
to complex schemes preferring certain expressions word choices. Those should
be included in the determinism (and possibly quality) tests.
References
back to index