Preparatory steps. The lexicon was prepared in the following way: All packages with only a single transfer were removed. For these packages, the problem of transfer selection does not exist. After this, 104.200 entries remained. All function word entries were removed, as they need a different type of transfer selection, and are much more interwoven with the MT system internals The treatment of all entries containing multiwords on either source or target side was postponed, to reduce the initial complexity of the task. They need to be integrated later. All entries containing part-of-speech changes were removed, as this is syntactic information: (de-adj) ‘sicher’ -> (en-adj) ‘secure’ and (en-adv) ‘securely’. Entries of this kind are in lexicons if the adverb formation is not completely regular. However, to determine the right part-of-speech selection, syntactic analysis is needed, which goes beyond the scope of the current Lt-Xfr work, and cannot easily be modelled by an approach for conceptual transfer After these operations, 27.000 packages with 71.400 entries remained for the investigation. Table 2-3 gives the details on the lexicon used for the following analysis.
Appears in 2 contracts
Sources: Grant Agreement, Grant Agreement