Francis Tyers
2017-07-10 09:20:27 UTC
Hello everyone!
We're making progress on a module for treating separable "multiword"
expressions. The general idea is to be able to do stuff like
^take$ ^the$ ^rubbish$ ^out$ -> ^take# out$ ^the$ ^rubbish$ -> ^sacar$
^la$ ^basura$
^be$ ^always$ ^late$ -> ^be# late$ ^always$ -> ^llegar# tarde$
^siempre$
^take$ ^the$ ^rubbish$ ^out of$ ^here$ -> ^take# out$ ^the$ ^rubbish$
^of$ ^here$ -> ^sacar$ ^la$ ^basura$ ^de$ ^aquí$
The general idea is that it will be a finite-state transducer (like the
existing monolingual and bilingual dictionaries) but that can work over
words. It will appear between the pretransfer module and the lexical
transfer module (apertium-pretransfer | new module | lt-proc -b).
So, this email is a call for language pair developers to give us
examples of phenomena you would like to treat in your language pair.
Thanks!
Fran
We're making progress on a module for treating separable "multiword"
expressions. The general idea is to be able to do stuff like
^take$ ^the$ ^rubbish$ ^out$ -> ^take# out$ ^the$ ^rubbish$ -> ^sacar$
^la$ ^basura$
^be$ ^always$ ^late$ -> ^be# late$ ^always$ -> ^llegar# tarde$
^siempre$
^take$ ^the$ ^rubbish$ ^out of$ ^here$ -> ^take# out$ ^the$ ^rubbish$
^of$ ^here$ -> ^sacar$ ^la$ ^basura$ ^de$ ^aquí$
The general idea is that it will be a finite-state transducer (like the
existing monolingual and bilingual dictionaries) but that can work over
words. It will appear between the pretransfer module and the lexical
transfer module (apertium-pretransfer | new module | lt-proc -b).
So, this email is a call for language pair developers to give us
examples of phenomena you would like to treat in your language pair.
Thanks!
Fran