Posted
almost 13 years
ago
by
Jacob Nordfalk
The first version of the Basque to English translator has been released.
This translator was produced with funding from the European Association for Machine Translation, based on data from the Basque to Spanish package, and from Matxin en-eu.
... [More]
Accompanying the data is a small corpus, of "Metamorphosis" by Kafka, taken from the Basque and English editions of Wikisource (cc-by-sa/gfdl and public domain, respectively), aligned using mALIGNa.
Thanks to the developers of apertium eu-es and matxin en-eu; to Mireia and Mikel for answering numerous questions; and to Gema for her help.
[Less]
|
The first version of the Basque to English translator has been released.This translator was produced with funding from the European Association for Machine Translation, based on data from the Basque to Spanish package, and from Matxin en-eu.
|
Posted
almost 13 years
ago
by
Jacob Nordfalk
The first version of the Basque to English translator has been released.
This translator was produced with funding from the European Association for Machine Translation, based on data from the Basque to Spanish package, and from Matxin en-eu.
... [More]
Accompanying the data is a small corpus, of "Metamorphosis" by Kafka, taken from the Basque and English editions of Wikisource (cc-by-sa/gfdl and public domain, respectively), aligned using mALIGNa.
Thanks to the developers of apertium eu-es and matxin en-eu; to Mireia and Mikel for answering numerous questions; and to Gema for her help. [Less]
|
Posted
about 14 years
ago
by
Jacob Nordfalk
This is a new release of apertium-en-ca!
New things:
- Wide support for Valentian forms (financed by Universitat Politècnica
de Valéncia, developed by Prompsit and Jim O'Regan)
- Improved vocabulary and transfer rules (financed by Universitat Oberta
de Catalunya, developed by Prompsit and Jim O'Regan)
|
This is a new release of apertium-en-ca!New things:
|
Posted
about 14 years
ago
by
Jacob Nordfalk
This is a new release of apertium-en-ca!
New things:
- Wide support for Valentian forms (financed by Universitat Politècnica
de Valéncia, developed by Prompsit and Jim O'Regan)
- Improved vocabulary and transfer rules (financed by Universitat Oberta
de Catalunya, developed by Prompsit and Jim O'Regan)
|
This is a note to declare the release of the first version ofapertium-is-en (Icelandic → English). This has been the joint work ofTungutæknisetur (Icelandic Centre for Language Technology) and the
|
Posted
over 14 years
ago
by
Jacob Nordfalk
This is a note to declare the release of the first version of
apertium-is-en (Icelandic → English). This has been the joint work of
Tungutæknisetur (Icelandic Centre for Language Technology) and the
Universitat d'Alacant.
This is the first released
... [More]
pair to include a rule-based lexical
selection module based on Constraint Grammar.
Some statistics below:
==Coverage==
Fri Mar 5 12:51:17 GMT 2010 Total: 2069639, Known: 1731691 (83.67%)
==Dictionary and rules==
* Lexical entries (is): ~9,069
* Lexical entries (is-en): ~23,305
* Disambiguation: 162
* Lexical selection: 30
* Transfer: 106 (t1x: 75, t2x: 2, t3x: 24, t4x: 5)
==Edit distance==
Number of words in reference: 1202
Number of words in test: 1141
Edit distance: 500
Word error rate (WER): 43.82 %
Number of position-independent word errors: 330
Position-independent word error rate (PER): 28.92 %
The evaluation text can be found in:
https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-is-en/dev/eval
==Future work==
In version 0.2.0 we hope to:
* Include support for IceNLP as well as lttoolbox/CG/apertium-tagger for
analysis and disambiguation
* Increase coverage of lttoolbox dictionary.
* Improve disambiguation with constraint grammar.
* Increase number of multiword units recognised.
* Improve transfer rules to cover more cases.
* Increase number of lexical selection rules
The pages
* http://wiki.apertium.org/wiki/Icelandic_and_English/Regression_tests
* http://wiki.apertium.org/wiki/Icelandic_and_English/Pending_tests
also give an idea of future work.
[Less]
|
Posted
over 14 years
ago
by
Jacob Nordfalk
This is a note to declare the release of the first version of
apertium-is-en (Icelandic → English). This has been the joint work of
Tungutæknisetur (Icelandic Centre for Language Technology) and the
Universitat d'Alacant.
This is the first released
... [More]
pair to include a rule-based lexical
selection module based on Constraint Grammar.
Some statistics below:
==Coverage==
Fri Mar 5 12:51:17 GMT 2010 Total: 2069639, Known: 1731691 (83.67%)
==Dictionary and rules==
* Lexical entries (is): ~9,069
* Lexical entries (is-en): ~23,305
* Disambiguation: 162
* Lexical selection: 30
* Transfer: 106 (t1x: 75, t2x: 2, t3x: 24, t4x: 5)
==Edit distance==
Number of words in reference: 1202
Number of words in test: 1141
Edit distance: 500
Word error rate (WER): 43.82 %
Number of position-independent word errors: 330
Position-independent word error rate (PER): 28.92 %
The evaluation text can be found in:
https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-is-en/dev/eval
==Future work==
In version 0.2.0 we hope to:
* Include support for IceNLP as well as lttoolbox/CG/apertium-tagger for
analysis and disambiguation
* Increase coverage of lttoolbox dictionary.
* Improve disambiguation with constraint grammar.
* Increase number of multiword units recognised.
* Improve transfer rules to cover more cases.
* Increase number of lexical selection rules
The pages
* http://wiki.apertium.org/wiki/Icelandic_and_English/Regression_tests
* http://wiki.apertium.org/wiki/Icelandic_and_English/Pending_tests
also give an idea of future work. [Less]
|
Posted
almost 15 years
ago
by
Jacob Nordfalk
A fully compatible Java port of the upcoming lttoolbox 3.2 has been released.
Compared to lttoolbox some experimental code for compounding and flag match (for multiwords and inner inflection) is included.
Contact Jacob Nordfalk for more information
... [More]
on how (and whether) to use this.
Acknowledements
Nic Cottrell contributed an initial version of a Java port of lttoolbox.
During GSOC2009 Raphaël and Sergio worked on it, but didnt get processing to work (compilation and expansion worked)
November 2009 Jacob Nordfalk finished it up and optimized it
For more information, see http://wiki.apertium.org/wiki/Lttoolbox-java
Jacob Nordfalk
[Less]
|