Thursday, July 07, 2005

Making multilingual corpus

There is a couple of ways to make a multilingual corpus with AppleTrans. One way is to align multiple documents translated into languages. The other way is simply to make a tab-delimited glossary table using a spreadsheet program, and import it to a corpus. The latter is somewhat limited in that you can only enter plain text.

Aligning segments in multiple documents is not as hard as you might think. AppleTrans allows you to split or merge segments as you like. You can even reorder segments if necessary (the difference in the logic does that in certain languages.)

You can also append new language text to existing records in the corpus by switching the target language. In this case, you have to find the record first by the source language text before entering new translation you want to attach to it.

0 Comments:

Post a Comment

<< Home