MultiTrans is the most user-friendly and powerful second
generation CAT tool available today. Unlike traditional
Translation Memory systems, MultiTrans does not build a
laborious database of pre-aligned sentences. It indexes
previously translated documents, and creates a bilingual
reference corpus that allows the user to make full-text
searches and to retrieve words, expressions, and
sentences. MultiTrans is available in two versions:
MultiTrans Light for light volume and non-networked
usage, and MultiTrans Pro for high volume and
MultiTrans is the most
user-friendly and powerful second generation CAT tool available
today. Unlike traditional Translation Memory systems, MultiTrans
does not build a laborious database of pre-aligned sentences. It
indexes previously translated documents, creating a bilingual
reference corpus that allows the user to make full-text searches
and to retrieve words, expressions, and sentences. Using
MultiTrans, you will profit from being able to perform the
- Rapidly create vast, easily-referenced
pools of legacy multilingual content.
- Easily access terminology and previous
translations in their complete usage and style context.
- Productively recycle translations of any
length, not just whole sentences.
- Rapidly extract, create and manage rich
- Easily share and manage your multilingual
assets across your enterprise, clients and partners
The Advantages of MultiTrans
The Need for Better Tools
Vast bodies of high quality previously
translated content may exist in many places: in your personal
collection, in your organization, in your clients’ or partners’
organizations, in many multilingual public web sites, and
elsewhere. If effectively exploited, these untapped reservoirs
of valuable translation knowledge would enable leaps in
translation and terminology management productivity and quality
for global organizations and language industry professionals.
Despite the obvious potential, Computer-Aided
Translation (CAT) tools have failed to successfully leverage
these valuable multilingual assets. Translation Memory (TM)
databases consisting of laboriously aligned, out-of-context full
sentences have proven to be of only limited benefit in very
specific applications, such as the translation of technical
product update documentation. In over 95% of documents, the
repetition of full sentences is very rare and these tools have a
very limited application.
Taking a fundamentally different approach to
the problem, MultiTrans allows vast collections of legacy
translations and other content to be easily searched for
expressions of any length in their full usage context. It
provides a corpus-based translation support solution and
multilingual information gateway that has helped thousands of
global organizations and language industry professionals to
fully exploit their multilingual assets and dramatically improve
translation and terminology management productivity and quality.
MultiTrans is an integrated environment that
provides productivity and quality gains to all of the
participants in the multilingual information value chain,
including writers, translators, terminologists, editors,
reviewers and content consumers. The complete MultiTrans
solution consists of an indexed multilingual reference corpus,
tools to build and search the corpus, terminology extraction
technology, terminology management tools, and an integrated
Unique to MultiTrans, advanced corpus indexing
and search technology combined with a sophisticated
Client/Server terminology management infrastructure provides
authors, terminologists and content consumers with unparalleled
capabilities for language comprehension and terminology
Quickly created and easily searched reference
corpora put a large body of monolingual or multilingual best
writing practices and terminology usage examples your
fingertips. The corpora act as powerful “by-example”
dictionaries and usage guides. State-of-the-art terminology
extraction technology enables terminologists to rapidly research
and create new terminology from legacy content. Client/Server
terminology management enables terminology to be easily shared
and managed across many users.
Instant Access to Millions of Terms in Full
The corpus-based approach takes a vast
collection of legacy mono or multilingual documents and, using
advanced search engine techniques, rapidly indexes all of the
text. In the case of multilingual text, it also uses
sophisticated algorithms to align the translated text with the
original language text. Since full context is always provided,
any misalignments can be easily spotted and corrected on the fly
as the system is used.
You can easily build a corpus of millions of
words in less than an hour by easily importing any relevant
texts. You can easily search the entire corpus or multiple
corpora simultaneously for an expression of any length, in any
of the languages contained in the corpus. In less than one
second, all of the instances of that expression in the entire
corpus are automatically found and retrieved, along with the
aligned translation texts. By providing context, the corpus acts
as an extensive “by-example” dictionary, usage and style
reference for terms and expressions.
Terminology Extraction in the Blink of an Eye
Once a desired term is found in a corpus, it
can be manually copied along with any translated equivalents and
added to the terminology management repository with a single
mouse click. This simple process also automatically fills in
source, context, created-by, and created-date fields in the
In a batch process, the corpus-building
process automatically extracts all unique words and statistical
expressions from reference corpora that may contain millions of
words of monolingual or multilingual text. Purely statistical
extraction algorithms adapt to any language. User-defined
“exclude lists” reduce terminology extraction noise by
eliminating specified expressions from consideration.
An advanced auto-generation add-on can also
analyze the terminology extracted and compare it to the entire
indexed multilingual corpus to identify and retrieve the
corresponding translations. The system then displays a list of
possible translations per expression, and a terminologist or a
translator may validate the suggestions. This unique
corpus-based approach enables you to quickly leverage the
information contained in your multilingual documentation into
large and specialized terminology databases.
Tight integration with the MultiTrans
terminology repositories provide easy, automated capture of
approved terms from the corpus. Flexibility is also provided to
export terms into other formats. Intelligent comparison
algorithms identify the extracted words and expressions that
already exist in the terminology databases and allow for
convenient filtering so you are able to focus your work only on
Proven technology provides fast, scalable
performance. In real applications, these algorithms have
extracted over 60,000 expressions with their corresponding
translations from a 6 million word corpus in a matter of hours.
You can easily search the entire corpus or
multiple corpora simultaneously for an expression of any length,
in any of the languages contained in the corpus. In less than
one second, all of the instances of that expression in the
entire corpus are automatically found and retrieved, along with
the aligned translation texts. You can then select and view an
instance of the expression and its aligned translation in a
split screen view. One half of the view displays all of the text
of the document that contains the searched expression,
automatically scrolled to the location of the found expression,
which is color-highlighted for easy viewing. The other half of
the view displays the complete corresponding translation text,
scrolled to the aligned text segment, which is also highlighted.
At a glance, you can see the expression and aligned translation
in the contexts of their complete original documents. By
providing context, the corpus acts as an extensive “by-example”
dictionary, containing usage and style reference for terms and
When you find a special term in a corpus that
requires formal terminological management or a recurring
“translation terminology” expression (typically five words or
less and not part of the formal terminology of a subject) you
can simultaneously insert the translation into your project and
capture the term in a repository for additional management and
future reuse. The same repository structure allows you to
seamlessly incorporate sentence-level translation memory
databases that have been previously created with traditional TM
tools. In addition to building formal and translation
terminology repositories on the fly, you can use the corpus
expression extraction and alignment tools to rapidly build these
repositories independent of a translation project.
Shared Terminology Management Across Your
Language Management Value Chain
MultiTrans TermBase C/S
(Client/Server) is MultiCorpora’s optional solution for
building, sharing and distributing terminology across the many
users in your language management value chain, which is
accessible via a network, intranet or the Internet. It is built
on an n-tier Client/Server architecture that provides high
scalability and easy integration into large multi-server
The architecture of MultiTrans TermBase C/S
enables an organization’s terminology to be accessible, shared,
and updated in real time, across geographical boundaries. The
databases are centralized on a server within an organization and
accessed in real-time via an internal network, Intranet or the
Internet by a MultiTrans workstation or web browser. This
architecture enables easy and cost effective deployment of
terminology, while creating a synergy between all participants
in the information processes, from authoring and translation,
through revision and information usage.
Differences Between MultiTrans Pro and
MultiTrans comes in two versions:
MultiTrans Pro and MultiTrans Light.
If your translation requirements are modest
and you do not require a networked installation or multi-user
support, MultiTrans Light will allow you to effectively
create and manage multilingual terminology data banks with the
ease of quick language-pair reference document lookups. With
slightly less capacity but most of the features of its bigger
brother, MultiTrans Light is the ideal toolbox for low-volume
translation. MultiTrans Pro and MultiTrans Light have exactly
the same engine, file format, and interfaces. The files created
by the Light and Pro versions are identical; the processing
speed is also identical.
has no practical limits, except the limits of your hard drive.
It is geared towards high volume performance, and is networked
and available to multiple users. MultiTrans Pro also comes with
free automatic online upgrades (internet connection required).
Comparison of Light and Pro
expressions in a TermBase
|Number of words in
Microsoft Word 97 and 2000
MultiTrans consists of three modules designed
to increase your translation capacity and consistency while
helping you deliver top-quality translations on time - it's like
having an assistant!
Corpus Builder and Search capabilities enable you
to rapidly create and easily access large bodies of
existing texts and their corresponding translated
versions. It automatically aligns and indexes the full
text and extracts lists of recurring words and phrases.
Powerful search capabilities allow you to quickly
identify all instances of a desired expression, view
them in their original contexts and see previous
- Automatic file alignment based on
user-defined file naming rules.
- High performance alignment,
indexing and term extraction engine which processes
over 50,000 words per minute and easily scales to
include thousands of documents and many millions of
- Sophisticated statistical
alignment algorithms that use a wide array of
parameters, including common words and numbers,
segment lengths, and punctuation.
- Multiple import file formats,
including Word, WordPerfect, text files, Unicode
text files, HTML, XML, PDF, TMX-compliant
translation memories and many others.
- Automatic unique word and
expression extraction to streamline terminology
creation and build project-specific lexicons.
- A Unicode compliant rating.
- Proven, scalable, high
performance search engine which returns results in a
fraction of a second, even when searching corpora
containing over 30 million words.
- The ability to search seamlessly
across multiple corpora expressions of any length:
words, phrases, sentences, and paragraphs.
- Split-screen view which shows
source and target expressions in their full
The TermBase module provides comprehensive tools
to extract and generate terminology from corpora or
import terminology information from many other sources.
Easy-to-use search and edit capabilities allow you to
easily find and manage nominal terminology, translation
terminology and translation memory. Optional
Client/Server technology allows you to share language
management and collaborate across many users.
- Flexible, concept-oriented
terminology management functionality with
- Advanced terminology extraction
- TMX-compliant import and export
- The ability to automatically
generate TMX compatible memory.
- Powerful term search and edit
- Integrated pre-translation
feature which automatically inserts approved
translations into your project.
- Scalable database architecture
which easily handles millions of terms.
- User-defined reference links
The Translation Support Workbench module
integrates MultiTrans functions right into your word
processor. This tool is used to seamlessly interact with
multiple corpora and TermBases to quickly complete new
- Full MS-Word, HTML, and
PowerPoint integration which provides familiar,
easy-to-use and full-featured editing environments.
An open architecture allows easy integration with
other editing environments of your choice.
- DoubleVue™, which transforms
MS-Word into a powerful split-screen editing
environment that simultaneously tracks and displays
your source language and target translation texts.
Translators and editors can use it to scroll through
the translated text and display the sentence in the
original text at any time.
- Comprehensive project analysis
tools which help you scope and plan projects
- Automated comparison which
color-highlights all expression in your translation
project that have matching expressions in your
corpora and TermBases.
- The ability to simultaneously
search expressions and previous translations in
multiple corpora, multiple terminology repositories,
external glossaries and term banks, and machine
translation engines. Select and insert the best
translation in an easy one-click operation.
- Automatic pre-translation which
compares an entire document with the TermBases,
retrieving and inserting previous translations for
all known expressions while retaining all text
- An interactive working
environment which allows on-the-fly alignment of
corpora and automatic updating of TermBases,
improving your translation and language resources as
- TMX-compliant export