Paul Filkin Director of Client Communities SDL Language Technologies 1 how Trados was developed and established itself as industry leader how translation memory tools work what their benefits for open and professional translators are ID: 701679
Download Presentation The PPT/PDF document "OT12 online seminar: Translation Memory ..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
OT12 online seminar: Translation Memory Tools Paul Filkin, Director of Client Communities,SDL Language Technologies
1Slide2
how Trados was developed and established itself as industry leaderhow
translation memory tools workwhat their benefits for open (and professional) translators are
what the particular distinguishing features of SDL Trados Studio are what the future is for translation memory software
The Agenda… or things we’ll cover2Slide3
SDL Trados… a brief history
3Slide4
Translation Production4
Content is either …
Translated by professional translatorOr, the “occasional” translatorNon-linguist, Subject matter specialist (reviewer), Crowd sourced, …Or, left un-translated
Not relevant, too costly, too much overhead involved, …This presentation focuses on content produced by professional translatorsSlide5
Productivity Environments5
Today, content workers utilize specialized productivity environment(s)
Content WorkerApplication ClassProminent
ExampleGraphic DesignersGraphic toolsAdobe Photoshop
Audio
Producers Musicians
DAW
(Digital Audio Workstation)
Steinberg
Cubase
Architects
3D
modeling programGoogle Sketch upEngineersCAD (Computer Aided Design)Autodesk AutoCAD
Game DeveloperGame EngineEpic
Games Unreal EngineTranslatorsCAT(Computer Aided Translation)SDL TRADOS TWB / SDL Studio
All mentioned trademarks are property of their respective owners. Slide6
Translation Editor is at the core of any CAT6
Professional Translation
can be done …In principle, in any authoring editor (desktop/browser)However, with limited productivity (in the range 800-1500 words per day) and high efforts maintaining consistency and accuracy.Using Microsoft Word + Plug-ins Plug-in to translation productivity tool
Hard dealing with structured contentUsing a Dedicated Translation Editor (CAT or TEnT)Depending on various factors: productivity boost in the range 2000 to 5000 words per dayWell established market for professionalsSlide7
CAT: Computer-Aided Translation
A generic term used to describe software which assists users during the localization/translation process
Sometimes referred to as TEnT : Translation Environment ToolOur CAT technology is an integrated toolset, offering:Translation Memory (TM)TermbaseEditing environments
Project Management functionalitySoftware LocalizationOpenExchange
What is CAT Technology?
7
Public
ProZ
Poll August 24 reply
from 1670 translators
http://www.proz.com/polls/5474Slide8
CAT technology incorporates the concept of translation memory
and termbase
Translation memory: a database consisting of translation unitsTranslation unit: source and translated sentence or paragraphDuring translation, the technology searches for exact or similar matches to the current source segment for translation
Matches found can be reused or editedTermbase: multilingual database consisting of term entries
Term entries:
terms, synonyms, acronyms, etc.
Contextual data:
definition, part of speech, gender, etc.
Translators work with a translation memory and termbase to reuse previous translations and ensure consistency of terminology during translation
What is CAT Technology?
8Slide9
A translation memory is a searchable database containing source and translated sentences or paragraphs
The translation of a segment or phrase occurs only once, as each occurrence is stored in the database
During a translation project, when the source segment re-occurs, the translation memory remembers the translation (by searching the database) and inserts it into the new document
The translator may accept the previous translation or edit the translation, if necessary
Translation Memory Overview
9Slide10
Terminology Management Overview
A termbase is a searchable database which contains a list of multilingual terms and contextual term data
Term data gives details about the origin and use of the term, such as definition, gender, context, etc.The termbase can be used in monolingual form during source content creationEnsure consistency of terminology in source documentationFacilitate translation for the global marketplaceThe termbase can be used in bilingual form in conjunction with translation memory technology to increase translation accuracyEnsure consistency of terminology in translated documentation
10Slide11
Key Productivity Accelerators11
Topic Level
document, page, fragment, chunk, …Segment Levelsentence, header, footnote, table cell, …
Subsegment Levelphrase, word, … Exclusion
from translation through markup
Translation
Memory
Auto-suggest
(dictionary
based
auto-completions)
“Perfect Matching” utilizing
bi-lingual representations Automated TranslationPlaceables, Terms
Auto-propagationConcordance
Impact on effective handling of update translationsImpact on effective handling of new translations
Impact on effective handling of document internal redundanciesImpact on consistency & qualitySlide12
Topic (Document, …) Level12
“Don’t translate if
it hasn’t changed”
(but show it to provide context
for
the text
that has
actually
changed/
added)
Significant productivity gains
dependent
on update frequency
Markup exclusions
Use ITS
/ other convention to lock text
Custom arrangements between CMS + Translation System
Perfect Matching
Compare text with predecessor translation project and lock what hasn’t changed
But, high overhead in managing corresponding projectsSlide13
Segment Level : TM13
“Don’t re-translate
if you can reuse an (approved) existing translation” (but adapt as you need)
Increasingly sophisticated match type differentiation100%, Fuzzies, Context Matches (CM), (ICE) Cascaded TMs, Ranking of TMsSignificant productivity gains dependent on Availability of relevant TMs
Similar content produced again and againSlide14
Segment Level : Automated Translations14
“Adapt an automated
translation proposal” (instead of translatingfrom scratch)Increasingly accepted by professional translatorsEspecially using Statistical Machine Translation (SMT)
Significant Productivity gains depending on SMT engine trained with sufficient, relevant (in-domain), high quality (professional translator output) data Translators are able to dynamically select “in-domain” trained engine [e.g. “Touchpoints
”]
Trust scoresSlide15
Segment Level : Auto-propagation15
“Auto-propagate
translationsfor identical source segments” (and ripple through any changes
when you change your translation)Productivity gain if text has internal repetitions
Simplifies updating identical segments throughout the content
Requires parameters to control behaviorSlide16
Subsegment Level : Auto-suggest16
“While I type, provide a list of relevant candidates so that I can quickly auto-complete this part of my translation’”
Productivity gain highly dependent on available
data-sources and proposal strategyOptimal configurations reduce keystrokes by 30 up to 50%
Avoidance of typos, impact on consistencySlide17
Subsegment Level : Placeables, terms
17
“While I type, make it easy for me to place tags,
recognised terms and other placeables so I can focus on the translatable text.’”
Productivity gain highly dependent on available
data-sources
for terminology or translator diligence, and the complexity of the tags
Avoidance of typos, impact on consistency, robust target documentsSlide18
Subsegment Level : Concordance
18
“Make it easy for me to search through Translation Memories, in both source or target text and from wherever I am in the document I’m translating’”
Biggest impact is in being able to find things you’ve translated before that are similar, or the same, as the current text and make it easy to reuse
Impacts the quality of the work you deliver
Impacts the time it takes to find the right words for complicated textsSlide19
Whereas the key technology advances are in the area of subsegment
reuse and statistical machine translation (SMT), the
actual productivity gains for a ProfessionalTranslator relate to the ergonomics of how systems allow users to interact, control and automate the various data
sources:Access, creation, chaining, weighting and sharing of TMsAccess to SMT pointing to specific enginesCompilation of phrase dictionaries on the fly
Key technology advances…
19Slide20
What Happens When Teams Grow?
When teams of three or more work together, new factors must be considered to work effectively and properly collaborate
Translators
Reviewers
Project Managers
20Slide21
Typical Package-based Workflows
Project Manager
Translator
Reviewer
Project Manager
Translator
Reviewer
or
21Slide22
...x 5 languages...
Project Manager
22Slide23
Project Manager
Typical
Project Workflowwith SDL Studio GroupShare
Project Manager creates a project
Performs analysis, pre-translation using SDL Trados Studio connected to a TM on TM Server
Project Manager publishes project
Uses Publish command in Studio, select server and location, and Studio takes care of the rest
Contact team via email, phone
23Slide24
Project Manager
Team Accesses Project
Use Studio 2011 to open project
Check out files as required for translation, review, or signoffStudio only gets files as neededProject Server tracks file versionsStudio and Project Server synchronize metadata
Typical Project Workflow
with SDL Studio GroupShare
Translator
Reviewer
24Slide25
Looking forward…25
Current theme for CAT tools – reviewer productivity
Inclusion of track changes and commenting mechanisms in translation editorAutomation in the broader production chainSlide26
… and the Studio “Platform” which includes the OpenExchange
26Slide27
The SDL OpenExchange… current state of affairs
27
57 Apps on the OpenExchange
42 are completely free
29,804 downloads
(August 2012)
7,141
app users
(August 2012)
396 developers
(August 2012)Slide28