Domain-Specific Neural Machine Translation

Introduction

ULG created and trained domain-specific Neural Machine Translation (NMT) engines and tested their effectiveness across multiple domains. The results? A methodology that can be applied to industries and clients for an improved quality output – even with non-traditional languages.

ULG provides Neural Machine Translation (NMT) in over 130 translation directions for global clients across multiple markets and domains.

Each engine is designed to provide translations within the scope of a generic subject area. In order to integrate NMT in the production workflow, domain engines needed to be created, so ULG built domain-specific engines pertaining to medical devices, pharmaceutical, heavy machinery, IFU-DFU and Healthcare.

Chapter 1

THE CHALLENGE

Developing domain-specific NMT engines is a unique undertaking that poses several distinct challenges.

For one, the NMT team requires a much larger pool of data and content to build out the terminology and segments. Additionally, to prepare the NMT engines for the basic production domains, ULG had to ensure consistency in the cleaned and aligned data, train the NMT engines to provide higher output quality and then test the integration quality in a standalone and production workflow.

Chapter 2

THE SOLUTION

ULG created custom domain-specific NMT engines that increased the quality scores of the translation output.

Translation quality is scored using multiple evaluation methods, including the standard BLEU quality score which is a computer-generated metric that analyzes the results by comparing the NMT file to a reference file. ULG also uses TER (automated edit distance scoring) and independent Distance Scoring performed by a team of linguists to determine and validate quality scoring of NMT.

When creating the new engines, ULG’s NMT team followed the standard engine customization process and obtained significant BLEU score increases compared to the score of the in-market engine and the scores of the new domain engines.

Chapter 3

PROVEN RESULTS

After creating domain-specific engines, the actual scores in live production projects showed that:

In all cases the lower score of a Live project was higher than the score of the original engine.
Average scores of Live projects were between 13 and 26 points higher than those of the original engine.

Screenshot 2022-12-01 at 6.26.51 PM

These results indicate ULG’s domain-specific NMT will align with client content requirements to drive higher quality outputs.

The same methodology the ULG team employed to train the domain‑specific engines is also what is used for client-specific engines, ensuring that terminology, corpora and overall content is better and more accurate to bring faster turnarounds, lower costs and increased quality.

How domain‑specific Neural Machine Translation solutions support greater output accuracy and cost efficiencies.

Introduction

Sections

THE CHALLENGE

THE SOLUTION

PROVEN RESULTS

Ready to work with a top language services company for global success?

Our Resource Library

Translation vs Interpretation: What's the Difference?

How ULG Helps You Maximize Your Localization Budget

How Technology Can Enhance Language Access

Americas HQ

How domain‑specific Neural Machine Translation solutions support greater output accuracy and cost efficiencies.

Introduction

Sections

THE CHALLENGE

THE SOLUTION

PROVEN RESULTS

Ready to work with a top language services company for global success?

hbspt.cta._relativeUrls=true;hbspt.cta.load(3356907, '68678381-6442-41f7-b04f-3f520a9fbf0d', {"useNewLoader":"true","region":"na1"});

Our Resource Library

Translation vs Interpretation: What's the Difference?

How ULG Helps You Maximize Your Localization Budget

How Technology Can Enhance Language Access

Americas HQ