AT-LARGE GATEWAY
At-Large Regional Policy Engagement Program (ARPEP)
At-Large Review Implementation Plan Development
Page History
Identification of LACRALO translation issues
(this section is from a June 18 2012 email)
Dear David,
With the implementation of the custom machine translation tool for the Latin American and Caribbean Regional At-Large Organisation (LACRALO) mailing lists since mid last year (2011), several issues or factors continue to negatively impact the working of the machine translation tool and in turn, has lead to great difficulty in communication and collaboration with the English and Spanish speaking communities in the LAC region.
...
To date, several issues have been detected
1) Attachments in emails sent to a list are not received on the other list.
...
Panel | ||||||
---|---|---|---|---|---|---|
| ||||||
(a) First email posted to lac-discuss-en list : (b) which is translated and posted to lac-discuss-es list as: (c) Someone on the lac-discuss-es list responds posts to lac-discuss-es list as: (d) which is translated and posted to the en list as: |
...
Panel |
---|
...
|
...
|
...
| ||
Another example: Email on lac-discuss-es list : http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004518.html |
...
|
...
line: |
...
[lac-discuss-es] |
...
RES: |
...
Alerta |
...
de |
...
Noticias |
...
de |
...
la |
...
ICANN |
...
- |
...
Aviso |
...
de |
...
Prórroga |
...
del |
...
período |
...
que |
...
abarca |
...
la |
...
ICANN: |
...
ICANN |
...
FY13 |
...
Proyecto |
...
de |
...
Plan |
...
Operativo |
...
y |
...
Presupuesto |
...
gets |
...
translated |
...
and |
...
posted |
...
as |
...
an |
...
...
on |
...
lac-discuss-es |
...
list: |
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005897.html |
...
|
...
line: |
...
[lac-discuss-en] |
...
=? |
...
Utf-8? |
...
Q? |
...
RES |
...
= |
...
3A_Alerta_de_Noticias_de_la_ICANN_? |
...
== |
...
Utf-8? |
...
Q?-_Aviso_de_Pr |
...
= |
...
C3 |
...
= |
...
C3 |
...
= |
...
ADodo_que_abarca_la_ |
...
B3rroga_del_per |
...
=? |
...
== |
...
Utf-8? |
...
Q? |
...
ICANN |
...
3A_ICANN_FY13_Proyecto_de_Plan_Operativo_y_Presupu |
...
=? |
...
== |
...
utf-8? |
...
q? |
...
this? |
...
= |
...
Note |
...
the |
...
difference |
...
with |
...
"Utf-8? |
...
Q?" |
...
in |
...
this |
...
example |
...
as |
...
compared |
...
to |
...
"Iso-8859-1? |
...
Q?" |
...
in |
...
the |
...
previous |
...
example. |
...
Such gibberish in the subject lines can get even worse if someone responds on the EN list and the translation further scrambles the subject line on the other list.
Panel |
---|
Again, examples abound from a review of the archives but as one example, consider the subject line for an email on lac-discuss-es |
...
list |
...
|
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004039.html |
...
|
...
line: |
...
[lac-discuss-es] |
...
=? |
...
Iso-8859-1? |
...
Q? |
...
Invitación |
...
= |
...
F3n_a_la_reuni |
...
= |
...
F3n_/_LAC? |
...
== |
...
Iso-8859-1? |
...
Q? |
...
RALO_Costa_Rica_Eventos_rueda_de_prensa_Grupo_de_Tr? |
...
== |
...
Iso-8859-1? |
...
Q? |
...
Abajo_el_martes_06_de_marzo_2012_a_las_20 |
...
= |
...
3A00_UTC? |
...
= |
...
which |
...
gets |
...
translated |
...
and |
...
posted |
...
to |
...
the |
...
EN |
...
list |
...
as |
...
|
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005357.html |
...
|
...
line: |
...
[lac-discuss-en] |
...
=? |
...
Iso-8859-1? |
...
Q? |
...
= |
...
3D |
...
= |
...
3F_Iso-8859-1 |
...
= |
...
3F_Q |
...
= |
...
3F_Invitac? |
...
== |
...
Iso-8859-1? |
...
Q? |
...
I |
...
= |
...
F3n_ |
...
= |
...
3D_F3n |
...
= |
...
5FA |
...
= |
...
5Fla |
...
= |
...
5Freuni_ |
...
= |
...
3D_F3n |
...
= |
...
5F |
...
/ |
...
= |
...
5FLAC |
...
= |
...
3F? |
...
== |
...
iso-8859-1? |
...
q? |
...
_ |
...
= |
...
3D |
...
= |
...
3D_Iso-8859-1 |
...
= |
...
3F_Q |
...
= |
...
3F_RALO |
...
= |
...
5FCosta |
...
= |
...
5FRica |
...
= |
...
5FEv? |
...
== |
...
iso-8859-1? |
...
q |
...
? |
...
ents |
...
= |
...
5Frueda |
...
= |
...
5Fde |
...
= |
...
5Fprensa |
...
= |
...
5FGrupo |
...
= |
...
5Fde |
...
= |
...
5FTr |
...
= |
...
3F_? |
...
== |
...
iso-8859-1? |
...
q? |
...
= |
...
3D |
...
= |
...
3D_Iso-8859-1 |
...
= |
...
3F_Q |
...
= |
...
3F_Abajo |
...
= |
...
5Fel |
...
5Fmartes |
...
= |
...
5F06 |
...
=? |
...
== |
...
iso-8859-1? |
...
q? |
...
= |
...
5Fde |
...
= |
...
5Fmarzo |
...
= |
...
5F2012 |
...
= |
...
5FA |
...
= |
...
5Flas |
...
= |
...
5F20_ |
...
= |
...
3D_3A00 |
...
= |
...
5FUTC? |
...
== |
...
iso-8859-1? |
...
q? |
...
= |
...
3F_ |
...
= |
...
3D? |
...
= |
3)
...
Missing
...
[lac-discuss-es]
...
in
...
subject
...
lines
...
of
...
translated
...
emails
...
posted
...
to
...
the
...
lac-discuss-es
...
list
Consider example #1 again -
First email posted to lac-discuss-en
...
list
...
:
...
Email:
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005932.html
...
Subject
...
line:
...
[lac-discuss-en]
...
ICANN
...
full
...
list
...
of
...
applied
...
for
...
gTLD
...
strings
...
which
...
is
...
translated
...
and
...
posted
...
to
...
lac-discuss-es
...
list
...
as:
...
Email:
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html
...
Subject
...
line:
...
Lista
...
completa
...
de
...
la
...
ICANN
...
solicitó
...
cadenas
...
de
...
gTLD
...
The
...
...
at
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html
...
shows
...
that
...
the
...
subject
...
line
...
is
...
missing
...
the
...
[lac-discuss-es].
...
This
...
hampers
...
filtering
...
by
...
ES
...
users
...
and
...
makes
...
it
...
difficult
...
to
...
track
...
threaded
...
conversations
...
.
4)
...
Unusual
...
superscript
...
and
...
other
...
odd
...
characters
...
in
...
translated
...
emails
...
There
...
have
...
been
...
numerous
...
complaints
...
about
...
the
...
quality
...
of
...
the
...
translation
...
of
...
the
...
actual
...
body
...
of
...
emails
...
with
...
strange
...
characters,
...
some
...
of
...
which
...
are
...
superscript
...
characters
...
appearing
...
in
...
the
...
translated
...
version.
...
Examples
...
aboud
...
(repeating
...
phrase,
...
I'm
...
afraid)
...
on
...
the
...
LACRALO
...
list
...
archives,
...
here
...
is
...
one
...
example:
...
Email
...
to
...
lac-discuss-en
...
:
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005858.html
...
got
...
translated
...
to
...
this
...
on
...
the
...
lac-discuss-es
...
list:
...
http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004483.html
...
As
...
you
...
can
...
see,
...
*
...
a
...
character
...
like
...
a
...
double
...
quote
...
"
...
is
...
translated
...
to
...
"
...
*
...
a
...
word
...
like
...
"organisation"
...
is
...
translated
...
to
...
organización
...
*
...
a
...
sentence
...
like
...
"The
...
highest
...
decision
...
making
...
body
...
in
...
any
...
organisation
...
is
...
also
...
subject
...
to
...
rules."
...
is
...
translated
...
to
...
"El
...
más
...
alto
...
órgano
...
de
...
decisión
...
en
...
cualquier
...
la
...
organización
...
también
...
está
...
sujeto
...
a
...
reglas."
...
This is a summary of the key issues affecting the machine translation of emails in LACRALO. I hope to have the opportunity to chat with you in Prague
and trust the identification of the issues in this email is sufficient to clarify the problems so that solutions can be developed.
Kind Regards,
Dev Anand Teelucksingh
...