Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info

Jump to the Background for the history and background of this issue.

Table of Contents

...


Bug Reports/Observations with the new (completely rewritten) translation engine

...

December 2019

...

Below are some of the bug reports/observations noted during testing.


ICANN IT embarked on a complete rewrite of the translation tool in August 2019 and is ready to deploy the new tool around December 9 2019, unless showstopper bugs are identified.

The new tool is deployed on the testing mailing lists new-transbot-en and new-transbot-es

new-transbot-EN : http://mm.icann.org/pipermail/new-transbot-en/

new-transbot-ES :  http://mm.icann.org/pipermail/new-transbot-es/




Description of IssueNoted byDate AddedStatusAdditional Notes on testing, fixes








Bug Reports/Observations with the new translation engine (updated January 2019)

Below are some of the bug reports/observations noted during testing.


Better identification of

  • which email gives the transbot problems,
  • what and where in the email gives the transbot problems

in its error response emails.

When the translation tool has issues with the email, an email is sent from transbot-no-reply@icann.org with
the static subject line "Unable to translate your email to ICANN lists" and a template text like

Dear <sender>
Thank you for your participation in the ICANN email list new-transbot-en.
You are getting this email because we were unable to translate your post automatically.
It violated one or more of the formatting rules we must impose to make translation possible.
A complete description of the formatting rules is available at:
https://community.icann.org/x/aYtEAg
In preparing your post for translation, we found the following format violations in your message:
    <issue, usually Sentence punctuation must be followed by a space>
Please edit your post and send it again.
Thank you.Problem is there is no identification of which email this is referring to - perhaps the subject line of the problem email should be included either in the subject line or in the body of the email.
The format violations message doesn't say WHERE the error is in the email. If its a long email, then how can users identify and correct such issues. Persons getting these messages and not easily seeing where the problem is aren't likely to understand how to do future emails better and the warning messages becomes more of an hindrance.
Perhaps a workaround is to have the text of the email in the error email and identify what section the transbot has issues with.

Other questions :

  • Why is the test required? Are the reasons for this test still valid ( translation tool can only send a limited amount of characters to the Google API)
  • How are domain names handled? Since domain names can't have spaces, do domain names without beginning with http:// trigger the error?

    Missing emails from one list to another

    In 2018, it was noticed that many emails are being silently dropped from one list, mainly emails from lacralo-es not being sent to lacralo-en

    Comparing :lacralo-en archives at https://atlarge-lists.icann.org/pipermail/lac-discuss-en/ and
    lacralo-es archive at https://atlarge-lists.icann.org/pipermail/lac-discuss-es/



    Description of IssueNoted byDate AddedStatusAdditional Notes on testing, fixes

    1


    Description of IssueNoted byDate AddedStatusAdditional Notes on testing, fixes1

      

    updated  

    Status
    colourYellow
    titleIN PROGRESS

     Tool now identifies the subject line of the problem email in the body of the email

    2

    Missing emails from one list to another

    In 2018, it was noticed that many emails are being silently dropped from one list, mainly emails from lacralo-es not being sent to lacralo-en

    Comparing :
    lacralo-en archives at https://atlarge-lists.icann.org/pipermail/lac-discuss-en/ and
    lacralo-es archive at https://atlarge-lists.icann.org/pipermail/lac-discuss-es/

    Month/Year

    Number of lacralo-en emails

    Number of lacralo-es emailsMissing emails
    Dec 201863718
    Nov 201867616
    Oct 201863707
    Sept 201861547
    Aug 201876697
    Jul 20181111209
    Jun 2018667711
    May 2018
    Excellentable
    excellentable-id5


    Month/Year

    Number of lacralo-en emails

    Number of lacralo-es emailsMissing emails
    Jan 2019659227
    Dec 201863718
    Nov 201867616
    Oct 201863707
    Sept 201861547
    Aug 201876697
    Jul 20181111209
    Jun 2018667711
    May 2018
    17922950
    Apr 201811714831
    Mar 2018568024
    Feb 201861665
    Jan 20188310421
    Why is this happening?




     

    3<DNT> tag isn't case sensitive.
    Using <DNT> tag works to not translate text, but <dnt> does not.
    The tool should be able to treat the DNT tag, regardless of case.07/03/20173

    The <DNT> tag will be seen in the original email but not in the translated one.

    See EN and ES messages.
    Q: Should the <DNT> tags be removed in the original email? I'm thinking it should07/03/2017
    Status
    colourBlue
    titleON HOLD

    To achieve this, it would be necessary to re-architecture the transbot code.

    Upon investigating the issue it was discovered this is a known limitation, rather than a bug in the existing code.

    The request has been recorded as something to consider in a future iteration of the translation tool.

    4ES Translated emails seem to remove several line breaks from the EN
    See EN and ES messages
     
    5

    Re: attachments, new-transbot lists have a message and attachment limit of 200K

    Given many PDFs will be larger, will be hard to test unless message size limit is raised

    Tested with a smaller attachment, the attachment does go through. See EN and ES

    08/03/2017

    Status
    colourGreen
    titleFIXED

    The new-transbot list email size limit has been increased to 400K.

    This is enforced for the entire email, including text and attachment.

    6

    Handling an email sent to both new-transbot-en and new-transbot-es lists at the same time.

    When such an email to both lists happens, some emails don't get translated.

    09/03/2017

    Fixed Bug Reports/Observations with the new translation engine (updated January 2019)

      08Green


    2

    Better identification of

    • which email gives the transbot problems,
    • what and where in the email gives the transbot problems

    in its error response emails.

    When the translation tool has issues with the email, an email is sent from transbot-no-reply@icann.org with
    the static subject line "Unable to translate your email to ICANN lists" and a template text like

    Dear <sender>

    Thank you for your participation in the ICANN email list new-transbot-en.
    You are getting this email because we were unable to translate your post automatically.
    It violated one or more of the formatting rules we must impose to make translation possible.

    A complete description of the formatting rules is available at:
    https://community.icann.org/x/aYtEAg

    In preparing your post for translation, we found the following format violations in your message:
        <issue, usually Sentence punctuation must be followed by a space>

    Please edit your post and send it again.
    Thank you.

    The format violations message doesn't say WHERE the error is in the email. If its a long email, then how can users identify and correct such issues. Persons getting these messages and not easily seeing where the problem is aren't likely to understand how to do future emails better and the warning messages becomes more of an hindrance.

    Perhaps a workaround is to have the text of the email in the error email and identify what section the transbot has issues with.


    Other questions :

    • Why is the test required? Are the reasons for this test still valid ( translation tool can only send a limited amount of characters to the Google API)
    • How are domain names handled? Since domain names can't have spaces, do domain names without beginning with http:// trigger the error?

      

    updated  

    Status
    colourYellow
    titleIN PROGRESS

     Tool now identifies the subject line of the problem email in the body of the email

    3

    <DNT> tag isn't case sensitive.

    Using <DNT> tag works to not translate text, but <dnt> does not.
    The tool should be able to treat the DNT tag, regardless of case.

    07/03/2017Description of IssueNoted byDate AddedStatusAdditional Notes on testing, fixes

    A decimal number in an email e.g 3.9MB will trigger the error message

    "Sentence punctuation must be followed by a space"
    See EN email

    Status
    colourGreen
    titleFIXED

     Tool now handles numbers with decimal spaces.Fixed in November 2019 version of tool
    3

    The <DNT> tag will be seen in the original email but not in the translated one.

    See EN and ES messages.

    Q: Should the <DNT> tags be removed in the original email? I'm thinking it should

    07

    Re: attachments, new-transbot lists have a message and attachment limit of 200K

    Given many PDFs will be larger, will be hard to test unless message size limit is raised

    Tested with a smaller attachment, the attachment does go through. See EN and ES

    /03/2017
    Status
    colour
    Blue
    title
    FIXED

    The new-transbot list email size limit has been increased to 400K.

    This is enforced for the entire email, including text and attachment.

    Best practices and other notes on the new translation engine (March 2017)

    ...

    ON HOLD

    To achieve this, it would be necessary to re-architecture the transbot code.

    Upon investigating the issue it was discovered this is a known limitation, rather than a bug in the existing code.

    The request has been recorded as something to consider in a future iteration of the translation tool.

    4ES Translated emails seem to remove several line breaks from the EN
    See EN and ES messages
     


    5

    Re: attachments, new-transbot lists have a message and attachment limit of 200K

    Given many PDFs will be larger, will be hard to test unless message size limit is raised

    Tested with a smaller attachment, the attachment does go through. See EN and ES

    08/03/2017

    Status
    colourGreen
    titleFIXED

    The new-transbot list email size limit has been increased to 400K.

    This is enforced for the entire email, including text and attachment.

    6

    Handling an email sent to both new-transbot-en and new-transbot-es lists at the same time.

    When such an email to both lists happens, some emails don't get translated.

    09/03/2017














    Fixed Bug Reports/Observations with the new translation engine (updated January 2019)



    Description of IssueNoted byDate AddedStatusAdditional Notes on testing, fixes

    A decimal number in an email e.g 3.9MB will trigger the error message

    "Sentence punctuation must be followed by a space"
    See EN email

      

    Status
    colourGreen
    titleFIXED

     Tool now handles numbers with decimal spaces.

    Re: attachments, new-transbot lists have a message and attachment limit of 200K

    Given many PDFs will be larger, will be hard to test unless message size limit is raised

    Tested with a smaller attachment, the attachment does go through. See EN and ES

    08/03/2017

    Status
    colourGreen
    titleFIXED

    The new-transbot list email size limit has been increased to 400K.

    This is enforced for the entire email, including text and attachment.










    Best practices and other notes on the new translation engine (March 2017)

    • As a best practice, emails in Spanish should be sent to the -es list only, and emails in English should be sent to the -en list only. Unusual behavior has been observed when an email is sent to both the -es and en list at the same time. Insome cases, this results in one list receiving an un-translated and two translated emails; in other cases, the lists receive only un-translated emails. Until we have a solution for this, sending emails to only one list will reduce the risk of errors.
    • Formatting changes by email clients have been observed in translated emails. In these cases, the email engine delivered the email to the translation tool in a different format than seen in the original. Format changes have been inconsistent and generally minor, not impacting the email content. Avoiding extensive formatting and using plain text in the original email has shown to minimize impact to format.


    Anchor
    introduction
    introduction
    Background

    A custom machine translation tool for the Latin American and Caribbean Regional At-Large Organisation (LACRALO) mailing lists was implemented around early 2011. However several issues or factors continue to negatively impact the working of the machine translation tool and in turn, has lead to great difficulty in communication and collaboration with the English and Spanish speaking communities in the LAC region.


    See

    View file
    nameLACRALO-Mailing-List-Presentation-ICANN53.pdf
    height250
    presentation by the At-Large Technology Taskforce Working Group for the ICANN53 meeting which has worked to identify the issues and continues to follow this issue with ICANN Staff.


    To recap, LACRALO has two mailing lists

    ...

    A custom machine translation tool for the Latin American and Caribbean Regional At-Large Organisation (LACRALO) mailing lists was implemented around early 2011. However several issues or factors continue to negatively impact the working of the machine translation tool and in turn, has lead to great difficulty in communication and collaboration with the English and Spanish speaking communities in the LAC region.

    See

    View file
    nameLACRALO-Mailing-List-Presentation-ICANN53.pdf
    height250
    presentation by the At-Large Technology Taskforce Working Group for the ICANN53 meeting which has worked to identify the issues and continues to follow this issue with ICANN Staff.

    To recap, LACRALO has two mailing lists

    Emails in english sent to lac-discuss-en@atlarge-lists.icann.org are machine translated via your custom tool using Google Translate and posted to lac-discuss-es@atlarge-lists.icann.org.

    Similarly, emails in Spanish sent to the lac-discuss-es@atlarge-lists.icann.org are translated and posted to lac-discuss-en@atlarge-lists.icann.org.

    Issues identified since 2012

    (this section is from a June 18 2012 email)

    1) Attachments in emails sent to a list are not received on the other list.

    When an email with attachments such as PDFs is sent to one list, the subject line and body of the email is translated and sent to the other list BUT without the attachment.

    2) Subject lines of translated emails from ES to EN become garbled.

    The subject line of translated emails (seemingly) from the lac-discuss-ES list to the lac-discuss-EN list often translated to garbled text.

    Examples abound from a review of the archives.

    ...

    bgColorA9F5F2
    borderStylesolid
    titleAs One Example

    (a) First email posted to lac-discuss-en list :
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005932.html
    Subject line: [lac-discuss-en] ICANN full list of applied for gTLD strings

    ...

    • LACRALO list in English: 

    ...

    ...

    • LACRALO list in Spanish: 

    ...

    ...

    Emails in english sent to

    lac-discuss

    ...

    -en@atlarge-lists.icann.org are machine translated via your custom tool using Google Translate and posted to lac-discuss-es@atlarge-lists.icann.org

    ...

    .

    Similarly, emails in Spanish sent to the lac-discuss-

    ...

    es@atlarge-lists.icann.org are translated and posted to lac-discuss-en@atlarge-lists.icann.org.

    Issues identified since 2012

    (this section is from a June 18 2012 email)

    1) Attachments in emails sent to a list are not received on the other list.

    When an email with attachments such as PDFs is sent to one list, the subject line and body of the email is translated and sent to the other list BUT without the attachment.

    2) Subject lines of translated emails from ES to EN become garbled.

    The subject line of translated emails (seemingly) from the lac-discuss-ES list to the lac-discuss-EN list often translated to garbled text.

    Examples abound from a review of the archives.


    en] =? Iso-8859-1? Q? Lista_completa_de_la_ICANN_solici? == Iso-8859-1? Q? T = F3_cadenas_de_gTLD? =
    Panel
    bgColorA9F5F2
    borderStylesolid
    titleAs One Example

    (a) First email posted to lac-discuss-en list :
    Email: http://atlarge-

    Panel
    bgColorA9F5F2
    titleAnother example

    Another example:

    Email on lac-discuss-es list : http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004518.html
    Subject line: [lac-discuss-es] RES: Alerta de Noticias de la ICANN - Aviso de Prórroga del período que abarca la ICANN: ICANN FY13 Proyecto de Plan Operativo y Presupuesto

    gets translated and posted as an email on lac-discuss-es list: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005897005932.html
    Subject line: [lac-discuss-en] =? Utf-8? Q? RES = 3A_Alerta_de_Noticias_de_la_ICANN_? == Utf-8? Q?-_Aviso_de_Pr = C3 = C3 = ADodo_que_abarca_la_ B3rroga_del_per =? == Utf-8? Q? ICANN 3A_ICANN_FY13_Proyecto_de_Plan_Operativo_y_Presupu =? == utf-8? q? this? =

    Note the difference with "Utf-8? Q?" in this example as compared to "Iso-8859-1? Q?" in the previous example.

    Such gibberish in the subject lines can get even worse if someone responds on the EN list and the translation further scrambles the subject line on the other list.

    ICANN full list of applied for gTLD strings

    (b) which is translated and posted to lac-discuss-es list as:
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html
    Subject line: Lista completa de la ICANN solicitó cadenas de gTLD

    (c) Someone on the lac-discuss-es list responds posts to lac-discuss-es list as:
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004553.html
    Subject line: [lac-discuss-es] Lista completa de la ICANN solicitó cadenas de gTLD

    (d) which is translated and posted to the en list as:

    Panel

    Again, examples abound from a review of the archives but as one example, consider the subject line for an email on lac-discuss-es list
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-esen/2012/004039005933.html
    Subject line: [lac-discuss-esen] =? Iso-8859-1? Q? Invitación = F3n_aLista_completa_de_la_reuni = F3n_/_LACICANN_solici? == Iso-8859-1? Q? RALO_Costa_Rica_Eventos_rueda_de_prensa_Grupo_de_Tr? == Iso-8859-1? Q? Abajo_el_martes_06_de_marzo_2012_a_las_20 = 3A00_UTC? =which gets translated and posted to the EN list as
    Email: T = F3_cadenas_de_gTLD? =


    Panel
    bgColorA9F5F2
    titleAnother example

    Another example:

    Email on lac-discuss-es list : http:http://atlarge-lists.icann.org/pipermail/lac-discuss-enes/2012/005357004518.html
    Subject line: [lac-discuss-en] =? Iso-8859-1? Q? = 3D = 3F_Iso-8859-1 = 3F_Q = 3F_Invitac? == Iso-8859-1? Q? I = F3n_ = 3D_F3n = 5FA = 5Fla = 5Freuni_ = 3D_F3n = 5F / = 5FLAC = 3F? == iso-8859-1? q? _ = 3D = 3D_Iso-8859-1 = 3F_Q = 3F_RALO = 5FCosta = 5FRica = 5FEv? == iso-8859-1? q ? ents = 5Frueda = 5Fde = 5Fprensa = 5FGrupo = 5Fde = 5FTr = 3F_? == iso-8859-1? q? = 3D = 3D_Iso-8859-1 = 3F_Q = 3F_Abajo = 5Fel 5Fmartes = 5F06 =? == iso-8859-1? q? = 5Fde = 5Fmarzo = 5F2012 = 5FA = 5Flas = 5F20_ = 3D_3A00 = 5FUTC? == iso-8859-1? q? = 3F_ = 3D? =

    3) Missing [lac-discuss-es] in subject lines of translated emails posted to the lac-discuss-es list

    ...

    es] RES: Alerta de Noticias de la ICANN - Aviso de Prórroga del período que abarca la ICANN: ICANN FY13 Proyecto de Plan Operativo y Presupuesto

    gets translated and posted as an email on lac-discuss-es list: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005897.html
    Subject line: [lac-discuss-en] =? Utf-8? Q? RES = 3A_Alerta_de_Noticias_de_la_ICANN_? == Utf-8? Q?-_Aviso_de_Pr = C3 = C3 = ADodo_que_abarca_la_ B3rroga_del_per =? == Utf-8? Q? ICANN 3A_ICANN_FY13_Proyecto_de_Plan_Operativo_y_Presupu =? == utf-8? q? this? =

    Note the difference with "Utf-8? Q?" in this example as compared to "Iso-8859-1? Q?" in the previous example.


    Such gibberish in the subject lines can get even worse if someone responds on the EN list and the translation further scrambles the subject line on the other list.


    Panel

    Again, examples abound from a review of the archives but as one example, consider the subject line for an email on lac-discuss-es list


    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004039.html
    Subject line: [lac-discuss-es] =? Iso-8859-1? Q? Invitación = F3n_a_la_reuni = F3n_/_LAC? == Iso-8859-1? Q? RALO_Costa_Rica_Eventos_rueda_de_prensa_Grupo_de_Tr? == Iso-8859-1? Q? Abajo_el_martes_06_de_marzo_2012_a_las_20 = 3A00_UTC? =

    which gets translated and posted to the EN list as
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005357.html
    Subject line: [lac-discuss-en] =? Iso-8859-1? Q? = 3D = 3F_Iso-8859-1 = 3F_Q = 3F_Invitac? == Iso-8859-1? Q? I = F3n_ = 3D_F3n = 5FA = 5Fla = 5Freuni_ = 3D_F3n = 5F / = 5FLAC = 3F? == iso-8859-1? q? _ = 3D = 3D_Iso-8859-1 = 3F_Q = 3F_RALO = 5FCosta = 5FRica = 5FEv? == iso-8859-1? q ? ents = 5Frueda = 5Fde = 5Fprensa = 5FGrupo = 5Fde = 5FTr = 3F_? == iso-8859-1? q? = 3D = 3D_Iso-8859-1 = 3F_Q = 3F_Abajo = 5Fel 5Fmartes = 5F06 =? == iso-8859-1? q? = 5Fde = 5Fmarzo = 5F2012 = 5FA = 5Flas = 5F20_ = 3D_3A00 = 5FUTC? == iso-8859-1? q? = 3F_ = 3D? =

    3) Missing [lac-discuss-es] in subject lines of translated emails posted to the lac-discuss-es list


    Consider example #1 again -
    First email posted to lac-discuss-en list :
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005932.html
    Subject line: [lac-discuss-en] ICANN full list of applied for gTLD strings

    which is translated and posted to lac-discuss-es list as:
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html
    Subject line: Lista completa de la ICANN solicitó cadenas de gTLD

    The email at http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html shows that the subject line is missing the [lac-discuss-es]. This hampers filtering by ES users and makes it difficult to track threaded conversations.

    4) Unusual superscript and other odd characters in translated emails

    There have been numerous complaints about the quality of the translation of the actual body of emails with strange characters, some of which are superscript characters appearing in the translated version.
    Examples aboud (repeating phrase, I'm afraid) on the LACRALO list archives, here is one example:
    Email to lac-discuss-en : http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2012/005858.html
    got translated to this on the lac-discuss-es list: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004483.html

    As you can see,
    * a character like a double quote " is translated to &quot;
    * a word like "organisation" is translated to organización
    * a sentence like "The highest decision making body in any organisation is also subject to rules." is translated to
    "El más alto órgano de decisión en cualquier la organización también está sujeto a reglas."

    History of machine translation of LACRALO mailing lists

    which is translated and posted to lac-discuss-es list as:
    Email: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html
    Subject line: Lista completa de la ICANN solicitó cadenas de gTLD

    The email at http://atlarge-lists.icann.org/pipermail/lac-discuss-es/2012/004552.html shows that the subject line is missing the [lac-discuss-es]. This hampers filtering by ES users and makes it difficult to track threaded conversations.

    4) Unusual superscript and other odd characters in translated emails

    ...

    • The LACRALO list in English: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/ and LACRALO list in Spanish: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/ were machine translated around September 2008. The translation engine was based on Systran.
    • The quality of translation left a lot to be desired in terms of the language translation. Also, there were technical issues such as the subject lines of translated emails becoming garbled.
    • During the LACRALO meeting in Cartagena in 2010, there were intense discussions regarding the quality of mailing list translations. Some wanted to disable the translation entirely. Others wanted to keep the imperfect translation for mobile  users and disabling translation would send the wrong signal to ICANN when ICANN should be giving language services a higher priority. ICANN staff acknowledged the issues and stressed that IT staff were aware of the issues and would be looking to solve the translation problems.
    • Around May 2011, the translation engine was switched from Systran to Google Translate.
    • in May 2011, Google announced that it would be depreciating the (then) free Google Translate API (http://googlecode.blogspot.com/2011/05/spring-cleaning-for-some-of-our-apis.html) in a few months but later (due to public outcry) announced that a paid version of the translate API would be offered.
    • Other issues noted around June 2011 were identified, one was the translation engine would stop translating emails when they became too big. This meant that a conversation thread would stop being translated leading to missing emails from one list not being translated to the other list. See 

    ...

    ...

    ...

    •  for the solution to this (translation tool would stop translating quoted text)
    • Around August 2011, the translation engine was switched back to Systran due to Google rejecting calls to the Google Translate API. See http://atlarge-lists.icann.org/pipermail/lac-discuss-

    ...

    ...

    As you can see,
    * a character like a double quote " is translated to &quot;
    * a word like "organisation" is translated to organización
    * a sentence like "The highest decision making body in any organisation is also subject to rules." is translated to
    "El más alto órgano de decisión en cualquier la organización también está sujeto a reglas."

    History of machine translation of LACRALO mailing lists

    • The LACRALO list in English: http://atlarge-lists.icann.org/pipermail/lac-discuss-en/ and LACRALO list in Spanish: http://atlarge-lists.icann.org/pipermail/lac-discuss-es/ were machine translated around September 2008. The translation engine was based on Systran.
    • The quality of translation left a lot to be desired in terms of the language translation. Also, there were technical issues such as the subject lines of translated emails becoming garbled.
    • During the LACRALO meeting in Cartagena in 2010, there were intense discussions regarding the quality of mailing list translations. Some wanted to disable the translation entirely. Others wanted to keep the imperfect translation for mobile  users and disabling translation would send the wrong signal to ICANN when ICANN should be giving language services a higher priority. ICANN staff acknowledged the issues and stressed that IT staff were aware of the issues and would be looking to solve the translation problems.
    • Around May 2011, the translation engine was switched from Systran to Google Translate.
    • in May 2011, Google announced that it would be depreciating the (then) free Google Translate API (http://googlecode.blogspot.com/2011/05/spring-cleaning-for-some-of-our-apis.html) in a few months but later (due to public outcry) announced that a paid version of the translate API would be offered.
    • Other issues noted around June 2011 were identified, one was the translation engine would stop translating emails when they became too big. This meant that a conversation thread would stop being translated leading to missing emails from one list not being translated to the other list. See http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2011/004308.html for the solution to this (translation tool would stop translating quoted text)
    • Around August 2011, the translation engine was switched back to Systran due to Google rejecting calls to the Google Translate API. See http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2011/004368.html
    • Update to the translation engine: September 2011, http://atlarge-lists.icann.org/pipermail/lac-discuss-en/2011/004498.html
    • June 2012, detailed analysis of translation issues (as noted in the introduction above)

    Testing of new translation tool for LACRALO mailing lists

    ICANN Staff have created two mailing lists (New-transbot-en and New-transbot-es) with a select number of persons on those lists for testing purposes.

    Some of the key changes implemented in the new translation tool.

    ...

    ...

    Testing of new translation tool for LACRALO mailing lists

    ICANN Staff have created two mailing lists (New-transbot-en and New-transbot-es) with a select number of persons on those lists for testing purposes.

    Some of the key changes implemented in the new translation tool.

    • The lack of punctuation was identified as a key issue for the poor translation of emails. This is because the translate tool can only send a certain amount of characters to the Google Translate API. Without punctuation, the translation tool would have to send text mid sentence.
      One of the outcomes from the LACRALO translation WG was the Proposed Notice when email is not translated message which would be sent to the user if the email had formatting issues.
    • Subject lines would not be translated to ensure the conversation thread would not be lost and reduce the chance of garbled subject lines.  


    FY17 update

    The TTF filed a budget request to the At-Large FBSC in FY17 for ICANN to finance the hiring of a programmer to assist the volunteer ICANN staff member to fix outstanding bugs - see At-Large FY17 Budget Development Workspace , this was approved by the At-Large FBSC and filed with ICANN Finance. On the 2016-08-08 At-Large Technology Taskforce Call, ICANN Staff member Corinna Ace confirmed that a programmer/developer has been hired to sort out the remaining bugs.

    To test the new translation tool, two test email lists: new-transbot-en and new-transbot-es were created and TTF volunteers and ICANN staff joined these lists to test the translation and to report bugs at discussion-of-LACRALO-mailing-list-issues page.

    New versions of the translation tool were deployed to these transbot lists in late Dec 2016 and March 2017. The March 2017 update introduced new features

    • translated emails will also include attachments (TXT, PDF, WORD, JPEG, PPT, PNG, GIF) from the original email
    • If there is text that you don't want translated, you can enclose such text with a <DNT></DNT> tags

    In May 2017, emails from the existing LACRALO mailing lists were reposted to the new transbot lists  to get a sense of how the new tool will handle how current users are using the email lists.

    Since ICANN59, the TTF chairs have been discussing with Mark Segall and Corinna Ace from ICANN IT and with Silvia Vivanco and Mario Aleman from ICANN At-Large Staff on implementing the new version of the translation tool developed by ICANN IT on the existing LACRALO mailing lists.

    To minimize the issue of persons posting to both lists at the same time which would create problems, members of LACRALO will be asked via online survey at https://goo.gl/forms/sEOEWqacRYLPk2Xc2 to indicate
    * which lac discuss list do you wish to RECEIVE emails from (English, Spanish, or both)
    * which lac discuss list do you wish to be able to SEND emails to. You can post to one list.

    A conference call for LACRALO members was held on Tuesday Sept 5 2017 (see recordings at https://community.icann.org/x/yh8hB) to raise awareness of the planned changes to the translation tool used for the LACRALO mailing lists and what persons on the LACRALO lists need to do to prepare for the changes.

    The tool was deployed to the main LACRALO lists on October 6 2017.

    ...




    ...

    Testers of new translation tool for LACRALO mailing lists

    ...