Google watermarked their translation output: https://research.google.com/pubs/archive/37162.pdf
Would be good to check if they're still doing this with neural systems. On 04/06/2018 09:14 AM, Mathias Müller wrote: > Hi Ryan > > My two cents: > > First of all, a way of detecting Google-translated text might be to take > the original text and run it through Google Translate (if they have > access to an API, I mean). Then see if an alleged machine > translated-text is similar to this. > > Here is another pointer to literature for you: > > http://www.aclweb.org/anthology/P14-2048 > > Regards > Mathias > >> On 6 Apr 2018, at 04:14, Ryan Coughlin <[email protected] >> <mailto:[email protected]>> wrote: >> >> Hi all, >> >> Hope you all had a happy Easter and are having a good spring (or >> autumn). >> >> One of the admins from Open Subtitles asked me if I knew of a way to >> detect if a translation was machine or machine or human translation. >> The website seems to have a lot of submissions that are simply Google >> translated, and they're looking for a script to flag these >> submissions. Do you have any ideas how they might go about this? >> >> I was thinking of running the SL against a few of the popular SMT >> programs, and seeing if the BLEU scores were too similar, but the >> method seems too random and computationally heavy. I'd appreciate any >> thoughts. >> >> thank you, >> Ryan >> _______________________________________________ >> Moses-support mailing list >> [email protected] <mailto:[email protected]> >> http://mailman.mit.edu/mailman/listinfo/moses-support > > > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
