Quoting Roger Harris <[EMAIL PROTECTED]>: > 2003 May 30. London, U.K. > ---------------------------------------------------------------------------- > --- > > > And it has been said .. > >>> The hiding of text for forcing MT in HTML is unique, although > >>> the tagging of items to prevent translation is not. The interesting > >>> bit would be to see how systran processes this... > > Systran can hardly respond correctly to an HTML document if it has not been > programmed to recognise one.
Roger, The RAW html code was not being translated. For example, by being pasted into the text box at http://www.google.co.uk/language_tools?hl=en rather, I used the translate a web page option on the above page. Here you can paste the URL of a complete web page, which systran then translates as a entire web page. they have obviously developed some server side code to scrape the source web page and separate the textual content from the structural markup, process the content, leave the markup alone and then reassemble it all back into a functioning web page. The URL I pasted into the google translation service was http://mull.ccl.umist.ac.uk/staff/toby/Partial_Translation_in_CALL/testing _systran/invisible_text.html The code I wrote into my mail of this morning was a) the source code of this page and b) the resulting source code of google/systrans' English--> German processed web page output. So systran was not being forced to translate raw html. Toby ************************************************************* Toby Peers Centre for Computational Linguistics UMIST PO Box 88 Tel: +44 161 200 3077 Manchester M60 1QD Fax: +44 161 200 3091 UK [EMAIL PROTECTED] ************************************************************* -- For MT-List info, see http://www.eamt.org/mt-list.html
