2003 May 30. London, U.K. ---------------------------------------------------------------------------- ---
And it has been said .. >>> The hiding of text for forcing MT in HTML is unique, although >>> the tagging of items to prevent translation is not. The interesting >>> bit would be to see how systran processes this... Systran can hardly respond correctly to an HTML document if it has not been programmed to recognise one. For an HTML document, the filename extension is either .htm or .html but both are correct. Also, the first two lines of the text should be something like ... <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN"> <HTML> or <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> <HTML> This is the coding which tells a browser such as Netscape, Internet Explorer or Opera how to select the method of rendering the text on the computer screen. If a browser can recognise an HTML document then so should Systran, but only if it has been programmed to do so. I haven't tried this yet but it should be possible to discover how Systran handles an HTML file by sending it various test files and noting the results: filename: TEST-1.TXT content: <BR ALT="Do not translate this line">Translate this message<BR> filename:TEST-2.HTM content: <BR ALT="Do not translate this line">Translate this message<BR> filename: TEST-3.TXT content: <HTML><BR ALT="Do not translate this line">Translate this message<BR></HTML> filename: TEST-4.HTM content: <HTML><BR ALT="Do not translate this line">Translate this message<BR></HTML> The <BR> tag normally causes a line-feed and ALT="" is used to incorporate non-visible text. If you place the cursor over an image and a line of text appears on the screen then it is text which was placed within an ALT="". The file contents will fail an HTML validation but will be displayed by a browser. Kind regards, Roger Harris. ----- Original Message ----- From: "Toby Peers" <[EMAIL PROTECTED]> To: "Florence Reeder" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Friday, May 30, 2003 10:54 AM Subject: Re: [MT-list] Is this hidden text technique original? > Quoting Florence Reeder <[EMAIL PROTECTED]>: > > > Toby, > > > > The hiding of text for forcing MT in HTML is unique, although > > the tagging of items to prevent translation is not. The interesting > > bit would be to see how systran processes this... Do they ignore > > the "hidden" text or do they process it as a separate translation > > unit? > > > > flo > > -- For MT-List info, see http://www.eamt.org/mt-list.html
