Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Jim Ault
Why would the filename ever have any line endings? On Sep 3, 2009, at 10:39 PM, Sivakatirswami wrote: I have some UTF-16 unicode raw text. If I import this into Pages, it displays the font correctly and also the line breaks between paragraphs correctly But if I use this function: on

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Kenji Kojima
Hi Sivakatirswami, I have some UTF-16 unicode raw text. If I import this into Pages, it displays the font correctly and also the line breaks between paragraphs correctly But if I use this function: on mouseUp answer file Choose a unicode file to read in. if it is empty then exit mouseUp

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Sadhu Nadesan
Namaste, Swami I am not sure of the answer to your question, but to debug problems like this I use the Unix utility od for example, with the -c flag. (od = octal dump). Then you can see exactly what characters are in your original, and/or, an example that works right, that need to be

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Joe F.
The trick is to use uniencode/unidecode for everything. Three separate examples: ask file Name new file: with NewFile.xml put binfile: it into theNewFileName get the unicodetext of cd fld 1 put unidecode(it,utf8) into url (theNewFileName) -- set the

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Sivakatirswami
Aloha, Joe: I'm not quite sure how your suggestion relates to the problem of endlines. The unicode.txt file I have is being read OK in Pages on the mac. It also loads just fine in Rev, with the exception of the line breaks I'm not sure where the uniencode/unidecode could be used to solve the

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread stephen barncard
Why are you replacing the CRs with LFs? doesn't the engine's Unicode functions handle line endings? - Stephen Barncard San Francisco http://houseofcubes.com/disco.irev 2009/9/4 Sivakatirswami ka...@hindu.org Aloha, Joe: I'm not quite sure how your suggestion relates

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Jim Ault
On Sep 4, 2009, at 3:01 PM, Sivakatirswami wrote: Aloha, Joe: I'm not quite sure how your suggestion relates to the problem of endlines. The unicode.txt file I have is being read OK in Pages on the mac. It also loads just fine in Rev, with the exception of the line breaks I'm not sure

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Richard Gaskin
My fellow petroglyph prowler Jim Ault wrote: Some programs, like BBEdit, try to detect the encoding, but this does not always work. I think the BOM is supposed to be the flag for the encoding, but this is not always clear. In BBEdit one of the File menu commands is Reopen Using Encoding

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Sivakatirswami
stephen barncard wrote: Why are you replacing the CRs with LFs? doesn't the engine's Unicode functions handle line endings? - Stephen Barncard San Francisco http://houseofcubes.com/disco.irev 2009/9/4 Sivakatirswami ka...@hindu.org Aloha, Joe: I'm not quite sure

Re: Line Breaks Dropped on Importing Unicode Text

2009-09-04 Thread Joe F.
Aloha (in NYC we just say howyadoin'?) I had some trouble getting unicode text to do what I expected in Rev. My understanding at the moment is this: Rev handles text internally as utf16, that's the unicodetext. No matter where you put it or display it, it's still the unicodetext. Rev

Line Breaks Dropped on Importing Unicode Text

2009-09-03 Thread Sivakatirswami
I have some UTF-16 unicode raw text. If I import this into Pages, it displays the font correctly and also the line breaks between paragraphs correctly But if I use this function: on mouseUp answer file Choose a unicode file to read in. if it is empty then exit mouseUp put binfile: it into