Re: [OT] Japanese (correction about Mail)

2003-06-23 Thread Joel Rees
I took another look at some garbled spam I seem to be picking up
regularly, which I had mistakenly assumed to be from a Korean source,
and it looks like Apple's mail app in 10.2.4 is _not_ handling 7-bit JIS
correctly. More later.

But, while I was checking that, I checked the following:

 I managed to find the kanji I asked the person about with the charecter palette
 description she gave, but it was or could be described otherwise as: Unicode
 5782, JIS(X0213) 1-31-66, Shift JIS(X0208) 9082: 

tarasu/tareru (hang down)

 and it was mojibake'ed as
 ($BEZ(B)

That kind of looks like seven-bit JIS. The $B is a piece of a control
sequence when mixing 7-bit JIS with 7-bit ANSI.

EZ is the 7-bit JIS for tsuchi (earth, dirt). And BE is 7-bit JIS for
the da in datou (valid). Nope. Something else happened to that.

 Some other codes she sent, and hence probably in the same encoding,
 were $B7V(B and $Bj%(B both for hotaru.

7V is the 7-bit JIS for hotaru (firefly). j% is 7-bit JIS for a more
traditional rendering of hotaru.

Here's the meat of the source of a C tool I wrote to check:
-
for ( i = 0; i  kTermWidth - 1; i += 2 )
{   unsigned long byte1 = (unsigned char) buf[ i ] - 0x21;  /* kuten */
unsigned long byte2 = (unsigned char) buf[ i + 1 ] - 0x21;  /* 
kuten */
if ( byte1 == '\0' )
break;
byte2 += 0x40;
if ( ( byte1  1 ) == 1 )
byte2 += 94;
if ( byte2  0x7e )
++byte2;
byte1 = 1;
byte1 += 0x81;
if ( byte1  0x9f )
byte1 += 0x40;
buf[ i ] = (char) byte1;
buf[ i + 1 ] = (char) byte2;
}
buf[ kTermWidth ] = '\0';   /* training wheels */
-

(Yeah, C comes more natural to me than perl. Especially for this kind of
stuff. So shoot me.) It's missing the escape sequence and end-of-line
handling, among other things, but may be amusing to those interested in
the relationship between 7-bit JIS and shift-JIS.

 
 some other strings are:
 a$EAaD (this is the one I could decode)

Weird. All I can read out of that is kilogram told hits. Or, maybe just
the character hayai (early)?

 $B0T$B0U$B0Gndc

Who's meaningful dark? Or perhaps the saba fish in the crucible?

Anyway, they _look_ sort of like 7-bit JIS, and the two you came up with
for hotaru are, in fact, 7-bit JIS for hotaru.

-- 
Joel Rees, programmer, Kansai Systems Group
Altech Corporation (Alpsgiken), Osaka, Japan
http://www.alpsgiken.co.jp



Re: [OT] Japanese

2003-06-23 Thread Joel Rees
 On Wednesday, Jun 18, 2003, at 20:01 Asia/Tokyo, Joel Rees wrote:
 
  (I'm still trying to decipher what they've done with the file system,
  and still trying to figure out how to get the terminal app to show the
  Japanese names for files. My brother in law has a book that shows a way
  that is supposed to even get it to show shift-JIS file names correctly
  in the terminal app, but I haven't got it to work on my iBook yet.)
 
 See Japanese in OS X 10.2 Terminal (in Japanese)
 http://member.nifty.ne.jp/poseidon/osx2t.html

Good information there. Thanks. I'd tried using escapes to type in file
names, but I did something wrong.

 File names in OS X are encoded in UTF-8 decomposition form. I.e.  
 U+30C0: KATAKANA LETTER DA is represented as 0xE382BF (U+30BF) followed  
 by 0xE38299 (U+3099).

Decomposed. ta+dakuten. Okay, knowing Apple is de-composing the kana
will be useful.

 UTF-8 aware tcsh is available as
 ftp://ftp.tba.org.tohoku.ac.jp/pub/tcsh-6.12-bin.tgz
 
 If you need install instructions in English, please see my posting to  
 another list.
 http://listserv.dartmouth.edu/scripts/ 
 wa.exe?A2=ind0306L=nisusT=0F=S=P=40940
 
  (Of course, this is all off topic unless somebody wants to come up with
  some perl code for trying to undo garbled file names.)
 
 It's not a perl script, but nkf -- Network Kanji code conversion Filter  
 -- is able to guess Japanese encodings and to restore broken JIS-Kanji.  

Well, I knew nkf was good for conversions, but I've never tried using it to
to fix really broken text. ;)

Actually, I was thinking more in terms of some code snippets that could
be useful in recovering text that had suffered serious damage in a
broken conversion pipeline.

 The garbled text in the original poster's message does not seem to be  
 Japanese though.

It looks like it didn't survive e-mailing. It looks like 7bit JIS, but
converting down doesn't produce much that makes sense, as you say. The
conversion that worked for hotaru is bad news, because it tends to
indicate some serious non-deterministic behavior in the broken pipe. 

 nkf is available as a part of jx package -- Japanese  
 aware Unix tools for OS X.
 http://www.fan.gr.jp/~sakai/jx.html
 
 BTW there are free text editors which autodetect Japanese encodings  
 properly in most cases.

 CocoaEditorJ (seems to be discontinued)
 http://cocoedit.hp.infoseek.co.jp/
 http://cocoedit.hp.infoseek.co.jp/CocoaEditorJ.dmg (binary)
 http://cocoedit.hp.infoseek.co.jp/CocoaEditorJSource.dmg (source)
 
 KEdit (syntax colouring for perl, php, html)
 http://www.drycarbon.com/macosx/kedit/
 http://www.drycarbon.com/macosx/archive/kedit010-20030619.zip (binary  
 and source)
 
 Dunno if they would help you in making money though ;-)

Heh.

-- 
Joel Rees, programmer, Kansai Systems Group
Altech Corporation (Alpsgiken), Osaka, Japan
http://www.alpsgiken.co.jp



Re: [CamelBones] and some prob with XML::LibXML

2003-06-23 Thread Yvon Thoraval
Le dimanche, 22 juin 2003, à 17:27 Europe/Paris, Sherm Pendley a écrit :

CamelBones doesn't use /usr/bin/perl - it's linked directly to 
libperl.dylib, and unless it's rebuilt to use another, it uses the one 
found under /System/Library/Perl. Have you built CamelBones against 
your copy of 5.8.0? If not, it's using 5.6.0, so you'll need to use 
modules that have been installed under 5.6.0.

OK, i see what happens.

I have CamelBones 5.8.0 because i  have to embed some perl modules into 
my app (at least XML::LibXML).

my 5.8 version, on this computer, comes from darwinports installed in 
/opt...

Thanks for your answer,

Yvon


DBD::Sybase 1.00

2003-06-23 Thread Don Rainwater
I'm trying to get DBD::Sybase 1.00 to work on 10.2.  It seems to 'make' 
correctly, but 'make test' gives me an error like this for every test:

t/base..dyld: /usr/bin/perl Undefined symbols:
_libiconv
_libiconv_close
_libiconv_open
t/base..dubious
Test returned status 0 (wstat 5, 0x5)
I have libiconv 1.8 installed, but this module can't seem to find it.

Any ideas?
--
Don Rainwater, Technology Manager   [EMAIL PROTECTED]
UCit Educational Services
University of Cincinnati


Panther Preview

2003-06-23 Thread Matthew Diephouse
Has anyone heard what version of perl is included in the Panther 
developer's preview?

	m:att d:iephouse



RE: Panther Preview

2003-06-23 Thread Robert Barris
 Panther is supposed to ship with 5.8.1.  Dunno what is on the WWDC disc
yet.

Rob

-Original Message-
From: Matthew Diephouse
To: [EMAIL PROTECTED]
Sent: 6/23/2003 5:55 PM
Subject: Panther Preview

Has anyone heard what version of perl is included in the Panther 
developer's preview?

m:att d:iephouse


Re: Panther Preview

2003-06-23 Thread Matthew Diephouse
Yay! I can't wait for it to get here. Hopefully by then we'll have some 
results from the fink/gentoo/darwinports alliance too.

	m:att d:iephouse

Robert Barris wrote:

 Panther is supposed to ship with 5.8.1.  Dunno what is on the WWDC disc
yet.
Rob

-Original Message-
From: Matthew Diephouse
To: [EMAIL PROTECTED]
Sent: 6/23/2003 5:55 PM
Subject: Panther Preview
Has anyone heard what version of perl is included in the Panther 
developer's preview?

	m:att d:iephouse