Andy Pepperdine wrote:
On Sunday 23 April 2006 20:14, Thomas Ronayne wrote:
I have a document that uses "smart quotes." I have a concordance file
(plain text) that I cannot force to also use smart quotes, so most of
what I want in an index is not found; e.g., O'Neil, Barry's, etc.
"Smart quotes" is just a way of saying that simple ASCII quotes are replaced
by suitable ones from the range [2018-201E] in the General Punctuation
section of the Unicode set according to some algorithm which usually does the
right thing.
Yep, knew that.
I've tried every way I can think of to get the index built with all the
entries in the concordance file.
I don't know what you have tried, but as you are sending from a Linux machine,
I'll assume that is what you have. Here are some possibilities.
What I've tried includes editing the concordance file and making sure
that the single quotes are all code U-2019 (so desired index words like
"Burke's" and "D'Laughtane" are properly coded). Did that with vi, nawk,
tr, sed, and a few other things out of desperation.
1. Write a filter to translate the quotes in your simple text file before
importing it into OOo. You could use perl, awk, sed, or best tr if your tr is
utf-8 enabled. Then import the resulting file as a utf-8 text file.
Yep, did that too.
2. Import your text file into OOo and do a find/replace for the simple quotes
for an appropriate quote from the general punctuation range. You can
copy/paste from the document into the replace field if you don't know the
character you want.
Yep, did that one too.
Other than going back to "dumb quotes" in my document, is there any way
to prepare a concordance file that OpenOffice 2.0.2 will be able to use?
Thing is, I have 1,913 index entries -- believe me, I don't want to
build the bloody damn thing manually. I also assume that I don't know
what I'm doing (which is not unusual), and there must be some magic
trick I've missed somewhere or other -- I've tried "escaping" the
characters in the concordance file (nope, no joy there).
My concordance file has entries like this:
Barry’s Great Island;Barry’s Great Island;;;0;0
Barry’s Great Island;Barry’s Great Island;;;0;0
D’Laughtane;D’Laughtane;;;0;0
As far as I can see, those single quotes are U-2019 and at this point I
am at a complete loss about what to do to get OOo to deal with them and
build the index entries.
Any further insight would be most appreciated.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]