Kelvin Eldridge wrote:
Russell Butler wrote:
Kelvin Eldridge wrote:
<snip>
Hi Kelvin
Thanks for the pointer to the new conversion utility.
I just tried it on my medical list (OOo 2.2 on gentoo Linux)
Unfortunately it produces big blocks of words run together without
spacing, and having to go through to insert spaces between them is
more of a problem than I had just doing a search/replace on a copy of
the file. IIRC there were a series of # between words, and just
replacing them with spaces (and/or carriage returns) gave me a list I
could sort alphabetically.
Also there doesn't seem to be any mechanism to export the list to a
new file, other than highlight, copy and paste.
Thanks anyway for working on it. Now we perhaps need a user friendly
way of editing Dictionary.lst.
Regards
Russell
Hi Russell,
I did notice the standard.dic file for Australian English appears not
to use Unicode characters. Yet if I create my own additional custom
dictionary as you have, that file uses Unicode characters.
From what I could see the utility I wrote worked with standard.dic,
but won't work directly with another custom .dic file like the medical
file.
When I saw this behaviour I took a punt the majority of people may
only use the standard.dic custom dictionary and so the utility may
prove to be useful. If it isn't, all I have done is waste some time.
If it proves useful then a lot more people can more easily contribute
words and that would be a good thing.
Does the utility work with your standard.dic file?
With your custom medical dictionary file I think you will find if you
open the file using MS Word, save the file as a text file, then use
the utility, it may work. It did for the test I did. You may have a
alternative package under Linux you can use. If you want me to try
this under Windows, send me a copy of your .dic medical file and I
will try it.
I used OpenOffice.org Basic and whilst I could be wrong, I don't think
I can get it to process Unicode characters correctly. Always happy to
be proven wrong. OpenOffice.org Basic isn't one of my stronger points.
The other issue to keep in mind is I use Windows and line end
characters can be an issue between operating systems.
With output going to another file, what I did was write a quick and
dirty program (quick and direct in name only, as it did take a fair
amount of time to analyse, write and test even this program). I felt
the copy and paste should be a sufficient compromise. It was only
meant to be an aid, not a professional tool.
The bottom line is I think I may have used the wrong tool to create
the utility. I used OpenOffice.org Basic as I felt it was appropriate
for this list.
Thanks again for the feedback. I will take on board the comments and
revisit this later if there is sufficient need.
Kevin:
I made a couple of changes in your code. You were trying to start at a
certain character (255) to start parsing the text. Because of a number
of other control characters in the file, it would not put the 'crlf' in
the correct places. I had it skip the first ll passes in the loop
before parsing the text. I then changed the 'Select Case' to look for 0
(null),instead of 3,6,8, to insert the 'crlf'. I only had some small
samples to work with for user dictionaries, but it seemed to work okay
with them. Also of note is that the code will give an error if the
number of lines to parse is too great from what I could gather. I am
not that familiar with StarBasic so I do not know what the character
limit is for a textbox.
TomW
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]