On Jan 18, 2007, at 5:30 PM, Michael Rebar wrote:
On 1/18/07 11:32 AM, "Chuck Pelto" <[EMAIL PROTECTED]> wrote:
On Jan 18, 2007, at 8:23 AM, Norman Palardy wrote:
Applescript can break text into words very nicely
True. But....
[1] AppleScript (AS) doesn't run on Windows-based platforms.
[2] I need something to work inside of an RB app, without dropping
into AS.
It's part of an SQL search engine I'm working on. Something to deal
with multiple words being sought in records of a DB. I'm not quite in
the depth of Octave's situation. But I WOULD like to see something
done that would allow us the ease that AS affords for this sort of
thing.
Perhaps a form of SPLIT that would not take merely ONE delimiter, but
would take an array of delimiters that could be set by the programmer
or end-user.
Regards,
Chuck
You could also use regular expressions. It can locate word
boundaries with
reasonable accuracy.
Google 'regex word list word boundary' or 'regex word list'
This can also get quite complex. For example:
* periods aren't accurate word boundaries
* commas aren't accurate word boundaries
* hyphens might be accurate word boundaries
You might have to maintain a dictionary/corpus against which to
compare for
validity.
Michael
Interesting report. I'll have to look into this more deeply, as I'm
not familiar with RegEx. But seeing that it is built into RB and
looks rather powerful, it might serve my purposes....if I can warp my
mind around it [transposition fully intended ;.-) ]
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>
Search the archives of this list here:
<http://support.realsoftware.com/listarchives/lists.html>