On 1/18/07 11:32 AM, "Chuck Pelto" <[EMAIL PROTECTED]> wrote:

> 
> On Jan 18, 2007, at 8:23 AM, Norman Palardy wrote:
> 
>> Applescript can break text into words very nicely
> 
> True. But....
> 
> [1] AppleScript (AS) doesn't run on Windows-based platforms.
> [2] I need something to work inside of an RB app, without dropping
> into AS.
> 
> It's part of an SQL search engine I'm working on. Something to deal
> with multiple words being sought in records of a DB. I'm not quite in
> the depth of Octave's situation. But I WOULD like to see something
> done that would allow us the ease that AS affords for this sort of
> thing.
> 
> Perhaps a form of SPLIT that would not take merely ONE delimiter, but
> would take an array of delimiters that could be set by the programmer
> or end-user.
> 
> Regards,
> 
> Chuck

You could also use regular expressions. It can locate word boundaries with
reasonable accuracy.

Google 'regex word list word boundary' or 'regex word list'

This can also get quite complex. For example:

  * periods aren't accurate word boundaries
  * commas aren't accurate word boundaries
  * hyphens might be accurate word boundaries

You might have to maintain a dictionary/corpus against which to compare for
validity.

Michael

_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>

Search the archives of this list here:
<http://support.realsoftware.com/listarchives/lists.html>

Reply via email to