On 1/18/07 11:32 AM, "Chuck Pelto" <[EMAIL PROTECTED]> wrote:
> > On Jan 18, 2007, at 8:23 AM, Norman Palardy wrote: > >> Applescript can break text into words very nicely > > True. But.... > > [1] AppleScript (AS) doesn't run on Windows-based platforms. > [2] I need something to work inside of an RB app, without dropping > into AS. > > It's part of an SQL search engine I'm working on. Something to deal > with multiple words being sought in records of a DB. I'm not quite in > the depth of Octave's situation. But I WOULD like to see something > done that would allow us the ease that AS affords for this sort of > thing. > > Perhaps a form of SPLIT that would not take merely ONE delimiter, but > would take an array of delimiters that could be set by the programmer > or end-user. > > Regards, > > Chuck You could also use regular expressions. It can locate word boundaries with reasonable accuracy. Google 'regex word list word boundary' or 'regex word list' This can also get quite complex. For example: * periods aren't accurate word boundaries * commas aren't accurate word boundaries * hyphens might be accurate word boundaries You might have to maintain a dictionary/corpus against which to compare for validity. Michael _______________________________________________ Unsubscribe or switch delivery mode: <http://www.realsoftware.com/support/listmanager/> Search the archives of this list here: <http://support.realsoftware.com/listarchives/lists.html>
