I think this might help:

        http://www.perlmonks.org/?node_id=457784

François

On Nov 12, 2011, at 5:57 PM, Robert A. Rosenberg wrote:

> At 21:46 -0800 on 11/11/2011, Sumtingwong wrote about Re: Large text search:
> 
>> > You can do it in BBEdit using a Perl script, but in what form do you
>> > want the results?
>> 
>> John, thanks for your reply.  I have spent my evenings attempting to
>> write a pithy one liner from the command line to do this, but the
>> resolution is just not there with grep.  All of the files to be
>> searched contain paragraphs of text that is soft wrapped (I don't know
>> if that is the correct term, sorry).  I have not written any Perl in
>> over 10 years, time to break out the books!
>> 
>> What is needed? <;-) Wink.gif>   A frequency count of each word in the input 
>> file
>> for each file that was searched.  For example, the first word of the
>> input file is "it".  Document one is searched for "it" and it shows up
>> 248 times.  Optimal output would be (in tabbed columns):
>> it     Document 1     248
>> 
>> I know the output is going to be huge (as the input file is rather
>> large), but that is fine--I just need to get to the analysis part at
>> this point.
>> Cheers!
> 
> I'm sorry that I can not help you with a BBE based solution but I think you 
> might be attempting to use the wrong tool for this project.
> 
> This is the type of project that I feel is better suited to use of a database 
> like mySQL. You read the first file to populate the database. As you read 
> each new file you do the updates. Once you are done you can do a query on 
> each word and it can tell you total occurrences or occurrences per file.
> 
> Note that the actual updating of the database needs a program to access the 
> database while once populated the results can be done with any database 
> access utility that accepts the SQL query.
> 
> .
> 
> -- 
> You received this message because you are subscribed to the 
> "BBEdit Talk" discussion group on Google Groups.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> <http://groups.google.com/group/bbedit?hl=en>
> If you have a feature request or would like to report a problem, 
> please email "[email protected]" rather than posting to the group.
> Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>

-- 
You received this message because you are subscribed to the 
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
<http://groups.google.com/group/bbedit?hl=en>
If you have a feature request or would like to report a problem, 
please email "[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>

Reply via email to