On Jun 4, 2006, at 8:03 AM, JT Moree wrote:
> Agent M wrote:
>> "Google Mini" is just a web crawler/ indexer. You seem to be looking
>> for a general all-around file-system-level content indexer. Is that
>> correct? I am not aware of any that can work in a decentralized
>> fashion
>> for Linux.
>
> what about htdig? it is already in most of the distros. I can't tell
> from your posts how you would use the resulting indexes but we've used
> htdig to index windows shares at client locations and make the index
> available from a web search interface on apache.
>
> take a look at this
> http://www.pcxperience.com/WebGUI/index.pl/casestudyhtdig
In all honesty we are not sure what we need, either. What we do know
is that we have thousands of documents in several different formats
(Word, Excel, PowerPoint, e-mail, PDF, Illustrator, PageMaker, etc.)
and that we often get requests for "the file that had something to do
with {insert topic}." The quick-and-dirty is to use find+strings
+grep. But it would be nice if there was way to index the documents
based on content.
A quick google search turned up the Google mini:
http://www.google.com/search?q=searching+intranet+documents
Thanks for the reference to htdig.
Regards,
- Robert
http://www.cwelug.org/downloads
Help others get OpenSource software. Distribute FLOSS
for Windows, Linux, *BSD, and MacOS X with BitTorrent
_______________________________________________
CWE-LUG mailing list
[email protected]
http://www.cwelug.org/
http://www.cwelug.org/archives/
http://www.cwelug.org/mailinglist/