This really would be up to you to write the parser for your log files
that then generates Solr documents. Another, option, possibly is if
you can make them CSV files, then you could just upload them.
On Oct 6, 2008, at 8:14 AM, Ward, Martin wrote:
Hi again,
Further to my last missive, I am looking at Solr as a quick way of
searching log files previously generated by Postfix. I have tried
using
the basic Lucene command line interface to add in the log files and
try
and retrieve data from them, but I can't retrieve the data I need. I
guess this is because the mail log files are not in XML, nor are they
simple text documents.
Is there a way of configuring Solr to index on specific fields,
delimited by = (or other) characters? In an ideal world I would want
some kind of config file such that every night Solr indexes the
previous
night's log file. Afterwards people could search on the From, To,
Date,
Size, DSN or other mail log entry that exists. Probably one of the
most
important (and so, naturally, the most problematical) fields is the
mail
code, which is a string made up of characters but which is not
readable.
Examples are: 4F7D594A33 and B970C9DB65. Also, different lines in the
log file contain different fields so it wouldn't be as simple as "one
line parser fits all".
Would it be possible to generate import filters for this sort of
data to
make the files searchable?
|\/|artin
*************************************************************************************
The message is intended for the named addressee only and may not be
disclosed to or used by anyone else, nor may it be copied in any way.
The contents of this message and its attachments are confidential
and may also be subject to legal privilege. If you are not the
named addressee and/or have received this message in error, please
advise us by e-mailing [EMAIL PROTECTED] and delete the message and
any attachments without retaining any copies.
Internet communications are not secure and COLT does not accept
responsibility for this message, its contents nor responsibility for
any viruses.
No contracts can be created or varied on behalf of COLT
Telecommunications, its subsidiaries or affiliates ("COLT") and any
other party by email Communications unless expressly agreed in
writing with such other party.
Please note that incoming emails will be automatically scanned to
eliminate potential viruses and unsolicited promotional emails. For
more information refer to www.colt.net or contact us on +44(0)20
7390 3900.
--------------------------
Grant Ingersoll
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ