Re: Solr configuration to enable indexing/searching webapp log files

2010-04-29 Thread Lance Norskog
It sounds like you want a data warehouse, not a text search engine. Splunk and Pentaho are good things to try. On Thu, Apr 29, 2010 at 12:03 PM, Jon Baer wrote: > To follow up it ... it seems dumping to Solr is common ... > > http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-

Re: Solr configuration to enable indexing/searching webapp log files

2010-04-29 Thread Jon Baer
To follow up it ... it seems dumping to Solr is common ... http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data - Jon On Apr 29, 2010, at 1:58 PM, Jon Baer wrote: > Good question, +1 on finding answer, my take ... > > Depending on how large of log files y

Re: Solr configuration to enable indexing/searching webapp log files

2010-04-29 Thread Jon Baer
Good question, +1 on finding answer, my take ... Depending on how large of log files you are talking about it might be better off to do this w/ HDFS / Hadoop (and a script language like Pig) (or Amazon EMR) http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873 Theoretically y

Solr configuration to enable indexing/searching webapp log files

2010-04-29 Thread Stefan Maric
I thought i remembered seeing some information about this, but have been unable to find it Does anyone know if there is a configuration / module that would allow us to setup Solr to take in the (large) log files generated by our web/app servers, so that we can query for things like peak time requ