Re: adding a TimerTask
Seems like one way is to write a servlet who's init method creates a TimerTask. From: Tri Nguyen tringuye...@yahoo.com To: solr user solr-user@lucene.apache.org Sent: Fri, February 18, 2011 6:02:44 PM Subject: adding a TimerTask Hi, How can I add a TimerTask to Solr? Tri
Re: slave out of sync
there is an http api where I can look at the latest replication and whether there is an ERROR keyword. If so, the latest replication failed. From: Otis Gospodnetic otis_gospodne...@yahoo.com To: solr-user@lucene.apache.org Sent: Wed, February 16, 2011 11:31:26 AM Subject: Re: slave out of sync Hi Tri, You could look at the stats page for each slave and compare the number of docs in them. The one(s) that are off from the rest/majority are out of sync. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Tri Nguyen tringuye...@yahoo.com To: solr-user@lucene.apache.org Sent: Mon, February 14, 2011 7:19:58 PM Subject: slave out of sync Hi, We're thinking of having a master-slave configuration where there are multiple slaves. Let's say during replication, one of the slaves does not replicate properly. How will we dectect that the 1 slave is out of sync? Tri
Re: Removing duplicates
I know that I can use the SignatureUpdateProcessorFactory to remove duplicates but I would like the duplicates in the index but remove them conditionally at query time. Is there any easy way I could accomplish this? Closest thing can be group documents by signature field. http://wiki.apache.org/solr/FieldCollapsing
Problems with ant test : No tests found
Hi List! I was wondering, after downloading the subversion source with svn co https://svn.apache.org/repos/asf/lucene/dev/trunk solr_svn According to the book I am reading I should be able to run the tests succesfully, so I did a /var/solr_svn# ant test And it seems that junit-parallel gives _only_ errors, not because tests it self fail but tests aren't found. What did I do wrong? junit-parallel: [junit] Testsuite: org.apache.lucene.TestDemo [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.TestDemo [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.TestDemo [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,475 sec [junit] TEST org.apache.lucene.TestDemo FAILED [junit] Testsuite: org.apache.lucene.TestSearchForDuplicates [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.TestSearchForDuplicates [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.TestSearchForDuplicates [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,012 sec [junit] TEST org.apache.lucene.TestSearchForDuplicates FAILED [junit] Testsuite: org.apache.lucene.document.TestDocument [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.document.TestDocument [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.document.TestDocument [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec [junit] TEST org.apache.lucene.document.TestDocument FAILED [junit] Testsuite: org.apache.lucene.index.TestByteSlices [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestByteSlices [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestByteSlices [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec [junit] TEST org.apache.lucene.index.TestByteSlices FAILED [junit] Testsuite: org.apache.lucene.index.TestDeletionPolicy [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestDeletionPolicy [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestDeletionPolicy [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,01 sec [junit] TEST org.apache.lucene.index.TestDeletionPolicy FAILED [junit] Testsuite: org.apache.lucene.index.TestDoc [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestDoc [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestDoc [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec [junit] TEST org.apache.lucene.index.TestDoc FAILED [junit] Testsuite: org.apache.lucene.index.TestIndexWriterDelete [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestIndexWriterDelete [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestIndexWriterDelete [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec [junit] TEST org.apache.lucene.index.TestIndexWriterDelete FAILED [junit] Testsuite: org.apache.lucene.index.TestMultiFields [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestMultiFields [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestMultiFields [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec [junit] TEST org.apache.lucene.index.TestMultiFields FAILED [junit] Testsuite: org.apache.lucene.index.TestNoMergeScheduler [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestNoMergeScheduler [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestNoMergeScheduler [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec [junit] TEST org.apache.lucene.index.TestNoMergeScheduler FAILED [junit] Testsuite: org.apache.lucene.index.TestSegmentTermDocs [junit] Testcase: warning(junit.framework.TestSuite$1):FAILED [junit] No tests found in org.apache.lucene.index.TestSegmentTermDocs [junit] junit.framework.AssertionFailedError: No tests found in org.apache.lucene.index.TestSegmentTermDocs [junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,012 sec [junit] TEST org.apache.lucene.index.TestSegmentTermDocs FAILED [junit] Testsuite: org.apache.lucene.TestAssertions [junit] Testcase:
Field Collapsing on 1.4.1
Is there a seamless field collapsing patch for 1.4.1? I see it has been merged into trunk but I tried downloading it to give it a whirl but it appears that many things have changed and our application would need some considerable work to get it up an running. Thanks
Solr 4.0 trunk in production
Would I be crazy even to consider putting this in production? Thanks
Re: Solr 4.0 trunk in production
I use it in a production setting, but I don't have a very large data set or a very heavy query load, the reason I use it is for edismax. François On Feb 19, 2011, at 9:50 AM, Mark wrote: Would I be crazy even to consider putting this in production? Thanks
Re: Best way for a query-expander?
Hello list, as Hoss suggests, I'll try to be more detailed. I wish to use http parameters in my requests that define the precise semantic of an advanced search. For example, if I can see from sessions, that a given user is requesting, that not only public resources but resources private-to-him are returned. For example, if there's a parameter ict, I want to expand the query with an extra (mandatory) term-query. I know I could probably do this at the client level but I do not think this is the best way, in particular about the access to private resources... I also think it's better to not rely too heavily on client's ability to formula string-queries since it allows all sorts of tweaking that one may not wish possible, in particular for queries that are service oriented. paul Le 19 févr. 2011 à 01:18, Chris Hostetter a écrit : : I want to implement a query-expander, one that enriches the input by the : usage of extra parameters that, for example, a form may provide. : : Is the right way to subclass SearchHandler? : Or rather to subclass QueryComponent? This smells like the poster child for an X/Y problem (or maybe an X/(Y OR Z) problem)... if you can elaborate a bit more on the type of enrichment you want to do, it's highly likely that your goal can be met w/o needing to write a custom plugin (i'm thinking particularly of the multitudes of parsers solr already has, local params, and variable substitution) http://people.apache.org/~hossman/#xyproblem XY Problem Your question appears to be an XY Problem ... that is: you are dealing with X, you are assuming Y will help you, and you are asking about Y without giving more details about the X so that we can understand the full issue. Perhaps the best solution doesn't involve Y at all? See Also: http://www.perlmonks.org/index.pl?node_id=542341 -Hoss
embedded solr and tomcat
I'm considering running an embedded instance of Solr in Tomcat (Amazon's beanstalk). Has anyone done this before? I'd be very interested in how I can instantiate Embedded solr in Tomcat. Do I need a resource loader to instantiate? If so, how? Thanks, Matt
Re: Solr 4.0 trunk in production
We use it in production, but the # of docs is only 2.5M. 2011/2/19 François Schiettecatte fschietteca...@gmail.com: I use it in a production setting, but I don't have a very large data set or a very heavy query load, the reason I use it is for edismax. François On Feb 19, 2011, at 9:50 AM, Mark wrote: Would I be crazy even to consider putting this in production? Thanks
[Solr] and CouchDB
I am curious to see if anyone has messed around with Solr and the Couch-Lucene incarnation that is out there...I was passed this article this morning and it really opened my eyes about CouchDB http://m.readwriteweb.com/hack/2011/02/hacker-chat-max-ogden.php Thoughts, Adam
Re: Indexing AutoCAD files
Hi Vignesh, I believe that you would have to incorporate GDAL in to Tika in order to read the file and extract the proper metadata. This is entirely doable but I don't know how to do it. There are companies out there that specialize in this sort of thing so hopefully, one of them has already contacted you outside of this list but I would love to see some detailed instruction on how to integrate GDAL in to Tika. Best of luck, Adam On Sat, Feb 19, 2011 at 12:31 AM, Vignesh Raj vignesh...@greatminds.co.in wrote: Hi team, Is there a way lucene can index AutoCAD files - *.dwg files? If so, please let me know. Can you please provide some insight on the same? Thanks in advance.. Regards Vignesh
Re: Indexing AutoCAD files
Hi Vignesh, I believe that you would have to incorporate GDAL in to Tika in order to read the file and extract the proper metadata. This is entirely doable but I don't know how to do it. There are companies out there that specialize in this sort of thing so hopefully, one of them has already contacted you outside of this list but I would love to see some detailed instruction on how to integrate GDAL in to Tika. Best of luck, Adam On Sat, Feb 19, 2011 at 12:31 AM, Vignesh Raj vignesh...@greatminds.co.in wrote: Hi team, Is there a way lucene can index AutoCAD files - *.dwg files? If so, please let me know. Can you please provide some insight on the same? Thanks in advance.. Regards Vignesh
Re: [Solr] and CouchDB
CouchDB is a good piece of software for some scenario's and easy to use. It has update handlers to which you could attach a small program that takes the input, transforms it to Solr XML and send it over. CouchDB lucene is a bit different. It lacks the power of Solr but allows and you need to write a custom view to send data over. But, it works and brings full-text search to CouchDB. I am curious to see if anyone has messed around with Solr and the Couch-Lucene incarnation that is out there...I was passed this article this morning and it really opened my eyes about CouchDB http://m.readwriteweb.com/hack/2011/02/hacker-chat-max-ogden.php Thoughts, Adam
Solr Ajax
Hello list, I'm in the process of trying to implement Ajax within my Solr-backed webapp I have been reading both the Solrj wiki as well as the tutorial provided via the google group and various info from the wiki page https://github.com/evolvingweb/ajax-solr/wiki I have all solrj jar libraries available in my webapp /lib but I am unsure as to what steps I take to configure the Solrj client. What do I need to configure to begin working with Solrj? I am unsure as to where to go and finding information on the wiki seems to be a non trivial task. Any help would be great. Thanks Lewis Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education’s Widening Participation Initiative of the Year 2009 and Herald Society’s Education Initiative of the Year 2009. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html Winner: Times Higher Education’s Outstanding Support for Early Career Researchers of the Year 2010, GCU as a lead with Universities Scotland partners. http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html
Delete doc in index by date range?
Hi, I'm wondering if it's possible to delete documents in m'y index by date range? I've got a field in my schema: indexed_date in date type and i would like to remove docs older than 90 days. Thanks for your help Marc
Re: Delete doc in index by date range?
Sure http://wiki.apache.org/solr/UpdateXmlMessages#A.22delete.22_by_ID_and_by_Query Hi, I'm wondering if it's possible to delete documents in m'y index by date range? I've got a field in my schema: indexed_date in date type and i would like to remove docs older than 90 days. Thanks for your help Marc
Re: question: havnig multiple solrCloud configuration on the same machine
Hi, I'm following your suggestions. Extract of your last step: This would give you three different configurations - you would then edit the zookeeper info to point each collection (essentially a SolrCore at this point) to the right configuration files: collections/collection1 config=conf1 collections/collection2 config=conf2 collections/collection3 config=conf3 How do you manage to set this? Do you need to modify solr.xml to set this stuff? Stijn Vanhoorelbeke, -- View this message in context: http://lucene.472066.n3.nabble.com/question-havnig-multiple-solrCloud-configuration-on-the-same-machine-tp1165790p2536210.html Sent from the Solr - User mailing list archive at Nabble.com.
RE: Best way for a query-expander?
I don't think there's any way to do this in Solr, although you could write your own query parser in Java if you wanted to. You can set defaults , invariants and appends values on your request handler, but I don't think that's flexible enough to do what you want. http://wiki.apache.org/solr/SearchHandler In general, to my perspective, Solr seems to be written assuming a trusted client. If you are allowing access to untrusted clients, there are probably all sorts of things a client can do that you woudln't want them to, writing your own query parser might be a good idea. From: Paul Libbrecht [p...@hoplahup.net] Sent: Saturday, February 19, 2011 11:01 AM To: solr-user@lucene.apache.org Subject: Re: Best way for a query-expander? Hello list, as Hoss suggests, I'll try to be more detailed. I wish to use http parameters in my requests that define the precise semantic of an advanced search. For example, if I can see from sessions, that a given user is requesting, that not only public resources but resources private-to-him are returned. For example, if there's a parameter ict, I want to expand the query with an extra (mandatory) term-query. I know I could probably do this at the client level but I do not think this is the best way, in particular about the access to private resources... I also think it's better to not rely too heavily on client's ability to formula string-queries since it allows all sorts of tweaking that one may not wish possible, in particular for queries that are service oriented. paul Le 19 févr. 2011 à 01:18, Chris Hostetter a écrit : : I want to implement a query-expander, one that enriches the input by the : usage of extra parameters that, for example, a form may provide. : : Is the right way to subclass SearchHandler? : Or rather to subclass QueryComponent? This smells like the poster child for an X/Y problem (or maybe an X/(Y OR Z) problem)... if you can elaborate a bit more on the type of enrichment you want to do, it's highly likely that your goal can be met w/o needing to write a custom plugin (i'm thinking particularly of the multitudes of parsers solr already has, local params, and variable substitution) http://people.apache.org/~hossman/#xyproblem XY Problem Your question appears to be an XY Problem ... that is: you are dealing with X, you are assuming Y will help you, and you are asking about Y without giving more details about the X so that we can understand the full issue. Perhaps the best solution doesn't involve Y at all? See Also: http://www.perlmonks.org/index.pl?node_id=542341 -Hoss
Re: Index Autocad
I think you may have already posted this same question but please check VoyagerGIS out. They have some shit-hot software that is geared specifically towards the archive and retrieval of geospatial data. I suggest that you check it out!!! w/r, Adam On Sat, Feb 19, 2011 at 2:33 AM, lucene lucene luc...@greatminds.co.in wrote: Hi team, Is there a way lucene can index AutoCAD files – “*.dwg” files? If so, please let me know. Can you please provide some insight on the same? Thanks in advance.. Regards Vignesh