Re: adding a TimerTask

2011-02-19 Thread Tri Nguyen
Seems like one way is to write a servlet who's init method creates a TimerTask.





From: Tri Nguyen tringuye...@yahoo.com
To: solr user solr-user@lucene.apache.org
Sent: Fri, February 18, 2011 6:02:44 PM
Subject: adding a TimerTask

Hi,

How can I add a TimerTask to Solr?

Tri

Re: slave out of sync

2011-02-19 Thread Tri Nguyen
there is an http api where I can look at the latest replication and whether 
there is an ERROR keyword.  If so, the latest replication failed.





From: Otis Gospodnetic otis_gospodne...@yahoo.com
To: solr-user@lucene.apache.org
Sent: Wed, February 16, 2011 11:31:26 AM
Subject: Re: slave out of sync

Hi Tri,

You could look at the stats page for each slave and compare the number of docs 
in them.  The one(s) that are off from the rest/majority are out of sync.

Otis

Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



- Original Message 
 From: Tri Nguyen tringuye...@yahoo.com
 To: solr-user@lucene.apache.org
 Sent: Mon, February 14, 2011 7:19:58 PM
 Subject: slave out of sync
 
 Hi,
 
 We're thinking of having a master-slave configuration where there are  
 multiple 


 slaves.  Let's say during replication, one of the slaves does not  replicate 
 properly.
 
 How will we dectect that the 1 slave is out of  sync?
 
 Tri


Re: Removing duplicates

2011-02-19 Thread Ahmet Arslan
 I know that I can use the
 SignatureUpdateProcessorFactory to remove duplicates but I
 would like the duplicates in the index but remove them
 conditionally at query time.
 
 Is there any easy way I could accomplish this?


Closest thing can be group documents by signature field.
http://wiki.apache.org/solr/FieldCollapsing


  


Problems with ant test : No tests found

2011-02-19 Thread Jurgen
Hi List!

I was wondering, after downloading the subversion source with

svn co https://svn.apache.org/repos/asf/lucene/dev/trunk solr_svn

According to the book I am reading I should be able to run the tests
succesfully, so I did a

/var/solr_svn# ant test

And it seems that junit-parallel gives _only_ errors, not because tests it
self fail but tests aren't found.
What did I do wrong?

junit-parallel:
[junit] Testsuite: org.apache.lucene.TestDemo
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.TestDemo
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.TestDemo


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,475 sec

[junit] TEST org.apache.lucene.TestDemo FAILED
[junit] Testsuite: org.apache.lucene.TestSearchForDuplicates
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.TestSearchForDuplicates
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.TestSearchForDuplicates


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,012 sec

[junit] TEST org.apache.lucene.TestSearchForDuplicates FAILED
[junit] Testsuite: org.apache.lucene.document.TestDocument
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.document.TestDocument
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.document.TestDocument


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec

[junit] TEST org.apache.lucene.document.TestDocument FAILED
[junit] Testsuite: org.apache.lucene.index.TestByteSlices
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestByteSlices
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestByteSlices


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec

[junit] TEST org.apache.lucene.index.TestByteSlices FAILED
[junit] Testsuite: org.apache.lucene.index.TestDeletionPolicy
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestDeletionPolicy
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestDeletionPolicy


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,01 sec

[junit] TEST org.apache.lucene.index.TestDeletionPolicy FAILED
[junit] Testsuite: org.apache.lucene.index.TestDoc
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestDoc
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestDoc


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec

[junit] TEST org.apache.lucene.index.TestDoc FAILED
[junit] Testsuite: org.apache.lucene.index.TestIndexWriterDelete
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestIndexWriterDelete
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestIndexWriterDelete


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec

[junit] TEST org.apache.lucene.index.TestIndexWriterDelete FAILED
[junit] Testsuite: org.apache.lucene.index.TestMultiFields
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestMultiFields
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestMultiFields


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,008 sec

[junit] TEST org.apache.lucene.index.TestMultiFields FAILED
[junit] Testsuite: org.apache.lucene.index.TestNoMergeScheduler
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestNoMergeScheduler
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestNoMergeScheduler


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,007 sec

[junit] TEST org.apache.lucene.index.TestNoMergeScheduler FAILED
[junit] Testsuite: org.apache.lucene.index.TestSegmentTermDocs
[junit] Testcase: warning(junit.framework.TestSuite$1):FAILED
[junit] No tests found in org.apache.lucene.index.TestSegmentTermDocs
[junit] junit.framework.AssertionFailedError: No tests found in
org.apache.lucene.index.TestSegmentTermDocs


[junit] Tests run: 1, Failures: 1, Errors: 0, Time elapsed: 0,012 sec

[junit] TEST org.apache.lucene.index.TestSegmentTermDocs FAILED
[junit] Testsuite: org.apache.lucene.TestAssertions
[junit] Testcase: 

Field Collapsing on 1.4.1

2011-02-19 Thread Mark

Is there a seamless field collapsing patch for 1.4.1?

I see it has been merged into trunk but I tried downloading it to give 
it a whirl but it appears that many things have changed and our 
application would need some considerable work to get it up an running.


Thanks


Solr 4.0 trunk in production

2011-02-19 Thread Mark

Would I be crazy even to consider putting this in production? Thanks


Re: Solr 4.0 trunk in production

2011-02-19 Thread François Schiettecatte
I use it in a production setting, but I don't have a very large data set or a 
very heavy query load, the reason I use it is for edismax.

François

On Feb 19, 2011, at 9:50 AM, Mark wrote:

 Would I be crazy even to consider putting this in production? Thanks



Re: Best way for a query-expander?

2011-02-19 Thread Paul Libbrecht
Hello list,

as Hoss suggests, I'll try to be more detailed.

I wish to use http parameters in my requests that define the precise semantic 
of an advanced search.
For example, if I can see from sessions, that a given user is requesting, that 
not only public resources but resources private-to-him are returned.
For example, if there's a parameter ict, I want to expand the query with an 
extra (mandatory) term-query.

I know I could probably do this at the client level but I do not think this is 
the best way, in particular about the access to private resources... I also 
think it's better to not rely too heavily on client's ability to formula 
string-queries since it allows all sorts of tweaking that one may not wish 
possible, in particular for queries that are service oriented.

paul


Le 19 févr. 2011 à 01:18, Chris Hostetter a écrit :

 
 : I want to implement a query-expander, one that enriches the input by the 
 : usage of extra parameters that, for example, a form may provide.
 : 
 : Is the right way to subclass SearchHandler?
 : Or rather to subclass QueryComponent?
 
 This smells like the poster child for an X/Y problem 
 (or maybe an X/(Y OR Z) problem)...
 
 if you can elaborate a bit more on the type of enrichment you want to do, 
 it's highly likely that your goal can be met w/o needing to write a custom 
 plugin (i'm thinking particularly of the multitudes of parsers solr 
 already has, local params, and variable substitution)
 
 http://people.apache.org/~hossman/#xyproblem
 XY Problem
 
 Your question appears to be an XY Problem ... that is: you are dealing
 with X, you are assuming Y will help you, and you are asking about Y
 without giving more details about the X so that we can understand the
 full issue.  Perhaps the best solution doesn't involve Y at all?
 See Also: http://www.perlmonks.org/index.pl?node_id=542341
 
 
 -Hoss



embedded solr and tomcat

2011-02-19 Thread Matt Mitchell
I'm considering running an embedded instance of Solr in Tomcat (Amazon's
beanstalk). Has anyone done this before? I'd be very interested in how I can
instantiate Embedded solr in Tomcat. Do I need a resource loader to
instantiate? If so, how?

Thanks,
Matt


Re: Solr 4.0 trunk in production

2011-02-19 Thread William Bell
We use it in production, but the # of docs is only 2.5M.



2011/2/19 François Schiettecatte fschietteca...@gmail.com:
 I use it in a production setting, but I don't have a very large data set or a 
 very heavy query load, the reason I use it is for edismax.

 François

 On Feb 19, 2011, at 9:50 AM, Mark wrote:

 Would I be crazy even to consider putting this in production? Thanks




[Solr] and CouchDB

2011-02-19 Thread Adam Estrada
I am curious to see if anyone has messed around with Solr and the
Couch-Lucene incarnation that is out there...I was passed this article
this morning and it really opened my eyes about CouchDB
http://m.readwriteweb.com/hack/2011/02/hacker-chat-max-ogden.php

Thoughts,
Adam


Re: Indexing AutoCAD files

2011-02-19 Thread Adam Estrada
Hi Vignesh,

I believe that you would have to incorporate GDAL in to Tika in order
to read the file and extract the proper metadata. This is entirely
doable but I don't know how to do it. There are companies out there
that specialize in this sort of thing so hopefully, one of them has
already contacted you outside of this list but I would love to see
some detailed instruction on how to integrate GDAL in to Tika.

Best of luck,
Adam

On Sat, Feb 19, 2011 at 12:31 AM, Vignesh Raj
vignesh...@greatminds.co.in wrote:
 Hi team,

 Is there a way lucene can index AutoCAD files - *.dwg files?

 If so, please let me know.

 Can you please provide some insight on the same?



 Thanks in advance..



 Regards

 Vignesh




Re: Indexing AutoCAD files

2011-02-19 Thread Adam Estrada
Hi Vignesh,

I believe that you would have to incorporate GDAL in to Tika in order
to read the file and extract the proper metadata. This is entirely
doable but I don't know how to do it. There are companies out there
that specialize in this sort of thing so hopefully, one of them has
already contacted you outside of this list but I would love to see
some detailed instruction on how to integrate GDAL in to Tika.

Best of luck,
Adam

On Sat, Feb 19, 2011 at 12:31 AM, Vignesh Raj
vignesh...@greatminds.co.in wrote:
 Hi team,

 Is there a way lucene can index AutoCAD files - *.dwg files?

 If so, please let me know.

 Can you please provide some insight on the same?



 Thanks in advance..



 Regards

 Vignesh




Re: [Solr] and CouchDB

2011-02-19 Thread Markus Jelsma
CouchDB is a good piece of software for some scenario's and easy to use. It 
has update handlers to which you could attach a small program that takes the 
input, transforms it to Solr XML and send it over.

CouchDB lucene is a bit different. It lacks the power of Solr but allows and 
you need to write a custom view to send data over. But, it works and brings 
full-text search to CouchDB.

 I am curious to see if anyone has messed around with Solr and the
 Couch-Lucene incarnation that is out there...I was passed this article
 this morning and it really opened my eyes about CouchDB
 http://m.readwriteweb.com/hack/2011/02/hacker-chat-max-ogden.php
 
 Thoughts,
 Adam


Solr Ajax

2011-02-19 Thread McGibbney, Lewis John
Hello list,

I'm in the process of trying to implement Ajax within my Solr-backed webapp I 
have
been reading both the Solrj wiki as well as the tutorial provided via
the google group and various info from the wiki page 
https://github.com/evolvingweb/ajax-solr/wiki

I have all solrj jar libraries available in my webapp /lib but I am
unsure as to what steps I take to configure the Solrj client. What do I need to 
configure to begin working with Solrj? I am unsure as to where to go and 
finding information on the wiki seems to be a non trivial task.

Any help would be great. Thanks

Lewis

Glasgow Caledonian University is a registered Scottish charity, number SC021474

Winner: Times Higher Education’s Widening Participation Initiative of the Year 
2009 and Herald Society’s Education Initiative of the Year 2009.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html

Winner: Times Higher Education’s Outstanding Support for Early Career 
Researchers of the Year 2010, GCU as a lead with Universities Scotland partners.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html


Delete doc in index by date range?

2011-02-19 Thread Rosa (Anuncios)

Hi,

I'm wondering if it's possible to delete documents in m'y index by date range?

I've got a field in my schema: indexed_date in date type and i would like to 
remove docs older than 90 days.

Thanks for your help

Marc



Re: Delete doc in index by date range?

2011-02-19 Thread Markus Jelsma
Sure
http://wiki.apache.org/solr/UpdateXmlMessages#A.22delete.22_by_ID_and_by_Query

 Hi,
 
 I'm wondering if it's possible to delete documents in m'y index by date
 range?
 
 I've got a field in my schema: indexed_date in date type and i would like
 to remove docs older than 90 days.
 
 Thanks for your help
 
 Marc


Re: question: havnig multiple solrCloud configuration on the same machine

2011-02-19 Thread Stijn Vanhoorelbeke

Hi,

I'm following your suggestions.

Extract of your last step:
This would give you three different configurations - you would then edit
the zookeeper info to point each collection (essentially a SolrCore at
this point) to the right configuration files:

collections/collection1
  config=conf1

collections/collection2
  config=conf2

collections/collection3
  config=conf3

How do you manage to set this?
Do you need to modify solr.xml to set this stuff?

Stijn Vanhoorelbeke,
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/question-havnig-multiple-solrCloud-configuration-on-the-same-machine-tp1165790p2536210.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Best way for a query-expander?

2011-02-19 Thread Jonathan Rochkind
I don't think there's any way to do this in Solr, although you could write your 
own query parser in Java if you wanted to. 

You can set defaults , invariants  and appends values on your request 
handler, but I don't think that's flexible enough to do what you want. 
http://wiki.apache.org/solr/SearchHandler

In general, to my perspective, Solr seems to be written assuming a trusted 
client.  If you are allowing access to untrusted clients, there are probably 
all sorts of things a client can do that you woudln't want them to, writing 
your own query parser might be a good idea. 

From: Paul Libbrecht [p...@hoplahup.net]
Sent: Saturday, February 19, 2011 11:01 AM
To: solr-user@lucene.apache.org
Subject: Re: Best way for a query-expander?

Hello list,

as Hoss suggests, I'll try to be more detailed.

I wish to use http parameters in my requests that define the precise semantic 
of an advanced search.
For example, if I can see from sessions, that a given user is requesting, that 
not only public resources but resources private-to-him are returned.
For example, if there's a parameter ict, I want to expand the query with an 
extra (mandatory) term-query.

I know I could probably do this at the client level but I do not think this is 
the best way, in particular about the access to private resources... I also 
think it's better to not rely too heavily on client's ability to formula 
string-queries since it allows all sorts of tweaking that one may not wish 
possible, in particular for queries that are service oriented.

paul


Le 19 févr. 2011 à 01:18, Chris Hostetter a écrit :


 : I want to implement a query-expander, one that enriches the input by the
 : usage of extra parameters that, for example, a form may provide.
 :
 : Is the right way to subclass SearchHandler?
 : Or rather to subclass QueryComponent?

 This smells like the poster child for an X/Y problem
 (or maybe an X/(Y OR Z) problem)...

 if you can elaborate a bit more on the type of enrichment you want to do,
 it's highly likely that your goal can be met w/o needing to write a custom
 plugin (i'm thinking particularly of the multitudes of parsers solr
 already has, local params, and variable substitution)

 http://people.apache.org/~hossman/#xyproblem
 XY Problem

 Your question appears to be an XY Problem ... that is: you are dealing
 with X, you are assuming Y will help you, and you are asking about Y
 without giving more details about the X so that we can understand the
 full issue.  Perhaps the best solution doesn't involve Y at all?
 See Also: http://www.perlmonks.org/index.pl?node_id=542341


 -Hoss



Re: Index Autocad

2011-02-19 Thread Adam Estrada
I think you may have already posted this same question but please
check VoyagerGIS out. They have some shit-hot software that is geared
specifically towards the archive and retrieval of geospatial data. I
suggest that you check it out!!!

w/r,
Adam


On Sat, Feb 19, 2011 at 2:33 AM, lucene lucene luc...@greatminds.co.in wrote:
 Hi team,

 Is there a way lucene can index AutoCAD files – “*.dwg” files?

 If so, please let me know.

 Can you please provide some insight on the same?



 Thanks in advance..



 Regards

 Vignesh