Re: Can solr return documents which don't match a query?

2009-09-07 Thread Chantal Ackermann
Or add a new tag: the "NO TAG" at index time and search for that. (If 
you have the possibility to reindex at least that one, first, time.) 
Would clear things up for developers/admins looking at the stuff in some 
months...


Chantal

Yonik Seeley schrieb:

"return all documents which either match query1 or don't match query 2"


query1 (*:* -query2)

-Yonik
http://www.lucidimagination.com




Re: capturing field length into a stored document field

2009-09-07 Thread mike.schultz

Here's a hybrid solution.  Add a filter to the field in question that counts
all the tokens and at the end  outputs a token of the form
__numtokens.__.  This eliminates the need to retokenize the field
again.  Also, bucket the numbers, either by some factor of ten, or base 2,
so that there aren't so many different token types produced.  This has a
space advantage over storing in a field, especially since the information
isn't needed at query time anyway.



mike.schultz wrote:
> 
> For various statistics I collect from an index it's important for me to
> know the length (measured in tokens) of a document field.  I can get that
> information to some degree from the "norms" for the field but a) the
> resolution isn't that great, and b) more importantly, if boosts are used
> it's almost impossible to get lengths from this.
> 
> Here's two ideas I was thinking about that maybe some can comment on.
> 
> 1) Use copyto to copy the field in question, fieldA to an addition field,
> fieldALength, which has an extra filter that just counts the tokens and
> only outputs a token representing the length of the field.  This has the
> disadvantage of retokenizing basically the whole document (because the
> field in question is basically the body).  Plus I would think littering
> the term space with these tokens might be bad for performance, I'm not
> sure.
> 
> 2) Add a filter to the field in question which again counts the tokens. 
> This filter allows the regular tokens to be indexed as usual but somehow
> manages to get the token-count into a stored field of the document.  This
> has the advantage of not having to retokenize the field and instead of
> littering the token space, the count becomes docdata for each doc.  Can
> this be done?  Maybe using threadLocal to temporarily store the count?
> 
> Thanks.
> 
> 

-- 
View this message in context: 
http://www.nabble.com/capturing-field-length-into-a-stored-document-field-tp25297690p25339584.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Can solr return documents which don't match a query?

2009-09-07 Thread Yonik Seeley
> "return all documents which either match query1 or don't match query 2"

query1 (*:* -query2)

-Yonik
http://www.lucidimagination.com


Can solr return documents which don't match a query?

2009-09-07 Thread Stephen Weiss

Hi everyone,

I have what's probably a loaded question about how Solr parses queries.

Basically, the client wants to restructure the way they handle  
permissions.  Instead of assigning specific permissions ids (as we've  
done up until this point), they want to structure permissions by  
adding a particular filter query depending on the products that the  
user has purchased.  In principle, it could save a lot of reindexing  
time because when products change, it simply changes the filter  
queries, instead of requiring reindexing of large portions of the  
documents in the index.


However, I'm hitting a snag...  Basically, here is the way we'd like  
to structure the queries:


Documents can have zero or more tags in a particular field assigned to  
them.  All permissions are essentially tied to the tags in this  
field.  For each product a user has purchased, there are a certain  
number of tags associated with it.  Users should be able to see all  
documents which match any of these tags, as well as all documents  
which don't match *any* tag associated with a product.  Basically,  
this means that users can see any documents which match their specific  
products, as well as any documents which aren't bound to any product  
in particular (documents which are essentially open to anyone).


Let's say that the following tags represent all the restricted tags in  
the system:


tag1
tag2
tag3
tag4

The current user is allowed access to documents with tags tag1 and  
tag2, and documents which have none of these tags.


So, by my own logic, I'd make the query:

((my_field:tag1) OR (my_field:tag2) OR (-my_field:tag1 AND - 
my_field:tag2 AND -my_field:tag3 AND -my_field:tag4))


Keep in mind that "my_field" can have other tags beyond tags 1-4 which  
essentially have no impact on permissions.


Obviously, I'm writing here because this query doesn't work... at  
all.  I see according to the wikis that, apparently, - or "NOT"  
*excludes* all documents which have the tag in question.  So this is  
just not the right syntax for what I'm doing.


What I can't seem to find is a syntax which says "return all documents  
which don't match this query", or perhaps more importantly, "return  
all documents which either match query1 or don't match query 2".  I  
tried using the "_query_" syntax but no luck there either.


Am I mistaken, or is there simply no way in SOLR to return the  
documents which didn't match the specified query?


If not, I fear there's very little potential for optimization  
here  Reindexing all affected documents for a product every time a  
product's parameters change is becoming very cumbersome - it can take  
hours.  We'd really like to find a way to avoid having to do that.  If  
anyone can help figure out a way to actually do this I'd be very  
grateful.


Thanks!

--
Steve


abortOnConfigurationError=false not taking the effect in solr 1.3

2009-09-07 Thread djain101

Hi,

We have set false in
solrconfig.xml for all the cores but still whenever any configuration error
occurs search does not work in any of the cores. Is there any other
configuration we have to set so as to not affect search in the cores which
are configured correctly?

For one of the core, we got following error today but then none of the core
is working after that. The below message suggests to add
abortOnConfigurationError in solr.xml but we have it added in solrconfig.xml
for all the cores. Do we need to add in solr.xml? If yes, where?

Please suggest what is the right way to configure so that if one core fails
due to configuration errors, all other cores remain unaffected?

*
Check your log files for more detailed information on what may be wrong.

If you want solr to continue after configuration errors, change: 

 false

in solr.xml

java.lang.RuntimeException: java.io.IOException: Cannot create directory:
/usr/local/app/data/search/core09/index
***
-- 
View this message in context: 
http://www.nabble.com/abortOnConfigurationError%3Dfalse-not-taking-the-effect-in-solr-1.3-tp25332254p25332254.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Mark Miller
Mark Miller wrote:
> Setting Solr up to easily run tests through the IDE should be about as
> simple with any IDE of your choosing.
>
> I do it with eclipse. It takes a couple basic steps:
>
> 1. Add the correct libraries to your classpath - all the jars in lib and
> example/lib (for jetty jars).
>
> 2. Add the build/solr build/solrj build/test folders to your classpath.
>
> 3. Set your working directory to
> ${workspace_loc:solrtrunk}/src/test/test-files/ (In eclipse, the opening
> variable here is a path to the project base dir - for other IDE's,
> change to what makes sense).
>
>
> Now you should be good to go.
>
> (In eclipse, its best to add the jars/folders to the project classpath,
> rather than to the single test Run configuration - else you will have to
> do it for every test - instead, add it to the project build path and
> then set the working directory for each test Run configuration.)
>
> Now its easy to run tests one at a time, or to debug, or whatever. Not
> quite as simple to get going as Lucene, but once you have it up, its the
> same dev pleasure.
>
> - Mark
>   
P.S. to extend to contribs, just add their jars and class folders as well.

-- 
- Mark

http://www.lucidimagination.com





Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Mark Miller
Setting Solr up to easily run tests through the IDE should be about as
simple with any IDE of your choosing.

I do it with eclipse. It takes a couple basic steps:

1. Add the correct libraries to your classpath - all the jars in lib and
example/lib (for jetty jars).

2. Add the build/solr build/solrj build/test folders to your classpath.

3. Set your working directory to
${workspace_loc:solrtrunk}/src/test/test-files/ (In eclipse, the opening
variable here is a path to the project base dir - for other IDE's,
change to what makes sense).


Now you should be good to go.

(In eclipse, its best to add the jars/folders to the project classpath,
rather than to the single test Run configuration - else you will have to
do it for every test - instead, add it to the project build path and
then set the working directory for each test Run configuration.)

Now its easy to run tests one at a time, or to debug, or whatever. Not
quite as simple to get going as Lucene, but once you have it up, its the
same dev pleasure.

- Mark


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Lukáš Vlček
Shalin,
do you think you can share more details about your IDEA setup for Solr
testing?
I am still having some issues with some tests (not all tests can find solr
configuration files).

Best regards,
Lukas


On Mon, Sep 7, 2009 at 2:35 PM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:

> On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie 
> wrote:
>
> > >This testcase is quite independent of anything in Solr. It is a
> > >standalone utility and the only dependency is stax.
> > >discalimer (I run these testcases from Intellij and command line)
> > >BTW are you using XpathRecordReader outside of DIH?
> >
> > Nobel,
> >
> > Is there a better way to test and play with XPathRecordReader.java
> > other than
> >
> >  ant -Dtestcase=TestXPathRecordReader test
> >
> > Which takes 8secs to run here? I am not using XpathRecordReader
> > outside of DIH, but looking to see how I would add support for
> > xpaths such as //a.
> >
> >
> The target takes a lot of time because it has to go through all the
> test-cases in core and contribs trying to match the value given in
> -Dtestcase.
>
> You could also do ant -Dtestcase=TestXPathRecordReader test-contrib which
> should be a little faster. I run individual test cases directly through
> IDEA
> which avoids these extra steps.
>
> --
> Regards,
> Shalin Shekhar Mangar.
>


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
It is possible to use XPathRecordReader directly with a couple of
lines of code as demonstrated in the JUnit testcases. just add the
stax jars in the classpath

On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie wrote:
>>This testcase is quite independent of anything in Solr. It is a
>>standalone utility and the only dependency is stax.
>>discalimer (I run these testcases from Intellij and command line)
>>BTW are you using XpathRecordReader outside of DIH?
>
> Nobel,
>
> Is there a better way to test and play with XPathRecordReader.java
> other than
>
>  ant -Dtestcase=TestXPathRecordReader test
>
> Which takes 8secs to run here? I am not using XpathRecordReader
> outside of DIH, but looking to see how I would add support for
> xpaths such as //a.
>
> Fergus.
>
>>
>>On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie wrote:
>>> Hello all,
>>>
>>> I would appreciate help from somebody who has set up Solr within
>>> netbeans, I am wanting to do more work with DIH and particularly its
>>> XpathEntityProcessor stuff. I wish to preform the following from
>>> within the IDE
>>>
>>>   ant -Dtestcase=TestXPathRecordReader.java test
>>>
>>> I have spent a few hours playing Whac-A-Mole with classpath and source
>>> settings. In the end I got it down to zero flags, but I then added
>>> some test cases and the scanner thing then went off and flagged dozens
>>> files with undefined classes I removed my change but the rescan did not
>>> remove the dozens of flagged files.
>>>
>>> PS: I am a total netbeans newbie.
>>>
>>> --
>>>
>>> ===
>>> Fergus McMenemie               Email:fer...@twig.me.uk
>>> Techmore Ltd                   Phone:(UK) 07721 376021
>>>
>>> Unix/Mac/Intranets             Analyst Programmer
>>> ===
>>>
>>
>>
>>
>>--
>>-
>>Noble Paul | Principal Engineer| AOL | http://aol.com
>
> --
>
> ===
> Fergus McMenemie               Email:fer...@twig.me.uk
> Techmore Ltd                   Phone:(UK) 07721 376021
>
> Unix/Mac/Intranets             Analyst Programmer
> ===
>



-- 
-
Noble Paul | Principal Engineer| AOL | http://aol.com


Re: Faceting optimization

2009-09-07 Thread Grant Ingersoll
If you add &debugQuery=true to your query, it should give you a  
breakdown of time spent in the various components.


Also, are you doing any warming?  You might also look at the new  
faceting method in Solr 1.4.


On Sep 7, 2009, at 6:37 AM, Sébastien Lamy wrote:


Hi

I'm currently trying to optimize the response time of my solr server.
I found one aberration and hope you may be able to help me solve it:
If, considering the whole document index, there is a lot of possible  
values for a field, asking for facet on that field dramatically  
increase response time. Even if the search returns only one  
document, with only one facet value for that field. This is shown by  
the three requests at the bottom of this mail.


It seems to me that solr looks at all the possible values in the  
whole index for the faceted field. Whereas it should look at the  
possible values only for the documents in the results, wich would be  
a lot faster. Is there a way asking him to do so?



---
Let's look at this three requests:

1- This request returns only one document and take 3ms
http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO 
+T9]



2- This request returns one document, and its facets for one field.  
It takes about 1000ms. The facet on a_10_alpha_sort returns only one  
value: "air du temps". But overall the whole index, there is a lot  
of values (>10 000) for a_10_alpha_sort.

http://localhost:8983/solr/select/?
facet=true&
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO 
+T9]&

facet.field=a_10_alpha_sort&
f.a_10_alpha_sort.facet.mincount=1&
f.a_10_alpha_sort.facet.sort=true&
f.a_10_alpha_sort.facet.limit=8&

3- This request includes the value "air du temps" in the search  
string. It takes 3ms

http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22+AND 
+a_10_alpha_sort:"air+du+temps")+AND+type_s:[T0+TO+T9]



Here is the description of the faceted field in my schema: this is a  
single-valued field, with no tokens.


indexed="true" stored="false" multivalued="false"/>
sortMissingLast="true" omitNorms="true">


  
  
  
  
  




--
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

http://www.lucidimagination.com/search



Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
>On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie  wrote:
>
>> >This testcase is quite independent of anything in Solr. It is a
>> >standalone utility and the only dependency is stax.
>> >discalimer (I run these testcases from Intellij and command line)
>> >BTW are you using XpathRecordReader outside of DIH?
>>
>> Nobel,
>>
>> Is there a better way to test and play with XPathRecordReader.java
>> other than
>>
>>  ant -Dtestcase=TestXPathRecordReader test
>>
>> Which takes 8secs to run here? I am not using XpathRecordReader
>> outside of DIH, but looking to see how I would add support for
>> xpaths such as //a.
>>
>>
>The target takes a lot of time because it has to go through all the
>test-cases in core and contribs trying to match the value given in
>-Dtestcase.
>
>You could also do ant -Dtestcase=TestXPathRecordReader test-contrib which
>should be a little faster. I run individual test cases directly through IDEA
>which avoids these extra steps.
>
Shalin,

Hmm, 6 seconds. I looked up IDEA and I guess I should be able
to use it for free while working on solr. Is it easier to 
setup and come up the learning curve?

Regards Fergus.
-- 

===
Fergus McMenemie   Email:fer...@twig.me.uk
Techmore Ltd   Phone:(UK) 07721 376021

Unix/Mac/Intranets Analyst Programmer
===


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Shalin Shekhar Mangar
On Mon, Sep 7, 2009 at 5:58 PM, Fergus McMenemie  wrote:

> >This testcase is quite independent of anything in Solr. It is a
> >standalone utility and the only dependency is stax.
> >discalimer (I run these testcases from Intellij and command line)
> >BTW are you using XpathRecordReader outside of DIH?
>
> Nobel,
>
> Is there a better way to test and play with XPathRecordReader.java
> other than
>
>  ant -Dtestcase=TestXPathRecordReader test
>
> Which takes 8secs to run here? I am not using XpathRecordReader
> outside of DIH, but looking to see how I would add support for
> xpaths such as //a.
>
>
The target takes a lot of time because it has to go through all the
test-cases in core and contribs trying to match the value given in
-Dtestcase.

You could also do ant -Dtestcase=TestXPathRecordReader test-contrib which
should be a little faster. I run individual test cases directly through IDEA
which avoids these extra steps.

-- 
Regards,
Shalin Shekhar Mangar.


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
>This testcase is quite independent of anything in Solr. It is a
>standalone utility and the only dependency is stax.
>discalimer (I run these testcases from Intellij and command line)
>BTW are you using XpathRecordReader outside of DIH?

Nobel,

Is there a better way to test and play with XPathRecordReader.java
other than

 ant -Dtestcase=TestXPathRecordReader test

Which takes 8secs to run here? I am not using XpathRecordReader
outside of DIH, but looking to see how I would add support for
xpaths such as //a.

Fergus.

>
>On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie wrote:
>> Hello all,
>>
>> I would appreciate help from somebody who has set up Solr within
>> netbeans, I am wanting to do more work with DIH and particularly its
>> XpathEntityProcessor stuff. I wish to preform the following from
>> within the IDE
>>
>>   ant -Dtestcase=TestXPathRecordReader.java test
>>
>> I have spent a few hours playing Whac-A-Mole with classpath and source
>> settings. In the end I got it down to zero flags, but I then added
>> some test cases and the scanner thing then went off and flagged dozens
>> files with undefined classes I removed my change but the rescan did not
>> remove the dozens of flagged files.
>>
>> PS: I am a total netbeans newbie.
>>
>> --
>>
>> ===
>> Fergus McMenemie               Email:fer...@twig.me.uk
>> Techmore Ltd                   Phone:(UK) 07721 376021
>>
>> Unix/Mac/Intranets             Analyst Programmer
>> ===
>>
>
>
>
>-- 
>-
>Noble Paul | Principal Engineer| AOL | http://aol.com

-- 

===
Fergus McMenemie   Email:fer...@twig.me.uk
Techmore Ltd   Phone:(UK) 07721 376021

Unix/Mac/Intranets Analyst Programmer
===


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread rajan chandi
We use command-line for most stuff except editing/debugging!

2009/9/7 Noble Paul നോബിള്‍ नोब्ळ् 

> This testcase is quite independent of anything in Solr. It is a
> standalone utility and the only dependency is stax.
> discalimer (I run these testcases from Intellij and command line)
> BTW are you using XpathRecordReader outside of DIH?
>
> On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie wrote:
> > Hello all,
> >
> > I would appreciate help from somebody who has set up Solr within
> > netbeans, I am wanting to do more work with DIH and particularly its
> > XpathEntityProcessor stuff. I wish to preform the following from
> > within the IDE
> >
> >   ant -Dtestcase=TestXPathRecordReader.java test
> >
> > I have spent a few hours playing Whac-A-Mole with classpath and source
> > settings. In the end I got it down to zero flags, but I then added
> > some test cases and the scanner thing then went off and flagged dozens
> > files with undefined classes I removed my change but the rescan did not
> > remove the dozens of flagged files.
> >
> > PS: I am a total netbeans newbie.
> >
> > --
> >
> > ===
> > Fergus McMenemie   
> > Email:fer...@twig.me.uk
> > Techmore Ltd   Phone:(UK) 07721 376021
> >
> > Unix/Mac/Intranets Analyst Programmer
> > ===
> >
>
>
>
> --
> -
> Noble Paul | Principal Engineer| AOL | http://aol.com
>


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
This testcase is quite independent of anything in Solr. It is a
standalone utility and the only dependency is stax.
discalimer (I run these testcases from Intellij and command line)
BTW are you using XpathRecordReader outside of DIH?

On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie wrote:
> Hello all,
>
> I would appreciate help from somebody who has set up Solr within
> netbeans, I am wanting to do more work with DIH and particularly its
> XpathEntityProcessor stuff. I wish to preform the following from
> within the IDE
>
>   ant -Dtestcase=TestXPathRecordReader.java test
>
> I have spent a few hours playing Whac-A-Mole with classpath and source
> settings. In the end I got it down to zero flags, but I then added
> some test cases and the scanner thing then went off and flagged dozens
> files with undefined classes I removed my change but the rescan did not
> remove the dozens of flagged files.
>
> PS: I am a total netbeans newbie.
>
> --
>
> ===
> Fergus McMenemie               Email:fer...@twig.me.uk
> Techmore Ltd                   Phone:(UK) 07721 376021
>
> Unix/Mac/Intranets             Analyst Programmer
> ===
>



-- 
-
Noble Paul | Principal Engineer| AOL | http://aol.com


Re: Exact Word Search

2009-09-07 Thread bhaskar chandrasekar
 
Hi Shalin,
 
My search is based on the following fields in schema.xml
 


 
 
 
 
Let me know if you need anything else?
Regards
Bhaskar

--- On Fri, 9/4/09, Shalin Shekhar Mangar  wrote:


From: Shalin Shekhar Mangar 
Subject: Re: Exact Word Search
To: solr-user@lucene.apache.org
Date: Friday, September 4, 2009, 5:51 AM


On Fri, Sep 4, 2009 at 6:06 PM, bhaskar chandrasekar
wrote:

>
> Hi,
>
>
> I have integrated Solr with Carrot2 Cluster Engine (v 3.1.0).
>
> Carrot2 is used as a presentation layer. Carrot2 sends requested query to
> external source (Solr) and get results from Solr.
> Carrot2 may not be responsible for forming Query. It would have been
> handled from Solr end.
>

Can you post the exact query that your application or Carrot2 is sending to
Solr? Can you also list the Solr field and type defined in schema.xml which
is being searched?


>
> Please help me with the below scenarios.
>
> Scenario: (Please DO NOT consider any case sensitive)
>
> Assuming I give "bhaskar" as input string
> It should give me search results pertaining to word ‘bhaskar’ only.
>
> I am expecting output like below database query
> Select * from MASTER where name =’bhaskar’;
>
> Above query suppose to return matched records for ‘bhaskar’..
>

Use a solr.TextField with KeywordTokenizer and LowerCaseFilter and search
with q=field-name:"field-value"

-- 
Regards,
Shalin Shekhar Mangar.



  

Faceting optimization

2009-09-07 Thread Sébastien Lamy

Hi

I'm currently trying to optimize the response time of my solr server.
I found one aberration and hope you may be able to help me solve it:
If, considering the whole document index, there is a lot of possible 
values for a field, asking for facet on that field dramatically increase 
response time. Even if the search returns only one document, with only 
one facet value for that field. This is shown by the three requests at 
the bottom of this mail.


It seems to me that solr looks at all the possible values in the whole 
index for the faceted field. Whereas it should look at the possible 
values only for the documents in the results, wich would be a lot 
faster. Is there a way asking him to do so?



---
Let's look at this three requests:

1- This request returns only one document and take 3ms
http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO+T9]


2- This request returns one document, and its facets for one field. It 
takes about 1000ms. The facet on a_10_alpha_sort returns only one value: 
"air du temps". But overall the whole index, there is a lot of values 
(>10 000) for a_10_alpha_sort.

http://localhost:8983/solr/select/?
facet=true&
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22)+AND+type_s:[T0+TO+T9]&
facet.field=a_10_alpha_sort&
f.a_10_alpha_sort.facet.mincount=1&
f.a_10_alpha_sort.facet.sort=true&
f.a_10_alpha_sort.facet.limit=8&

3- This request includes the value "air du temps" in the search string. 
It takes 3ms

http://localhost:8983/solr/select/?
rows=10&
q=(available_owner_display_name_s_facet:%22mag%22+AND+a_10_alpha_sort:"air+du+temps")+AND+type_s:[T0+TO+T9]


Here is the description of the faceted field in my schema: this is a 
single-valued field, with no tokens.


stored="false" multivalued="false"/>
sortMissingLast="true" omitNorms="true">

 
   
   
   
   
   
 



Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
>We've set-up NetBeans with Solr but we are using command-line for most of
>the stuff except for editing the code.
>
>Does your code build from NetBeans? If not what errors do you see?

The code builds and runs from net beans because the underlying build.xml
file is being used. But when you want to run testcases... you are doing 
that from the command line? Are you are only using the IDE as an editor?

>Regards
>Rajan
>
>On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie  wrote:
>
>> Hello all,
>>
>> I would appreciate help from somebody who has set up Solr within
>> netbeans, I am wanting to do more work with DIH and particularly its
>> XpathEntityProcessor stuff. I wish to preform the following from
>> within the IDE
>>
>>   ant -Dtestcase=TestXPathRecordReader.java test
>>
>> I have spent a few hours playing Whac-A-Mole with classpath and source
>> settings. In the end I got it down to zero flags, but I then added
>> some test cases and the scanner thing then went off and flagged dozens
>> files with undefined classes I removed my change but the rescan did not
>> remove the dozens of flagged files.
>>
>> PS: I am a total netbeans newbie.
>>
>> --
>>
>> ===
>> Fergus McMenemie   
>> Email:fer...@twig.me.uk
>> Techmore Ltd   Phone:(UK) 07721 376021
>>
>> Unix/Mac/Intranets Analyst Programmer
>> ===
>>

-- 

===
Fergus McMenemie   Email:fer...@twig.me.uk
Techmore Ltd   Phone:(UK) 07721 376021

Unix/Mac/Intranets Analyst Programmer
===


Re: Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread rajan chandi
We've set-up NetBeans with Solr but we are using command-line for most of
the stuff except for editing the code.

Does your code build from NetBeans? If not what errors do you see?

Regards
Rajan

On Mon, Sep 7, 2009 at 3:26 PM, Fergus McMenemie  wrote:

> Hello all,
>
> I would appreciate help from somebody who has set up Solr within
> netbeans, I am wanting to do more work with DIH and particularly its
> XpathEntityProcessor stuff. I wish to preform the following from
> within the IDE
>
>   ant -Dtestcase=TestXPathRecordReader.java test
>
> I have spent a few hours playing Whac-A-Mole with classpath and source
> settings. In the end I got it down to zero flags, but I then added
> some test cases and the scanner thing then went off and flagged dozens
> files with undefined classes I removed my change but the rescan did not
> remove the dozens of flagged files.
>
> PS: I am a total netbeans newbie.
>
> --
>
> ===
> Fergus McMenemie   
> Email:fer...@twig.me.uk
> Techmore Ltd   Phone:(UK) 07721 376021
>
> Unix/Mac/Intranets Analyst Programmer
> ===
>


Netbeans and Solr : Whac-A-Mole

2009-09-07 Thread Fergus McMenemie
Hello all,

I would appreciate help from somebody who has set up Solr within
netbeans, I am wanting to do more work with DIH and particularly its
XpathEntityProcessor stuff. I wish to preform the following from
within the IDE

   ant -Dtestcase=TestXPathRecordReader.java test

I have spent a few hours playing Whac-A-Mole with classpath and source
settings. In the end I got it down to zero flags, but I then added 
some test cases and the scanner thing then went off and flagged dozens
files with undefined classes I removed my change but the rescan did not 
remove the dozens of flagged files.

PS: I am a total netbeans newbie. 

-- 

===
Fergus McMenemie   Email:fer...@twig.me.uk
Techmore Ltd   Phone:(UK) 07721 376021

Unix/Mac/Intranets Analyst Programmer
===


Re: Field Collapsing (was Re: Schema for group/child entity setup)

2009-09-07 Thread Uri Boness


Great. Nice site and very similar to my requirements.

thanks.


So, right now, you get all field values by default?
Right now, no field values are returned for the collapsed documents. The 
patch which will be committed soon will add this functionality.


R. Tan wrote:

Great. Nice site and very similar to my requirements.

  

There's work on the patch that is being done now which will enable you to
ask for specific field values of the collapsed documents using a dedicated
request parameter.




So, right now, you get all field values by default?


On Sun, Sep 6, 2009 at 3:58 AM, Uri Boness  wrote:

  

You can check out http://www.ilocal.nl. If you search for a bank in
Amsterdam then you'll see that a lot of the results are collapsed. For this
we used an older version of this patch (which works on 1.3) but a lot has
changed since then. We're currently using this patch on another project, but
it's not live yet.


Uri

R. Tan wrote:



Thanks Uri. Your personal suggestion is appreciated and I think I'll
follow
your advice. We're still early in development and 1.4 would be a good
choice. I hope I can get field collapsing to work with my requirements. Do
you know any live site using field collapsing already?

On Sat, Sep 5, 2009 at 5:57 PM, Uri Boness  wrote:



  

There's work on the patch that is being done now which will enable you to
ask for specific field values of the collapsed documents using a
dedicated
request parameter. This work is not committed yet to the latest patch,
but
will be very soon. There is of course a drawback to that as well, the
collapsed documents set can be very large (depends on your data of
course)
in which case the returned result which includes the fields values can be
rather large, which will impact performance, this is why this feature
will
be enabled only if you specify this extra parameter - by default no field
values will be returned.

AFAIK, the latest patch should work fine with the latest build. Martijn
(which is the main maintainer of this patch) tries to keep it up to date
with the latest builds. But I guess the safest way is to work with the
nightly build of the same date as the latest patch (though I would give
it a
try first with the latest build).

BTW, it's not an official suggestion from the Solr development team, but
if
you ask me, if you have to choose now whether to use 1.3 or 1.4-dev, I
would
go for the later. 1.4 is supposed to be released in the upcoming week or
two
and it bring loads of bug fixes, enhancements and extra functionality.
But
again, this is my personal suggestion.


cheers,
Uri

R. Tan wrote:





Okay. Thanks for giving an insight on how it works in general. Without
trying it myself, are the field values for the collapsed ones also part
of
the results data?
What is the latest build that is safe to use on a production
environment?
I'd probably go for that and use field collapsing.

Thank you very much.


On Fri, Sep 4, 2009 at 4:49 AM, Uri Boness  wrote:





  

The collapsed documents are represented by one "master" document which
can
be part of the normal search result (the doc list), so pagination just
works
as expected, meaning taking only the returned documents in account
(ignoring
the collapsed ones). As for the scoring, the "master" document is
actually
the document with the highest score in the collapsed group.

As for Solr 1.3 compatibility... well... it's very hart to tell. All
latest
patch are certainly *not* 1.3 compatible (I think they're also
depending
on
some changes in lucene which are not available for solr 1.3). I guess
you'll
have to try some of the old patches, but I'm not sure about their
stability.

cheers,
Uri


R. Tan wrote:







Thanks Uri. How does paging and scoring work when using field
collapsing?
What patch works with 1.3? Is it production ready?

R


On Thu, Sep 3, 2009 at 3:54 PM, Uri Boness  wrote:







  

The development on this patch is quite active. It works well for
single
solr instance, but distributed search (ie. shards) is not yet
supported.
Using this page you can group search results based on a specific
field.
There are two flavors of field collapsing - adjacent and
non-adjacent,
the
former collapses only document which happen to be located next to
each
other
in the otherwise-non-collapsed results set. The later (the
non-adjacent)
one
collapses all documents with the same field value (regardless of
their
position in the otherwise-non-collapsed results set). Note, that
non-adjacent performs better than adjacent one. There's currently
discussion
to extend this support so in addition to collapsing the documents,
extra
information will be returned for the collapsed documents (see the
discussion
on the issue page).

Uri


R. Tan wrote:









I think this is what I'm looking for. What is the status of this
patch?

On Thu, Sep 3, 2009 at 12:00 PM, R. Tan 
wrote:









  

Hi Solrers,
I would like to get your o