Re: DIH out of memory exception

2009-10-28 Thread Constantijn Visinescu
Does this help? http://wiki.apache.org/solr/DataImportHandlerFaq#I.27m_using_DataImportHandler_with_a_MySQL_database._My_table_is_huge_and_DataImportHandler_is_going_out_of_memory._Why_does_DataImportHandler_bring_everything_to_memory.3F On Wed, Oct 28, 2009 at 12:38 AM, William Pierce

Fwd: Full Text Search: Solr on Cassandra

2009-10-28 Thread Noble Paul നോബിള്‍ नोब्ळ्
-- Forwarded message -- From: Nick Lothian nloth...@educationau.edu.au Date: Wed, Oct 28, 2009 at 11:37 AM Subject: Full Text Search: Solr on Cassandra To: cassandra-u...@incubator.apache.org cassandra-u...@incubator.apache.org Just in case anyone here is interested, I've managed

ERROR:SCHEMA-INDEX-MISMATCH

2009-10-28 Thread Király Péter
Hi all, I just try to migrate from solr-2009-09-09.zip to today's nightly build. I found that the *_i dynamic field's type remained int but its definition became fieldType name=int class=solr.TrieIntField precisionStep=0 omitNorms=true positionIncrementGap=0/ the older Solr schema lacks

RE: MLT cross core

2009-10-28 Thread Adamsky, Robert
Have two cores with some common fields in their schemas. I want to perform a MLT query on one core and get results from the other schema. Both cores have same type of id. Having the same type of id in two different cores is of no good for a MLT handler (which in-fact operates on one core)

Re: Solr 1.4 (RC) performance on multi-CPU system

2009-10-28 Thread gabriele renzi
On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley yo...@lucidimagination.com wrote: is there a reason why StreamingUpdateSolrServer does not accept a connection manager argument as CommonsHttpSolrServer does? I don't see any reason why it shouldn't... can you open a JIRA issue and optionally

Re: benchmarking tools

2009-10-28 Thread mike anderson
Great suggestion, I took a look and it seems pretty useful. As a follow up question, did you do anything to disable Solr caching for certain tests? -mike On Tue, Oct 27, 2009 at 8:14 PM, Joshua Tuberville joshuatubervi...@eharmony.com wrote: Mike, For response times I would also look at

solrjs

2009-10-28 Thread Antonio Eggberg
I am all for new stuff. It would be nice to see a working example of ajax-solr before killing completely solrjs from trunk... at least it was working .. ajax-solr has no how to, nor any working example.. http://github.com/evolvingweb/ajax-solr Well why not just remove the javascript folder

Re: solrjs

2009-10-28 Thread Colin Hynes
Actually, it wasn't quite working. It also replicated a lot of stuff that's in ajax solr, which is being more actively developed. Hence the removal. On Oct 28, 2009, at 10:16 AM, Antonio Eggberg wrote: I am all for new stuff. It would be nice to see a working example of ajax-solr

Re: solrjs

2009-10-28 Thread Antonio Eggberg
I fully understand it was not working probably in production or on other data sets. But it did serve a purpose for me.. i.e. show a demo to anyone out of my box.. and I update my local repo with trunk all the time.. I could do ant reuters-start using my laptop and it would work.. my point is

Re: solrjs

2009-10-28 Thread Colin Hynes
Out of curiosity... Why not just make/find a quick example using ajax- solr and use git to update from them? On Oct 28, 2009, at 10:29 AM, Antonio Eggberg wrote: I fully understand it was not working probably in production or on other data sets. But it did serve a purpose for me.. i.e. show

Spell check suggestion and correct way of implementation and some Questions

2009-10-28 Thread darniz
Hi Solr masters, I just wanted to confirm what i read is the best and optimal way of doing spell check correction. i am using index based spell check. For this as the doc says Create a field ex. my_text with a fieldType my_text_fl which has all the filters defined. Copy all the othe fields data

Re: ERROR:SCHEMA-INDEX-MISMATCH

2009-10-28 Thread Király Péter
From: Yonik Seeley yo...@lucidimagination.com To: solr-user@lucene.apache.org Sent: Wednesday, October 28, 2009 2:11 PM Subject: Re: ERROR:SCHEMA-INDEX-MISMATCH I imagine that your field impl was SortableIntField, not IntField, but checking your old schema will tell you for sure. Yes, you

Re: MLT cross core

2009-10-28 Thread Avlesh Singh
My thought now is I cannot use MLT and instead must do a query to B using the fields from core A ID as query params. Is there big difference in what will be returned as results using query instead of MLT? Yes, there is definitely a difference between the results from a MLT handler and any

Re: Slow Commits

2009-10-28 Thread Jérôme Etévé
Hi, here's two thing that can slow down commits: 1) Autowarming the caches. 2) The Java old generation object garbage collection. You can try: - Turning autowarming off (set autowarmCount=0 in the caches configuration) - If you use the sun jvm, use -XX:+UseConcMarkSweepGC to get a less

Re: Slow Commits

2009-10-28 Thread Jim Murphy
Thanks Jerome, 1. I have shut off autowarming by setting params to 0. 2. My JVM Settings: -Xmx1200m -Xms1200m -XX:-UseGCOverheadLimit -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=50 3. I am using autocommits - every 6 ms. But the commit blocks all the master request

RE: MLT cross core

2009-10-28 Thread Adamsky, Robert
Thanks for the reply -- In your multi-core set-up, I don't think you are anywhere close to using MLT. The arrangement looks more like a search query. Does that mean that you cannot do a 'MLT' query from one core result to get MLT from another (even if there is some common schema between)?

Re: MLT cross core

2009-10-28 Thread Avlesh Singh
Does that mean that you cannot do a 'MLT' query from one core result to get MLT from another (even if there is some common schema between)? You can always run MLT handlers on a core. Each MLT handler takes certain parameters based on which similar results are fetched. You would need to pass

Simple problem with a nested entity and it's SQL

2009-10-28 Thread Jonathan Hendler
I have a nested entity on a jdbc data import handler that is causing an SQL error because the second key is either NULL (blank when generating the sql) or non-zero INT. The query is in the following form: document name=content entity name=bl_lessonfiles

Re: Simple problem with a nested entity and it's SQL

2009-10-28 Thread Avlesh Singh
Shouldn't this work too? SELECT * FROM table2 WHERE IS NOT NULL ${table1.somethin_like_a_foreign_key} AND ${table1.somethin_like_a_foreign_key} 0 AND id = ${table1.somethin_like_a_foreign_key} Cheers Avlesh On Wed, Oct 28, 2009 at 11:03 PM, Jonathan Hendler jonathan.hend...@gmail.com wrote:

Re: Simple problem with a nested entity and it's SQL

2009-10-28 Thread Jonathan Hendler
No - the SQL will fail to validate because at runtime it will look like SELECT * FROM table2 WHERE IS NOT NULL table1.somethin_like_a_foreign_key AND table1.somethin_like_a_foreign_key 0 AND id = Note the id = On Oct 28, 2009, at 1:38 PM, Avlesh Singh wrote: Shouldn't this work too?

Re: Simple problem with a nested entity and it's SQL

2009-10-28 Thread Avlesh Singh
Assuming this to be MySQL, will this work - SELECT * FROM table2 WHERE id = IF(ISNULL(${table1.somethin_like_a_foreign_key}), 0, ${table1.somethin_like_a_foreign_key}); Cheers Avlesh On Wed, Oct 28, 2009 at 11:12 PM, Jonathan Hendler jonathan.hend...@gmail.com wrote: No - the SQL will fail

Re: Simple problem with a nested entity and it's SQL

2009-10-28 Thread Jonathan Hendler
Thanks - that solution still causes an error. But it helped me think of an SQL solution like so : CONVERT ( '${table1.somethin_like_a_foreign_key}' , UNSIGNED INTEGER ) Convert the integer or NULL to a string, then back again. (ugly but it works) On Oct 28, 2009, at 1:48 PM, Avlesh Singh

Re: faceting ordering

2009-10-28 Thread Avlesh Singh
curious...is it possible to have faceted results ordered by score? First, I am not sure what that means. Score of what? Documents? If yes, how do you think the same should influence faceting? Second, there are only two ways you can sort facet values on a field. More here -

Re: Faceting within one document

2009-10-28 Thread Avlesh Singh
For facets - http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount For terms - http://wiki.apache.org/solr/TermsComponent Helps? Cheers Avlesh On Wed, Oct 28, 2009 at 11:32 PM, Andrew Clegg andrew.cl...@gmail.comwrote: Hi, If I give a query that matches a single document, and

Re: Faceting within one document

2009-10-28 Thread Andrew Clegg
Isn't the TermVectorComponent more for one document at a time, and the TermsComponent for the whole index? Actually -- having done some digging... What I'm really after is the most informative terms in a given document, which should take into account global document frequency as well as term

Re: Simple problem with a nested entity and it's SQL

2009-10-28 Thread Jonathan Hendler
THanks - That's a good question. I thought of using one single SQL statement - but the nested entity's query is actually quite complex (unlike the example). So it'd be possible, but more readable as a separate query. Further, MySQL has some limitations also about temporary tables and seems

Re: Environment Timezone being considered when using SolrJ

2009-10-28 Thread Michel Bottan
Hi Hoss, Thanks for the clarification again. Now I can see where the problem resides. My client application was formatting date fields using SimpleDateFormat and as you said, it assumes host timezone configuration. : your dateFormat object doesn't know that the 'Z' at the end of the string you

Re: Plugin Performance Issues

2009-10-28 Thread entdeveloper
This is an issue we experienced a while back. We once again tried to load a custom class as a plugin jar from the lib directory and began experiencing severe memory problems again. The code in our jar wasn't being used at all...the class was only referenced in the schema. I find it strange

RE: begins with searches

2009-10-28 Thread Bernadette Houghton
Here's the all code snippets - !-- catchall field, containing all other searchable text fields (implemented via copyField further on in this schema -- field name=all type=text indexed=true stored=false multiValued=true/ . . !-- field for the QueryParser to use when an explicit

weird problem with letters S and T

2009-10-28 Thread Joel Nylund
(I am super new to solr, sorry if this is an easy one) Hi, I want to support an A-Z type view of my data. I have a DataImportHandler that uses sql (my query is complex, but the part that matters is: SELECT f.id, f.title, LEFT(f.title,1) as firstLetterTitle FROM Foo f I can create this

RE: weird problem with letters S and T

2009-10-28 Thread Bernadette Houghton
Hi Joel, I had a similar issue the other day; in my case the solution turned out to be that the letters were stopwords. Don't know if this is your answer, but worth checking. Bern -Original Message- From: Joel Nylund [mailto:jnyl...@yahoo.com] Sent: Thursday, 29 October 2009 9:17 AM

Re: weird problem with letters S and T

2009-10-28 Thread Joel Nylund
Thanks Bern, now that you mention it they are in there, I assume if I remove them it will work, but I probably dont want to do that right? Is there a way for this particular query to ignore stopwords thanks Joel On Oct 28, 2009, at 6:20 PM, Bernadette Houghton wrote: Hi Joel, I had a

Re: Solr 1.4 (RC) performance on multi-CPU system

2009-10-28 Thread gabriele renzi
2009/10/28 gabriele renzi rff@gmail.com: On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley yo...@lucidimagination.com wrote: is there a reason why StreamingUpdateSolrServer does not accept a connection manager argument as CommonsHttpSolrServer does? I don't see any reason why it

Re: weird problem with letters S and T

2009-10-28 Thread Martijn v Groningen
I think that is not a problem, because your are only storing one character per field. There are other text field types that do not have the stop word filter, so give your first letter field that field type. In this way stopword filter analyser is only disabled for searches on the first letter

Re: weird problem with letters S and T

2009-10-28 Thread Joel Nylund
Well I tried removing those 2 letters from stopwords, didnt seem to help, I also tried changing the field type to text_ws, didnt seem to work. Any other ideas? thanks Joel On Oct 28, 2009, at 6:42 PM, Martijn v Groningen wrote: I think that is not a problem, because your are only storing

RE: weird problem with letters S and T

2009-10-28 Thread Bernadette Houghton
Hi Joel, I'm a relative beginner to solr myself. I think the s and t are probably in the stopwords list because there will be a lot of them resulting from analysing of words such as don't and person's. Whether that's (hey, another example!) an issue for you will probably depend on what

RE: weird problem with letters S and T

2009-10-28 Thread Bernadette Houghton
Joel, did you restart tomcat? Need to restart each time you change schema.xml. bern -Original Message- From: Joel Nylund [mailto:jnyl...@yahoo.com] Sent: Thursday, 29 October 2009 10:21 AM To: solr-user@lucene.apache.org Subject: Re: weird problem with letters S and T Well I tried

Re: weird problem with letters S and T

2009-10-28 Thread Dave Searle
Or just reload the app pool. No need to restart the whole server On 28 Oct 2009, at 23:23, Bernadette Houghton bernadette.hough...@deakin.edu.au wrote: Joel, did you restart tomcat? Need to restart each time you change schema.xml. bern -Original Message- From: Joel Nylund

Re: Greater-than and less-than in data import SQL queries

2009-10-28 Thread Lance Norskog
It is easier to put SQL select statements in a view, and just use that view from the DIH configuration file. On Tue, Oct 27, 2009 at 12:30 PM, Andrew Clegg andrew.cl...@gmail.com wrote: Heh, eventually I decided where 4 node_depth was the most pleasing (if slightly WTF-ish) way of writing

Re: weird problem with letters S and T

2009-10-28 Thread Avlesh Singh
Any ideas, are S and T special chars in query for solr? Nope, they are NOT. My guess is that - You are using a text type field for firstLetterTitle which has the stopword filter applied to it. - Your stopwords.txt file contains the characters s and t because of which the above

Re: begins with searches

2009-10-28 Thread Avlesh Singh
It sounds from what you say that I'm going to need to change the field type to edgytext. Which won't achieve the result I want, viz. the current all plus the edgytext. Any way to achieve this? I guess there is a mismatch of expectations here. A field can be analyzed in only ONE way. If your

TermVector (TF-IDF Scores) From Subset of Documents

2009-10-28 Thread peelman
I have an index of about 3 million documents, and specific list of document ids that belong in that 3 million (somewhere around 20-50 documents on average). With my filtered list of documents I want to be able to get TF-IDF scores calculated based on only that small subset, instead of the scores

Re: TermVector (TF-IDF Scores) or MoreLikeThis From Subset of Documents

2009-10-28 Thread peelman
peelman wrote: I have an index of about 3 million documents, and specific list of document ids that belong in that 3 million (somewhere around 20-50 documents on average). With my filtered list of documents I want to be able to get TF-IDF scores or run a MoreLikeThis query against ONE

Re: weird problem with letters S and T

2009-10-28 Thread Norberto Meijome
On Wed, 28 Oct 2009 19:20:37 -0400 Joel Nylund jnyl...@yahoo.com wrote: Well I tried removing those 2 letters from stopwords, didnt seem to help, I also tried changing the field type to text_ws, didnt seem to work. Any other ideas? Hi Joel, if your stop word filter was applied on index,