Does this help?
http://wiki.apache.org/solr/DataImportHandlerFaq#I.27m_using_DataImportHandler_with_a_MySQL_database._My_table_is_huge_and_DataImportHandler_is_going_out_of_memory._Why_does_DataImportHandler_bring_everything_to_memory.3F
On Wed, Oct 28, 2009 at 12:38 AM, William Pierce
-- Forwarded message --
From: Nick Lothian nloth...@educationau.edu.au
Date: Wed, Oct 28, 2009 at 11:37 AM
Subject: Full Text Search: Solr on Cassandra
To: cassandra-u...@incubator.apache.org cassandra-u...@incubator.apache.org
Just in case anyone here is interested, I've managed
Hi all,
I just try to migrate from solr-2009-09-09.zip to today's nightly build.
I found that the *_i dynamic field's type remained int but its definition
became
fieldType name=int class=solr.TrieIntField precisionStep=0
omitNorms=true positionIncrementGap=0/
the older Solr schema lacks
Have two cores with some common fields in their schemas. I want to perform
a MLT query on one core and get results from the other schema. Both cores
have same type of id.
Having the same type of id in two different cores is of no good for a MLT
handler (which in-fact operates on one core)
On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley
yo...@lucidimagination.com wrote:
is there a reason why StreamingUpdateSolrServer does not accept a
connection manager argument as CommonsHttpSolrServer does?
I don't see any reason why it shouldn't... can you open a JIRA issue
and optionally
Great suggestion, I took a look and it seems pretty useful. As a follow up
question, did you do anything to disable Solr caching for certain tests?
-mike
On Tue, Oct 27, 2009 at 8:14 PM, Joshua Tuberville
joshuatubervi...@eharmony.com wrote:
Mike,
For response times I would also look at
I am all for new stuff.
It would be nice to see a working example of ajax-solr before killing
completely solrjs from trunk... at least it was working .. ajax-solr has no how
to, nor any working example..
http://github.com/evolvingweb/ajax-solr
Well why not just remove the javascript folder
Actually, it wasn't quite working. It also replicated a lot of stuff
that's in ajax solr, which is being more actively developed. Hence the
removal.
On Oct 28, 2009, at 10:16 AM, Antonio Eggberg wrote:
I am all for new stuff.
It would be nice to see a working example of ajax-solr
I fully understand it was not working probably in production or on other data
sets. But it did serve a purpose for me.. i.e. show a demo to anyone out of my
box.. and I update my local repo with trunk all the time..
I could do ant reuters-start using my laptop and it would work.. my point is
Out of curiosity... Why not just make/find a quick example using ajax-
solr and use git to update from them?
On Oct 28, 2009, at 10:29 AM, Antonio Eggberg wrote:
I fully understand it was not working probably in production or on
other data sets. But it did serve a purpose for me.. i.e. show
Hi Solr masters,
I just wanted to confirm what i read is the best and optimal way of doing
spell check correction.
i am using index based spell check.
For this as the doc says
Create a field ex. my_text with a fieldType my_text_fl which has all the
filters defined.
Copy all the othe fields data
From: Yonik Seeley yo...@lucidimagination.com
To: solr-user@lucene.apache.org
Sent: Wednesday, October 28, 2009 2:11 PM
Subject: Re: ERROR:SCHEMA-INDEX-MISMATCH
I imagine that your field impl was SortableIntField, not IntField, but
checking your old schema will tell you for sure.
Yes, you
My thought now is I cannot use MLT and instead must do a query to B using
the fields from core A ID as query params. Is there big difference in what
will be returned as results using query instead of MLT?
Yes, there is definitely a difference between the results from a MLT handler
and any
Hi, here's two thing that can slow down commits:
1) Autowarming the caches.
2) The Java old generation object garbage collection.
You can try:
- Turning autowarming off (set autowarmCount=0 in the caches configuration)
- If you use the sun jvm, use -XX:+UseConcMarkSweepGC to get a less
Thanks Jerome,
1. I have shut off autowarming by setting params to 0.
2. My JVM Settings: -Xmx1200m -Xms1200m -XX:-UseGCOverheadLimit
-XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=50
3. I am using autocommits - every 6 ms. But the commit blocks all the
master request
Thanks for the reply --
In your multi-core set-up, I don't think you are anywhere close to using
MLT. The arrangement looks more like a search query.
Does that mean that you cannot do a 'MLT' query from one core result
to get MLT from another (even if there is some common schema between)?
Does that mean that you cannot do a 'MLT' query from one core result to get
MLT from another (even if there is some common schema between)?
You can always run MLT handlers on a core. Each MLT handler takes certain
parameters based on which similar results are fetched. You would need to
pass
I have a nested entity on a jdbc data import handler that is causing
an SQL error because the second key is either NULL (blank when
generating the sql) or non-zero INT.
The query is in the following form:
document name=content
entity name=bl_lessonfiles
Shouldn't this work too?
SELECT * FROM table2 WHERE IS NOT NULL
${table1.somethin_like_a_foreign_key} AND
${table1.somethin_like_a_foreign_key} 0 AND id =
${table1.somethin_like_a_foreign_key}
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:03 PM, Jonathan Hendler
jonathan.hend...@gmail.com wrote:
No - the SQL will fail to validate because at runtime it will look like
SELECT * FROM table2 WHERE
IS NOT NULL table1.somethin_like_a_foreign_key
AND table1.somethin_like_a_foreign_key 0
AND id =
Note the id =
On Oct 28, 2009, at 1:38 PM, Avlesh Singh wrote:
Shouldn't this work too?
Assuming this to be MySQL, will this work -
SELECT * FROM table2 WHERE id =
IF(ISNULL(${table1.somethin_like_a_foreign_key}), 0,
${table1.somethin_like_a_foreign_key});
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:12 PM, Jonathan Hendler
jonathan.hend...@gmail.com wrote:
No - the SQL will fail
Thanks - that solution still causes an error.
But it helped me think of an SQL solution like so :
CONVERT ( '${table1.somethin_like_a_foreign_key}' , UNSIGNED INTEGER )
Convert the integer or NULL to a string, then back again. (ugly but it
works)
On Oct 28, 2009, at 1:48 PM, Avlesh Singh
curious...is it possible to have faceted results ordered by score?
First, I am not sure what that means. Score of what? Documents? If yes, how
do you think the same should influence faceting?
Second, there are only two ways you can sort facet values on a field. More
here -
For facets -
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount
For terms - http://wiki.apache.org/solr/TermsComponent
Helps?
Cheers
Avlesh
On Wed, Oct 28, 2009 at 11:32 PM, Andrew Clegg andrew.cl...@gmail.comwrote:
Hi,
If I give a query that matches a single document, and
Isn't the TermVectorComponent more for one document at a time, and the
TermsComponent for the whole index?
Actually -- having done some digging... What I'm really after is the most
informative terms in a given document, which should take into account global
document frequency as well as term
THanks - That's a good question.
I thought of using one single SQL statement - but the nested entity's
query is actually quite complex (unlike the example).
So it'd be possible, but more readable as a separate query.
Further, MySQL has some limitations also about temporary tables and
seems
Hi Hoss,
Thanks for the clarification again.
Now I can see where the problem resides. My client application was
formatting date fields using SimpleDateFormat and as you said, it assumes
host timezone configuration.
: your dateFormat object doesn't know that the 'Z' at the end of the string
you
This is an issue we experienced a while back. We once again tried to load a
custom class as a plugin jar from the lib directory and began experiencing
severe memory problems again. The code in our jar wasn't being used at
all...the class was only referenced in the schema. I find it strange
Here's the all code snippets -
!-- catchall field, containing all other searchable text fields (implemented
via copyField further on in this schema --
field name=all type=text indexed=true stored=false
multiValued=true/
.
.
!-- field for the QueryParser to use when an explicit
(I am super new to solr, sorry if this is an easy one)
Hi, I want to support an A-Z type view of my data.
I have a DataImportHandler that uses sql (my query is complex, but the
part that matters is:
SELECT f.id, f.title, LEFT(f.title,1) as firstLetterTitle FROM Foo f
I can create this
Hi Joel, I had a similar issue the other day; in my case the solution turned
out to be that the letters were stopwords. Don't know if this is your answer,
but worth checking.
Bern
-Original Message-
From: Joel Nylund [mailto:jnyl...@yahoo.com]
Sent: Thursday, 29 October 2009 9:17 AM
Thanks Bern, now that you mention it they are in there, I assume if I
remove them it will work, but I probably dont want to do that right?
Is there a way for this particular query to ignore stopwords
thanks
Joel
On Oct 28, 2009, at 6:20 PM, Bernadette Houghton wrote:
Hi Joel, I had a
2009/10/28 gabriele renzi rff@gmail.com:
On Tue, Oct 27, 2009 at 3:41 PM, Yonik Seeley
yo...@lucidimagination.com wrote:
is there a reason why StreamingUpdateSolrServer does not accept a
connection manager argument as CommonsHttpSolrServer does?
I don't see any reason why it
I think that is not a problem, because your are only storing one
character per field. There are other text field types that do not have
the stop word filter, so give your first letter field that field type.
In this way stopword filter analyser is only disabled for searches on
the first letter
Well I tried removing those 2 letters from stopwords, didnt seem to
help, I also tried changing the field type to text_ws, didnt seem to
work. Any other ideas?
thanks
Joel
On Oct 28, 2009, at 6:42 PM, Martijn v Groningen wrote:
I think that is not a problem, because your are only storing
Hi Joel, I'm a relative beginner to solr myself. I think the s and t are
probably in the stopwords list because there will be a lot of them resulting
from analysing of words such as don't and person's. Whether that's (hey,
another example!) an issue for you will probably depend on what
Joel, did you restart tomcat? Need to restart each time you change schema.xml.
bern
-Original Message-
From: Joel Nylund [mailto:jnyl...@yahoo.com]
Sent: Thursday, 29 October 2009 10:21 AM
To: solr-user@lucene.apache.org
Subject: Re: weird problem with letters S and T
Well I tried
Or just reload the app pool. No need to restart the whole server
On 28 Oct 2009, at 23:23, Bernadette Houghton
bernadette.hough...@deakin.edu.au
wrote:
Joel, did you restart tomcat? Need to restart each time you change
schema.xml.
bern
-Original Message-
From: Joel Nylund
It is easier to put SQL select statements in a view, and just use that
view from the DIH configuration file.
On Tue, Oct 27, 2009 at 12:30 PM, Andrew Clegg andrew.cl...@gmail.com wrote:
Heh, eventually I decided
where 4 node_depth
was the most pleasing (if slightly WTF-ish) way of writing
Any ideas, are S and T special chars in query for solr?
Nope, they are NOT. My guess is that
- You are using a text type field for firstLetterTitle which has the
stopword filter applied to it.
- Your stopwords.txt file contains the characters s and t because
of which the above
It sounds from what you say that I'm going to need to change the field type
to edgytext. Which won't achieve the result I want, viz. the current all
plus the edgytext. Any way to achieve this?
I guess there is a mismatch of expectations here. A field can be analyzed in
only ONE way. If your
I have an index of about 3 million documents, and specific list of document
ids that belong in that 3 million (somewhere around 20-50 documents on
average). With my filtered list of documents I want to be able to get
TF-IDF scores calculated based on only that small subset, instead of the
scores
peelman wrote:
I have an index of about 3 million documents, and specific list of
document ids that belong in that 3 million (somewhere around 20-50
documents on average). With my filtered list of documents I want to be
able to get TF-IDF scores or run a MoreLikeThis query against ONE
On Wed, 28 Oct 2009 19:20:37 -0400
Joel Nylund jnyl...@yahoo.com wrote:
Well I tried removing those 2 letters from stopwords, didnt seem to
help, I also tried changing the field type to text_ws, didnt seem to
work. Any other ideas?
Hi Joel,
if your stop word filter was applied on index,
44 matches
Mail list logo