Thanks for the links Otis,
I will have a look.
Regards
Ericz
On Tue, Apr 26, 2011 at 10:06 PM, Otis Gospodnetic
otis_gospodne...@yahoo.com wrote:
Hi Eric,
Before using the terms component, allow me to point out:
* http://sematext.com/products/autocomplete/index.html (used on
Hi Otis,
The new Solr 3.1 Suggester also does not support filter queries.
Is anyone using shingles with faceting on large data?
Regards
Ericz
On Tue, Apr 26, 2011 at 10:06 PM, Otis Gospodnetic
otis_gospodne...@yahoo.com wrote:
Hi Eric,
Before using the terms component, allow me to point
I thought it was something simple. Here is my configuration:
fieldType name=searchType class=solr.TextField
positionIncrementGap=100
analyzer
charFilter class=solr.HTMLStripCharFilterFactory/
tokenizer class=solr.WhitespaceTokenizerFactory/
filter
I thought it was something simple. Here is my configuration:
fieldType name=searchType class=solr.TextField
positionIncrementGap=100
analyzer
charFilter class=solr.HTMLStripCharFilterFactory/
tokenizer class=solr.WhitespaceTokenizerFactory/
filter
I thought it was something simple. Here is my configuration:
fieldType name=searchType class=solr.TextField
positionIncrementGap=100
analyzer
charFilter class=solr.HTMLStripCharFilterFactory/
tokenizer class=solr.WhitespaceTokenizerFactory/
filter
Lucene/solr's length normalization is not discriminative for very short
documents.
See Jay's excellent explanation for more details.
http://search-lucene.com/m/uGKRu1pvRjw/
- Original Message -
From: roySolr royrutten1...@gmail.com
To: solr-user@lucene.apache.org
Cc:
Sent:
Erick,
Thank you. I could fix the problem. Started from scratch considering your
advice and been successful. Thanks a lot.
Rajani Maski
On Tue, Apr 26, 2011 at 5:28 PM, Erick Erickson erickerick...@gmail.comwrote:
Sorry, but there's too much here to debug remotely. I strongly advise you
Hmmm, I don't quite get how this would work. There is really no association
between terms from one field and terms from another. As I read your message,
you have a PM from user1 with a term in some field of interest, call it
suggestion. A PM from user2 also has the term suggestion. Now you want
(2) isn't viable. Updating a multiValued field is the same as any other field, a
delete followed by an add of the entire document.
(1) could work. The problem here is how you need to search. If you need
to search your notes it would be separate from the document. In other
words, you
There's nothing special you need to do to be able to view the various
stats from admin/stats.jsp. If another look doesn't show them, could you
post a screenshot?
And please include the version of Solr you're using, I checked with 1.4.1.
Best
Erick
On Wed, Apr 27, 2011 at 1:44 AM, Solr Beginner
Query related to solr synonymfilterfactory. I am using Solr 1.4.1.
I have datatype field textSynonym
fieldType name=textSynonym class=solr.TextField
positionIncrementGap=100
analyzer
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt
ignoreCase=true expand=true/
Thanks!! It's clear now, sometimes the lengthNorm is the same. See the table
below:
# of termslengthNorm
1 1.0
2 .625
3 .5
4 .5
5 .4375
6 .375
7 .375
8 .3125
9 .3125
10.3125
Is it
Solr version:
Solr Specification Version: 3.1.0
Solr Implementation Version: 3.1.0 1085815 - grantingersoll -
2011-03-26 18:00:07
Lucene Specification Version: 3.1.0
Lucene Implementation Version: 3.1.0 1085809 - 2011-03-26 18:06:58
Current Time: Wed Apr 27 14:28:34 CEST 2011
Server Start
thanks stefan
i done same and dataimporthandler error gone ,then i got slf4j error and
done same and error gone
i got requesthandler error, but now i have some configuration problem,i try
to fix it out
--
View this message in context:
There's the limited join patch, see:
https://issues.apache.org/jira/browse/SOLR-2272
that hasn't been applied yet
Correction: Yonik committed this feature in r1096978.
Hi everyone, I'm programming a filter system for a listing. Each filter
operates on an object that generates the query to solr at the end of
execution. After executing the query, it is returned to each filter to
display (or not) the results of the facets. Some of these filters are
multiple
Have a look:
http://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams
On Wed, Apr 27, 2011 at 4:10 PM, enbrol nicos@gmail.com wrote:
Hi everyone, I'm programming a filter system for a listing. Each filter
operates on an object that generates the query to solr
Thanks man... That did it.
On 4/26/11 6:00 PM, Koji Sekiguchi k...@r.email.ne.jp wrote:
(11/04/27 7:35), Alejandro Delgadillo wrote:
Hi,
I¹ve been using solr with Coldfusion9, I¹ve made a couple of adjustment to
it in order to fulfill my needs of my client, I¹m using solr as a document
coud you try to escape white spaces like this:
Hind\ claw
Ludovic.
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Facing-problem-with-white-space-in-synonyms-tp2870193p2870552.html
Sent from the Solr - User mailing list archive at Nabble.com.
what do you mean with segments-number ?
-
--- System
One Server, 12 GB RAM, 2 Solr Instances, 7 Cores,
1 Core with 31 Million Documents other Cores 100.000
- Solr1 for Search-Requests - commit every Minute - 5GB Xmx
-
All,
We have created index with CommonGrams and the final size is around 370GB.
Everything is working fine but now when we add more documents into index it
takes forever (almost 12 hours)...seems to change all the segments file in a
commit.
The same commit used to take few mins with normal
OK, thanks.
On Wed, Apr 27, 2011 at 9:29 AM, Steven A Rowe sar...@syr.edu wrote:
There's the limited join patch, see:
https://issues.apache.org/jira/browse/SOLR-2272
that hasn't been applied yet
Correction: Yonik committed this feature in r1096978.
Are you by any chance optimizing?
Best
Erick
On Wed, Apr 27, 2011 at 11:04 AM, Salman Akram
salman.ak...@northbaysolutions.net wrote:
All,
We have created index with CommonGrams and the final size is around 370GB.
Everything is working fine but now when we add more documents into index it
No way. It just does this while committing.
Also before this when we merged multiple small indexes without optimization
- as it was done in past it again took around 12 hours and made around 20
CFS files (it never happened before)
On Wed, Apr 27, 2011 at 8:21 PM, Erick Erickson
You can turn off norms for the field. It doens't make any sense to talk
about changing the length norm. The length norm is based on the size
of the field for the particular document, to implement the TF/IDF style
relevance algorithm. But you can turn off norms for the field if you
don't
Hi All,
I am using ClientUtil.escapeQueryChars function to escape query characters,
before I make call to Solr (using Solr 1.3) server through Solrj client in Java.
The issue is any query with space, is getting escaped. For example chicken
tikka is escaped to chicken\ tikka. When the escaped
Hi Salman,
Sounds like somehow you are triggering merges or optimizes. What is your
mergeFactor?
Have you turned on the IndexWriter log?
In solrconfig.xml
infoStream file=${solr.indexwriter.log.dir}true/infoStream
In our case we feed the directory name as a Java property in our java
The nature of my project is such that search is needed and specifically
search across related entities. We want to perform several queries involving
a correlation between two or more properties of a given entity in a
collection.
To put things in context, here is a snippet of the domain:
Student
Yes I did, but that's cool because it is useful to make the final determination
explicit here on the group for the benefit of other users. :)
Thanks
Robi
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Tuesday, April 26, 2011 5:10 PM
To:
Thanks for the response. We got it resolved! .
We made small indexes in bulk using SOLR with Standard File Format and then
merged it with a Lucene app which for some reason made it CFS. Now when we
started adding real time documents using SOLR (with Compound File Format set
to false) it was
There is no great way.
One approach would be to 'de-normalize' at index time, to actually have
a field that looks like this:
institution_year: 2010.OHIO_ST ; 2007.YALE
Then, with some code on client side, you could more easily facet and
search how you want. It still doesn't (I don't
Hi,
you might want to look at the SIREn plugin [1,2], which allows you to
index and query 1:N relationships such as yours, in a tabular data
format [3].
[1] http://siren.sindice.com/
[2] https://github.com/rdelbru/SIREn
[3]
Thanks Jonathan. We thought this through and have done quite a bit of digging
on this and saw there was no easy way around this. It was our last attempt
to post to the forum and see if there was some killer feature we were
missing somehow
We thought of the concatenated fields route, but quickly
Renaud,
Can you provide a brief synopsis of how your system works?
Jason
On Wed, Apr 27, 2011 at 11:17 AM, Renaud Delbru renaud.del...@deri.org wrote:
Hi,
you might want to look at the SIREn plugin [1,2], which allows you to index
and query 1:N relationships such as yours, in a tabular data
Thanks Renaud - I'll look into that asap.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Search-across-related-correlated-multivalue-fields-in-Solr-tp2871176p2871395.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi Jason,
On 27/04/11 19:25, Jason Rutherglen wrote:
Renaud,
Can you provide a brief synopsis of how your system works?
SIREn provides a new field type for Solr. In this particular SIREn
field, the data is not a piece of text, but is organised in a table.
Then, SIREn provides query objects
On 27/04/11 19:37, Renaud Delbru wrote:
Hi Jason,
On 27/04/11 19:25, Jason Rutherglen wrote:
Renaud,
Can you provide a brief synopsis of how your system works?
SIREn provides a new field type for Solr. In this particular SIREn
field, the data is not a piece of text, but is organised in a
Hi Salman,
We had a similar problem with the IndexMergeTool in Lucene contrib.
I seem to remember having to hack the IndexMergeTool code so that it wouldn't
create the CFF automatically.
Let me know if you need it and I'll dig up the modified code.
Tom.
-Original Message-
From:
This kind of thing is really easy in an XML database. That is an XPath
expression, not even a search.
MarkLogic implements it with search engine technology, but you don't have to
care about that.
wunder
Walter Underwood, MarkLogic
On Apr 27, 2011, at 11:43 AM, Renaud Delbru wrote:
On
Hi!
Thanks for the reply.
We decided to give another try with ngrams. After much tweaking/tuning for
our needs. Both the size and speed was more than good enough for our needs.
So it looks like ngrams was the solution for us afterall :)
Best regards
Tor Henning Ueland
--
View this message in
Hello All,
I'm having a very strange problem that I just can't figure out. The
slave is not able to replicate from the master, even though the master
is reachable from the slave machine. I can telnet to the port it's
running on, I can use text based browsers to navigate the master from
the
On 27/04/11 19:50, Walter Underwood wrote:
This kind of thing is really easy in an XML database. That is an XPath
expression, not even a search.
Indeed, in fact SIREn is based on a XML IR technique, i.e., a simplified
node-based indexing scheme.
--
Renaud Delbru
Thank you very much, that's exactly what i want to do. But now i've a new
problem. In the server, we've got installed Solr 1.4.1, so we can't use a
tag in the main query. Is there any way to use the fq, but not caching it?.
The server will be upgraded in july/august, so this is just a temporary
Hi Guys,
We have an index with more than 3 millions documents, we use the pagination
feature through SolrQuery#setStart and SolrQuery#setRows
methods. Some queries can return a huge amount of documents and i'm worry
about the integer parameter of the setStart method, this parameter
should be a
Is it possible to change the lengthNorm?
Yes you can customize it and plug it into solr. DefaultSimilarity and
SweetSpotSimilarity can be starting point.
http://lucene.apache.org/java/3_0_3/api/core/org/apache/lucene/search/Similarity.html#lengthNorm%28java.lang.String,%20int%29
--
View
Hi Charles,
Yes, the threads I was referring to are in the context of the client/indexer,
so
one of the params for StreamingUpdateSolrServer.
post.sh/jar are just there because they are handy. Don't use them for
production.
It's impossible to tell how long indexing of 100M documents may
In Solr's schema.xml you can use omitNorms=true to turn norms off on
field-by-field basis.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: Jonathan Rochkind rochk...@jhu.edu
To:
Yes, it is possible to implement your own Lucene Similarity in which you can
override the length norm.
Otis
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: roySolr royrutten1...@gmail.com
Well, the java native int fomat is 32 bits, so unless you're returning
over 2 billion documents, you should be OK. But you'll run into other issues
long before you get to that range.
Best
Erick
On Wed, Apr 27, 2011 at 5:25 PM, Leonardo Souza leonardo...@gmail.com wrote:
Hi Guys,
We have an
Hi All,
Just quick query of no particular importance to me, but we did observe this
problem:
http://code.google.com/p/solr-geonames/wiki/DeveloperInstall
It's worth noting that the build has also been run on Mac and Solaris now,
and the Solr index is about half the size. We suspect the
Thank you for your response. I did not make the StreamingUpdate application
yet, but I did change the other settings that you mentioned. It gave me a huge
boost in indexing speed. (I am still using post.sh but hope to change that
soon).
One thing I noticed is the indexing speed was incredibly
Requesting help for someone way outside of his comfort zone. :)
I'm trying to use solr to index several hundred OpenDocument files. I
downloaded and installed the example site and got it to work on the same files.
I modified post.sh to change the mime type to vnd.oasis.opendocument.text (and
Hi Derek,
Simple Post Tool is only for posting xml docs. If you want to index
OpenDocument files then you have to use ExtractingRequestHandler (AKA solr
cell).
-
Thanx:
Grijesh
www.gettinhahead.co.in
--
View this message in context:
53 matches
Mail list logo