.
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/correct-XPATH-syntax-tp3951804p3959397.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
- User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
We did it at my last job. Took a few days to split a 500mdoc index.
On Sat, May 5, 2012 at 9:55 AM, Erick Erickson erickerick...@gmail.com wrote:
Oh, isn't that easier! Need more coffee before suggesting things..
Thanks,
Erick
On Fri, May 4, 2012 at 8:16 PM, Lance Norskog goks...@gmail.com
is:
field name=bt_rni_NameHRK_encodedName type=text_ws indexed=true
stored=true multiValued=false /
--
Lance Norskog
goks...@gmail.com
.472066.n3.nabble.com/problem-with-date-searching-tp3961761p3961833.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Regards,
Dmitry Kan
--
Lance Norskog
goks...@gmail.com
but the first 4 ones works well and they didn't
get the error. Does anybody know how to solve it? Thanks
Emma
--
Lance Norskog
goks...@gmail.com
-tp3971101.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
think)
Thank you.
--
Lance Norskog
goks...@gmail.com
solution architect
Cominvent AS - www.cominvent.com
Solr Training - www.solrtraining.com
--
Lance Norskog
goks...@gmail.com
it be possible to index an xml file as well or
do we need to use java -jar post.jar file.xml? Or let me put it this way,
how is post.jar different than curl?
Regards,
--
Lance Norskog
goks...@gmail.com
) of the index.
Thanks,
Shawn
--
Lance Norskog
goks...@gmail.com
.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
.n3.nabble.com/Solr-query-issues-tp3974922p3975398.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
this message in context:
http://lucene.472066.n3.nabble.com/Newbie-Tries-to-make-a-Schema-xml-tp3974200.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
byBRreplying to the message and delete the original
message and any copies immediately thereafter.BR BR Thank you.~BR
**BR
FAFLDBR
PRE
--
Lance Norskog
goks...@gmail.com
-tp3983579.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
--
Regards,
Dmitry Kan
--
Lance Norskog
goks...@gmail.com
This is my json variant of solr/example/exampledocs/post.sh. It takes
an url as the first parameter.
#!/bin/sh
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information
that this language has accents, apostrophes and comas.
I will be glad to have a pointer or answer.
Thanks,
Tom
--
View this message in context:
http://lucene.472066.n3.nabble.com/Quering-Solr-tp3983945.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance
the facet, and give an approximate result much quicker.
Is there any way to facet only a random sample of the results?
Thanks
Yuval
--
Lance Norskog
goks...@gmail.com
at the same time?
Thanks a lot for your help
Sergio
--
Lance Norskog
goks...@gmail.com
/biblio_view.php?bibid=26913tab=opac
*Author(s):* विनोद कॠमार मिशॠर MISHRA (VK) *
Material:* Books
How do I go about solving this language problem.
Thanks in advace.
K. P. Sanjailal
--
--
Lance Norskog
goks...@gmail.com
to configure the solr on tomcat in eclipse ,can u plz help me
out ,how to do this thing
eagerly waiting for ur reply...
On 5/20/12, Lance Norskog goks...@gmail.com wrote:
One DataImportHandler requestHandler entry is a single-threaded job.
Always.
You can make two requestHandler entries
and Regards
Rahul A. Warawdekar
--
Lance Norskog
goks...@gmail.com
www.lucidimagination.com/search and www.search-lucene.com are search
indexes for all things Lucene Solr. You can hunt for this kind of
problem there.
On Sun, May 20, 2012 at 4:16 PM, Lance Norskog goks...@gmail.com wrote:
Please file this as a JIRA. Also, is it possible to test this on the Solr
this message in context:
http://lucene.472066.n3.nabble.com/Must-match-and-terms-with-only-one-letter-tp3984139.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Walter Underwood
wun...@wunderwood.org
--
Lance Norskog
goks...@gmail.com
the following characters.
à ¤¸à ¥Åà ¤° à ¤Šà ¤°à ¥8dà ¤Åà ¤¾ Saur oorja
Please suggest what I have to do to solve this issue.
Regards,
Sanjailal KP
--
On Sun, May 20, 2012 at 6:59 AM, Lance Norskog goks...@gmail.com wrote:
Also, try saving data from a query into a file and verify
quicker.
Is there any way to facet only a random sample of the results?
Thanks
Yuval
--
Lance Norskog
goks...@gmail.com
an post that and/or include it in your sample XML
| file...
|
| Best
| Erick
|
|
| On Fri, Nov 2, 2012 at 10:02 AM, Dotan Cohen dotanco...@gmail.com
| wrote:
|
| On Thu, Nov 1, 2012 at 9:28 PM, Lance Norskog goks...@gmail.com
| wrote:
| Have you uploaded data with that field populated? Solr
LucidFind is a searchable archive of Solr documentation and email lists:
http://find.searchhub.org/?q=solrcloud
- Original Message -
| From: Jack Krupansky j...@basetechnology.com
| To: solr-user@lucene.apache.org
| Sent: Monday, November 5, 2012 4:44:46 AM
| Subject: Re: Where to get
The question you meant to ask is: Does MoreLikeThis support Distributed
Search? and the answer apparently is no. This is the issue to get it working:
https://issues.apache.org/jira/browse/SOLR-788
(Distributed Search is independent of SolrCloud.) If you want to make unit
tests, that would
You can debug this with the 'Analysis' page in the Solr UI. You pick
'text_general' and then give words with umlauts in the text box for indexing
and queries.
Lance
- Original Message -
| From: Daniel Brügge daniel.brue...@googlemail.com
| To: solr-user@lucene.apache.org
| Sent:
LucidFind collects several sources of information in one searchable archive:
http://find.searchhub.org/?q=sort=#%2Fp%3Asolr
- Original Message -
| From: Dmitry Kan dmitry@gmail.com
| To: solr-user@lucene.apache.org
| Sent: Sunday, November 11, 2012 2:24:21 AM
| Subject: Re: More
I think this means the pattern did not match any files:
str name=Total Rows Fetched0/str
The wiki example includes a '^' at the beginning of the filename pattern. This
matches a complete line.
http://wiki.apache.org/solr/DataImportHandler#Transformers_Example
More:
Add rootEntity=true. It
| dataSource=null
I think this should not be here. The datasource should default to the
dataSource listing. And 'rootEntity=true' should be in the
XPathEntityProcessor block, because you are adding each file as one document.
- Original Message -
| From: Spadez
sagarzond- you are trying to embed a recommendation system into search.
Recommendations are inherently a matrix problem, where Solr and other search
engines are one-dimensional databases. What you have is a sparse user-product
matrix. This book has a good explanation of recommender systems:
You don't need the transformers.
I think the paths should be what is in the XML file.
forEach=/add
And the paths need to use the syntax for name=fname and name=number. I
think this is it, but you should make sure.
xpath=/add/doc/field[@name='fname']
xpath=/add/doc/field[@name='number']
Look
- http://sematext.com/spm/index.html
| Search Analytics - http://sematext.com/search-analytics/index.html
|
|
|
|
| On Sat, Nov 24, 2012 at 9:30 PM, Lance Norskog goks...@gmail.com
| wrote:
|
| sagarzond- you are trying to embed a recommendation system into
| search.
| Recommendations
Maybe these are text encoding markers?
- Original Message -
| From: Eva Lacy e...@lacy.ie
| To: solr-user@lucene.apache.org
| Sent: Thursday, November 29, 2012 3:53:07 AM
| Subject: Re: Downloading files from the solr replication Handler
|
| I tried downloading them with my browser and
.nabble.com/Modeling-openinghours-using-multipoints-tp4025336p4025454.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
to build, save, and query the bitmap
whereas working on top of existing functionality seems to me a lot more
maintainable on the user's part.
~ David
From: Lance Norskog-2 [via Lucene] [ml-node+s472066n4025579...@n3.nabble.com]
Sent: Sunday, December 09, 2012 6:35 PM
Do you use rounding in your dates? You can index a date rounded to the
nearest minute, N minutes, hour or day. This way a range query has to
look at such a small number of terms that you may not need to tune the
precision step. Hunt for NOW/DAY or 5DAYS in the queries.
To be clear: 1) is fine. Lucene index updates are carefully sequenced so
that the index is never in a bogus state. All data files are written and
flushed to disk, then the segments.* files are written that match the
data files. You can capture the files with a set of hard links to create
a
The only sure way to get the last searchable document is to use a
timestamp or sequence number in the document. I do not think that using
a timestamp with default=NOW will give a unique timestamp, so you need
your own sequence number.
On 12/19/2012 10:17 PM, Joe wrote:
I'm using SOLR 4 for
Please start a new thread.
Thanks!
On 12/22/2012 11:03 AM, J Mohamed Zahoor wrote:
Hi
I have a word completion requirement where i need to pick result from two
indexed fields.
The trick is i need to pick top 5 results from each field and display as
suggestions.
If i set fq as field1:XXX
?
On Sunday, December 23, 2012, Lance Norskog wrote:
Please start a new thread.
Thanks!
On 12/22/2012 11:03 AM, J Mohamed Zahoor wrote:
Hi
I have a word completion requirement where i need to pick result from two
indexed fields.
The trick is i need to pick top 5 results from each field
Maybe you could write a Javascript snippet that downloads and runs your
external file?
On 12/26/2012 09:12 AM, Dyer, James wrote:
I'm not very familiar with using scipting langauges with Java, but having seen the
DIH code for this, my guess is that all script code needs to be in the script
/
Cool!
On 12/25/2012 08:03 AM, Robert Muir wrote:
25 December 2012, Apache Solr™ 3.6.2 available
The Lucene PMC and Santa Claus are pleased to announce the release of
Apache Solr 3.6.2.
Solr is the popular, blazing fast open source enterprise search
platform from the Apache Lucene project. Its
A Solr facet query does a boolean query, caches the Lucene facet data
structure, and uses it as a Lucene filter. After that until you do a
full commit, using the same fq=string (you must match the string
exactly) fetches the cached data structure and uses it again as a Lucene
filter.
Have
3 problems:
a- he wanted to read it locally.
b- crawling the open web is imperfect.
c- /browse needs to get at the files with the same URL as the uploader.
a and b- Try downloading the whole thing with 'wget'. It has a 'make
links point to the downloaded files' option. Wget is great.
I have
Indexes will not work. I have not heard of an index upgrader. If you run
your 3.6 and new 4.0 Solr at the same time, you can upload all the data
with a DataImportHandler script using the SolrEntityProcessor.
How large are your indexes? 4.1 indexes will not match 4.0, so you will
have to
Also, searching can be much faster if you put all of the shards on one
machine, and the search distributor. That way, you search with multiple
simultaneous threads inside one machine. I've seen this make searches
several times faster.
On 01/03/2013 06:36 AM, Jack Krupansky wrote:
Ah... the
Please start new mail threads for new questions. This makes it much
easier to research old mail threads. Old mail is often the only
documentation for some problems.
On 01/02/2013 10:04 AM, Benjamin, Roy wrote:
Will the existing 3.6 indexes work with 4.0 binary ?
Will 3.6 solrJ clients work
What does group.query do? How is it different from q= and fq= ?
Thanks.
At this scale, your indexing job is prone to break in various ways.
If you want this to be reliable, it should be able to restart in the
middle of an upload, rather than starting over.
On 01/08/2013 10:19 PM, vijeshnair wrote:
Yes Shawn, the batchSize is -1 only and I also have the
This example may be out of date, if the RSS feeds from Slashdot have
changed. If you know XML and XPaths, try this:
Find an rss feed from somewhere that works. Compare the xpaths in it
v.s. the xpaths in the DIH script.
On 01/13/2013 07:38 PM, bibhor wrote:
Hi
I am trying to use the RSS
Will a field have different names in different languages? There is no
facility for 'aliases' for field name. Erick is right, this sounds like
you need query and update components to implement this. Also, you might
try using URL-encoding for the field names. This would save my sanity.
On
Try all of the links under the collection name in the lower left-hand
columns. There several administration monitoring tools you may find useful.
On 01/14/2013 11:45 AM, hassancrowdc wrote:
ok stats are changing, so the data is indexed. But how can i do query with
this data, or ow can i search
For this second report, it's easy: switching from a single query server
to a sharded query is going to be slower. Virtual machines add jitter to
the performance and response time of the front-end vs the query shards.
Distributed search does 2 round-trips for each sharded query. Add these
all
It is possible to do this with IP Multicast. The query goes out on the
multicast and all query servers read it. The servers wait for a random
amount of time, then transmit the answer. Here's the trick: it's
multicast. All of the query servers listen to each other's responses,
and drop out when
Thanks, Kai!
About removing non-nouns: the OpenNLP patch includes two simple
TokenFilters for manipulating terms with payloads. The
FilterPayloadFilter lets you keep or remove terms with given payloads.
In the demo schema.xml, there is an example type that keeps only
nounsverbs.
There is a
A side problem here is text analyzers: the analyzers have changed how
they split apart text for searching, and are matched pairs. That is, the
analyzer queries are created matching what the analyzer did when
indexing. If you do this binary upgrade sequence, the indexed data will
not match what
I don't have the source handy. I believe that SolrCloud hard-codes 'id'
as the field name for defining shards.
On 02/04/2013 10:19 AM, Shawn Heisey wrote:
On 2/4/2013 10:58 AM, Lance Norskog wrote:
A side problem here is text analyzers: the analyzers have changed how
they split apart text
Lucene and Solr have an aggressive upgrade schedule.From 3 to 4 got a
major rewiring,
and parts are orders of magnitude faster and smaller.
If you code using Lucene, you will never upgrade to newer versions.
(I supported SolrLucene customers for 3 years, and nobody ever did.)
Cheers,
Lance
I
Do you use replication instead, or do you just have one instance?
On 02/25/2013 07:55 PM, Otis Gospodnetic wrote:
Hi,
Quick poll to see what % of Solr users use SolrCloud vs. Master-slave setup:
http://blog.sematext.com/2013/02/25/poll-solr-cloud-or-not/
I have to say I'm surprised with the
Yes, the SolrEntityProcessor can be used for this.
If you stored the original document bodies in the Solr index!
You can also download the documents in Json or CSV format and re-upload
those to old Solr. I don't know if CSV will work for your docs. If CSV
works, you can directly upload what
Thank you (and Hoss)! I have found this concept elusive, and you two
have nailed it. I will be able to understand it for the 5 minutes I will
need to code with it.
Lance
On 03/09/2013 10:57 AM, David Smiley (@MITRE.org) wrote:
Just finished:
-tp3227720p3227720.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
and the solrconfig file.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Problem-using-stop-words-tp3274598p3280319.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
TieredMergePolicy help me here, or some other config I can change?
-Michael
--
Lance Norskog
goks...@gmail.com
... so if the number of documents in the set were 100, then it
would only take up 400 bytes.
-Yonik
http://www.lucidimagination.com
--
Lance Norskog
goks...@gmail.com
.nabble.com/how-to-differentiate-multiple-datasources-when-building-solr-query-tp3286309p3286309.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
to index and query
data based on the above architecture.
Thanks,
Priti
--
Lance Norskog
goks...@gmail.com
)
...
Caused by: java.lang.OutOfMemoryError: Map failed
at sun.nio.ch.FileChannelImpl.map0(Native Method)
...
--
Lance Norskog
goks...@gmail.com
I remember now: by memory-mapping one block of address space that big, the
garbage collector has problems working around it. If the OOM is repeatable,
you could try watching the app with jconsole and watch the memory spaces.
Lance
On Thu, Sep 8, 2011 at 8:58 PM, Lance Norskog goks...@gmail.com
crawling even though it started out as innocent
DIH usage.
- Pulkit
--
Lance Norskog
goks...@gmail.com
SEVERE: java.lang.ClassCastException:
org.apache.solr.analysis.SmartChineseWordTokenFilterFactory cannot be cast
to org.apache.solr.analysis.TokenizerFactory
Any thought?
--
Lance Norskog
goks...@gmail.com
by
REDUCING the number of cores/threads each query was allowed to use (making
sense of our customer investment)
maybe you can get a similar effect by reducing the number of pieces your
distributed search has to merge
my 2 eurocents
federico
--
Lance Norskog
goks...@gmail.com
(DataImporter.java:421)
--
Lance Norskog
goks...@gmail.com
solr when I
use
http caching On.
thanks
--
Lance Norskog
goks...@gmail.com
there was a DIH FAQ about this, but if not there really
should be.
-Hoss
--
Lance Norskog
goks...@gmail.com
!
Kind regards,
Marc
--
Lance Norskog
goks...@gmail.com
increase to
4GB.
And
I need to do something to prevent performance downgrade.
Is there any solr official monitoring profiling tool for this?
Spark
--
Lance Norskog
goks...@gmail.com
.html
Sent from the Solr - User mailing list archive at Nabble.com.
--
Lance Norskog
goks...@gmail.com
-existant.
Thanks again,
Blaise
--
Lance Norskog
goks...@gmail.com
http://www.lucidimagination.com/search/link?url=http://wiki.apache.org/solr/UniqueKey
On Wed, Dec 7, 2011 at 5:04 PM, Lance Norskog goks...@gmail.com wrote:
Yes, the SignatureUpdateProcessor is what you want. The 128-bit hash is
exactly what you want to use in this situation. You will never
no html. So I can't
use
xpath. How do I index these pure text into different fields of the index?
How
do I make nutch/solr understand these different parts belong to different
fields? Maybe I can use existing content in the fields in my index?
Thanks.
--
Lance Norskog
goks
. YMWV
-Glen
http://zzzoot.blogspot.com/
[1]http://linuxmanpages.com/man8/numactl.8.php
--
-
--
Lance Norskog
goks...@gmail.com
am running the 1.4 war but I was having this problem with 1.3 also. Tomcat
6.0.18, Java 1.6.0. I haven't gone as far as doing any memory profiling or
java debugging because I'm inexperienced, but that will be the next thing I
try. Any help would be appreciated.
Thanks,
-Jeff
--
Lance
--
Grant Ingersoll
http://lucenerevolution.org Apache Lucene/Solr Conference, Boston Oct 7-8
--
Lance Norskog
goks...@gmail.com
queryResultCache: 1024, 512, 128
documentCache: 16384, 4096, n/a
Thanks.
--
Lance Norskog
goks...@gmail.com
and JVM tunning better then me please
sanity check me on that?)
-Hoss
--
http://lucenerevolution.org/ ... October 7-8, Boston
http://bit.ly/stump-hoss ... Stump The Chump!
--
Lance Norskog
goks...@gmail.com
The score of a document has no scale: it only has meaning against other
score in the same query.
Solr does not rank these documents correctly. Without sharing the TF/DF
information across the shards, it cannot.
If the shards each have a lot of the same kind of document, this
problem
Please start a new email thread instead of replying to an existing one
with a new subject and question.
Sharma, Raghvendra wrote:
Is there a way to specify a xslt at the server side, and make it default, i.e.
whenever a response is returned, that xslt is applied to the response
Please start a new email thread for this instead of replying to an
existing one with a new subject and question.
Sharma, Raghvendra wrote:
I have been able to load around a million rows/docs in around 5+ minutes. The
schema contains around 250+ fields. For the moment, I have kept
Oracle has a bunch of functions you can use in the SELECT statement to
translate types. You may want to translate a NULL into an empty string.
harrysmith wrote:
Anyone ever see this error on an import?
Caused by: java.lang.NullPointerException
at
with using development versions.
-Yonik
http://lucenerevolution.org Lucene/Solr Conference, Boston Oct 7-8
--
Lance Norskog
goks...@gmail.com
column in Solr,i.e. I could feed it something like:
2010-10-15T23:59:59
And it's indexable of course :-)
Dennis Gearon
Signature Warning
EARTH has a Right To Life,
otherwise we all die.
Read 'Hot, Flat, and Crowded'
Laugh at http://www.yert.com/film.php
--
Lance
Start a new thread.
Dennis Gearon wrote:
What's the difference between the filter/anayzers that have 'factory' in their
name, and the ones that don't?
Dennis Gearon
Signature Warning
EARTH has a Right To Life,
otherwise we all die.
Read 'Hot, Flat, and Crowded'
Laugh at
801 - 900 of 1360 matches
Mail list logo