Hi,
I am creating a new collection using the following get request:
http://ec2_host:8983/solr/admin/collections?action=CREATE&name=collection_name_1&collection.configName=oem/conf&numShards=1
What I’m finding is that now and then base_url for the replica in state.json is
set to the internal IP
Hi Hoss,
Thanks for the reply. I installed the service using the install script. I
double checked it and it looks like it install solr.in.sh in
/etc/defaults/solr.in.sh. It actually looks like if it is in /var the install
script moves it into /etc/defaults (unless I’m reading this wrong):
http
Hi Hoss,
Thanks for your help. Going over the install page again I realized I had
originally not adjusted the value of SOLR_HOST and it had started up using the
default internal IP. I changed that to the public DNS and restarted solr.
However in /live_nodes I then had 2 values: one for the publ
Hi,
Does anyone used docker for deploying solr? I am using it for running a single
solr server ‘cloud’ locally on my dev box, but wondering about the pros/cons of
using it in production.
Thanks,
Brendan
cted by a
> 1 gb network speed. The best speed I am getting for replication is 5mb/sec,
> how can this be increased.
> The replication keeps failing and this is the first time replication of an
> index over 300Gb in size. We are using solr verson 4.1 on master and solr
> 4.3 on slave.
Hi,
I'm experimenting with indexing using the EmbeddedSolrServer. Just to be
sure, as I understand it, I do not need a running instance of solr to use
this, it literally is a running instance of solr.
Given the above, how safe is it to use an EmbeddedSolrServer for indexing
an index that might be
Awesome thanks. What about indexing in a different core then renaming it once
its done?
Thanks
Brendan
On Jul 3, 2013, at 6:48 PM, Shawn Heisey wrote:
> On 7/3/2013 2:45 PM, Brendan Grainger wrote:
>> I'm experimenting with indexing using the EmbeddedSolrServer. Just to be
Hi All,
I changed the name of the queryAnalyzerFieldType for my spellcheck
component and the corresponding field and now when solr starts up, it hangs
at this point:
5797 [searcherExecutor-4-thread-1] INFO org.apache.solr.core.SolrCore –
QuerySenderListener sending requests to
Searcher@153d12bf
eate a spellchecking dictionary.
Thanks
Brendan
--
Brendan Grainger
www.kuripai.com
Hi All,
I have an IndexBasedSpellChecker component configured as follows (note the
field parameter is set to the spellcheck field):
text_spell
default
solr.IndexBasedSpellChecker
* spellcheck*
./spellchecker
.0001
with the corresponding f
gt; Ingram Content Group
> (615) 213-4311
>
>
> -Original Message-
> From: Brendan Grainger [mailto:brendan.grain...@gmail.com]
> Sent: Tuesday, July 23, 2013 1:47 PM
> To: solr-user@lucene.apache.org
> Subject: Spellcheck field element and collation issues
>
&g
hat get results?
>
> My suspicion here is that you are assuming that "markup_texts" is the
> default search field for "/select" but in fact it isn't.
>
> James Dyer
> Ingram Content Group
> (615) 213-4311
>
>
> -Original Message-
> From: Brendan G
is available only if
> using dismax/edismax.
>
> http://wiki.apache.org/solr/SearchHandler#df
> http://wiki.apache.org/solr/ExtendedDisMax#qf_.28Query_Fields.29
>
> James Dyer
> Ingram Content Group
> (615) 213-4311
>
>
> -Original Message-
> From: Brendan
. You could write your own Query
> COnverter but its often just easier to strip out the keywords and send them
> over with "spellcheck.q".
>
> James Dyer
> Ingram Content Group
> (615) 213-4311
>
>
> -Original Message-
> From: Brendan Grainger [mail
Hi All,
First of all, what I was actually trying to do is actually get a little
space back. So if there is a better way to do this by adjusting the
MergePolicy or something else please let me know. My index is currently
200Gb. In the past (Solr 1.4) we've found that optimizing the index will
doubl
hich reduced the number of segments in my index from 12 to 10. Amazingly,
it also reduced the space used by almost 50Gb. Is that even possible?
Thanks again
Brendan
On Tue, Aug 6, 2013 at 10:55 AM, Brendan Grainger <
brendan.grain...@gmail.com> wrote:
> Hi All,
>
> First of al
g 6, 2013 at 11:45 AM, Brendan Grainger <
brendan.grain...@gmail.com> wrote:
> Well, I guess I can answer one of my questions which I didn't exactly
> explicitly state, which is: how do I force solr to merge segments to a
> given maximum. I forgot about doing this:
>
>
27;
> >
> >
> http://localhost:8983/solr/update?optimize=true&maxSegments=10&waitFlush=false
> > '
> >
> > It looks to me that it was from deleted docs being completely removed
> from
> > the index.
> >
> > Thanks
> >
> &g
Hi All,
I've been debugging an issue where the query 'tpms' would make the
spellchecker throw the following exception:
21021 [qtp91486057-17] ERROR org.apache.solr.servlet.SolrDispatchFilter –
null:java.lang.StringIndexOutOfBoundsException: String index out of range:
-1
at java.lang.AbstractStri
pressure monitor system,tpms
I don't get a crash. I tried it with some other fields too. e.g.:
asdm,airbag system diagnostic module => crash
airbag system diagnostic module,asdm => no crash
Thanks
Brendan
On Thu, Aug 15, 2013 at 1:37 PM, Brendan Grainger <
brendan.grain...@gmail.com
ate a
test case if this is actually an issue and not something weird with my
configuration.
Thanks
Brendan
On Thu, Aug 15, 2013 at 1:43 PM, Brendan Grainger <
brendan.grain...@gmail.com> wrote:
> Further to this. If I change:
>
> tpms,service tire monitor,tire monitor,tire p
message in context:
> http://lucene.472066.n3.nabble.com/Where-is-the-webapps-directory-of-servlet-container-tp4084968p4085094.html
> Sent from the Solr - User mailing list archive at Nabble.com.
--
Brendan Grainger
www.kuripai.com
2:06 PM, Kamaljeet Kaur wrote:
> On Fri, Aug 16, 2013 at 10:22 PM, Brendan Grainger [via Lucene]
> wrote:
> > ou can then
> > use the packaged Jetty servlet container while you get comfortable with
> > working with solr.
>
>
> Can I ask why jetty?
>
> --
&g
expected?
>
>
>
> -
> Best regards
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Solr-Ref-guide-question-tp4086142.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
--
Brendan Grainger
www.kuripai.com
TOkenizers, etc) where can I
> put them so that the compilation process will view the new files?
>
> Regards
> Bruno Santos
>
> --
> Bruno René Santos
> Lisboa - Portugal
>
--
Brendan Grainger
www.kuripai.com
Hi,
I'm using the PathHierarchyTokenizer like this:
to create path based facets.
A query like:
http://localhost:8982/solr/articles/select?facet=true&start=0&q=*:*&facet.field=my_path_based_facet&wt=ruby&indent=on
gives facets that look like
do:
http://localhost:8982/solr/articles/select?facet=true&start=0&q=*:*&facet.field=my_path_based_facet&wt=ruby&indent=on
&facet.prefix=1/root
Is that what people do?
Thanks
Brendan
On Fri, May 10, 2013 at 6:50 PM, Brendan Grainger <
brendan.grain...@gmail
sunspot and they construct the queries
with keys as in my first example, i.e. facet.field={!ex=dt
key=my_facet_key}the_facet_field&f.my_facet_key.facet.prefix=a_given_prefix
Thanks
Brendan
--
Brendan Grainger
www.kuripai.com
Thanks for the excellent clarification. I'll ask the sunspot guys about the
localparams issue. I have a patch that would fix it
Thanks
Brendan
On May 16, 2013, at 1:42 PM, Chris Hostetter wrote:
>
> : I would then like to refer to these 'pseudo' field later in the request
> : string. I thoug
ces
> Organizational/Staff
>
> I would like to then provide facets using these names.
>
> Can someone please guide me in the right direction on how I might
> accomplish this?
>
> Thank you
>
> Cord
>
--
Brendan Grainger
www.kuripai.com
Hi All,
Not really a pressing need for this at all, but having worked through a few
tutorials, I was wondering if there was any work being done to incorporate
Lucene Facets into solr:
http://lucene.apache.org/core/4_3_0/facet/org/apache/lucene/facet/doc-files/userguide.html
Brendan
10 minutes or longer to get the cumulative word
> count results.
>
> Eventually we're hoping to build a system that will return results in a few
> seconds and scale to hundreds of millions of documents.
> Is there anyway to get this level of performance out of Solr/Lucene?
>
> Thanks,
>
> David
>
--
Brendan Grainger
www.kuripai.com
; It actually has a placeholder Jira:
> https://issues.apache.org/**jira/browse/SOLR-4774<https://issues.apache.org/jira/browse/SOLR-4774>
>
> Feel free to add your encouragement there.
>
> -- Jack Krupansky
>
> -----Original Message- From: Brendan Grainger
> Sent
e faceting values for the field
> that in fact begin with 'dotan-', even if a document has other tags
> such as 'beatles'?
>
> 4) How to have Solr return only those faceting values which are larger
> than 0?
>
> Thank you!
>
> --
> Dotan Cohen
>
> http://gibberish.co.il
> http://what-is-what.com
>
--
Brendan Grainger
www.kuripai.com
ts for a honda accord don't have any brake pad articles.
If the user is filtering on the honda accord I wouldn't want "brake pads"
as a suggestion.
Right now, I've played with the suggest component and using faceting.
Any thoughts?
Thanks
Brendan
--
Brendan Grainger
www.kuripai.com
/sematext.com/
>
>
>
>
>
> On Thu, Jun 13, 2013 at 7:43 PM, Brendan Grainger
> wrote:
> > Hi Solr Guru's
> >
> > I am trying to implement auto suggest where solr would suggest several
> > phrases that would return results as the user type
onfig-phrasesuggest.xml
Thanks
Brendan
On Fri, Jun 14, 2013 at 11:50 AM, Brendan Grainger <
brendan.grain...@gmail.com> wrote:
> Hi Otis,
>
> Sorry was a bit tired when I wrote that. I think what I'd like is to be
> able spellcheck the suggestions. For example. If a user typ
ction suggestions would
be offered for terms that exist in that book not the entire index.
Otherwise I guess I should look at using EdgeNGramFilter?
Thanks
Brendan
--
Brendan Grainger
www.kuripai.com
lt;http://www.linkedin.com/in/alexandrerafalovitch>
> - Time is the quality of nature that keeps events from happening all
> at once. Lately, it doesn't seem to be working. (Anonymous - via GTD
> book)
>
--
Brendan Grainger
www.kuripai.com
Hi,
Does anyone know if an optimize can be stopped once started?
Thanks
27;t think so, at least not gracefully. You can always do partial optimize
>> and do a few of them if you want to optimize in smaller steps.
>>
>> Otis
>>
>>
>>>
>>> From: Brendan Grainger
>>> To: sol
Hi,
The default for catenateAll is 0 which we've been using on the
WordDelimiterFilter. What would be the possibly negative implications of
setting this to 1? So that:
wi-fi-800
would produce the tokens:
wi, fi, wifi, 800, wifi800
for example?
Thanks
There is an update to that book for Solr 3:
http://www.packtpub.com/apache-solr-3-enterprise-search-server/book
I actually bought it recently, but haven't looked at it yet.
Good luck.
Brendan
On Dec 16, 2011, at 9:01 PM, Shiv Deepak wrote:
> I am looking for a good book to read from and get a
Hi Darren,
You might want to look at the KStemmer
(http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem) instead of
the standard PorterStemmer. It essentially has a 'dictionary' of exception
words where stemming stops if found, so in your case president won't be stemmed
any furthe
Hi Scott,
Is there a reason why you wouldn't just index these special words into another
field and then search over both fields? That would also have the nice property
of being able to boost on the special word field if you wanted.
HTH
Brendan
On Aug 20, 2010, at 6:19 AM, scott chu (朱炎詹) wrote
> On Tue, Jan 18, 2011 at 6:04 PM, Grant Ingersoll wrote:
>
>> As devs of Lucene/Solr, due to the way ASF mirrors, etc. works, we really
>> don't have a good sense of how people get Lucene and Solr for use in their
>> application. Because of this, there has been some talk of dropping Maven
>> su
Hi Briggs,
I'm not sure about Solr 4.0, but do you need to commit?
> curl http://localhost:8983/solr/coupon/update?commit=true -H "Content-Type:
> text/xml" --data-binary '*:*'
Brendan
www.kuripai.com
On Jul 18, 2012, at 7:11 PM, Briggs Thompson wrote:
> I have realized this is not specifi
You might be running into the same issue someone else had the other day:
https://issues.apache.org/jira/browse/SOLR-3432
On Jul 19, 2012, at 1:23 PM, Rohit wrote:
> We delete some data from solr, post which solr is not accepting any
> commit's. What could be wrong?
>
>
>
> We don't see any
I assume you're backgrounding solr. Maybe you just need
disown %1
Brendan
On Aug 2, 2012, at 1:04 PM, Niall wrote:
> I've got Solr 3.6 up working with Jetty but the admin page is inaccessible
> and Solr appears to stop working when I terminate my SSH connection to the
> server after running s
Hi Noble,
Just to be clear. It was not so that I could refer to my plugin class
with solr. that I suggested using SolrResourceLoader, it's
that the SolrResourceLoader loads the jars you put in solr-home/lib.
If you don't do this you get a ClassNotFoundException. It's pretty
well explained
Hi Ryan,
We do basically the same thing, using a modified ShingleFilter (http://hudson.zones.apache.org/hudson/job/Lucene-trunk/javadoc//contrib-analyzers/org/apache/lucene/analysis/shingle/ShingleFilter.html
). I have it set up to build 'shingles' of size 2, 3, 4, 5 which I
index into separat
Hi,
I think:
Can't find resource 'solrconfig.xml' in classpath or 'solr/conf/'
is a major clue no? Do you actually have a solrconfig.xml and how are
you starting solr?
Regards
Brendan
On Oct 1, 2008, at 11:11 AM, sunnyfr wrote:
Oct 1 16:45:10 solr-test jsvc.exec[23757]: eaa main
Oct 1
/etc/tomcat5.5/Catalina/localhost/
solr.xml
and then I start tomcat5.5 ... do I miss something ?
Brendan Grainger-2 wrote:
Hi,
I think:
Can't find resource 'solrconfig.xml' in classpath or 'solr/conf/'
is a major clue no? Do you actually have a solrconfig.xml and ho
re.xml in /data/solr/
sharedLib="lib"
otherwise I've my solr.xml in /etc/tomcat5.5/Catalina/localhost/
solr.xml
and then I start tomcat5.5 ... do I miss something ?
Brendan Grainger-2 wrote:
Hi,
I think:
Can't find resource 'solrconfig.xml' in cla
soon .. I tried to
rename it
... but still .
help i need somebody .. heppp LOL
Thanks Brendan
Brendan Grainger-2 wrote:
Hi Sunny,
Sorry, I've not use multicores with tomcat yet. However, I seem to
remember that multicore.xml changed it's name to solr.xml. I take it
you'
https://issues.apache.org/jira/secure/attachment/12394264/apache_solr_a_red.jpg
https://issues.apache.org/jira/secure/attachment/12394282/solr2_maho_impression.png
https://issues.apache.org/jira/secure/attachment/12394266/apache_solr_b_red.jpg
https://issues.apache.org/jira/secure/attachment/12394
Are you using Porter Stemming? If so I think you can just specify your
word in the protwords.txt file (or whatever you've called it).
Check out http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
and the example config for the Porter Stemmer:
protected="protwords.txt" />
For what it's worth, it's also really easy to implement your own
EntityProcessor. Extend from EntityProcessorBase then implement the getNext
method to return a Map representing the row you want indexed. I
did exactly this so I could use reuse my hibernate domain models to query for
the data ins
Hi Patrick,
I don't know much about couch, but if you to return json from solr (which I
think couch would understand) you can do that with wt=json in the query string
when querying solr. See here for more details:
http://wiki.apache.org/solr/SolJSON
HTH a little
Brendan
On Apr 28, 2010, at 11
Hi,
I am creating an index of approx 500K documents. I wrote an indexing
program using embeded solr: http://wiki.apache.org/solr/EmbeddedSolr
and am seeing probably a 10 fold increase in indexing speeds. My
problem is though, that if I try to reindex say 20K docs at a time it
slows down c
Thanks so much for your suggestions. I am attempting to index 550K
docs at once, but have found I've had to break them up into smaller
batches. Indexing seems to stop at around 47K docs (the index reaches
264M in size at this point). The index eventually itself grows to
about 2Gb. I am usin
Hi,
Thanks for answering this question a while back. I have made some of
the suggestions you mentioned. ie not committing until I've finished
indexing. What I am seeing though, is as the index get larger (around
1Gb), indexing is taking a lot longer. In fact it slows down to a
crawl. Have
s
the computer thrashing, etc?
-Mike
On 19-Nov-07, at 2:44 PM, Brendan Grainger wrote:
Hi,
Thanks for answering this question a while back. I have made some
of the suggestions you mentioned. ie not committing until I've
finished indexing. What I am seeing though, is as the index get
larger
mputer thrashing, etc?
-Mike
On 19-Nov-07, at 2:44 PM, Brendan Grainger wrote:
Hi,
Thanks for answering this question a while back. I have made some
of the suggestions you mentioned. ie not committing until I've
finished indexing. What I am seeing though, is as the index get
larger (around
Hi,
I hope you can help me. I'm having an odd problem with solr. I have a
field that could be represent a car. A car could have a name like
"Silverado" or could be something like "Silverado 2WD" to denote the
2 wheel drive version of the car. Anyway, all is well when I search
over the fie
Hi,
I'm not sure if this is a good way to do it or not (so comments are
more than welcome!), but the way we have achieved this is using the
idea that a category/subcategory/subsubcategory etc create a path
that we associate with a document. This is the simple field
definition we use:
so
rg.apache.solr.analysis.WordDelimiterFilterFactory
{catenateWords=1, catenateNumbers=1, catenateAll=0,
generateNumberParts=1, generateWordParts=1})
--matt
On Dec 9, 2007, at 6:41 PM, Brendan Grainger wrote:
Hi,
I hope you can help me. I'm having an odd problem with solr. I
have a
This approach works (I do a similar thing using solr), but you have
to be careful as BooleanQuery.TooManyClauses exception can be thrown
depending where you use the wild card. It should be fine in the case
you described however. Anyway, there is a pretty interesting
discussion about this he
Hi,
I'm using a solr nightly build and I have created my own
QueryComponent which is just a subclass of the default
QueryComponent. FYI, in most cases I just delegate to the superclass,
but I also allow a parameter to be used which will cause some custom
filtering (which is why I'm doing
I think your problem is happening because splitOnCaseChange is 1 in
your WordDelimiterFilterFactory:
So "getElementById" is tokenized to:
(get,0,3)
(Element,3,10)
(By,10,12)
(Id,12,14)
(getElementById,0,14,posIncr=0)
However getelementbyid is tokenized to:
(getelementbyid,0,14)
which woul
Hi,
Just started using the Dismax handler and it looks very promising.
However I'm a little confused about this query. Could somebody please
explain why I'm getting a phrase query here?
+(((title_t:mass) (title_t:air) (title_t:flow))~3) ()',
And is that extra () indicative of something? I h
Got it.
Thanks so much.
Brendan
On Mar 14, 2008, at 8:11 AM, Erik Hatcher wrote:
On Mar 13, 2008, at 3:06 AM, Brendan Grainger wrote:
Just started using the Dismax handler and it looks very promising.
However I'm a little confused about this query. Could somebody
please explain wh
Hi,
Not sure if this is what you want, but to search for 'empty' fields we
use something like this:
(*:* AND -color:[* TO *])
Hope that helps.
Brendan
On May 6, 2008, at 6:43 PM, Daniel Andersson wrote:
Hi (again)
One of the fields in my database is color. It can either contain a
valu
Hi,
I've just changed the stemming algorithm slightly and am running a few
tests against the old stemmer versus the new stemmer. I did a query
for 'hanger' and using the old stemmer I get the following scoring for
a document with the title: Converter Hanger Assembly Replacement
6.4242806
When I was asked for something similar I quickly cobbled together a
stylesheet (I'm no xsl expert so it's probably pretty bad).
Invoked like this:
http://localhost:8982/solr/select?q=testing&fl=id,title_t,score&&wt=xslt&tr=csv.xsl&rows=10
YMMV, but feel free to use it if it helps, I've attach
corresponds to the same
document you queried earlier.
-Yonik
On Wed, Jun 11, 2008 at 1:09 PM, Brendan Grainger
<[EMAIL PROTECTED]> wrote:
I've just changed the stemming algorithm slightly and am running a
few tests
against the old stemmer versus the new stemmer. I did a query for
'h
o, you
might want to verify that docid=3454 still corresponds to the same
document you queried earlier.
-Yonik
On Wed, Jun 11, 2008 at 1:09 PM, Brendan Grainger
<[EMAIL PROTECTED]> wrote:
I've just changed the stemming algorithm slightly and am running a
few tests
against the old stemmer
Thanks so much, that explains it.
Brendan
On Jun 11, 2008, at 4:00 PM, Yonik Seeley wrote:
Field norms have limited precision (it's encoded as an 8 bit float) so
you are probably seeing rounding.
-Yonik
On Wed, Jun 11, 2008 at 2:13 PM, Brendan Grainger
<[EMAIL PROTECTED]> wrote
Same here. I took a look at the options you from the dev list and
seems to me (3) user education should be fine.
Thanks for all the great work.
Brendan
On Jun 13, 2008, at 4:37 PM, Brian Johnson wrote:
FWIW - I have no problem with the change.
Thanks,
Brian
- Original Message
Fr
Hi,
I set up the new DataimportHandler last night to replace some custom
import code I'd written and so far I'm loving it thank you.
I had one issue you might want to know about it. I have some solr
extensions I've written and packaged in a jar which I place in:
solr-home/lib
as per:
ht
mentations use the default
package or provide the fully qualified class name.
--Noble
On Thu, Jun 19, 2008 at 8:09 AM, Jon Baer <[EMAIL PROTECTED]> wrote:
Thanks. Yeah took me a while to figure out I needed to do
something like
transformer="com.mycompany.solr.MyTransformer"
we shall make the necessary changes .
--Noble
On Thu, Jun 19, 2008 at 10:06 AM, Brendan Grainger
<[EMAIL PROTECTED]> wrote:
Hi,
I am actually providing the fully qualified classname in the
configuration
and I was still getting a ClassNotFoundException. If you look at
the c
82 matches
Mail list logo