Re: Question on Solr/WordPress Integration

2019-03-01 Thread markus kalkbrenner
If you’re more familiar with PHP you can do the same using the Solarium library instead of SolrJ for Java. Once the PDFs are extracted and indexed, Drupal is an alternative to Wordpress as Frontend. Using the Serach API Solr module you can access and „present“ any existing Solr index without a

Re: Question on Solr/WordPress Integration

2019-03-01 Thread Erick Erickson
Writing a Java (SolrJ) program that traverses a filesystem and extracts the contents of PDF is actually quite simple, see: https://lucidworks.com/2012/02/14/indexing-with-solrj/ (you can ignore the RDBMS stuff). That code is a little out of date so may need some very minor tweaks. Tika (the

Re: CloudSolrClient Question

2019-03-01 Thread Erick Erickson
First, that resource leak is worrying. Is there any way you could take a stack trace and/or memory dump? I suppose it’d be easy enough to simulate. It’s particularly worrying because SolrJ is how Solr<->Solr communications happen so if there really is more than transitory leak that’d affect

CloudSolrClient Question

2019-03-01 Thread Webster Homer
I am using the CloudSolrClient Solrj api for querying solr cloud collections. For the most part it works well. However we recently experienced a series of outages where our production cloud became unavailable. All the nodes were down. That's a separate topic... The client application tried to

Re: Question on Solr/WordPress Integration

2019-03-01 Thread Paul Buiocchi
Thank you Shawn ! Sent from Yahoo Mail on Android On Fri, Mar 1, 2019 at 12:25 PM, Paul Buiocchi wrote: Greetings,  I have a couple of questions about Solr /Wordpress integration -  First , I am not "committed to using WordPress as a front end. If there is a better front end option , I

Re: Python Client for Solr Cloud - Leader aware

2019-03-01 Thread Walter Underwood
There is no guarantee that sending an update to a non-leader node is slower. It certainly seems like a bad idea, but forwarding a document is fast and indexing a document is slow, so it might not even be measurable. We’ve indexed a million docs per minute by sending all updates to the load

Re: Solr Reference Guide for version 7.7

2019-03-01 Thread Jason Gerlowski
Hi Edwin, I volunteered to release the 7.7 ref-guide last week but decided to wait until 7.7.1 came out to work on it. (You probably know that 7.7.0 contained some serious bugs. These would've required non-trivial documentation effort in the ref-guide, and 7.7.1 already had a release-manager

Re: Question on Solr/WordPress Integration

2019-03-01 Thread Shawn Heisey
On 3/1/2019 10:25 AM, Paul Buiocchi wrote: I have a couple of questions about Solr /Wordpress integration - You would need to talk to the person who wrote the plugin for Wordpress that integrates with Solr. If they indicate that a question can only be answered by the Solr project, then

Re: Python Client for Solr Cloud - Leader aware

2019-03-01 Thread Jason Gerlowski
Hi Ganesh, I'm not an expert on pysolr, but from a quick scan of their update code, it does look like pysolr attempts to send update requests to _a_ leader node for a particular collection. But that's all it does. It doesn't check which shard the document(s) will belong to and try to pick the

Question on Solr/WordPress Integration

2019-03-01 Thread Paul Buiocchi
Greetings,  I have a couple of questions about Solr /Wordpress integration -  First , I am not "committed to using WordPress as a front end. If there is a better front end option , I would be willing to convert. For functionality , all I am looking for is the ability to full txt search ,

Re: Giving SolrJ credentials for Zookeeper

2019-03-01 Thread Jason Gerlowski
Hi Ryan, I haven't tried this myself, but wanted to offer a sanity check based on how I understand those instructions. Are you setting the "zkCredentialsProvider", "zkDigestUsername", and "zkDigestPassword" system-properties on your client app/process as well as on your Solr/ZK servers? Or are

Re: %solr_logs_dir% does not like spaces

2019-03-01 Thread Jason Gerlowski
+1 to submitting a JIRA, even if you cannot find an edit to solr.cmd to fix the issue. And +1 to the issue likely just being a lack of double-quotes around the reference to SOLR_LOG_DIR. Best, Jason Gerlowski On Tue, Feb 26, 2019 at 11:56 AM Erick Erickson wrote: > > If you can munge the

Code review for SOLR related changes.

2019-03-01 Thread Fiz Ahmed
Hi Solr Experts, Can you please suggest Code review techniques for SOLR related changes in a Project. Thanks FIZ AML Team.

Re: Spring Boot Solr+ Kerberos+ Ambari

2019-03-01 Thread Jason Gerlowski
Hi Rushikesh, Solr's Kerberos authentication is completely independent of Ranger. You can set it up to use Ranger, as is common with Hortonworks HDP, but it's also possible to setup Kerberos+Solr without Ranger in the picture at all. I haven't come across a concise explanation of _how_ to do

Re: Old searcher to new searcher

2019-03-01 Thread Shawn Heisey
On 3/1/2019 4:42 AM, Amjad Khan wrote: We are trying to extend AbstractSolrEventListener class and override newSearcher method. Was curious to know if we can copy the existing searcher cache to new searcher instead of executing the query receiving from solrconfig.. Because we are not sure

Re: Porter Stem filter and employing

2019-03-01 Thread Shawn Heisey
On 3/1/2019 4:38 AM, Marisol Redondo wrote: When using the PorterStemFilter, I saw that the work "employing" is change to "emploi" and my document is not found in the query to solr because of that. This also happens with other words that finish in -ying as annoying or deploying. It there any

RE: Index database with SolrJ using xml file directly throws an error

2019-03-01 Thread Dyer, James
Instead of dataConfig=data-config.xml, use config=data-config.xml . From: sami Sent: Friday, March 1, 2019 3:05 AM To: solr-user@lucene.apache.org Subject: RE: Index database with SolrJ using xml file directly throws an error Hi James, Thanks for your reply. I am not absolotuely sure I

Re: cve-2017-

2019-03-01 Thread Jeff Courtade
Thank you very much On Fri, Mar 1, 2019 at 12:24 AM Tomás Fernández Löbbe wrote: > I updated the description of SOLR-12770 > a bit. The problem > stated is that, since the "shards" parameter allows any URL, someone could > make an insecure Solr

Porter Stem filter and employing

2019-03-01 Thread Marisol Redondo
Hi. When using the PorterStemFilter, I saw that the work "employing" is change to "emploi" and my document is not found in the query to solr because of that. This also happens with other words that finish in -ying as annoying or deploying. It there any path for this filter or should I create a

Old searcher to new searcher

2019-03-01 Thread Amjad Khan
We are trying to extend AbstractSolrEventListener class and override newSearcher method. Was curious to know if we can copy the existing searcher cache to new searcher instead of executing the query receiving from solrconfig.. Because we are not sure what item was mostly searched. Will

RE: Index database with SolrJ using xml file directly throws an error

2019-03-01 Thread sami
Hi James, Thanks for your reply. I am not absolotuely sure I understood everything correctly here. I would like to index my database to start with fresh index. I have already done it with DIH execute function. It works absolutely

RE: MLT and facetting

2019-03-01 Thread Martin Frank Hansen (MHQ)
Hi Walter, Thanks for your answer, it makes sense. Best regards Martin Internal - KMD A/S -Original Message- From: Walter Underwood Sent: 1. marts 2019 03:30 To: solr-user@lucene.apache.org Subject: Re: MLT and facetting The last time I looked, the MLT was a search handler but

RE: MLT and facetting

2019-03-01 Thread Martin Frank Hansen (MHQ)
Hi Dave, The problem is that we have different levels of metadata and documents. The documents are arranged such that we have a case for which there are multiple documents (files). When we use the mlt function, we do it on file-level, but it needs to be displayed at case level, which means

RE: MLT and facetting

2019-03-01 Thread Martin Frank Hansen (MHQ)
Hi Edwin, Thanks for your time, much appreciated. Best regards Martin Internal - KMD A/S -Original Message- From: Zheng Lin Edwin Yeo Sent: 1. marts 2019 02:48 To: solr-user@lucene.apache.org Subject: Re: MLT and facetting Hi Martin, I have no idea on this, as the case has not

Re: Custom search in own SearchComponent

2019-03-01 Thread Mikhail Khludnev
Just a guess QueryParser might be a Lucene class, which isn;t aware of Solr scheme and hence might not properly convert term to number or point "product_type:106" Check the particular query with Solr fq, then use QParser for parsing. See QueryComponent as a sample. On Fri, Mar 1, 2019 at 12:59 PM

Custom search in own SearchComponent

2019-03-01 Thread Moritz Schmidt
Hi there, I’m working on a custom SearchComponent to add some docs to the response with another filter query than in the original request. Here is what I currently have: public void process(ResponseBuilder rb) throws IOException { if(rb.getResults() != null) { QueryParser queryParser

Antwort: Re: Re: High CPU usage with Solr 7.7.0

2019-03-01 Thread Lukas Weiss
This is the information of the Thread Dump screen of the Solr web interface: process reaper (8195) java.util.concurrent.SynchronousQueue$TransferStack@23ec2c53 sun.misc.Unsafe.park​(Native Method) java.util.concurrent.locks.LockSupport.parkNanos​(LockSupport.java:215)

Errors during solrcloud replication (7.7.x)

2019-03-01 Thread Karl Stoney
Hey all, I’m looking for some support with replication errors we’re seeing in SolrCloud 7.7.x (tried both .0 and .1). I’ve created a StackOverflow issue: We have errors in SolrCloud (7.7.1) during replication, which we can't understand. We thought it may be