Re: Indexing information on number of attachments and their names in EML file

2019-08-02 Thread Zheng Lin Edwin Yeo
Thanks for the reply, will find out more about it. Currently I am able to retrieve the normal Metadata of the email, but not the Metadata of the attachments which are part of the contents in the EML file, which looks something like this. --d8b77b057d59ca19--

Re: Indexing information on number of attachments and their names in EML file

2019-08-02 Thread Tim Allison
I'd strongly recommend rolling your own ingest code. See Erick's superb: https://lucidworks.com/post/indexing-with-solrj/ You can easily get attachments via the RecursiveParserWrapper, e.g.

Re: Solr on HDFS

2019-08-02 Thread Kevin Risden
> > If you think about it, having a shard with 3 replicas on top of a file system that does 3x replication seems a little excessive! https://issues.apache.org/jira/browse/SOLR-6305 should help here. I can take a look at merging the patch since looks like it has been helpful to others. Kevin

Re: Solr on HDFS

2019-08-02 Thread Joe Obernberger
Hi Kyle - Thank you. Our current index is split across 3 solr collections; our largest collection is 26.8TBytes (80.5TBytes when 3x replicated in HDFS) across 100 shards.  There are 40 machines hosting this cluster. We've found that when dealing with large collections having no replicas (but

Re: Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-02 Thread Jörn Franke
Not sure if this is possible, but why not create a query handler in Solr with any custom query and you use that as ping replacement ? > Am 02.08.2019 um 15:48 schrieb dinesh naik : > > Hi all, > I have few clusters with huge data set and whenever a node goes down its > not able to recover due

Solr 7.6.0: PingRequestHandler - Changing the default query (*:*)

2019-08-02 Thread dinesh naik
Hi all, I have few clusters with huge data set and whenever a node goes down its not able to recover due to below reasons: 1. ping request handler is taking more than 10-15 seconds to respond. The ping requesthandler however, expects it will return in less than 1 second and fails a

RE: Basic Authentication problem

2019-08-02 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
Was I correct in my description yesterday (which I am pasting in below)? That you are using a hash based on the "solr" account name and expecting that to work if you change the account name but not the hash? Am I correct in assuming that everything other than security-edit functions currently

Re: Solr on HDFS

2019-08-02 Thread lstusr 5u93n4
Hi Joe, We fought with Solr on HDFS for quite some time, and faced similar issues as you're seeing. (See this thread, for example:" http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201812.mbox/%3cCABd9LjTeacXpy3FFjFBkzMq6vhgu7Ptyh96+w-KC2p=-rqk...@mail.gmail.com%3e ) The Solr lock

Re: Solr on HDFS

2019-08-02 Thread Joe Obernberger
Thank you.  No, while the cluster is using Cloudera for HDFS, we do not use Cloudera to manager the solr cluster.  If it is a configuration/architecture issue, what can I do to fix it?  I'd like a system where servers can come and go, but the indexes stay available and recover automatically. 

Re: Solr 8.2.0 having issue with ZooKeeper 3.5.5

2019-08-02 Thread Jörn Franke
I just checked also the output of the telnet commands - for conf it is different for standalone compared to ensemble, will put it later in the Jira > Am 02.08.2019 um 03:46 schrieb Zheng Lin Edwin Yeo : > > Yes, I tried with space and the same error occurs. > > I have also tried to put * , but

Re: Solr 8.2.0 having issue with ZooKeeper 3.5.5

2019-08-02 Thread Jörn Franke
Telnet is working correct. The status endpoint seem to report the error that is displayed in the UI. I don’t see any obvious in the code, but it might not be working for more than one node, but i am not sure exactly why. I could not find the log line there with „membership: check 4lw“ in the

Re: Indexing information on number of attachments and their names in EML file

2019-08-02 Thread Jan Høydahl
Try the Apache Tika mailing list. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 2. aug. 2019 kl. 05:01 skrev Zheng Lin Edwin Yeo : > > Hi, > > Does anyone knows if this can be done on the Solr side? > Or it has to be done on the Tika side? > > Regards, > Edwin >

Re: Problem with uploading Large synonym files in cloud mode

2019-08-02 Thread Jörn Franke
You can use the configset API: https://lucene.apache.org/solr/guide/7_7/configsets-api.html I don’t recommend to use Schema.xml , but managed Schemas: https://lucene.apache.org/solr/guide/6_6/schema-api.html For people new to Solr I generally recommend to read a recent book about Solr from

Re: Basic Authentication problem

2019-08-02 Thread Zheng Lin Edwin Yeo
>From what I see, you are trying to change your own user's password. If I remembered correctly this might not be allowed, which is why you are getting the "Unauthorized request" error. You can try to create another user with admin role as well, and to change your existing user's password from the

Re: Problem with uploading Large synonym files in cloud mode

2019-08-02 Thread Bernd Fehling
http://lucene.apache.org/solr/guide/6_6/command-line-utilities.html "Upload a configuration directory" Take my advise and read the SolrCloud section of Solr Ref Guide. It will answer most of your questions and is a good start. Am 02.08.19 um 08:30 schrieb Salmaan Rashid Syed: Hi Bernd, Yet,

Re: Problem with uploading Large synonym files in cloud mode

2019-08-02 Thread Salmaan Rashid Syed
Hi Bernd, Yet, another noob question. Consider that my conf directory for creating a collection is _default. Suppose now I made changes to managed-schema and conf.xml, How do I upload it to external zookeeper at 2181 port? Can you please give me the command that uploads altered config.xml and

Re: Problem with uploading Large synonym files in cloud mode

2019-08-02 Thread Bernd Fehling
to 1) yes, because -Djute.maxbuffer is going to JAVA as a start parameter. to 2) I don't know because i never use internal zookeeper to 3) the configs are located at solr/server/solr/configsets/ - choose one configset, make your changes and upload it to zookeeper - when creating a

Re: Problem with uploading Large synonym files in cloud mode

2019-08-02 Thread Salmaan Rashid Syed
Hi Bernd, Sorry for noob questions. 1) What do you mean by restart? Do you mean that I shoud issue ./bin/solr stop -all? And then issue these commands, bin/solr restart -cloud -s example/cloud/node1/solr -p 8983 bin/solr restart -c -p 7574 -z localhost:9983 -s example/cloud/node2/solr 2)