RE: Solr 6.4. Can't index MS Visio vsdx files

2017-02-06 Thread Allison, Timothy B.
Ah, ConnectsType. That's fixed in the most recent version of POI [1], and will soon be fixed in Tika [2]. So, no need to open a ticket on Tika's Jira. > as tika is failing, is it could help or not? Y, that will absolutely help. In your Solr contrib/extract/lib directory, you'll see

Re: Solr 6.4. Can't index MS Visio vsdx files

2017-02-06 Thread Gytis Mikuciunas
sad, but didn't help. what I did: 1. stopped solr: bin\solr stop -p 80 2. removed poi-ooxml-schemas-3.15.jar from contrib\extraction\lib 3. add ooxml-schemas-1.3.jar to contrib\extraction\lib 4. restarted solr: bin\solr start -p 80 -m 4g 5. tried again to parse vsdx file: java -Dauto

Help with design choice: join or multiValued field

2017-02-06 Thread Karl Kildén
Hello! I have Items and I have Shops. This is a e-commerce system with items from thousands of shops all though the inventory is often similar between shops. Some users can shop from any shop and some only from their default one. One item can exist in about 1 shops. - When a user logs

XMLQueryParser support for Wildcards and Prefix Queries

2017-02-06 Thread Puneet Pawaia
Hi, I see that the Lucene XMLQueryParser still does not support some query types like Wildcard queries and Prefix queries. How is the search for terms with wildcards etc proposed to be handled by XmlQueryParser? Thanks. Puneet

Re: custom plugin version

2017-02-06 Thread Zaccheo Bagnati
Thank you all for your answers. Directory and directive suggestions are clear. Can you expand a little bit about IIRC method? I'm not so used to solr code (and btw I'm neither an experienced java programmer). Il giorno ven 3 feb 2017 alle ore 19:25 Erick Erickson < erickerick...@gmail.com> ha

Re: Issues with uniqueKey != id?

2017-02-06 Thread alessandro.benedetti
Hi Matthias, I found some scenario where could be tricky having different names for the unique key. 1) Inter collection search - In this case more than callign the uniqueKey "Id" or not, is the fact of using the same uniqueKey across all the collections. If not when you aggregate the results,

Re: Issues with uniqueKey != id?

2017-02-06 Thread Matthias X Falkenberg
Hi Susheel, My question is about the name of the "uniqueKey" field rather than the composition of its values. By default, Solr uses a field with the name "id". For reasons of ambiguity with the applications in my environment, I am considering to change the field name to, for example, "docId".

Re: XMLQueryParser support for Wildcards and Prefix Queries

2017-02-06 Thread Mikhail Khludnev
Hello, As far as I understand you can hookup lucene syntax with . On Mon, Feb 6, 2017 at 2:18 PM, Puneet Pawaia wrote: > Hi, > > I see that the Lucene XMLQueryParser still does not support some query > types like Wildcard queries and Prefix queries. > How is the search

Re: Switching from Managed Schema to Manually Edited schema.xml --IS NOT WORKING

2017-02-06 Thread Erick Erickson
You did not answer whether you uploaded your configs to Zookeeper and reloaded the collection. Providing configs will not help you with that. What I'd advise: First get it working in stand-alone mode without Solr cloud at all. That should be quite simple, all on your local machine. Then migrate

Re: How to combine third party search data as top results ?

2017-02-06 Thread shamik
Charlie, this looks something very close to what I'm looking for. Just wondering if you've made this available as a jar or can be build from source? Our Solr distribution is not built from source, I can only use an external jar. I'll appreciate if you can let me know. -- View this message in

Re: 回复: bin/post and self-signed SSL

2017-02-06 Thread Kevin Risden
I expect that the commands work the same or very close from 5.5.x through 6.4.x. There have been some cleaning up of the bin/solr and bin/post commands but not many security changes. If you find differently then please let us know. Kevin Risden On Feb 5, 2017 21:02, "alias" <524839...@qq.com>

Re: Find groups where at least one item matches a query

2017-02-06 Thread Joel Bernstein
Assuming you have a unique id for each document the graph expression below will get you what you're looking for. The nodes function is short for gatherNodes described in the docs ( https://cwiki.apache.org/confluence/display/solr/Graph+Traversal). Starting in 6.4 you can call the function "nodes"

Re: Issues with uniqueKey != id?

2017-02-06 Thread Erik Hatcher
Personally I'd leave it as "id" - and adjust your other domain specific field name to something else. Why? Keep Solr and other potential tools from having issues. I don't know exactly what may break, but I'd rather keep things straightforward. Erik > On Feb 6, 2017, at 02:33, Matthias

Re: Time of insert

2017-02-06 Thread Mahmoud Almokadem
Thanks Alex for your reply. But the field created_date will be updated every time the document inserted to the solr. I want to record the first time the document indexed to solr and I'm using DataImport handler. And I tried solr.TimestampUpdateProcessorFactory but I got NullPointerException, So I

Re: Faceting and Grouping Performance Degradation in Solr 5

2017-02-06 Thread Solr User
I am pleased to report that we are in Production on Solr 5.5.3 with comparable performance to Solr 4.8.1 through leveraging facet.method=uif as well as https://issues.apache.org/jira/browse/SOLR-9176. Thanks to everyone who worked on these! On Mon, Oct 3, 2016 at 3:55 PM, Solr User

RE: Solr 6.4. Can't index MS Visio vsdx files

2017-02-06 Thread Allison, Timothy B.
Argh. Looks like we need to add curvesapi (BSD 3-clause) to Solr. For now, add this jar: https://mvnrepository.com/artifact/com.github.virtuald/curvesapi/1.03 See also [1] [1] http://apache-poi.1045710.n5.nabble.com/support-for-reading-Microsoft-Visio-2013-vsdx-format-td5721500.html

[ANNOUNCE] Apache Solr 6.4.1 released

2017-02-06 Thread Adrien Grand
6 February 2017, Apache Solr™ 6.4.1 available The Lucene PMC is pleased to announce the release of Apache Solr 6.4.1 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted

RE: Switching from Managed Schema to Manually Edited schema.xml --IS NOT WORKING

2017-02-06 Thread Anatharaman, Srinatha (Contractor)
Erick, I did as mentioned in that URL, made changes to solrconfig and kept only required fields in schema.xml Would you mind sharing config files for indexing text document? Regards, ~Sri -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Monday, February

Re: custom plugin version

2017-02-06 Thread Erick Erickson
Sorry, IIRC is an acronym for "If I Recall Correctly", it's not a method name ;) There should be a method in the superclass (DocTransformer) that you can use to return information about the plugin, maybe getName or toString depending on your needs. Best, Erick On Mon, Feb 6, 2017 at 4:04 AM,

RE: Solr 6.4. Can't index MS Visio vsdx files

2017-02-06 Thread Allison, Timothy B.
Shouldn't have taken you that much effort. Sorry. Y, I should probably get around to a patch for: https://issues.apache.org/jira/browse/SOLR-9552 Although, frankly, it might be time for Tika 1.15 shortly. -Original Message- From: Gytis Mikuciunas [mailto:gyt...@gmail.com] Sent:

Re: Solr 6.4. Can't index MS Visio vsdx files

2017-02-06 Thread Gytis Mikuciunas
Tim, you saved my day ;) now vsdx files were indexed successfully. Thank you very much!!! summary: as a workaround I have in solr-6.4.0\contrib\extraction\lib: 1. ooxml-schemas-1.3.jar instead of poi-ooxml-schemas-3.15.jar 2. curvesapi-1.03.jar So, now I'm waiting when this will be

Re: custom plugin version

2017-02-06 Thread Zaccheo Bagnati
What a newbie I am! :) OK, I've seen methods to override: I'll give a try. I suppose that getName output is then shown somewhere in the solr response. Thank you again Erick for your patience. Kind regards Zaccheo Il giorno lun 6 feb 2017 alle ore 17:33 Erick Erickson < erickerick...@gmail.com>

Re: Time of insert

2017-02-06 Thread Fuad Efendi
Not; historical logs for document updates is not provided. Users need to implement such functionality themselves if needed. From: Mahmoud Almokadem Reply: solr-user@lucene.apache.org

Time of insert

2017-02-06 Thread Mahmoud Almokadem
Hello, I'm using dih on solr 6 for indexing data from sql server. The document can br indexed many times according to the updates on it. Is that available to get the first time the document inserted to solr? And how to get the dates of the document updated? Thanks for help, Mahmoud

Re: Issues with uniqueKey != id?

2017-02-06 Thread David Kramer
I’m just setting that up now. I’m far from a Solr expert so I won’t swear we’re doing it right though Our issue is that we have documents, nested 3 deep. Products, Items, and SKUs. Each has an ID field that’s unique within the document type, but unfortunately we have products with the same

Re: Solr querying nested documents with ChildDocTransformerFactory, get “Parent query yields document which is not matched by parents filter”

2017-02-06 Thread David Kramer
For closure, I’ve solved the problem! It was not using my schema.xml at all. I had to change the solrconfig.xml to include and comment out the schema adding processor. My schema still didn’t work right, but I took the managed-schema and renamed it and changed uniqueKey to uuid and everything

Re: Help with design choice: join or multiValued field

2017-02-06 Thread Fuad Efendi
Correct: multivalued field with 1 shop IDs. Use case: shopping network in U.S. for example for a big brand such as Walmart, when user implicitly provides IP address or explicitly Postal Code, so that we can find items in his/her neighbourhood. You basically provide “join” information via

Re: Time of insert

2017-02-06 Thread Alexandre Rafalovitch
If you are reindexing full documents, there is no way. If you are actually doing updates using Solr updates XML/JSON, then you can have a created_date field with default value of NOW. Similarly, you could probably do something with UpdateRequestProcessor chains to get that NOW added somewhere.

DataImportHandler - Unable to load Tika Config Processing Document # 1

2017-02-06 Thread Anatharaman, Srinatha (Contractor)
Hi, I am having below error while trying to index using dataImporthandler Data-Config file is mentioned below. zookeeper is not able to read "tikaConfig.xml" on below statement processor="TikaEntityProcessor" tikaConfig="tikaConfig.xml" Please help me to resolve this issue ion: