Re: How to index and search (integer or float) vector.

2018-04-13 Thread Rick Leir
Jason One way is simply to use a multi value field. But this is not officially a vector, and the order might not be guaranteed. I suspect you can just post a document with the values, and see them in order. Search for a single value would not be very useful. Another way is to choose a textual

Re: How do I create a schema file for FIX data in Solr

2018-04-02 Thread Rick Leir
Google fix to json, there are a few interesting leads. On April 2, 2018 12:34:44 AM EDT, Raymond Xie wrote: >Thank you, Shawn, Rick and other readers, > >To Shawn: > >For *8=FIX.4.4 9=653 35=RIO* as an example, in the FIX standard: 8 >means BeginString, in this

Re: How do I create a schema file for FIX data in Solr

2018-04-02 Thread Rick Leir
Ray Have you looked around for an existing FIX to Solr conduit? If FIX is a common standard then I would expect that someone has done some work on this and github'd it. Even just FIX to JSON. Cheers -- Rick On April 2, 2018 12:34:44 AM EDT, Raymond Xie wrote: >Thank you,

Re: Need help to get started on Solr, searching get nothing. Thank you very much in advance

2018-04-02 Thread Rick Leir
sn't want to specify the >search >scope to be restricted in field "name" but anywhere in the index'ed >documents? > > >** >*Sincerely yours,* > > >*Raymond* > >On Sun, Apr 1, 2018 at 2:10 PM, Rick Leir <rl...@leirt

Re: Need help to get started on Solr, searching get nothing. Thank you very much in advance

2018-04-01 Thread Rick Leir
Raymond The output is not visible to me because the mailing list strips images. Please try a different way to show the output. Cheers -- Rick On March 29, 2018 10:17:13 PM EDT, Raymond Xie wrote: > I am new to Solr, following Steve Rowe's example on

Re: How do I create a schema file for FIX data in Solr

2018-04-01 Thread Rick Leir
Raymond Folks are quiet, maybe because of Easter. Solr has a RESTful interface, and all the details are in the manual. Briefly, you need to create a JSON document containing all the fields in a FIX, then POST it to Solr. POST all your FIX's to Solr, perhaps in batches. Then search for a FIX

Re: How do I create a schema file for FIX data in Solr

2018-03-31 Thread Rick Leir
Raymond Will you be streaming the FIX data, perhaps with aggregation? Just a thought, I have no experience with FIX. Streaming opens up lots of questions. Cheers -- Rick On March 31, 2018 2:33:25 PM EDT, Walter Underwood wrote: >Looks like Financial Information Exchange

Re: Add remote ip address in solr log

2018-03-29 Thread Rick Leir
Vince Something as simple as an Apache proxypass would help, then your Apache log would tell you. Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Solr on HDInsight to write to Active Data Lake

2018-03-28 Thread Rick Leir
Hi, The class that is not found is likely in the Azure related libraries. As Erick said, are you sure that you have a library containing it? Cheers Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Some performance questions....

2018-03-23 Thread Rick Leir
Deep, What is the test so I can try it. 75 or 90 ms .. is that the JVM startup time? Cheers -- Rick >> >> >I have stated the numbers which I found during my test. The best way to >verify them is for someone else to run the same test. Otherwise I don't >see >how we can verify the results --

Re: Solr on HDInsight to write to Active Data Lake

2018-03-23 Thread Rick Leir
Abhi Check your lib directives. https://lucene.apache.org/solr/guide/6_6/lib-directives-in-solrconfig.html#lib-directives-in-solrconfig I suspect your jars are not in a lib dir mentioned in solrconfig.xml Cheers -- Rick On March 23, 2018 11:12:17 AM EDT, Abhi Basu <9000r...@gmail.com> wrote:

Re: Looking for design ideas

2018-03-18 Thread Rick Leir
Steve Does a document have a different URL when it is in a personal DB? I suspect the easiest solution is to use just one index. You can have a field containing an integer identifying the personal DB. For public, set this to zero. Call it DBid. Update the doc to change this and the URL when

Re: Expose a metric for percentage-recovered during full recoveries

2018-03-15 Thread Rick Leir
S Were there errors in the logs just before recoveries? Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Solr on DC/OS ?

2018-03-14 Thread Rick Leir
Søren, DC/OS installs on top of Ubuntu or RedHat, and it is used to coordinate many machines so they appear as a cluster. Solr needs to be on a single machine, or in the case of SolrCloud, on many machines. It has no need of the coordination which DC/OS provides. Solr depends on direct access

Re: SynonymGraphFilterFactory with WordDelimiterGraphFilterFactory usage

2018-03-14 Thread Rick Leir
Jay Did you try using text_en_splitting copied out of another release? Though if someone went to the trouble of removing it from the example, there could be something broken in it. Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: How to store files larger than zNode limit

2018-03-14 Thread Rick Leir
Could you manage userdict using Puppet or Ansible? Or whatever your automation system is. -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

RE: How to store files larger than zNode limit

2018-03-14 Thread Rick Leir
Markus, Atita We set it higher too. When zk is recovering from a disconnected state it re-sends all the messages that it had been trying to send while the machines were disconnected. Is this stored in a ' transaction log' .tlog file? I am not clear on this. Zk also goes through the unsent

Re: Navigation/Paging

2018-03-13 Thread Rick Leir
Sebastien Can you not just handle this in your Javascript? Your request will always get 15 rows, start=0 then start=15 and so on. In the details view you only show one of the documents of course, and when the user is viewing the last of 15 and clicks next, you will request the next 15. When

Re: Altering the query if query contains all stopwods

2018-03-09 Thread Rick Leir
Tav, Ryan Now you have me wondering, should it be returning *:* or some general landing page. Suppose you had typeahead or autocomplete, it should ignore any stopwords list. By the way, people on this list have had good reasons why we should stop using stopwords. Cheers -- Rick On March 9,

Re: Indexing nested json

2018-03-08 Thread Rick Leir
Hi James Yonick has a great blog explaining that, but I am on the bus so I do not have a link for you. Yes, you can use nesting, and there are good reasons for doing so, but you will find it much easier to use flat fields. YMMV Cheers -- Rick On March 8, 2018 5:22:13 PM EST, "kasinger, james"

RE: Highlighter throwing InvalidTokenOffsetsException for field with large number of synonyms

2018-03-08 Thread Rick Leir
David Yes, highlighting is tricky, especially with synonyms. Sorry, I would need to see a bit more of your config before saying more about it. Thanks -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Highlighter throwing InvalidTokenOffsetsException for field with large number of synonyms

2018-03-08 Thread Rick Leir
David When you have "lcx__balmoral__cannum__clear_lake__lower_norton" in a field, would you search for *cannum* ? That might not perform well. Why not have a multivalue field for this information? It could be that you have a good reason for this, and I just do not understand. Cheers -- Rick --

Re: Atomic updates using solr-php-client

2018-03-06 Thread Rick Leir
Sami Why not do the simple case first, with complete document updates. When you have that working, you can decide if you want atomic updates too. Cheers -- Rick On March 6, 2018 2:26:50 AM EST, Sami al Subhi wrote: >Thank you for replying, > >Yes that is the one. Unfortunately

Re: Need a Query syntax for fetching results

2018-03-06 Thread Rick Leir
Hi Raj Maybe this would be what you need. "Keyword Tokenizer This tokenizer treats the entire text field as a single token." There used to be an example showing the use of this in schema.xml, but I am away from my computer so it is hard to check. And everything Emir says is spot-on. Then you

Re: Alias field names when searching (not for results)

2018-03-06 Thread Rick Leir
Christopher The first thing that came to mind is that you are planning not to have an app in front of Solr. Without a web app, you will need to trust whoever can get access to Solr. Maybe you are on an intranet. Thanks -- Rick On March 6, 2018 2:42:26 AM EST, "Emir Arnautović"

Re: Need help with match contains query in SOLR

2018-03-01 Thread Rick Leir
Hi Would a pf2 boost suit your needs? You would match loosely on any term, and your results containing bigrams would be at the top. HTH -- Rick On March 1, 2018 11:54:19 AM EST, bbarani wrote: >Hi, > >I want to do a complete "phrase contain" match. > >For ex: Value is

Re: SOLR Similarity Difference

2018-02-27 Thread Rick Leir
Rick Did you experiment in the SolrAdmin analysis page? It would possibly tell you whether your chain is doing what you expect. Then you need to consider that boolean logic is not strictly boolean in Solr. There is a Lucidworks blog which explains this nicely; every now and then someone posts

Re: StandardTokenizer and splitting on mixedcase strings

2018-02-23 Thread Rick Leir
Dan, Lowercase filter before the tokenizer? Cheers -- Rick On February 23, 2018 6:08:27 AM EST, "Dan ." wrote: >Hi, > >The StandardTokenizerFactory splits strings like 'JavaScript' into >'Java' >and 'Script', but then searches with 'javascript' do not match the >document. >

Re: Object not fetched because its identifier appears to be already in processing

2018-02-23 Thread Rick Leir
Ven, Where do you see that message? Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Indexing timeout issues with SolrCloud 7.1

2018-02-23 Thread Rick Leir
Tom I think you are saying that all updates fail? Need to do a bit of troubleshooting. How about queries? What else is in the logs? Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Filesystems supported by Solr

2018-02-20 Thread Rick Leir
Ritesh The filesystems you mention are used by Spark so it can stream huge quantities of data (corrections please). By comparison, Solr uses a more 'reasonable' sized filesystem, but needs enough memory that all the index data can be resident. The regular Linux ext3 or ext4 is fine. If you

Re: Index data from mysql DB to Solr - From Scratch

2018-02-17 Thread Rick Leir
Nandan Work backwards from your results screen. When a user has done a search, what information would you like to appear on the screen? That tells you what your Solr document needs to contain. How will you get that information into the Solr document? You will do the SQL select(s) as necessary,

Re: Using SolrJ for digest authentication

2018-01-31 Thread Rick Leir
Eddy Maybe your request is getting through twice. Check your logs to see. Cheers -- Rick On January 31, 2018 5:59:53 AM EST, ddramireddy wrote: >We are currently deploying Solr in war mode(Yes, recommendation is not >war. >But this is something I can't change now. Planned

Re: Searching for an efficient and scalable way to filter query results using non-indexed and dynamic range values

2018-01-31 Thread Rick Leir
Luigi Is there a reason for not indexing all of your on-disk pages? That seems to be the first step. But I do not understand what your goal is. Cheers -- Rick On January 30, 2018 1:33:27 PM EST, Luigi Caiazza wrote: >Hello, > >I am working on a project that simulates a

Re: SolrCloud installation troubles...

2018-01-29 Thread Rick Leir
SELinux? Number open File limits? Number of Process limits? -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: pf2

2018-01-26 Thread Rick Leir
g Management - Alerting - Anomaly Detection >Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > > > >> On 26 Jan 2018, at 13:38, Rick Leir <rl...@leirtech.com> wrote: >> >> Emir >> Thanks, I will do when I get off this bus

RE: solr usage reporting

2018-01-26 Thread Rick Leir
Becky, There are excellent log analysis systems. Logstash? Awstats? I do not think Solr should do this. Some people index their logs into a separate Solr core for analysis, but it might be a challenge to do this in a useful way. Cheers -- Rick On January 25, 2018 2:56:01 PM EST, Becky Bonner

Re: pf2

2018-01-26 Thread Rick Leir
e sample of your query and text that should match. > >Thanks, >Emir >-- >Monitoring - Log Management - Alerting - Anomaly Detection >Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > > > >> On 25 Jan 2018, at 23:13, Rick Leir <rl...@leir

pf2

2018-01-25 Thread Rick Leir
Hi all My pf2 keywords^11.0 works for english not for french. Here are the fieldtypes, actually from two schema.xml's in separate cores. Solr 5.2.2, edismax, q.op AND I suspect there are several problems with the french schema. Maybe I only needed to show the query analyzer, not the index

Re: Solr 7.2.1 - cursorMark and elevateIds

2018-01-25 Thread Rick Leir
Greg Does the CursorMark run correctly on its own, with no elevate? cheers -- Rick On 01/23/2018 08:36 PM, Greg Roodt wrote: Hi I'm trying to use the Query Eleveation Component in conjunction with CursorMark pagination. It doesn't seem to work. I get an exception. Are these components meant

Re: SOLR Data Backup

2018-01-22 Thread Rick Leir
. > >BTW, why do we not recommend having Solr as a source of truth? > One reason is that you might want to tune the analysis chain and then reindex. Or your data gets progressively larger, and you want to be able to recover from an OOM during indexing. Rick -- Sorry for being brief.

Re: Solr Exception: Undefined Field

2018-01-17 Thread Rick Leir
Deepak Would you like to write your post again without asterisks? Include the asterisks which are necessary to the query of course. Rick On January 17, 2018 1:10:28 PM EST, Deepak Goel wrote: >*Hello* > >*In Solr Admin: I type the q parameter as - * > >*text_entry:** > >*It

Re: How to implement the function of W/N in Solr?

2018-01-16 Thread Rick Leir
Xi Might this be something you can solve with pf or pf2? Google solr pf will find this for you. Adjust the slop to allow for terms which are not immediately adjacent. Rick On January 15, 2018 3:04:40 AM EST, "xizhen.w...@incoshare.com" wrote: >Hello, > >I'm using

Re: ClassicTokenizer

2018-01-10 Thread Rick Leir
me to work better the old way? Thanks Rick On January 9, 2018 7:07:59 PM EST, Shawn Heisey <apa...@elyograg.org> wrote: >On 1/9/2018 9:36 AM, Rick Leir wrote: >> A while ago the default was changed to StandardTokenizer from >ClassicTokenizer. The biggest difference seems to be

ClassicTokenizer

2018-01-09 Thread Rick Leir
Hi all A while ago the default was changed to StandardTokenizer from ClassicTokenizer. The biggest difference seems to be that Classic does not break on hyphens. There is also a different character pr(mumble). I prefer the Classic's non-break on hyphens. What was the reason for changing this

Re: Deliver static html content via solr

2018-01-05 Thread Rick Leir
’t slow. > >For more on /browse, VrW, and example/files usage of those, check out >https://lucidworks.com/2015/12/08/browse-new-improved-solr-5/ > > Erik > > > >> On Jan 5, 2018, at 4:19 AM, Rick Leir <rl...@leirtech.com> wrote: >> >> Using Ve

Re: Deliver static html content via solr

2018-01-05 Thread Rick Leir
Using Velocity, you can have some results-driven HTML served by Solr and all your JS, CSS etc 'assets' served by Apache from /var/www/html. Warning: the Velocity learning curve is steep and you still need a separate front-end web app for security because Velocity is a templating output filter.

Re: SOLR SSL Java command line properties

2018-01-05 Thread Rick Leir
Bob Thanks for mentioning the jetty-ssl.xml file. I have a follow-on question: since it is strongly recommended that you host Solr behind a web app (perhaps solr-security-proxy is adequate), the Solr REST interface will not be on the open Internet, so perhaps HTTP is the appropriate

Re: Small Tokenization issue

2018-01-05 Thread Rick Leir
Nawab Look at classicTokenizer. It is a good choice if you have part numbers with hyphens. The second tokenizer on this page: https://lucene.apache.org/solr/guide/6_6/tokenizers.html Cheers -- Rick On 01/03/2018 04:52 PM, Shawn Heisey wrote: On 1/3/2018 1:56 PM, Nawab Zada Asad Iqbal

Re: DIH XPathEntityProcessor XPath subset?

2018-01-05 Thread Rick Leir
Stefan There is at least one free Solr WP plugin. There are several Solr PHP toolkits on github. Start with these unless your WP is wildly custo..  .. cheers -- Rick On 01/03/2018 11:50 AM, Erik Hatcher wrote: Stefan - If you pre-transform the XML, I’d personally recommend either

Re: SolrJ with Async Http Client

2018-01-02 Thread Rick Leir
Agrawal There is good reading on the topic at https://wiki.apache.org/solr/IntegratingSolr Cheers -- Rick On January 2, 2018 10:31:28 AM EST, RAUNAK AGRAWAL wrote: >Hi Guys, > >I am trying to write fully async service where solr calls are also >async. >Just wondering

Re: Solr Issue

2018-01-02 Thread Rick Leir
Lewin Is this not a job for a database like MySQL? Solr is a search engine, which can be used as a DB with some effort. Choose the right tool for the job . Cheers -- Rick On January 2, 2018 4:35:47 PM EST, "Lewin Joy (TMNA)" wrote: >** PROTECTED 関係者外秘 >Hi, > >I am using

Re: Enable default wildcard search

2017-12-29 Thread Rick Leir
Siarhei: Will you be putting up your system at github? I would like to Solr-ize my dovecot. Maybe you saw this already: https://github.com/dovecot/core/blob/master/doc/solr-schema.xml https://github.com/dovecot/core/blob/master/src/plugins/fts-solr/solr-connection.c

Re: solrcloud through aws elb

2017-12-26 Thread Rick Leir
Per, This is more of a question for the Drupal folks. But in passing, I would suggest that you show your config and what you saw in your logs. And my guess is firewall problems! Cheers -- Rick On December 26, 2017 3:37:39 AM EST, Per Qvindesland wrote: >Hi All > >I am trying to

Re: start with techproducts example in docker

2017-12-19 Thread Rick Leir
Christine I think this is a long-lived docker container , meaning that it does not all terminate after the command you showed. If so, you should be able to start a console or ssh session to it. Have a look at the solr.log. Better still, start the techproducts example within this ssh session,

Re: How to restrict the fields solr returns?

2017-12-19 Thread Rick Leir
The fl parameter is used for this. On December 19, 2017 3:22:59 AM EST, Solrmails wrote: >Hey > >I'm using a custom "QParserPlugin" to restrict which documents are >returned to the user. >Now I'd like to restrict also the fields that are returned with a >document. I

Re: How to restart solr in docker?

2017-12-18 Thread Rick Leir
Christine Have a look at the API Lucene.apache.org/solr/guide/6_6/config-api.html (Choose whatever version of the doc that is appropriate) Various parts of solrconfig can be overlayed. Or do your debugging with a normally installed Solr, then dockerize. Cheers -- Rick On December 18, 2017

Synonyms

2017-12-17 Thread Rick Leir
Hi All Doug Turnbull's latest blog http://opensourceconnections.com/blog/2017/11/21/solr-synonyms-mea-culpa/ at OpenSourceConnections is great, I learned lots. He mentions Wordnet the lexical database for the English language. If you are using his suggested synonyms in Solr to 'tune' tf/idf

Re: How to implement Incremental Indexing.

2017-12-11 Thread Rick Leir
Fiz Here is a blog article that seems to cover your plans https://www.toadworld.com/platforms/nosql/b/weblog/archive/2017/02/03/indexing-mongodb-data-in-apache-solr Also look at github, there are several projects which could do it for you. Cheers -- Rick On December 11, 2017 5:19:43 PM EST, Fiz

RE: FW: Need Help Configuring Solr To Work With Nutch

2017-12-09 Thread Rick Leir
> >Kind regards, >Aratrika Mukhopadhyay > >-Original Message- >From: Rick Leir [mailto:rl...@leirtech.com] >Sent: Friday, December 08, 2017 4:06 PM >To: solr-user@lucene.apache.org >Subject: Re: FW: Need Help Configuring Solr To Work With Nutch > >Ara >So

Re: FW: Need Help Configuring Solr To Work With Nutch

2017-12-08 Thread Rick Leir
Ara Softcommit might be the default in Solrconfig.xml, and if not then you should probably make it so. Then you need to have a look in solr.log if things are not working as you expect. Cheers -- Rick On December 8, 2017 3:23:35 PM EST, "Mukhopadhyay, Aratrika"

Re: indexing XML stored on HDFS

2017-12-07 Thread Rick Leir
in JSONL. Then use DIH or post tool or Python to post the docs to Solr. Rick On December 7, 2017 10:14:37 AM EST, Rick Leir <rl...@leirtech.com> wrote: >Matthew, >Do you have some sort of script calling xslt? Sorry, I do not know >Scala and I did not have time to look into your spark util

Re: indexing XML stored on HDFS

2017-12-07 Thread Rick Leir
Matthew, Do you have some sort of script calling xslt? Sorry, I do not know Scala and I did not have time to look into your spark utils. The script or Scala could then shell out to curl, or if it is python it could use the request library to send a doc to Solr. Extra points for batching the

Re: Howto search for § character

2017-12-06 Thread Rick Leir
Bernd, What is the analysis chain you have in schema.xml? The chain tokenizes text and filters characters. There is an index time chain and a query time chain. My suspicion is that your analysis chain is mapping that char to a plain ascii char. Use the SolrAdmin analysis tab to debug this.

Re: Fwd: solr-security-proxy

2017-12-01 Thread Rick Leir
a few AJAX GETs? Cheers -- Rick On November 30, 2017 3:10:14 PM EST, Rick Leir <rl...@leirtech.com> wrote: >Hi all >I have just been looking at solr-security-proxy, which seems to be a >great little app to put in front of Solr (link below). But would it >make more sense to use a

Fwd: solr-security-proxy

2017-11-30 Thread Rick Leir
Hi all I have just been looking at solr-security-proxy, which seems to be a great little app to put in front of Solr (link below). But would it make more sense to use a whitelist of Solr parameters instead of a blacklist? Thanks Rick https://github.com/dergachev/solr-security-proxy

Re: Solr Wildcard Search

2017-11-30 Thread Rick Leir
George, When you get those results it could be due to stemming. Wildcard processing expands your term to multiple terms, OR'd together. It also takes you down a different analysis pathway, as many analysis components do not work with multiple terms. Look into the SolrAdmin console, and use the

Re: Solr7 org.apache.lucene.index.IndexUpgrader

2017-11-27 Thread Rick Leir
Leo Your low priority data could be accumulated in a Couchbase DB or just in JSONL. Then it would be easy to re-index. Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Difference between UAX29URLEmailTokenizerFactory and ClassicTokenizerFactory

2017-11-24 Thread Rick Leir
Edwin There is a spec for which characters are acceptable in an email name, and another spec for chars in a domain name. I suspect you will have more success with a tokenizer which is specialized for email, but I have not looked at UAX29URLEmailTokenizerFactory. Does ClassicTokenizerFactory

Re: Solr7: Very High number of threads on aggregator node

2017-11-23 Thread Rick Leir
gt; >I couldn't find a decent way to copy long xml here, so i created this >stackoverflow thread:- > >https://stackoverflow.com/questions/47439503/solr-7-0-1-aggregator-node-spinning-many-threads > > >Thanks! >Nawab > > >On Mon, Nov 20, 2017 at 3:10 PM, Rick Leir <r

Re: Issue facing with spell text field containing hyphen

2017-11-21 Thread Rick Leir
Chirag Look in Sor Admin, the Analysis panel. Put spider-man in the left and right text inputs, and see how it gets analysed. Cheers -- Rick On November 20, 2017 10:00:49 PM EST, Chirag garg wrote: >Hi Rick, > >Actually my spell field also contains text with hyphen i.e. it

Re: Issue facing with spell text field containing hyphen

2017-11-20 Thread Rick Leir
Chirag Some scattered clues: StandardTokenizer splits on punctuation, so your spell field might not contain spider-man. When you do a wildcard search, the analysis chain can be different from what you expected. Cheers -- Rick On November 20, 2017 9:58:54 AM EST, Chirag Garg

Re: Solr7: Very High number of threads on aggregator node

2017-11-20 Thread Rick Leir
Nawab Why it would be good to share the solrconfigs: I had a suspicion that you might be using the same solrconfig for version 7 and 4.5. That is unlikely to work well. But I could be way off base. Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Fwd: CVE-2017-3163 - SOLR-5.2.1 version

2017-11-20 Thread Rick Leir
Pad Read the CVE. Do you have an affected version of Solr? Do you have the replication feature enabled in solrconfig.xml? Note that it might be enabled by default. Test directory traversal on your system: can you read files remotely? No? Then you are finished. A better plan: upgrade to a newer

Re: Solr7: Very High number of threads on aggregator node

2017-11-18 Thread Rick Leir
Nawab You probably need to share the relevant config to get an answer to this. Cheers -- Rick On November 17, 2017 2:19:03 PM EST, Nawab Zada Asad Iqbal wrote: >Hi, > >I have a sharded solr7 cluster and I am using an aggregator node (which >has >no data/index of its own) to

Re: DIH not stop

2017-11-16 Thread Rick Leir
Can, I would like to learn many languages, but so far only two. Shawn suggested you get help from a friend who knows English. As well, Google translate is great for me, but I have not used it with Turkish. Cheers -- Rick On November 16, 2017 5:19:33 AM EST, Shawn Heisey

Re: TimeZone issue

2017-11-16 Thread Rick Leir
Renuka Are your clients all in the same time zone? Solr should support clients in several timezones, and UTC conversion to local is best done in the client in my mind. Thanks -- Rick On November 16, 2017 6:54:47 AM EST, Renuka Srishti wrote: >Thanks for your

Re: How to get a solr core to persist

2017-11-15 Thread Rick Leir
Hi Shawn, Amanda When we put the data under /var/lib, I feel a need to put the config under /etc. Is this recommended, and would you use a symbolic link for the conf dir? Cheers--Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Admin Console Question

2017-11-14 Thread Rick Leir
Homer In chrome, right-click and choose 'inspect' at the bottom. Now go to the network tab then reload the page. Are you seeing errors? Tell! Thanks Rick On November 14, 2017 3:14:46 PM EST, Shawn Heisey wrote: >On 11/14/2017 11:43 AM, Webster Homer wrote: >> I am using

Re: recent utf8 problems

2017-11-07 Thread Rick Leir
Dr Krell Item 11): It is best to get the solrconfig.xml provided with the new version of Solr, and change it to suit your needs. Do not try to work from the old version's solrconfig.xml. I did not have time to read the other items. Look in solr.log, and compare the successful query with the

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Hoss Clearly it is U+00FC ü c3 bc LATIN SMALL LETTER U WITH DIAERESIS As in Tübingen "With the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset, a great novel dataset was introduced to the computer vision and multimedia research community." -- cool I think it is strange

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Dr. Krell You could look at your /select query handler, and compare it with the /query query handler in the Admin config. Did you upgrade from a previous version of Solr? Or change your config ( no, you must have thought of that). If it is a bug related to the Java upgrade then you need to

Re: Fwd: configuring Solr with Tesseract

2017-11-06 Thread Rick Leir
Anand, As Charlie says you should have a separate process for this. Also, if you go back about ten months in this mailing list you will see some discussion about how OCR can take minutes of CPU per page, and needs some preprocessing with Imagemagick or Graphicsmagick. You will want to do some

Re: Incomplete Index

2017-10-31 Thread Rick Leir
Dawg, I have a similar setup, and this is what works for me. I have a field which contains a timestamp. The timestamp is set to be identical for all documents added/updated in a run. Whe the run is complete and some/many documents have been overwritten then I can delete all un-updated documents

Re: Some problems in SOLR-6.5.1

2017-10-25 Thread Rick Leir
Klin, You need to use the new version's solrconfig.xml, with modifications as necessary. Start by looking at the current solrconfig, what was modified there? Did you re-index? If you cannot reindex then you should upgrade to 5.n then to 6.m. Cheers -- Rick On October 24, 2017 11:21:48 PM EDT,

Re: BlendedTermQuery for Solr?

2017-10-25 Thread Rick Leir
James It looks as if Markus could help: http://lucene.472066.n3.nabble.com/BlendedTermQuery-causing-negative-IDF-td4271289.html Also, ES has a query. You could look at the source there. "BlendedTermQuery forms the guts behind Elasticsearch’s cross_field search. -- Doug Turnbull Cheers -- Rick

Re: Upload/update full schema and solrconfig in standalone mode

2017-10-21 Thread Rick Leir
Alessandro, Scp is "secure cp" and is a part of the ssh service. So if you have ssh access then you can use scp. From Windows you would be using winscp. Many hosts provide this but not all. If you send files to the ops staff then they can coordinate the restart and any fallback planning, so

Re: Upload/update full schema and solrconfig in standalone mode

2017-10-20 Thread Rick Leir
Alessandro First, let me say that the whole idea makes me nervous. 1/ are you better off with scp? I would not want to do this via Solr API 2/ the right way to do this is with Ansible, Puppet or Docker, 3/ would you like to update a 'QA' installation, test it, then flip it into production?

Re: Goal: reverse chronological display Methods? (1) boost, and/or (2) disable idf

2017-10-20 Thread Rick Leir
Bill, In the debug score calculations, the bf boosting does not appear at all. I would expect it to at least show up with a small value. So maybe we need to look at the query. Cheers -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Deploy Solr to Production: guides, best practices

2017-10-19 Thread Rick Leir
Maximka The app server is bundled in Solr, so you do not install Tomcat or JEtty separately. Cheers -- Rick On October 19, 2017 2:01:30 AM EDT, maximka19 wrote: >Hi everyone! > >I was looking for full-text search engine and chosen Solr. Quickly >introduced with Solr.

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-15 Thread Rick Leir
hl, search solution architect >Cominvent AS - www.cominvent.com > >> 15. okt. 2017 kl. 15:29 skrev Florian Gleixner <f...@redflo.de>: >> >> On 13.10.2017 15:13, Rick Leir wrote: >>> Hi all, >>> What is the earliest version which was vulnerable? >>> Thanks -- R

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Rick Leir
On 2017-10-13 04:19 PM, Kevin Layer wrote: Amrit Sarkar wrote: Kevin, fileType => md is not recognizable format in SimplePostTool, anyway, moving on. OK, thanks. Looks like I'll have to abandon using solr for this project (or find another way to crawl the site). Thank you for all the help,

Re: Several critical vulnerabilities discovered in Apache Solr (XXE & RCE)

2017-10-13 Thread Rick Leir
Hi all, What is the earliest version which was vulnerable? Thanks -- Rick -- Sorry for being brief. Alternate email is rickleir at yahoo dot com

Re: Solr related questions

2017-10-13 Thread Rick Leir
1/ the _version_ field is necessary. 2/ there is a Solr api for editing the manged schema 3/ not having used solrnet, I suspect you can bypass it and use the solr REST api directly. Cheers -- Rick On October 13, 2017 5:40:26 AM EDT, startrekfan wrote: >Hello, > >I

Re: Appending fields to pre-existed document

2017-10-13 Thread Rick Leir
Hi Show us the solr version, field types, the handler definition, and the query you send. Any log entries? Cheers -- Rick On October 13, 2017 5:57:16 AM EDT, "Игорь Абрашин" wrote: >Hello, solr community. >We are getting strugled with updating already existing docs. For

Re: book on solr

2017-10-13 Thread Rick Leir
Jay, get info on this with a search: https://www.google.ca/search?q=solr+shard+size cheers -- Rick On 2017-10-13 01:42 AM, Jay Potharaju wrote: Any blog or documentation also that would provide some basic rules or guidelines for scaling would also be great. Thanks Jay Potharaju

Re: ERROR ipc.AbstractRpcClient: SASL authentication failed

2017-10-04 Thread Rick Leir
Ascot, At the risk of ...   Can you disable Kerberos in Hbase? If not, then you will have to provide a password! Rick On 2017-10-04 07:32 PM, Ascot Moss wrote: Does anyone use hbase indexer in index kerberos Hbase to solr? Pls help! On Wed, Oct 4, 2017 at 10:18 PM, Ascot Moss

Re: Time to Load a Solr Core with Hdfs Directory Factory

2017-10-04 Thread Rick Leir
Shashank, I had a quick look at: https://lucene.apache.org/solr/guide/6_6/running-solr-on-hdfs.html Did you enable the Block Cache and the solr.hdfs.nrtcachingdirectory? cheers -- Rick On 2017-10-03 09:22 PM, Shashank Pedamallu wrote: Hi, I’m trying an experiment in which, I’m loading a

Re: SOLR terminology

2017-09-28 Thread Rick Leir
Gunalan, Solr Core (core), is one-to-one with a Solr process and its data directory. It can be a shard, or part of a replica. Collection - is one or more shards grouped together, and can be replicated for reliability, availability and performance Node - is a machine in a Zookeeper group

Re: Replicates not recovering after rolling restart

2017-09-22 Thread Rick Leir
Wunder, Erick $ dc 16o 1578578283947098112p 15E83C95E8D0 That is an interesting number. Is it, as a guess, machine instructions or an address pointer? It does not look like UTF-8 or ASCII. Machine code looks promising: Disassembly: 0: 15 e8 3c 95 e8 adceax,0xe8953ce8 5:

  1   2   3   >