Is it possible to profile the code to find the exact points which are
taking more time comparatively?
On Sun, 27 May 2018, 06:02 Will Currie, wrote:
> I raised https://issues.apache.org/jira/browse/SOLR-12407. In case anybody
> else sees a similar slowdown with boosts.
>
> On
I raised https://issues.apache.org/jira/browse/SOLR-12407. In case anybody
else sees a similar slowdown with boosts.
On Sat, May 26, 2018 at 4:10 PM, Will Currie wrote:
> I did some more (micro)benchmarking with a single query. Setting the query
> cache size to zero I see
Thanks! now I can just record the URL and then paste it in ;)
Who knows, maybe people will see it first too!
On Sat, May 26, 2018 at 9:48 AM, Tim Allison wrote:
> W00t! Thank you, Shawn!
>
> The "don't use ERH in production" response comes up frequently enough
>> that I
Thanks- It's actually more like a localhost/app2:
app2 in question is Omeka (digital publishing platform)
When Omeka is installed on a server, it's usually all alone on the server.
So you *tell *it to index something and what core corresponds to that index
and it indexes it?
If so, I think I'll
I think you may have other pieces of software in that equation. Solr does
not normally pull data from websites, it gets data pushed.
Well, data import handler can do it. Then you normally start indexing by a
command to Solr. That commans corresponds to a request handler in
solrconfig.xml that
Hello.
I have a page that consists of a domain name and several folders in it
corresponding to different web applications.
eg:
website.university.edu/app1
website.university.edu/app2
website.university.edu/app3
And all the pages are stored in separate folders in an html directory.
There is
W00t! Thank you, Shawn!
The "don't use ERH in production" response comes up frequently enough
> that I have created a wiki page we can use for responses:
>
> https://wiki.apache.org/solr/RecommendCustomIndexingWithTika
>
> Tim, you are extremely well-qualified to expand and correct this page.
>
On 5/26/2018 4:52 AM, Tim Allison wrote:
Please see Erick Erickson’s evergreen advice and linked blog post:
https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201805.mbox/%3ccan4yxve_0gn0a1y7wjpr27inuddo6+jzwwfgvzkfs40gh3r...@mail.gmail.com%3e
The "don't use ERH in production"
+1 as always to Erick’s advice. DIH is only a PoC.
We do have a DigestingParser in Tika, and when you combine that w the
RecursiveParserWrapper, you can get digests not only of the main file but
also on all embedded files/attachments...which can be pretty neat for some
use cases.
Operators are
On third thought, I can’t think of how you’d easily inject a
PasswordProvider into Solr’s integration.
Please see Erick Erickson’s evergreen advice and linked blog post:
You’ll need to provide a PasswordProvider in the ParseContext. I don’t
think that is currently possible in the Solr integration. Please open a
ticket if SolrJ doesn’t meet your needs.
On Thu, May 24, 2018 at 1:03 PM Alexandre Rafalovitch
wrote:
> Hmm. If it works, then it
I did some more (micro)benchmarking with a single query. Setting the query
cache size to zero I see 400ms response time on 7.2 and 600ms on 7.3.
Running curl in a loop on my laptop. ~4M docs. ~3G index. 1M total hits for
the query.. Yup. I'm reluctant to post the query. It has multiple 300+
12 matches
Mail list logo