Hi Shawn et al,
Thanks a lot for the prompt answer.
It looks to me that I made quite a few mistakes in formulating those solr
queries. Setting shards.qt to the name of the core was completely wrong. I
tried to search for shards.qt in http://lucene.apache.org/solr/guide/7_3/
but it did not give
On 6/22/2018 8:12 AM, Shawn Heisey wrote:
I wonder if having an invalid handler contributed to the speed.
Further thought about this:
I can't say whether having an invalid handler name would cause speed
problems, but based on my limited understanding of the code involved, I
don't think it
On 6/22/2018 6:50 AM, Arturas Mazeika wrote:
I grabbed the 2.7.1 version of solr, created a 4 core setup with
replication factor 2 on windows using [1], I've restarted the setup with
2GB for each node [2], inserted the html docs from the german wikipedia
archive [3], and obtained top 10 terms
Hi Solr-Team,
I am familiarizing myself with solr cloud and I am trying out and compare
different processing setups. Short story: term-query ran on shard gives
lower numbers compared querying the complete index. I wonder why.
Long story:
I grabbed the 2.7.1 version of solr, created a 4 core
heng Lin Edwin Yeo [mailto:edwinye...@gmail.com]
Sent: Thursday, April 21, 2016 12:13 AM
To: solr-user@lucene.apache.org
Subject: Re: Overall large size in Solr across collections
Hi Shawn,
Yes, I'm using the Extracting Request Handler.
The 0.7GB/hr is the indexing rate at which the size of the
n command prompt in 4 different command windows.
> > The ID has been configured in such a way that it will not overwrite each
> > other during the indexing. Is that considered multi-threading?
> >
> > The rate are all below 0.2GB/hr for each individual threads, and overall
> &
other during the indexing. Is that considered multi-threading?
>
> The rate are all below 0.2GB/hr for each individual threads, and overall
> rate is just 0.7GB/hr.
Was I right to think you're using the Extracting Request Handler?
If you have enough CPU resources on the Solr server, you could s
?
The rate are all below 0.2GB/hr for each individual threads, and overall
rate is just 0.7GB/hr.
Regards,
Edwin
On 20 April 2016 at 21:43, Shawn Heisey <apa...@elyograg.org> wrote:
> On 4/19/2016 10:12 PM, Zheng Lin Edwin Yeo wrote:
> > Thanks for the information Shawn.
> >
&
On 4/19/2016 10:12 PM, Zheng Lin Edwin Yeo wrote:
> Thanks for the information Shawn.
>
> I believe it could be due to the types of file that is being indexed.
> Currently, I'm indexing the EML files which are in HTML format, and they
> are more rich in content (with in line images and full text),
Thanks for the information Shawn.
I believe it could be due to the types of file that is being indexed.
Currently, I'm indexing the EML files which are in HTML format, and they
are more rich in content (with in line images and full text), while
previously the EML files are in Plain Text format,
On 4/19/2016 9:28 AM, Zheng Lin Edwin Yeo wrote:
> Currently, the searching performance is still doing fine, but it is the
> indexing that is slowing down. Not sure if increasing the RAM, or changing
> to a SSD hard disk will help with the indexing speed?
You need to figure out exactly what is
Hi Shawn,
Currently, the searching performance is still doing fine, but it is the
indexing that is slowing down. Not sure if increasing the RAM, or changing
to a SSD hard disk will help with the indexing speed?
Regards,
Edwin
On 19 April 2016 at 21:57, Shawn Heisey wrote:
On 4/18/2016 8:50 PM, Zheng Lin Edwin Yeo wrote:
> Thanks for your explanation.
>
> I have set my segment size to 20GB under the TieredMergePolicy
>
> "maxMergeAtOnce">10 10 "maxMergedSegmentMB">20480
That just controls the maximum size of a segment. This defaults to
5GB. When segments
s passed the 1TB mark. Currently
> > the query speed is still normal, but the indexing speed seems to be
> become
> > slower.
> >
> > Will it affect the performance if I continue to increase the index size
> but
> > stick to 1 shard?
>
> I have noticed over
> slower.
>
> Will it affect the performance if I continue to increase the index size but
> stick to 1 shard?
I have noticed overall *bulk* indexing speed slows down as the index
gets bigger, but I suspect that a big part of the reason this happens is
segment merging involves more *large* s
Hi,
I have many collections in Solr, but with only 1 shard. I found that the
index size across all the collections has passed the 1TB mark. Currently
the query speed is still normal, but the indexing speed seems to be become
slower.
Will it affect the performance if I continue to increase the
Hi all,
Is there a setting to enforce an overall timeout for Solr? For example, we are
using setting timeallowed=2000 in solrconfig.xml (using version 3.5), but as
far as I can tell, that only applies to the search part that returns partial
results if it takes more than 2 seconds and returns
overall Solr timeout
Hi all,
Is there a setting to enforce an overall timeout for Solr? For example, we
are
using setting timeallowed=2000 in solrconfig.xml (using version 3.5), but as
far
as I can tell, that only applies to the search part that returns partial
results
if it takes
Hi!
Some questions:
1) Is it possible to make Solr to use, for example, MySQL database,
or it only supports *.xml files as a database?
2) Is there a way to add data in the search database using some
online interface, or the only way is manually adding the data in the
*.xml files?
3) Is there any
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]:
Hi!
Some questions:
1) Is it possible to make Solr to use, for example, MySQL database,
or it only supports *.xml files as a database?
you can use DataImportHandler to index from MySql (or other databases)
2) Is there a way to add data in
1) ok
2) This means developing some custom program, so there is no such
functionality in Solr :(
3) I have some connection problems and I really can't load these
mailing list archives at all! Anyway, I want to understand, how can I
use Solr in my site or any other usage?
Quoting Umar Shah :
Hi Mihails,
I don't know about points 1 and 2 as I'm just starting with Solr but
for point 3 you need to understand that Solr is just going to return
xml for your queries so you can use any web language to parse the xml
of the results. It might return other formats like json as well,
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]:
Hi!
Some questions:
1) Is it possible to make Solr to use, for example, MySQL database,
or it only supports *.xml files as a database?
If you do that, use MySQL own full text search capabilities and not Solr, as
it is built from Lucene.
2) Is
2) Take a look at DataImportHandler for indexing data at
http://wiki.apache.org/solr/DataImportHandler
2008/6/10 Alexander Ramos Jardim [EMAIL PROTECTED]:
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]:
Hi!
Some questions:
1) Is it possible to make Solr to use, for example, MySQL
]
To: solr-user@lucene.apache.org
Sent: Thursday, October 18, 2007 9:49:25 PM
Subject: Re: Overall performance: network v.s. SAN file system
Hi,
I have not done any comparisons with Solr but have done some with
another enterprise search engine. Are you looking for performance data
or architecture
- Original Message
From: Lance Norskog [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Wednesday, August 15, 2007 6:28:27 PM
Subject: Overall performance: network v.s. SAN file system
Is anyone doing Solr installations with a SAN file system? Like IBM
Storage
Tank or Apple
, August 15, 2007 6:28:27 PM
Subject: Overall performance: network v.s. SAN file system
Is anyone doing Solr installations with a SAN file system? Like IBM
Storage
Tank or Apple XSAN or Red Hat GFS? What are your experiences?
Thanks,
Lance
--
Christopher
Triggs
ABN:24 544 055 144
Is anyone doing Solr installations with a SAN file system? Like IBM Storage
Tank or Apple XSAN or Red Hat GFS? What are your experiences?
Thanks,
Lance
28 matches
Mail list logo