Re: top 10 query overall vs shard

2018-06-22 Thread Arturas Mazeika
Hi Shawn et al, Thanks a lot for the prompt answer. It looks to me that I made quite a few mistakes in formulating those solr queries. Setting shards.qt to the name of the core was completely wrong. I tried to search for shards.qt in http://lucene.apache.org/solr/guide/7_3/ but it did not give

Re: top 10 query overall vs shard

2018-06-22 Thread Shawn Heisey
On 6/22/2018 8:12 AM, Shawn Heisey wrote: I wonder if having an invalid handler contributed to the speed. Further thought about this: I can't say whether having an invalid handler name would cause speed problems, but based on my limited understanding of the code involved, I don't think it

Re: top 10 query overall vs shard

2018-06-22 Thread Shawn Heisey
On 6/22/2018 6:50 AM, Arturas Mazeika wrote: I grabbed the 2.7.1 version of solr, created a 4 core setup with replication factor 2 on windows using [1], I've restarted the setup with 2GB for each node [2], inserted the html docs from the german wikipedia archive [3], and obtained top 10 terms

top 10 query overall vs shard

2018-06-22 Thread Arturas Mazeika
Hi Solr-Team, I am familiarizing myself with solr cloud and I am trying out and compare different processing setups. Short story: term-query ran on shard gives lower numbers compared querying the complete index. I wonder why. Long story: I grabbed the 2.7.1 version of solr, created a 4 core

RE: Overall large size in Solr across collections

2016-04-26 Thread Allison, Timothy B.
heng Lin Edwin Yeo [mailto:edwinye...@gmail.com] Sent: Thursday, April 21, 2016 12:13 AM To: solr-user@lucene.apache.org Subject: Re: Overall large size in Solr across collections Hi Shawn, Yes, I'm using the Extracting Request Handler. The 0.7GB/hr is the indexing rate at which the size of the

Re: Overall large size in Solr across collections

2016-04-20 Thread Zheng Lin Edwin Yeo
n command prompt in 4 different command windows. > > The ID has been configured in such a way that it will not overwrite each > > other during the indexing. Is that considered multi-threading? > > > > The rate are all below 0.2GB/hr for each individual threads, and overall > &

Re: Overall large size in Solr across collections

2016-04-20 Thread Shawn Heisey
other during the indexing. Is that considered multi-threading? > > The rate are all below 0.2GB/hr for each individual threads, and overall > rate is just 0.7GB/hr. Was I right to think you're using the Extracting Request Handler? If you have enough CPU resources on the Solr server, you could s

Re: Overall large size in Solr across collections

2016-04-20 Thread Zheng Lin Edwin Yeo
? The rate are all below 0.2GB/hr for each individual threads, and overall rate is just 0.7GB/hr. Regards, Edwin On 20 April 2016 at 21:43, Shawn Heisey <apa...@elyograg.org> wrote: > On 4/19/2016 10:12 PM, Zheng Lin Edwin Yeo wrote: > > Thanks for the information Shawn. > > &

Re: Overall large size in Solr across collections

2016-04-20 Thread Shawn Heisey
On 4/19/2016 10:12 PM, Zheng Lin Edwin Yeo wrote: > Thanks for the information Shawn. > > I believe it could be due to the types of file that is being indexed. > Currently, I'm indexing the EML files which are in HTML format, and they > are more rich in content (with in line images and full text),

Re: Overall large size in Solr across collections

2016-04-19 Thread Zheng Lin Edwin Yeo
Thanks for the information Shawn. I believe it could be due to the types of file that is being indexed. Currently, I'm indexing the EML files which are in HTML format, and they are more rich in content (with in line images and full text), while previously the EML files are in Plain Text format,

Re: Overall large size in Solr across collections

2016-04-19 Thread Shawn Heisey
On 4/19/2016 9:28 AM, Zheng Lin Edwin Yeo wrote: > Currently, the searching performance is still doing fine, but it is the > indexing that is slowing down. Not sure if increasing the RAM, or changing > to a SSD hard disk will help with the indexing speed? You need to figure out exactly what is

Re: Overall large size in Solr across collections

2016-04-19 Thread Zheng Lin Edwin Yeo
Hi Shawn, Currently, the searching performance is still doing fine, but it is the indexing that is slowing down. Not sure if increasing the RAM, or changing to a SSD hard disk will help with the indexing speed? Regards, Edwin On 19 April 2016 at 21:57, Shawn Heisey wrote:

Re: Overall large size in Solr across collections

2016-04-19 Thread Shawn Heisey
On 4/18/2016 8:50 PM, Zheng Lin Edwin Yeo wrote: > Thanks for your explanation. > > I have set my segment size to 20GB under the TieredMergePolicy > > "maxMergeAtOnce">10 10 "maxMergedSegmentMB">20480 That just controls the maximum size of a segment. This defaults to 5GB. When segments

Re: Overall large size in Solr across collections

2016-04-18 Thread Zheng Lin Edwin Yeo
s passed the 1TB mark. Currently > > the query speed is still normal, but the indexing speed seems to be > become > > slower. > > > > Will it affect the performance if I continue to increase the index size > but > > stick to 1 shard? > > I have noticed over

Re: Overall large size in Solr across collections

2016-04-18 Thread Shawn Heisey
> slower. > > Will it affect the performance if I continue to increase the index size but > stick to 1 shard? I have noticed overall *bulk* indexing speed slows down as the index gets bigger, but I suspect that a big part of the reason this happens is segment merging involves more *large* s

Overall large size in Solr across collections

2016-04-18 Thread Zheng Lin Edwin Yeo
Hi, I have many collections in Solr, but with only 1 shard. I found that the index size across all the collections has passed the 1TB mark. Currently the query speed is still normal, but the indexing speed seems to be become slower. Will it affect the performance if I continue to increase the

Enforce overall Solr timeout

2012-01-18 Thread Jose Aguilar
Hi all, Is there a setting to enforce an overall timeout for Solr? For example, we are using setting timeallowed=2000 in solrconfig.xml (using version 3.5), but as far as I can tell, that only applies to the search part that returns partial results if it takes more than 2 seconds and returns

Re: Enforce overall Solr timeout

2012-01-18 Thread Otis Gospodnetic
overall Solr timeout Hi all, Is there a setting to enforce an overall timeout for Solr? For example, we are using setting timeallowed=2000 in solrconfig.xml (using version 3.5), but as far as I can tell, that only applies to the search part that returns partial results if it takes

Overall

2008-06-09 Thread Mihails Agafonovs
Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? 2) Is there a way to add data in the search database using some online interface, or the only way is manually adding the data in the *.xml files? 3) Is there any

Re: Overall

2008-06-09 Thread Umar Shah
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? you can use DataImportHandler to index from MySql (or other databases) 2) Is there a way to add data in

Re: Overall

2008-06-09 Thread Mihails Agafonovs
1) ok 2) This means developing some custom program, so there is no such functionality in Solr :( 3) I have some connection problems and I really can't load these mailing list archives at all! Anyway, I want to understand, how can I use Solr in my site or any other usage? Quoting Umar Shah :

Re: Overall

2008-06-09 Thread Dom Stockdale
Hi Mihails, I don't know about points 1 and 2 as I'm just starting with Solr but for point 3 you need to understand that Solr is just going to return xml for your queries so you can use any web language to parse the xml of the results. It might return other formats like json as well,

Re: Overall

2008-06-09 Thread Alexander Ramos Jardim
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? If you do that, use MySQL own full text search capabilities and not Solr, as it is built from Lucene. 2) Is

Re: Overall

2008-06-09 Thread Shalin Shekhar Mangar
2) Take a look at DataImportHandler for indexing data at http://wiki.apache.org/solr/DataImportHandler 2008/6/10 Alexander Ramos Jardim [EMAIL PROTECTED]: 2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL

Re: Overall performance: network v.s. SAN file system

2007-10-29 Thread Otis Gospodnetic
] To: solr-user@lucene.apache.org Sent: Thursday, October 18, 2007 9:49:25 PM Subject: Re: Overall performance: network v.s. SAN file system Hi, I have not done any comparisons with Solr but have done some with another enterprise search engine. Are you looking for performance data or architecture

Re: Overall performance: network v.s. SAN file system

2007-10-18 Thread Walter Underwood
- Original Message From: Lance Norskog [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Wednesday, August 15, 2007 6:28:27 PM Subject: Overall performance: network v.s. SAN file system Is anyone doing Solr installations with a SAN file system? Like IBM Storage Tank or Apple

Re: Overall performance: network v.s. SAN file system

2007-10-18 Thread Christopher Triggs
, August 15, 2007 6:28:27 PM Subject: Overall performance: network v.s. SAN file system Is anyone doing Solr installations with a SAN file system? Like IBM Storage Tank or Apple XSAN or Red Hat GFS? What are your experiences? Thanks, Lance -- Christopher Triggs ABN:24 544 055 144

Overall performance: network v.s. SAN file system

2007-08-15 Thread Lance Norskog
Is anyone doing Solr installations with a SAN file system? Like IBM Storage Tank or Apple XSAN or Red Hat GFS? What are your experiences? Thanks, Lance