Hi Shusheel,

we have enabled kerberos. so solr is accessed using Hue only. i will check
if I can get the similar information using Hue. Thanks.

Regards,
Anil

On 14 March 2016 at 19:34, Susheel Kumar <susheel2...@gmail.com> wrote:

> Hello Anil,
>
> Can you go to Solr Admin Panel -> Dashboard and share all 4 memory
> parameters under System / share the snapshot. ?
>
> Thanks,
> Susheel
>
> On Mon, Mar 14, 2016 at 5:36 AM, Anil <anilk...@gmail.com> wrote:
>
> > HI Toke and Jack,
> >
> > Please find the details below.
> >
> > * How large are your 3 shards in bytes? (total index across replicas)
> >           --  *146G. i am using CDH (cloudera), not sure how to check the
> > index size of each collection on each shard*
> > * What storage system do you use (local SSD, local spinning drives,
> remote
> > storage...)? *Local (hdfs) spinning drives*
> > * How much physical memory does your system have? *we have 15 data nodes.
> > multiple services installed on each data node (252 GB RAM for each data
> > node). 25 gb RAM allocated for solr service.*
> > * How much memory is free for disk cache? *i could not find.*
> > * How many concurrent queries do you issue? *very less. i dont see any
> > concurrent queries to this file_collection for now.*
> > * Do you update while you search? *Yes.. its very less.*
> > * What does a full query (rows, faceting, grouping, highlighting,
> > everything) look like? *for the file_collection, rows - 100, highlights =
> > false, no facets, expand = false.*
> > * How many documents does a typical query match (hitcount)? *it varies
> with
> > each file. i have sort on int field to order commands in the query.*
> >
> > we have two sets of collections on solr cluster ( 17 data nodes)
> >
> > 1. main_collection - collection created per year. each collection uses 8
> > shards 2 replicas ex: main_collection_2016, main_collection_2015 etc
> >
> > 2. file_collection (where files having commands are indexed) - collection
> > created per 2 years. it uses 3 shards and 2 replicas. ex :
> > file_collection_2014, file_collection_2016
> >
> > The slowness is happening for file_collection. though it has 3 shards,
> > documents are available in 2 shards. shard1 - 150M docs and shard2 has
> 330M
> > docs , shard3 is empty.
> >
> > main_collection is looks good.
> >
> > please let me know if you need any additional details.
> >
> > Regards,
> > Anil
> >
> >
> > On 13 March 2016 at 21:48, Anil <anilk...@gmail.com> wrote:
> >
> > > Thanks Toke and Jack.
> > >
> > > Jack,
> > >
> > > Yes. it is 480 million :)
> > >
> > > I will share the additional details soon. thanks.
> > >
> > >
> > > Regards,
> > > Anil
> > >
> > >
> > >
> > >
> > >
> > > On 13 March 2016 at 21:06, Jack Krupansky <jack.krupan...@gmail.com>
> > > wrote:
> > >
> > >> (We should have a wiki/doc page for the "usual list of suspects" when
> > >> queries are/appear slow, rather than need to repeat the same mantra(s)
> > for
> > >> every inquiry on this topic.)
> > >>
> > >>
> > >> -- Jack Krupansky
> > >>
> > >> On Sun, Mar 13, 2016 at 11:29 AM, Toke Eskildsen <
> > t...@statsbiblioteket.dk>
> > >> wrote:
> > >>
> > >> > Anil <anilk...@gmail.com> wrote:
> > >> > > i have indexed a data (commands from files) with 10 fields and 3
> of
> > >> them
> > >> > is
> > >> > > text fields. collection is created with 3 shards and 2 replicas. I
> > >> have
> > >> > > used document routing as well.
> > >> >
> > >> > > Currently collection holds 47,80,01,405 records.
> > >> >
> > >> > ...480 million, right? Funny digit grouping in India.
> > >> >
> > >> > > text search against text field taking around 5 sec. solr is query
> > just
> > >> > and
> > >> > > of two terms with fl as 7 fields
> > >> >
> > >> > > fileId:"file unique id" AND command_text:(system login)
> > >> >
> > >> > While not an impressive response time, it might just be that your
> > >> hardware
> > >> > is not enough to handle that amount of documents. The usual culprit
> is
> > >> IO
> > >> > speed, so chances are you have a system with spinning drives and not
> > >> enough
> > >> > RAM: Switch to SSD and/or add more RAM.
> > >> >
> > >> > To give better advice, we need more information.
> > >> >
> > >> > * How large are your 3 shards in bytes?
> > >> > * What storage system do you use (local SSD, local spinning drives,
> > >> remote
> > >> > storage...)?
> > >> > * How much physical memory does your system have?
> > >> > * How much memory is free for disk cache?
> > >> > * How many concurrent queries do you issue?
> > >> > * Do you update while you search?
> > >> > * What does a full query (rows, faceting, grouping, highlighting,
> > >> > everything) look like?
> > >> > * How many documents does a typical query match (hitcount)?
> > >> >
> > >> > - Toke Eskildsen
> > >> >
> > >>
> > >
> > >
> >
>

Reply via email to