[ 
https://issues.apache.org/jira/browse/SOLR-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17057767#comment-17057767
 ] 

Aaron Sun edited comment on SOLR-14314 at 3/12/20, 10:09 AM:
-------------------------------------------------------------

[~ichattopadhyaya]   Thanks for the valuable answer.  After change the JVM heap 
size to 25 GB, it indeed become much better, still a bit pause in log here and 
there, but much shorter, around 1~2 seconds.  Is it possible to make it even 
better?  Also notice the pause happen more often around "HttpSolrCall Closing 
out SolrRequest" which does not seem related with GC pause.

Regarding the muliple solr nodes(JVMs), I guess you refer to this page: 
[https://lucene.apache.org/solr/guide/7_2/taking-solr-to-production.html#running-multiple-solr-nodes-per-host|https://lucene.apache.org/solr/guide/7_2/taking-solr-to-production.html#running-multiple-solr-nodes-per-host,]
  , is that mean each solr instance have it's own solr home directory and port? 
 if so how to split the data? one core with one instance?  Is that mean client 
need to manage which solr instance to talk with?

I couldn't find good example on internet, appreciate if you could provide some 
guidance.  
{noformat}
2020-03-12 10:37:19.804 DEBUG (qtp1668016508-3474) [ x:aggprogram] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:37:20.543 DEBUG (qtp1668016508-4857) [ x:aggprogram] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE add\{,id=2101608110097976031} 
\{{params(commit=true),defaults(wt=json)}}

2020-03-12 10:39:11.250 DEBUG (qtp1668016508-6123) [ x:aggasset] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:39:11.915 TRACE (qtp1668016508-3376) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2102003090810779924 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002583 
refcount=1} LogPtr(1081326) map=1784607161

2020-03-12 10:40:08.746 DEBUG (qtp1668016508-382) [ x:aggasset] 
o.a.s.s.HttpSolrCall Closing out SolrRequest: 
\{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:40:09.640 DEBUG (qtp1668016508-3239) [ x:aggasset] 
o.a.s.u.TransactionLog New TransactionLog 
file=/data1/solr8/aggasset/data/tlog/tlog.0000000000000001116, exists=false, 
size=0, openExisting=false


2020-03-12 10:40:58.182 DEBUG (qtp1668016508-3413) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:41:00.318 TRACE (qtp1668016508-381) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2101701290647113224 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002593 
refcount=1} LogPtr(1940077) map=1984880505


2020-03-12 10:41:33.880 DEBUG (qtp1668016508-771) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:41:35.754 TRACE (qtp1668016508-3298) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2102003070806775224 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002598 
refcount=1} LogPtr(4246525) map=1493020555


2020-03-12 10:42:23.140 DEBUG (qtp1668016508-107) [ x:agglogtrackitem] 
o.a.s.u.DirectUpdateHandler2 
updateDocument(add\{_version_=1660950824311848960,id=2101702170007764324})
2020-03-12 10:42:23.935 TRACE (qtp1668016508-380) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2101806210189104124 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002605 
refcount=1} LogPtr(5096503) map=2041040637

{noformat}
 And the QTime with 100+s still not sound too good

{noformat}

2020-03-12 11:08:27.586 INFO (qtp1668016508-15663) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory [agglogtrackitem] webapp=/solr path=/update 
params=\{commit=true}{add=[2102001130729569124 (1660952325002362880), 
2102001220746002624 (1660952325018091520), 2102002130766975424 
(1660952325216272385), 2102003020799380624 (1660952325224660992), 
2102001150733370324 (1660952325239341056), 2102003090811568924 
(1660952325280235520), 2102002130766460924 (1660952325295964161), 
2102001220746002024 (1660952325313789954), 2102002200779134024 
(1660952325333712896), 2102002280792794524 (1660952325357830145), ... (200 
adds)],commit=} 0 134457
2020-03-12 11:08:27.586 DEBUG (qtp1668016508-15663) [ x:agglogtrackitem] 
o.a.s.s.HttpSolrCall Closing out SolrRequest: 
\{{params(commit=true),defaults(wt=json)}}
2020-03-12 11:08:30.588 INFO (qtp1668016508-354) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory [agglogtrackitem] webapp=/solr path=/update 
params=\{commit=true}{add=[2102003030801493024 (1660952327142506496), 
2101910280679323524 (1660952329722003458), 2102002280793200424 
(1660952329870901249), 2102002280793200524 (1660952330735976448), 
2102003030801513824 (1660952333609074688), 2102002280793203224 
(1660952335028846593), 2102002280793196024 (1660952337146970113), 
2102002180774885124 (1660952339094175744), 2101910280679328124 
(1660952340238172160), 2101910280679298024 (1660952343581032448), ... (200 
adds)],commit=} 0 135418
2020-03-12 11:08:30.588 DEBUG (qtp1668016508-354) [ x:agglogtrackitem] 
o.a.s.s.HttpSolrCall Closing out SolrRequest: 
\{{params(commit=true),defaults(wt=json)}}
2020-03-12 11:08:34.284 INFO (qtp1668016508-400) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory [agglogtrackitem] webapp=/solr path=/update 
params=\{commit=true}{add=[2102002200779000724 (1660952325957615618), 
2102002030765009024 (1660952325973344258), 2102002130766413424 
(1660952325979635712), 2102001170736628124 (1660952325984878594), 
2102002250788105624 (1660952325989072896), 2102003020799080224 
(1660952325993267200), 2102002170773995424 (1660952325996412929), 
2102001280755164624 (1660952326000607232), 2102002250788101824 
(1660952326004801537), 2102002200778996424 (1660952326008995841), ... (200 
adds)],commit=} 0 140243

{noformat}


was (Author: aaronsuns):
[~ichattopadhyaya]   Thanks for the valuable answer.  After change the JVM heap 
size to 25 GB, it indeed become much better, still a bit pause in log here and 
there, but much shorter, around 1~2 seconds.  Is it possible to make it even 
better?  Also notice the pause happen more often around "HttpSolrCall Closing 
out SolrRequest" which does not seem related with GC pause.

Regarding the muliple solr nodes(JVMs), I guess you refer to this page: 
[https://lucene.apache.org/solr/guide/7_2/taking-solr-to-production.html#running-multiple-solr-nodes-per-host|https://lucene.apache.org/solr/guide/7_2/taking-solr-to-production.html#running-multiple-solr-nodes-per-host,]
  , is that mean each solr instance have it's own solr home directory and port? 
 if so how to split the data? one core with one instance?  Is that mean client 
need to manage which solr instance to talk with?

I couldn't find good example on internet, appreciate if you could provide some 
guidance.  

{noformat}

2020-03-12 10:37:19.804 DEBUG (qtp1668016508-3474) [ x:aggprogram] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:37:20.543 DEBUG (qtp1668016508-4857) [ x:aggprogram] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE add\{,id=2101608110097976031} 
\{{params(commit=true),defaults(wt=json)}}

2020-03-12 10:39:11.250 DEBUG (qtp1668016508-6123) [ x:aggasset] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:39:11.915 TRACE (qtp1668016508-3376) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2102003090810779924 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002583 
refcount=1} LogPtr(1081326) map=1784607161

2020-03-12 10:40:08.746 DEBUG (qtp1668016508-382) [ x:aggasset] 
o.a.s.s.HttpSolrCall Closing out SolrRequest: 
\{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:40:09.640 DEBUG (qtp1668016508-3239) [ x:aggasset] 
o.a.s.u.TransactionLog New TransactionLog 
file=/data1/solr8/aggasset/data/tlog/tlog.0000000000000001116, exists=false, 
size=0, openExisting=false


2020-03-12 10:40:58.182 DEBUG (qtp1668016508-3413) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:41:00.318 TRACE (qtp1668016508-381) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2101701290647113224 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002593 
refcount=1} LogPtr(1940077) map=1984880505


2020-03-12 10:41:33.880 DEBUG (qtp1668016508-771) [ x:agglogtrackitem] 
o.a.s.u.p.LogUpdateProcessorFactory PRE_UPDATE 
commit\{,optimize=false,openSearcher=true,waitSearcher=true,expungeDeletes=false,softCommit=false,prepareCommit=false}
 \{{params(commit=true),defaults(wt=json)}}
2020-03-12 10:41:35.754 TRACE (qtp1668016508-3298) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2102003070806775224 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002598 
refcount=1} LogPtr(4246525) map=1493020555


2020-03-12 10:42:23.140 DEBUG (qtp1668016508-107) [ x:agglogtrackitem] 
o.a.s.u.DirectUpdateHandler2 
updateDocument(add\{_version_=1660950824311848960,id=2101702170007764324})
2020-03-12 10:42:23.935 TRACE (qtp1668016508-380) [ x:agglogtrackitem] 
o.a.s.u.UpdateLog TLOG: added id 2101806210189104124 to 
tlog\{file=/data1/solr8/agglogtrackitem/data/tlog/tlog.0000000000000002605 
refcount=1} LogPtr(5096503) map=2041040637

{noformat}

 

> Solr does not response most of the update request some times
> ------------------------------------------------------------
>
>                 Key: SOLR-14314
>                 URL: https://issues.apache.org/jira/browse/SOLR-14314
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Aaron Sun
>            Priority: Critical
>         Attachments: jstack_bad_state.log, solrlog.tar.gz
>
>
> Solr version:
> {noformat}
> solr-spec
> 8.4.1
> solr-impl
> 8.4.1 832bf13dd9187095831caf69783179d41059d013 - ishan - 2020-01-10 13:40:28
> lucene-spec
> 8.4.1
> lucene-impl
> 8.4.1 832bf13dd9187095831caf69783179d41059d013 - ishan - 2020-01-10 13:35:00
> {noformat}
>  
> Java process:
> {noformat}
> java -Xms100G -Xmx200G -DSTOP.PORT=8078 -DSTOP.KEY=ardsolrstop 
> -Dsolr.solr.home=/ardome/solr -Djetty.port=8983 
> -Dsolr.log.dir=/var/ardendo/log -jar start.jar --module=http
> {noformat}
> Run on a powerful server with 32 cores, 265GB RAM.
> The problem is that time to time it start to get very slow to update solr 
> documents, for example time out after 30 minutes.
> document size is around 20k~50K each, each http request send to /update is 
> around 4MB~10MB.
> /update request is done by multi processes.
> Some of the update get response, but the differences between "QTime"  and 
> http response time is big, one example, qtime = 66s, http response time is 
> 2304s.
> According to jstack for the thread state, lots of BLOCKED state.
> thread dump log is attached.
> Any hint would be appreciate, thanks!
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to