Hi Ram, Thank you very much for the detailed information. The only strange thing I see in the logs is the following
--> com.orientechnologies.orient.core.exception.OCommandExecutionException: Class 'NULL' was not found in current database [ONetworkProtocolHttpDb] 2019-08-21 12:03:39:675 SEVERE Internal server error: com.orientechnologies.orient.core.exception.OQueryParsingException: Error on parsing query at position #5: Error on parsing query Query: null timeout 5000 ----------^ but it seems more like a wrong query than a problem with the server, so it's unlikely to be the reason for your problem. There is one thing that could give us more information on what's actually happening: could you please take a thread dump when the server is stuck? Thanks Luigi Il giorno mer 21 ago 2019 alle ore 18:35 Ram Karthik < ramkarthik.m...@gmail.com> ha scritto: > To add more details the problem, > > We have more than 500+ queries/API built on 200+ vertex based schema. All > the query were written optimally with right indexes so that at any normal > condition the response time of the queries will be under 50 ms. Most of the > query response in less than 20 ms. You can refer the screenshot shared > before. > > All of the sudden one of these query freeze at the database indefinitely > and all the subsequent queries fired from application also start to freeze > indefinitely. This leads to an increase in concurrent connections to the > database, with none of the query responding back. This leads to the maximum > connection limit at the database level and the database stop accepting new > connections. Looking at the database, the CPU, Memory remains stable. There > is a very slight increase in CPU (due to too high concurrent connection). > This indicates the query is not executed in the database and are waiting > for resource/lock. > > To bring the server back to normal, we have to stop the database (thus > kill the connections), bounce back again to access. This happens very > frequently and sometime during restart the index crashes. So we have to > restore the database from backup. > > We log every query being executed. After bouncing the server, we tried to > run the frozen queries (same query with same parameter), they executed > normally as usual and responded in usual latency (10 - 20 ms). We tried > running all the queries (first query, some random query from all frozen > query set), all executed as expected. > > When the database goes to freeze mode, even simple query that supposes > pick single record by primary Id also freezes. We have no clue why the > database goes to freeze state all of sudden. > > We have been using OrientDb for last 5 years and never faced such a > situation. > > We tried passing timeout argument along with all the read query (with > timeout as 5000 ms), we reduced record.locktimeout, network level various > timeout to lower the number, session time out, connection timeout, etc. > None of them helped. The queries are not timing out. The connection breaks > and application is getting SocketTimeoutException, but connection/query > seems to be staying in frozen/lock state in the database side and not > allowing the new connection. > > We tried to kill the connection using Command "Kill", "interrupt", both > have failed, the command just hangs in waiting to get the response from the > server for the first connections. > > > We are currently rebuilding the index for the entire database on one go as > last resort. > > We are a startup, built the entire product using OrientDB. Due to this, > our service is down for the last 5 days and we are losing our customer > trust and we are having big crisis. > > Help us identify the root cause and overcome the issue. > > Regards, > Ram > > > > > > > > On Wed, Aug 21, 2019 at 7:36 PM Ram Karthik <ramkarthik.m...@gmail.com> > wrote: > >> Hi Luigi, >> >> Thanks for your reply. >> >> Database size 40GB >> >> Typical workload - 50 TPS >> >> Added Server logs and sample schema below >> we have around 200+ schema >> >> Thanks in advance. >> >> >> On Wed, Aug 21, 2019 at 6:58 PM Luigi Dell'Aquila < >> luigi.dellaqu...@gmail.com> wrote: >> >>> Hi Ram >>> >>> It's hard to give you a quick solution with so few information. >>> V 2.0 is EOL so we will hardly release a community patch, but we can try >>> to troubleshoot the problem and see if we can work around it. >>> Can you provide a bit more information, eg. server logs, typical >>> workload, some information about DB size and schema... >>> >>> Thanks >>> >>> Luigi >>> >>> >>> Il giorno mer 21 ago 2019 alle ore 14:57 Ram Karthik < >>> ramkarthik.m...@gmail.com> ha scritto: >>> >>>> We are using OrientDB ver 2.0.18, and we are facing a critical issue >>>> for the past 5 days. The following issues we are facing >>>> >>>> 1. Orient DB server is unreachable frequently >>>> 2. We cannot able to shut down the server. We are forced to kill >>>> the DB >>>> 3. Sometimes the Index gets crashed. >>>> >>>> The above issues occur when we open the traffic to use our application. >>>> >>>> This is a very critical issue, many users are unable to use the >>>> application due to this issue. We depend on the OrientDB, due to this we >>>> are facing many issues. >>>> >>>> Please help us to resolve this issue soon. >>>> >>>> Thanks, >>>> Ram >>>> >>>> -- >>>> >>>> --- >>>> You received this message because you are subscribed to the Google >>>> Groups "OrientDB" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to orient-database+unsubscr...@googlegroups.com. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/orient-database/80c7ea2d-4214-432a-9679-5e09a4a1cc99%40googlegroups.com >>>> <https://groups.google.com/d/msgid/orient-database/80c7ea2d-4214-432a-9679-5e09a4a1cc99%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- >>> >>> --- >>> You received this message because you are subscribed to a topic in the >>> Google Groups "OrientDB" group. >>> To unsubscribe from this topic, visit >>> https://groups.google.com/d/topic/orient-database/fmo0WKnfXUc/unsubscribe >>> . >>> To unsubscribe from this group and all its topics, send an email to >>> orient-database+unsubscr...@googlegroups.com. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/orient-database/CAFZLH8kGV%3Di1O1BWhtmSG4YKKXLW4x6DXEu88N%2B55QJf19opkA%40mail.gmail.com >>> <https://groups.google.com/d/msgid/orient-database/CAFZLH8kGV%3Di1O1BWhtmSG4YKKXLW4x6DXEu88N%2B55QJf19opkA%40mail.gmail.com?utm_medium=email&utm_source=footer> >>> . >>> >> -- > > --- > You received this message because you are subscribed to the Google Groups > "OrientDB" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to orient-database+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/orient-database/CAJybfWepOurPSvDRjZa%2B2CV%3DPU%3DSj7aymt0rMeEv4TkEisy8Eg%40mail.gmail.com > <https://groups.google.com/d/msgid/orient-database/CAJybfWepOurPSvDRjZa%2B2CV%3DPU%3DSj7aymt0rMeEv4TkEisy8Eg%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > -- --- You received this message because you are subscribed to the Google Groups "OrientDB" group. To unsubscribe from this group and stop receiving emails from it, send an email to orient-database+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/orient-database/CAFZLH8nQ4aB3YKJRQ_wjbonWMk9Bvts69qrJiyxrSBa94eBLxw%40mail.gmail.com.