Thomas,
   Thank you for raising this on the Solr list. Please let us know if we
can help you help us figure out what’s going on...or if you’ve already
figured it out!
    Thank you!

    Best,
       Tim

---------- Forwarded message ---------
From: Thomas Scheffler <[email protected]>
Date: Thu, Aug 2, 2018 at 6:06 AM
Subject: Memory Leak in 7.3 to 7.4
To: [email protected] <[email protected]>


Hi,

we noticed a memory leak in a rather small setup. 40.000 metadata documents
with nearly as much files that have „literal.*“ fields with it. While 7.2.1
has brought some tika issues (due to a beta version) the real problems
started to appear with version 7.3.0 which are currently unresolved in
7.4.0. Memory consumption is out-of-roof. Where previously 512MB heap was
enough, now 6G aren’t enough to index all files.
I am now to a point where I can track this down to the libraries in
solr-7.4.0/contrib/extraction/lib/. If I replace them all by the libraries
shipped with 7.2.1 the problem disappears. As most files are PDF documents
I tried updating pdfbox to 2.0.11 and tika to 1.18 with no solution to the
problem. I will next try to downgrade these single libraries back to 2.0.6
and 1.16 to see if these are the source of the memory leak.

In the mean time I would like to know if anybody else experienced the same
problems?

kind regards,

Thomas
-----BEGIN PGP SIGNATURE-----

iQIzBAEBCAAdFiEESEFqpw8gJjayWV1BRLllpVS5GWIFAlti14oACgkQRLllpVS5
GWL5wQ//XWQeGH8q5ICM4CKVoSOilNkOBP6PgLaraR6alZFNI2NIrHqhPduArlFs
EgEue8Pmuw5S2/RFghzqidmpc6Kz6L8PwgDtcmK8khvv9dOEyqKvAvoT26uYflho
ppr8TrfmzHnGfVgAREK285kXwGOtiLau0y+EOIX8cMNthpLH5Ztc/GIqqNWhTcDD
PoP9pc+mpBwHCR9Gnw9ifBEkGS6TevWLjzemSQqz7Cjj/wUm/CQyzTM9/py4lVVI
VYU5NlevbTtrEfcseR5CD2iKBCckmiYI0zxdcOZJcUXxGlmWA0Y0rT0tAYaSMrUi
1l2nAaJRs9x3YzIbB1pfxED+R6eXnzh3NIt6WTUI6eT0IZll+9H28WdMFBqZmjMd
p1lKKz3xcEEe5cA0rXxPknbIAqyPGfF5nAIxgFernR5kZHTrDGgnP49lginlzln1
354nuA9YfS/G4EEi++pzUvKEoOSYQQDpNrrUq8W9mk9ZXYC584fbdSYnuDu409mn
tuuu6OGIdNWeiJCrR4c9Xac0opvx19vlJTGSsiMuG4uI+ZEg1zmxsiU8w/yZhIoY
O+yx9tlQS6hGwbhtrhx8iNKqJ3wsTtDKCSwBXUzrOlnxyMw3Qb8E58KcwPP0JVjo
wji8a6kO0OHSR6UGXEpo2EYrVdiIk2XT+I+r89JvO702B1hrE4A=
=GpX2
-----END PGP SIGNATURE-----

Reply via email to