FYI, I recently became aware of a bug in the indexing of large text 
documents. It is in 7.6.2, introduced last February, 2024. I created a 
small PR that fixes the bug: https://github.com/DSpace/DSpace/pull/9893

It manifests when indexing text files that are larger than the configured 
character limit (default 100000). A message is logged about the large file 
with a suggestion to up the character limit and indicating the first 
(100000) characters are indexed. In fact, those characters are never 
indexed and the document will not be found in search results.

-- 
All messages to this mailing list should adhere to the Code of Conduct: 
https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/949bfb00-b0cd-4833-8a2c-348f731cb542n%40googlegroups.com.

Reply via email to