Just as I thought: something simple needed to be changed. Now it's working perfectly.

Thanks!!!!


Em 29/11/2024 13:17, Vipul Solanki escreveu:
Hi Roberto


By default, DSpace has a character limit of 100,000 for full-text indexing in the Discovery module. If you need to increase this limit, you can follow these steps:

Navigate to the following file in your DSpace installation:

      -[dspace] config/modules/discovery.cfg
      -Look for the property:
           #discovery.solr.fulltext.charLimit=100000

Uncomment this property (if it is commented) and update the value to your desired limit, e.g.,
discovery.solr.fulltext.charLimit=200000


After updating the character limit, you must reindex all the objects to apply the changes. Run the following command:

*[dspace]/bin/dspace index-discovery -f*

On Fri, Nov 29, 2024 at 8:10 PM Roberto Greiner <[email protected]> wrote:

    Hi,

    I'm having some trouble with searches inside of PDF files.

    For some reason, the search only seems to work when the PDF is small
    (less than 15MB or 100 pages).

    I've uploaded some small files, and after running "dspace
    filter-media",
    I get results for searches inside of files with these sizes:
    1.4MB and 10 pages
    11MB and 57 pages

    I uploaded somewhat larger files:
    17MB  and  101 pages
    58MB   and 218 pages
    15MB   and 137 pages

    For these documents, the search does not find anything at all
    inside the
    documents, even if I search for something in the first page.

    Additionally, I've downloaded a large file from another DSpace from a
    contact of mine. This file has 625MB and 188 pages. In his DSpace
    (running dspace 7) the search works. In mine it does not.

    When I run "dspace filter-media" I don't get any errors, and
    thumbnails/miniatures are being generated for all documents.

    Does anyone know what could be happening?

    Thank you,

    Roberto Greiner

    Notes:
    All files are PDF with OCR. My platform:
    Ubuntu 24.04.1
    Dspace 8.0
    Installed based in
    https://wiki.lyrasis.org/display/DSDOC8x/Installing+DSpace
    Using Apache as proxy


--    -----------------------------------------------------
                     Marcos Roberto Greiner

        Os otimistas acham que estamos no melhor dos mundos
         Os pessimistas tem medo de que isto seja verdade
                                  James Branch Cabell
       -----------------------------------------------------

-- All messages to this mailing list should adhere to the Code of
    Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
    ---
    You received this message because you are subscribed to the Google
    Groups "DSpace Technical Support" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:dspace-tech%[email protected]>.
    To view this discussion visit
    
https://groups.google.com/d/msgid/dspace-tech/425d4e56-8724-457a-a400-3fd29e101c97%40gmail.com.

--
  -----------------------------------------------------
                Marcos Roberto Greiner

   Os otimistas acham que estamos no melhor dos mundos
    Os pessimistas tem medo de que isto seja verdade
                             James Branch Cabell
  -----------------------------------------------------

--
All messages to this mailing list should adhere to the Code of Conduct: 
https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
--- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/dspace-tech/091452a5-dccb-4774-b766-c1ff732a9ea7%40gmail.com.

Reply via email to