[Dspace-tech] Searching of text from PDF files

2012-06-11 Thread Amit Rami
Today I try to search text from pdf in dspace search plz help me...! Amit C. Rami Student at Gujarat Vidyapith -- Live Security Virtual Conference Exclusive live event will

Re: [Dspace-tech] Searching of text from PDF files

2012-06-11 Thread helix84
On Mon, Jun 11, 2012 at 8:28 AM, Amit Rami mca.rami_a...@hotmail.com wrote: Today   I try to search text from pdf in dspace search plz help me...! Do you have any problem with that? Did you run [dspace]/bin/dspace filter-media to extract the text for indexing from the PDF files and did you run

Re: [Dspace-tech] Searching of text from PDF files

2009-09-10 Thread Vishal Kakapuri
1) pdf is an image - needs to be ocr'd - then uploaded - metadata filtermedia will try to extract the text out of the pdf and save it as a text file along with the pdf files..-- search happens on the extracted text OR 2) pdf is an text - to be uploaded - metadata filtermedia will try to extract

Re: [Dspace-tech] Searching of text from PDF files

2009-09-09 Thread Mark H. Wood
On Tue, Sep 01, 2009 at 03:55:11PM +1000, Gary Browne wrote: When a user searches via the dspace web interface, is the search run across the content of text pdfs or just the metadata? If so, does the pdf submitted to the repository need to have been previously OCR'd, or does the repository

[Dspace-tech] Searching of text from PDF files

2009-08-31 Thread Gary Browne
Hi all, I have a query about searching of pdf documents which I can't seem to find a definitive answer for: When a user searches via the dspace web interface, is the search run across the content of text pdfs or just the metadata? If so, does the pdf submitted to the repository need to have been