Hello James,

I saw your note at
http://www.loc.gov/standards/mets/mets-registry.htmlabout adding METS
support to DSpace. I am new to DSpace and to digital
libraries generally, so I could be wrong, but AFAIK DSpace does not support
METS yet. Am I right? If so then I wonder what the status of this work is. I
would be really keen to see it in there.

The reason I ask is that I am prototyping a digital library using DSpace as
a starting point and I have a number of non-searchable PDFs that I want to
import. Because they are non-searchable (I wish they were searchable, they
just happen not to be) I want to get METS metadata for them. This would
enable full text search and, hopefully, rendering as HTML as well. As a
newcomer to Dspace it seems to be that one imports searchable PDFs (e.g
OCR'd documents) or HTML originals. But unfortunately this is not where I am
starting from.

Something else I am keen on knowing more about is if it is possible to
generate a METS from the PDF. The PDFs I have do have meta using a
proprietary schema. I am not sure about using that to move to METS, it might
be more feasible to extract the data directly from the PDF.

Also, there might be some indexing operation of DSpace that I am not aware
of. I tried uploading a HTML version of a PDF where I used GMail to generate
the HTML. Once in DSpace it seemed like it was not searchable either, just
like the PDF. I checked the HTML code and the words were there. So maybe I
have to tell DSpace to index new material manually? I thought it indexed it
as it was uploaded.

-- 
Regards,

Andrew M.
------------------------------------------------------------------------------
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to