Hi RIchard:

I cannot speak to it's quality (and indeed we have had quality issues with 
other formats) but Apache POI library supports Powerpoint text extraction.
I would study the doc at http://poi.apache.org/ for how to use the library,

and look in:
http://scm.dspace.org/svn/repo/dspace/trunk/dspace-api/src/main/java/org/dspace/app/mediafilter

for examples of other extractor media filters.

Then post any questions to the tech or dev list.

Hope that is helpful,

Richard Rodgers

On Sep 29, 2010, at 10:14 AM, Jizba, Richard wrote:

Hello,

Are there plans to add a PPT text extractor to DSpace?
In the meantime, can some provide information on how to implement one?

Thanks,
Richard Jizba
Creighton University.
<ATT00001..c><ATT00002..c>

------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to