Pan,

You'll need to write your own media filter class to handle the 
extraction of text from PowerPoint files as ppt text extraction isn't 
currently supported by the default set of media filters. Hopefully 
someone may have already done this and will share, but if not you'll 
have to write your own using OpenOffice or some other means.

Scott.

>Date: Wed, 31 Jan 2007 15:07:21 -0800
>From: "Pan Family" <[EMAIL PROTECTED]>
>Subject: [Dspace-tech] DSpace not indexing MS Powerpoint files?
>To: [email protected]
>Message-ID:
>       <[EMAIL PROTECTED]>
>Content-Type: text/plain; charset="iso-8859-1"
>
>Hi,
>
>I submitted a MS ppt file to my collection, but filter-media
>does not want to index this ppt file.  I tried to shut down
>the database (PostgreSQL) and restarted it, and ran
>filter-media several times, but it did not help.  I made
>sure that this ppt file is indeed in the collection by openning
>it using View/Open.
>
>I have no problem indexing MS Word, text, html, or pdf
>files.  Do I need to do anything special for ppt files?
>
>Thanks a lot!
>
>-Pan
>  
>


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to