Hi Sue,

The format registry should have two PPT entries:

PPT
---
Name: Microsoft Powerpoint
MimeType: application/vnd.ms-powerpoint
Description: Microsoft Powerpoint
Support Level: Known
File Extensions: ppt

PPTX
----
Name: Microsoft Powerpoint XML
MimeType: 
application/vnd.openxmlformats-officedocument.presentationml.presentation
Description: Microsoft Powerpoint XML
Support Level: Known
File Extensions: pptx

This information can also be found in the 
'[dspace]/config/registries/bitstream-formats.xml' file:
https://fisheye3.atlassian.com/browse/dspace/dspace/trunk/dspace/config/registries/bitstream-formats.xml?hb=true#to142


- Tim

On 3/13/2012 4:26 PM, Thornton, Susan M. (LARC-B702)[LITES] wrote:
> Hi Tim,
>       Can you give me a screen shot of your definition(s) for PowerPoint in 
> the bitstream_format_registy?  Something's still not working right and I 
> suspect it may be here.
> Thanks,
> Sue
>
>
> Sue Walker-Thornton
> (w):  (757) 864-2368
> (m):  (757) 506-9903
>
>
> -----Original Message-----
> From: Tim Donohue [mailto:[email protected]]
> Sent: Tuesday, March 13, 2012 10:44 AM
> To: Thornton, Susan M. (LARC-B702)[LITES]
> Cc: [email protected]; Dedmond, Nicole K. (LARC-B702)[LITES]
> Subject: Re: [Dspace-tech] Are PDF-A documents filterable in DSpace?
>
> Sue,
>
> A few responses inline...
>
> On 3/13/2012 9:22 AM, Thornton, Susan M. (LARC-B702)[LITES] wrote:
>> For Word docs:
>>
>> --------------
>>
>> * The rather outdated "Text-mining" tools at:
>>
>> http://code.google.com/p/text-mining/
>>
>> * Unfortunately it looks like these do NOT support docx
>>
>> * But, it looks like POI (used for PPTs, see below) does work for docx.
>> Unfortunately, this is not enabled/built out in DSpace yet. I just
>> created an issue for it at: https://jira.duraspace.org/browse/DS-1140
>>
>> *Great! Can you let us know when it's been successfully implemented?*
>
> You are welcome to subscribe to the JIRA ticket itself to receive updates. 
> Just login to JIRA (uses the same acct as the DSpace wiki), and click "Watch" 
> icon in the far right.  You'll then get an email any time that ticket is 
> updated.
>
> https://jira.duraspace.org/browse/DS-1140
>
> Currently, we need to locate a volunteer developer to take on this work.
> So, I'm not sure how long it will take before it is implemented.
>
>> For PPT:
>>
>> --------
>>
>> * POI 3.6: http://poi.apache.org/
>>
>> * This software supports pptx as well
>>
>> *How would I integrate this with DSpace version 1.7.1 to tell DSpace
>> to use POI to filter .pptx files?*
>
> This PPT/PPTX Filter was first made available in DSpace 1.7.0.  So, it should 
> already work in your DSpace 1.7.1 installation.  In your dspace.cfg you'd 
> just want to make sure the following is setup (it should be by default):
>
> 'filter.plugins' setting: make sure this includes "PowerPoint Text 
> Extractor", like displayed here:
> https://fisheye3.atlassian.com/browse/dspace/dspace/trunk/dspace/config/dspace.cfg?hb=true#to400
>
> 'FormatFilter' setting: make sure it *defines* a "PowerPoint Text Extractor, 
> like displayed here:
> https://fisheye3.atlassian.com/browse/dspace/dspace/trunk/dspace/config/dspace.cfg?hb=true#to411
>
> Finally, make sure the  "PowerPoint Text Extractor" is setup to take in two 
> input formats: "Microsoft Powerpoint, Microsoft Powerpoint XML", like 
> displayed here:
> https://fisheye3.atlassian.com/browse/dspace/dspace/trunk/dspace/config/dspace.cfg?hb=true#to419
>
> You'll then need to make sure your "Bitstream Format Registry" has a 
> definition for "Microsoft Powerpoint XML" (pptx).
>
> Again, assuming you are running on an out-of-the-box 1.7.x, all of the above 
> settings should be enabled by default. So, it should just work.
>
> - Tim
>
>
>
>
>

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to