[ 
https://jira.duraspace.org/browse/DS-638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=19236#action_19236
 ] 

Tim Donohue commented on DS-638:
--------------------------------

Just a note to mention that the Developers discussed the DCAT Review of this 
issue in our meeting on Feb 23, 2011.

The DCAT Review of this issue is available at: 
https://wiki.duraspace.org/pages/viewpage.action?pageId=23268540
Full Developer discussion thread is available at: 
http://irclogs.duraspace.org/index.php?date=2011-02-23

Here's a brief summary of what the developers came up with:
* First, this request is actually 2 separate requests:  (1) Integrated Virus 
Scanning, and (2) Integrated File Format Verification
* The DSpace 1.7.0 Curation System already come with a Virus Scanning tool 
(requires & uses ClamAV virus scanner: http://www.clamav.net)
** More info on the Curation System (and existing tasks) is in our 1.7 docs: 
https://wiki.duraspace.org/display/DSDOC/Curation+System
* MIT (Richard Rodgers) is also currently working on a Curation Task to perform 
File Format Verification via DROID 
(http://www.nationalarchives.gov.uk/PRONOM/).  This will be released in 1.7.1 
or 1.8.0.
* The 1.7.0 Curation System currently is only integrated into the DSpace 
Administration UI and DSpace Workflow Approval processes (it can also be kicked 
off via the Commandline).  Any curation task can be kicked off automatically 
during a Workflow approval step, or "on demand" via the Administration UI or 
commandline.
* So, one feature that is definitely missing is the ability to kick off these 
Curation tasks automatically during the DSpace Submission UI
* Robin Taylor is currently looking into similar Submission UI changes (and 
also wants to validate a file during submission), so he volunteered to 
investigate some of this work. Elin & Robin agreed to get in touch about this 
work.

> check files on input for viruses, and verify file format 
> ---------------------------------------------------------
>
>                 Key: DS-638
>                 URL: https://jira.duraspace.org/browse/DS-638
>             Project: DSpace
>          Issue Type: New Feature
>          Components: JSPUI
>    Affects Versions: 1.6.2
>         Environment: to use this patch you will need to have ClamAV, and 
> jhove installed on your system.
>            Reporter: Jose Blanco
>            Assignee: Richard Rodgers
>         Attachments: java_files.zip, jhove_config_files.zip, jsp_files.zip
>
>
> This patch uses JHOVE to provide rough-and-ready format checking by 
> identifying that the file/bitstream extension matches  formats verifiable by 
> JHOVE. (Currently DSpace accepts a deposit's file extension as gospel, so a 
> user could tack a ".txt" extension onto a GIF and DSpace would assign the 
> incorrect format to the file based on that incorrect extension.) 
> This patch also also contains code to check the file for the presence of 
> viruses.
> In order to use this patch you must have jhove and ClamAV installed on your 
> system. 
> Important notes:
> (1) HTML identification has proved unreliable ( by jhove ), so this patch 
> does not return accurate results for that 
> file format.
> (2) This code does not fully incorporate JHOVE's validation functions; it 
> only verifies that what depositors intended to submit is in fact what they 
> submitted.
> The following are returned messages when an error is detected:
> Text in [brackets] is a returned value, ALLCAPS can/should be modified to 
> reflect your current installation.
> Questionable AIFF, GIF, JPG, PDF, TIF, WAVE, XML:
> DSPACE could not verify that your file is a valid [file_format_extension]. 
> Please check the file format and ".[file_format_extension]" extension.
> Questionable TXT:
> DSPACE found the text file you are trying to upload is neither UTF-8 nor 
> ASCII. Please verify that your file is in the format you wanted.
> Spaces in filenames ( this is an additional check ):
> The file name contains spaces; this is not recommended. If possible, please 
> replace spaces with underscores: "_".
> Virus detected:
> DSPACE detected a virus in this file. Please repair it and resume the 
> deposit. If you need assistance, please contact us: EMAIL_ADDRESS.
> To get the patch working:
> Add the jhove conf files to
> [dspace]/jhove direcoty
> Here are the conf files:
> jhove-aiff.conf
> jhove-ascii.conf
> jhove-gif.conf
> jhove-jpeg.conff
> jhove-pdf.conf
> jhove-tiff.conf
> jhove-utf8.conf
> jhove-wave.conf
> jhove-xml.conf
> Also the following files were changed:
> dspace-api/src/main/java/org/dspace/submit/step/UploadStep.java
> dspace-jspui/dspace-jspui-api/src/main/java/org/dspace/app/webui/submit/step/JSPUploadStep.java
> dspace-api/src/main/java/org/dspace/content/FormatIdentifier.java
> dspace/modules/jspui/src/main/webapp/submit/get-file-format.jsp ( locally 
> customized )
> dspace/modules/jspui/src/main/webapp/submit/upload-error-virus.jsp ( new file 
> - placed in locally modified area for the jspui interface)
> These files are attached with this patch.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://jira.duraspace.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

------------------------------------------------------------------------------
What You Don't Know About Data Connectivity CAN Hurt You
This paper provides an overview of data connectivity, details
its effect on application quality, and explores various alternative
solutions. http://p.sf.net/sfu/progress-d2d
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to