Archiva should clean bad UTF-8 data in poms
-------------------------------------------

                 Key: MRM-1069
                 URL: http://jira.codehaus.org/browse/MRM-1069
             Project: Archiva
          Issue Type: Improvement
    Affects Versions: 1.1.2
            Reporter: Wendy Smoak


Archiva should have a mechanism to clean the repository of bad poms that 
contain improper UTF8 data.

Possible places for this to occur:

1) During the repository scan for content.
   Have a process that reads the xml data as a stream (not as an xml document) 
and process out any improper UTF8 characters with appropriate XML latin 
entities. (example: changing "ΓΈ" to "%oslash;")

2) During pom download, check the pom and clean it inline on the way down to 
the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to