[ 
https://issues.apache.org/jira/browse/NET-516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13906796#comment-13906796
 ] 

pavan commented on NET-516:
---------------------------

The issue is happening because of the regular expression used from 
commons-net-2.0. The regex used by the commons-net 2.0 is 
(\\S+)\\s+(\\S+)\\s+(?:(<DIR>)|([0-9]+))\\s+(\\S.*). If you carefully look at 
the last capturing group (\\S.*) first it looks for a non whitespace character 
then for any character.

According the java pattern class documentation 0x85 is a line terminator and 
also .* by default only matches printable characters and wont match any control 
characters. Creating the pattern using Pattern.DOTALL makes control characters 
also to match. I am attaching a sample program for reproducing the issue.

>  problem occurs if the filename contains one or more characters of which the 
> second byte of Shift-JIS code is 0x85
> ------------------------------------------------------------------------------------------------------------------
>
>                 Key: NET-516
>                 URL: https://issues.apache.org/jira/browse/NET-516
>             Project: Commons Net
>          Issue Type: Bug
>          Components: FTP
>    Affects Versions: 2.0
>         Environment: windows
>            Reporter: Asha K S
>             Fix For: 2.0
>
>         Attachments: notworking.png, working.png
>
>
> Problem occurs if the filename contains one or more characters of which the 
> second byte of Shift-JIS code is 0x85, on a windows Japanese machine when 
> listing file names from IIS FTP server.
> This was working fine in commons-net-1.4.0.jar . 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to