[
https://issues.apache.org/jira/browse/CONNECTORS-696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13674110#comment-13674110
]
Minoru Osuka commented on CONNECTORS-696:
-----------------------------------------
Hi Karl,
I have compared the encoding of the filename which are outputted by Wget and
ManifoldCF.
Wget version is following.
[minoru@MacBook-Air:~/tmp]$ wget --version
GNU Wget 1.14 built on darwin12.1.0.
+digest +https +ipv6 +iri +large-file +nls -ntlm +opie +ssl/gnutls
Wgetrc:
/opt/local/etc/wgetrc (system)
Locale: /opt/local/share/locale
Compile: /usr/bin/clang -DHAVE_CONFIG_H -DSYSTEM_WGETRC="/opt/local/etc/wgetrc"
-DLOCALEDIR="/opt/local/share/locale" -I. -I../lib -I../lib
-I/opt/local/include -pipe -O2 -arch x86_64
Link: /usr/bin/clang -pipe -O2 -arch x86_64 -L/opt/local/lib -liconv -lintl
-arch x86_64 -lintl -liconv -lc -Wl,-framework -Wl,CoreFoundation
-lgnutls -ltasn1 -lgcrypt -lgpg-error -lz -lintl -liconv -lc
-lpthread -lgcrypt -lgpg-error -lz -lz -lidn ftp-opie.o gnutls.o
../lib/libgnu.a
Copyright (C) 2011 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later
<http://www.gnu.org/licenses/gpl.html>.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Originally written by Hrvoje Niksic <[email protected]>.
Please send bug reports and questions to <[email protected]>.
Wget command is here.
[minoru@MacBook-Air:~/tmp]$ wget -P wget --protocol-directories -r -l 1
http://www.rondhuit.com/
This is filename which is encoded by Wget, and decoded by Nkf command.
[minoru@MacBook-Air:~/tmp]$ ls
wget/http/www.rondhuit.com/{color:red}lucenesolr-revolution-2013%E3%81%AB%E5%8F%82%E5%8A%A0%E3%81%97%E3%81%BE%E3%81%97%E3%81%9F%EF%BC%81.html{color}
| nkf --url-input -w
wget/http/www.rondhuit.com/{color:blue}lucenesolr-revolution-2013に参加しました!.html{color}
This is filename which is encoded by ManifoldCF, and decoded by Nkf command.
[minoru@MacBook-Air:~/tmp]$ ls
out/http/www.rondhuit.com/{color:red}lucenesolr-revolution-2013%E3%81%AB%E5%8F%82%E5%8A%A0%E3%81%97%E3%81%BE%E3%81%97%E3%81%9F%EF%BC%81.html{color}
| nkf --url-input -w
out/http/www.rondhuit.com/{color:blue}lucenesolr-revolution-2013に参加しました!.html{color}
Wget and ManifoldCF both show the same results. So I think ManifoldCF has the
compatibility of Wget for filename encoding.
> FileSystem Output Connector
> ---------------------------
>
> Key: CONNECTORS-696
> URL: https://issues.apache.org/jira/browse/CONNECTORS-696
> Project: ManifoldCF
> Issue Type: Improvement
> Components: File system connector
> Affects Versions: ManifoldCF 1.2
> Reporter: Minoru Osuka
> Assignee: Minoru Osuka
> Fix For: ManifoldCF 1.3
>
> Attachments: CONNECTORS-696.patch
>
>
> ManifoldCF already has FileSystem Repository Connector, but doesn't have
> FileSystem Output Connector.
> I'd like to use ManifoldCF like a wget in combination with WebConnector.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira