Issue #23040 has been reported by Paul Behrisch.
----------------------------------------
Bug #23040: Puppet can't read UTF-8 filenames
https://projects.puppetlabs.com/issues/23040
* Author: Paul Behrisch
* Status: Unreviewed
* Priority: Urgent
* Assignee:
* Category:
* Target version:
* Affected Puppet version:
* Keywords: puppet utf-8
* Branch:
----------------------------------------
Hey guys,
I have a quite annoying problem with puppet.
I have a manifest which includes code like:
<pre>
file { "/tmp/TaiwanYo-台灣遊-11.jpg":
ensure => file,
#content => "yeah",
source =>
"puppet:///modules/wordpress-blog/wordpress/wp-content/uploads/TaiwanYo11.jpg",
#recurse => true,
owner => 'www-data',
group => 'www-data',
mode => '0644',
}
file { "/tmp/TaiwanYo-台灣遊-12.jpg":
ensure => file,
source =>
"puppet:///modules/wordpress-blog/wordpress/wp-content/uploads/TaiwanYo-台灣遊-11.jpg",
owner => 'www-data',
group => 'www-data',
mode => '0644',
}
file { "/tmp/TaiwanYo-台灣遊-13.jpg":
ensure => file,
source =>
"puppet:///modules/wordpress-blog/wordpress/wp-content/uploads/TaiwanYo-ü.jpg",
owner => 'www-data',
group => 'www-data',
mode => '0644',
}
</pre>
The first one runs fine, but the other both fail.
The puppet version on server and client is 3.3.1. The operating system is
Debian Wheezy and the installed package is from the puppetlabs repository.
The installed ruby version is 1.9.3p194 from the Debian repo. I'm quite sure
that it's a puppet problem, because i can print the characters fine in ruby
<pre>
ruby 1.9.3p194 (2012-04-20 revision 35410) [x86_64-linux]
root@manager:/etc/puppet/modules/wordpress-blog# irb
irb(main):001:0> puts "台灣遊 ü"
台灣遊 ü
</pre>
The client output is the following:
<pre>
Error: /Stage[main]/Wordpress-blog/File[/tmp/TaiwanYo-台灣遊-13.jpg]: Could not
evaluate: invalid byte sequence in US-ASCII Could not retrieve file metadata
for
puppet:///modules/wordpress-blog/wordpress/wp-content/uploads/TaiwanYo-ü.jpg:
invalid byte sequence in US-ASCII
Notice: /Stage[main]/Wordpress-blog/File[/tmp/TaiwanYo-台灣遊-11.jpg]/ensure:
defined content as '{md5}30dfed09a228b46ebdd200feeeac8028'
Error: /Stage[main]/Wordpress-blog/File[/tmp/TaiwanYo-台灣遊-12.jpg]: Could not
evaluate: invalid byte sequence in US-ASCII Could not retrieve file metadata
for
puppet:///modules/wordpress-blog/wordpress/wp-content/uploads/TaiwanYo-台灣遊-11.jpg:
invalid byte sequence in US-ASCII
</pre>
As you can see it works if the output filename is in chinese, but the source in
English. However, if the source file has unicode characters in it, it will fail.
Cheers,
Paul
--
You have received this notification because you have either subscribed to it,
or are involved in it.
To change your notification preferences, please click here:
http://projects.puppetlabs.com/my/account
--
You received this message because you are subscribed to the Google Groups
"Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/puppet-bugs.
For more options, visit https://groups.google.com/groups/opt_out.