I know what's going on. The file is missing a metadata that tells it
what coding was used for the text.

http://www.sno.phy.queensu.ca/~phil/exiftool/faq.html#Q10

It reads: "IPTC†: The value of the IPTC:CodedCharacterSet tag determines
how the internal IPTC string values are interpreted. If
CodedCharacterSet exists and has a value of "UTF8" (or "ESC % G") then
string values are assumed to be stored as UTF‑8. Otherwise the internal
IPTC encoding is assumed to be Windows Latin1 (cp1252), but this can be
changed with "-charset iptc=CHARSET"."

So I took the GottenGeography tagged image and added the tag

exiftool -CodedCharacterSet=UTF8 08.tif

Now the output is correct in both tools

exiftool 08.tif 
Coded Character Set             : UTF8
Envelope Record Version         : 4
City                            : San Martín de los Andes
Province-State                  : Neuquén

exiv2 -pi 08.tif
Iptc.Envelope.CharacterSet                   String      3  
Iptc.Envelope.ModelVersion                   Short       1  4
Iptc.Application2.City                       String     24  San Martín de los 
Andes
Iptc.Application2.ProvinceState              String      8  Neuquén
Iptc.Application2.CountryName                String      9  Argentina
Iptc.Application2.RecordVersion              Short       1  4




** Changed in: gottengeography
       Status: Invalid => New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1215249

Title:
  Character set (UTF-8) issues when geotagging

To manage notifications about this bug go to:
https://bugs.launchpad.net/gottengeography/+bug/1215249/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to