Hi Alexei,
Thank you for looking into this. I am glad to hear that Arches should support
diacriticals.
Here is the error message on loading the 'Ruler' Authority document:
RULER_AUTHORITY_DOCUMENT.csv
ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.values.csv
ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.csv
ERROR: Make sure the file is saved with UTF-8 encoding
'utf8' codec can't decode byte 0xea in position 30: invalid continuation byte
Traceback (most recent call last):
File
"/opt/projects/ENV/lib/python2.7/site-packages/arches/management/commands/package_utils/authority_files.py",
line 112, in load_authority_file
for row in rows:
File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", line
217, in next
row = csv.DictReader.next(self)
File "/usr/local/lib/python2.7/csv.py", line 104, in next
row = self.reader.next()
File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", line
128, in next
for value in row]
File "/opt/projects/ENV/lib/python2.7/encodings/utf_8_sig.py", line 22, in
decode
(output, consumed) = codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xea in position 30: invalid
continuation byte
ERROR in row 31 (Legacyoid (RULER_UID:30) not found. Make sure your
ParentConceptid in the
This caused further errors in the Ruler Values files as can be seen from above.
I do not have a copy of the authority file that caused the error asI have since
corrected it and changed it in a few places. But the alternative name was
Ptolemaîos Philadelphos
and I believe it was the circumflex above the 'i' that caused the problem.
Certainly when I removed the circumflex, the file loaded OK.
Thank you,
Lucy
----- Original Message -----
From: Alexei Peters
To: Lucy FJ
Cc: Arches Project
Sent: Wednesday, January 20, 2016 8:24 PM
Subject: Re: [Arches] Diacriticals in authority and .Arches files problems
Hi Lucy,
The .arches file should support diacritics. I'm actually surprised that the
authority files don't. I just tested a local file and I was able to add these
records:
conceptid,PrefLabel,AltLabels,ParentConceptid,ConceptType,Provider
20000001-0000-0000-0000-000000000000,Portland,,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI
20000002-0000-0000-0000-000000000000,San Francisco,The Bay
Area,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI
20000003-0000-0000-0000-000000000000,San Jose,San
José,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI
Notice that the alt label for San Jose, is San José
Can you share the authority file that you're having trouble with?
Cheers,
Alexei
Director of Web Development - Farallon Geographics, Inc. - 971.227.3173
On Wed, Jan 20, 2016 at 12:32 AM, Lucy FJ <[email protected]> wrote:
Hi all,
We have been loading customised authority files and have noticed that
Arches rejects words with diacriticals (accents etc). This is not a problem for
us as we were happy to remove them and if we really want them we can enter
then through the RDM. But will this problem occur when loading resource data
through .arches? We need to input place names as alternative names using
diacriticals and it would be much easier if we can do this via .arches files.
We know we can input them using the resource data manager but obviously when
dealing with about 3000 entries,,this is time consuming.
Any ideas?
Lucy
--
-- To post, send email to [email protected]. To unsubscribe,
send email to [email protected]. For more information,
visit https://groups.google.com/d/forum/archesproject?hl=en
---
You received this message because you are subscribed to the Google Groups
"Arches Project" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to [email protected].
For more options, visit https://groups.google.com/d/optout.
--
-- To post, send email to [email protected]. To unsubscribe, send
email to [email protected]. For more information,
visit https://groups.google.com/d/forum/archesproject?hl=en
---
You received this message because you are subscribed to the Google Groups
"Arches Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/d/optout.