Hi Alexei,

Thank you for looking into this. I am glad to hear that Arches should support 
diacriticals. 

Here is the error message on loading the 'Ruler' Authority document:

RULER_AUTHORITY_DOCUMENT.csv

ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.values.csv

ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.csv

ERROR: Make sure the file is saved with UTF-8 encoding
'utf8' codec can't decode byte 0xea in position 30: invalid continuation byte
Traceback (most recent call last):
  File 
"/opt/projects/ENV/lib/python2.7/site-packages/arches/management/commands/package_utils/authority_files.py",
 line 112, in load_authority_file
    for row in rows:
  File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", line 
217, in next
    row = csv.DictReader.next(self)
  File "/usr/local/lib/python2.7/csv.py", line 104, in next
    row = self.reader.next()
  File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", line 
128, in next
    for value in row]
  File "/opt/projects/ENV/lib/python2.7/encodings/utf_8_sig.py", line 22, in 
decode
    (output, consumed) = codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xea in position 30: invalid 
continuation byte

ERROR in row 31 (Legacyoid (RULER_UID:30) not found.  Make sure your 
ParentConceptid in the  

This caused further errors in the Ruler Values files as can be seen from above. 
I do not have a copy of the authority file that caused the error asI have since 
corrected it and changed it in a few places. But the alternative name was 

Ptolemaîos Philadelphos

and I believe it was the circumflex above the 'i' that caused the problem. 
Certainly when I removed the circumflex, the file loaded OK.

Thank you, 
Lucy


----- Original Message ----- 
  From: Alexei Peters 
  To: Lucy FJ 
  Cc: Arches Project 
  Sent: Wednesday, January 20, 2016 8:24 PM
  Subject: Re: [Arches] Diacriticals in authority and .Arches files problems


  Hi Lucy,
  The .arches file should support diacritics.  I'm actually surprised that the 
authority files don't.  I just tested a local file and I was able to add these 
records:


  conceptid,PrefLabel,AltLabels,ParentConceptid,ConceptType,Provider 
  
20000001-0000-0000-0000-000000000000,Portland,,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI
  20000002-0000-0000-0000-000000000000,San Francisco,The Bay 
Area,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI
  20000003-0000-0000-0000-000000000000,San Jose,San 
José,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI


  Notice that the alt label for San Jose, is San José


  Can you share the authority file that you're having trouble with?
  Cheers,
  Alexei




  Director of Web Development - Farallon Geographics, Inc. - 971.227.3173



  On Wed, Jan 20, 2016 at 12:32 AM, Lucy FJ <[email protected]> wrote:

    Hi all,
    We have been loading customised authority files and have noticed that 
Arches rejects words with diacriticals (accents etc). This is not a problem for 
us as we were happy to remove them  and if we really want them we can enter 
then through the RDM. But will this problem occur when loading resource data 
through .arches? We need to input place names as alternative names using 
diacriticals and it would be much easier if we can do this via .arches files. 
We know we can input them using the resource data manager but obviously when 
dealing with about 3000 entries,,this is time consuming.
    Any ideas?
    Lucy

    --
    -- To post, send email to [email protected]. To unsubscribe, 
send email to [email protected]. For more information, 
visit https://groups.google.com/d/forum/archesproject?hl=en
    ---
    You received this message because you are subscribed to the Google Groups 
"Arches Project" group.
    To unsubscribe from this group and stop receiving emails from it, send an 
email to [email protected].
    For more options, visit https://groups.google.com/d/optout.


-- 
-- To post, send email to [email protected]. To unsubscribe, send 
email to [email protected]. For more information, 
visit https://groups.google.com/d/forum/archesproject?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"Arches Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to