Hi Lucy, you can check the encoding in Notepad ++. Open your authority document with that program, and click the Encoding menu. Your file should be in "UTF-8" or "UTF-8 without BOM" (depends on the version of Notepad ++ you have). The î character should work as far as I know...
On Thu, Jan 21, 2016 at 7:18 AM, 'Lucy Fletcher-Jones' via Arches Project < [email protected]> wrote: > Hi Alexei, > > Thank you for looking into this. I am glad to hear that Arches should > support diacriticals. > > Here is the error message on loading the 'Ruler' Authority document: > > RULER_AUTHORITY_DOCUMENT.csv > > ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.values.csv > > ERRORS IN FILE: RULER_AUTHORITY_DOCUMENT.csv > > ERROR: Make sure the file is saved with UTF-8 encoding > 'utf8' codec can't decode byte 0xea in position 30: invalid continuation > byte > Traceback (most recent call last): > File > "/opt/projects/ENV/lib/python2.7/site-packages/arches/management/commands/package_utils/authority_files.py", > line 112, in load_authority_file > for row in rows: > File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", > line 217, in next > row = csv.DictReader.next(self) > File "/usr/local/lib/python2.7/csv.py", line 104, in next > row = self.reader.next() > File "/opt/projects/ENV/lib/python2.7/site-packages/unicodecsv/py2.py", > line 128, in next > for value in row] > File "/opt/projects/ENV/lib/python2.7/encodings/utf_8_sig.py", line 22, > in decode > (output, consumed) = codecs.utf_8_decode(input, errors, True) > UnicodeDecodeError: 'utf8' codec can't decode byte 0xea in position 30: > invalid continuation byte > > ERROR in row 31 (Legacyoid (RULER_UID:30) not found. Make sure your > ParentConceptid in the > > This caused further errors in the Ruler Values files as can be seen from > above. > I do not have a copy of the authority file that caused the error asI have > since corrected it and changed it in a few places. But the alternative > name was > > Ptolemaîos Philadelphos > > and I believe it was the circumflex above the 'i' that caused the problem. > Certainly when I removed the circumflex, the file loaded OK. > > Thank you, > Lucy > > > ----- Original Message ----- > > *From:* Alexei Peters <[email protected]> > *To:* Lucy FJ <[email protected]> > *Cc:* Arches Project <[email protected]> > *Sent:* Wednesday, January 20, 2016 8:24 PM > *Subject:* Re: [Arches] Diacriticals in authority and .Arches files > problems > > Hi Lucy, > The .arches file should support diacritics. I'm actually surprised that > the authority files don't. I just tested a local file and I was able to > add these records: > > conceptid,PrefLabel,AltLabels,ParentConceptid,ConceptType,Provider > > 20000001-0000-0000-0000-000000000000,Portland,,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI > 20000002-0000-0000-0000-000000000000,San Francisco,The Bay > Area,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI > 20000003-0000-0000-0000-000000000000,San Jose,San > José,CITY_AUTHORITY_DOCUMENT.csv,Index,GCI > > Notice that the alt label for San Jose, is San José > > Can you share the authority file that you're having trouble with? > Cheers, > Alexei > > > Director of Web Development - Farallon Geographics, Inc. - 971.227.3173 > > On Wed, Jan 20, 2016 at 12:32 AM, Lucy FJ <[email protected]> wrote: > >> Hi all, >> We have been loading customised authority files and have noticed that >> Arches rejects words with diacriticals (accents etc). This is not a problem >> for us as we were happy to remove them and if we really want them we can >> enter then through the RDM. But will this problem occur when loading >> resource data through .arches? We need to input place names as alternative >> names using diacriticals and it would be much easier if we can do this via >> .arches files. We know we can input them using the resource data manager >> but obviously when dealing with about 3000 entries,,this is time consuming. >> Any ideas? >> Lucy >> >> -- >> -- To post, send email to [email protected]. To >> unsubscribe, send email to [email protected]. >> For more information, visit >> https://groups.google.com/d/forum/archesproject?hl=en >> --- >> You received this message because you are subscribed to the Google Groups >> "Arches Project" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> For more options, visit https://groups.google.com/d/optout. >> > > -- > -- To post, send email to [email protected]. To unsubscribe, > send email to [email protected]. For more > information, visit https://groups.google.com/d/forum/archesproject?hl=en > --- > You received this message because you are subscribed to the Google Groups > "Arches Project" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/d/optout. > -- -- To post, send email to [email protected]. To unsubscribe, send email to [email protected]. For more information, visit https://groups.google.com/d/forum/archesproject?hl=en --- You received this message because you are subscribed to the Google Groups "Arches Project" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
