Where can I find Adam Cox's excellent guide to creating a .Arches file? :)

I am importing an old database that is a huge mess, and it seems I will 
need to prepare a .arches file by querying every column of every table into 
a .arches format. 
E.g. this query:

select CONCAT('HERITAGE_RESOURCE_', SpecimenID) as "RESOURCEID", 
'HERITAGE_RESOURCE.E18' as "RESOURSETYPE", 
'NAME.E41' as "ATTRIBUTENAME", 
CommonName as "ATTRIBUTEVALUE", 
'NAME.E41' as "GROUPID"
from dbo.Specimens;

will result in: 

RESOURCEID   RESOURCETYPE ATTRIBUTENAME    ATTRIBUTEVALUE  GROUPID
HERITAGE_RESOURCE_4  HERITAGE_RESOURCE.E18    NAME.E41    Hot Springs NAME.E41
HERITAGE_RESOURCE_5   HERITAGE_RESOURCE.E18 NAME.E41     Sapsuk NAME.E41
HERITAGE_RESOURCE_6   HERITAGE_RESOURCE.E18 NAME.E41     Sapsuk NAME.E41
HERITAGE_RESOURCE_7   HERITAGE_RESOURCE.E18 NAME.E41    Adamagan NAME.E41



And then of course there are the relations...

Any advise/tactics/example sql queries are very welcome! :D



On Wednesday, 17 February 2016 12:46:48 UTC-5, Adam Cox wrote:
>
> Hi Lucy, I have come to a similar conclusion in the past.  It's been 
> beneficial to batch import a single .arches file with a good deal of 
> information in it, but with no attempt to get absolutely everything.  
> Programming a way to convert some of the extra random pieces to .arches 
> seemed too costly (time-costly) compared to the relative ease of entering 
> that information through the arches interface.  However, populating the 
> entire database with a core of base data that could be added to later is 
> extremely valuable.
>
> Also, heads-up!  In v4 it looks like 
> <https://github.com/archesproject/arches/wiki/Import-Export> the .arches 
> format will give way to a more straightforward import approach.  That won't 
> be relevant to you for a while, but just something to keep in mind before 
> you spend extra time building processes for .arches that you expect to use 
> for years...
>
> Adam
>
> On Wed, Feb 17, 2016 at 4:07 AM, Lucinda Fletcher-Jones <[email protected] 
> <javascript:>> wrote:
>
>> Hi Adam,
>>
>> Your help is always appreciated and I really like the MS Excel query 
>> method of converting the excel file into an .arches file but that perennial 
>> problem, diacritics, is rearing its ugly head again. The MS query cannot 
>> import the diacritics from the Excel file (I have tried many times using 2 
>> different versions of Excel) and so adding those extra rows as you suggest 
>> below would be necessary for every resource as we have the Arabic name and 
>> an English name with diacritics for each. It probably wouldn't take that 
>> long but allows for human error to creep in...
>>
>> I also looked into using Open Office. By googling, I found a method by 
>> which I could use OO Base, import the Excel file and create a query from 
>> that. But, the modified query is not working at present. I have some syntax 
>> problems but I also believe that this method can only handle very simple 
>> queries.
>> As for the macros, I don't know python and my Javascript is rusty, so I 
>> might leave that to a new programmer coming on board soon. 
>> I am rapidly coming to the conclusion that a combination of batch input 
>> through .arches files and user input might be best for our project!
>>
>>
>>
>>
>> On Monday, February 15, 2016 at 7:42:42 PM UTC+2, Adam Cox wrote:
>>>
>>> Hi Lucy, personally I'd recommend modifying (a copy of!) the excel file, 
>>> because then you can continually recreate the .arches file from it--I've 
>>> always found it beneficial to spend the most time on replicable steps 
>>> rather than one time (copy/paste) operations.
>>>
>>> Here's something that could help the process, and I'll use the Name, 
>>> Name Type as an example:
>>>
>>> If you have a column of names, and you know that they are all primary 
>>> names, you can avoid creating a new column where every value is 
>>> "NAME_TYPE:1" by just hard-coding that term into a statement in the query.  
>>> So, your new statement would not reference a column name, but would just 
>>> have "NAME_TYPE:1" written into it.  In the example I made, the geometry 
>>> column shows an example of combining strings, 'POINT (', with column 
>>> names, lat.  So that may be helpful for reference.
>>>
>>> A couple of other points related to this:
>>>
>>> If you have begun to use open office, it would be really great to see if 
>>> this same SQL stuff can be used there.  Just a word of warning, even though 
>>> it's all SQL, the syntax (double vs. single quotes, the use of AS) may 
>>> differ between Excel and Open Office.
>>>
>>> Also, it occurs to me that once you have created a spreadsheet that is 
>>> formatted like a .arches file, you could probably do some pretty nimble 
>>> things with it by using the sorting and filtering capabilities that Excel 
>>> has and I'm sure open office has.  A .arches file is just a bunch of rows 
>>> with two levels of sorting: first by RESOURCEID and second by GROUPID.  To 
>>> add a line to a specific resource, you could just filter the rows based on 
>>> a RESOURCEID, add your new row or two, and then remove the filter, re-sort, 
>>> and you will have properly "inserted" a row.
>>>
>>> Finally, I just downloaded Open Office, and it looks like you can write 
>>> Python or Javascript macros.  This is pretty huge (if you're into that sort 
>>> of thing) and a simple python or js macro could be used in place of the SQL 
>>> query I made.  Sharing open office macros is probably really easy, so this 
>>> would be a great way to collaborate.
>>>
>>> Good luck!
>>>
>>>
>>> On Sunday, February 14, 2016 at 6:19:35 AM UTC-6, Lucy FJ wrote:
>>>>
>>>> Hi Adam,
>>>>
>>>> I have now thoroughly read the documentation on .Arches file and have 
>>>> played around with the Microsoft query and have a much better idea of what 
>>>> I need to do, so there is no need to answer some of the questions I asked 
>>>> below! 
>>>>
>>>> It seems that you either need to do a lot of manipulation of the excel 
>>>> file or a lot of copy and pasting in the .Arches file which will then be 
>>>> enormous even for our modest database!  
>>>>
>>>>
>>>>
>>>> Lucy
>>>>
>>>>
>>>>
>>>> On Sunday, February 14, 2016 at 11:01:23 AM UTC+2, Lucy FJ wrote:
>>>>>
>>>>> Hi Adam,
>>>>>
>>>>> Finally I am back testing out creating an .Arches file (but I am not 
>>>>> able to load them yet and see the results) and I do have further 
>>>>> questions 
>>>>> about adding alternative names and several resource classifications. 
>>>>>
>>>>> Just to be clear that I understood you correctly, the group ID keeps 
>>>>> records that are related within a heritage resource together, and the 
>>>>> resource ID ensures that all data for a particular resource is kept 
>>>>> together? Therefore, each alternative name/nametype would have a separate 
>>>>> Group ID from the preferred name/nametype, but the same Resource ID?
>>>>>
>>>>> Secondly, I am not very familiar with Microsoft query, but it seems 
>>>>> that the original Excel file for creating the query from, can only have 
>>>>> one 
>>>>> 'row' per record or archaeological site in our case. So as you suggested, 
>>>>> we would need to put in a new column for each alternative name we have 
>>>>> and 
>>>>> for each site type classification and of course modify the query to add 
>>>>> more statements for the extra names etc. I am looking for a method of 
>>>>> reading the Excel file in the way we have set up with all the alternative 
>>>>> name being in *one* column, as it would be more efficient for us - 
>>>>> sometimes we have 10 alternative names but I don't think Microsoft Query 
>>>>> can handle this. 
>>>>>
>>>>> I don't know if you have any further thoughts on this or know how 
>>>>> others have handles this.
>>>>> Thank you very much for your help, which is always appreciated! 
>>>>> Lucy
>>>>>
>>>>> On Monday, January 18, 2016 at 7:27:24 PM UTC+2, Adam Cox wrote:
>>>>>>
>>>>>> Hi Lucy, I'm happy to hear the documentation was helpful.
>>>>>>
>>>>>> To add more attributes to your .arches file, you'll just need to add 
>>>>>> more statements to the SQL query.  First, because the SQL window is 
>>>>>> pretty 
>>>>>> cramped in Excel, I'd recommend pasting your query into Notepad ++ so 
>>>>>> it's 
>>>>>> easier to work with.
>>>>>>
>>>>>> If you look at the two statements that define Name and Name_Type, 
>>>>>> you'll see that they both reference "group1".  What you want to do is 
>>>>>> make 
>>>>>> more groups, so I'd recommend copying and pasting the Name_Type 
>>>>>> statement 
>>>>>> from the existing query (because it is the most standard) and pasting it 
>>>>>> below, as many times as you need--one time per new attribute that you 
>>>>>> need 
>>>>>> to add.  Then just make sure to set the group value appropriately: if 
>>>>>> you 
>>>>>> have an alternate name, you'll have to make a new column for that name, 
>>>>>> and 
>>>>>> make a column with the correct conceptid (or hard-code the conceptid 
>>>>>> into 
>>>>>> the new statement), and then put "group3" into each statement. Or, if 
>>>>>> you 
>>>>>> just have a single column for a standalone attribute like "Site Type", 
>>>>>> make 
>>>>>> sure it has its own unique group ID.
>>>>>>
>>>>>> I hope that makes sense.  I have to run now, but if you have more 
>>>>>> questions don't hesitate to ask them!
>>>>>>
>>>>>> Adam
>>>>>>
>>>>>> On Sunday, January 17, 2016 at 12:17:31 PM UTC-6, Lucinda 
>>>>>> Fletcher-Jones wrote:
>>>>>>>
>>>>>>>
>>>>>>> <https://lh3.googleusercontent.com/-Qu9c0GYT4ig/VpvaZj1w5yI/AAAAAAAAFXo/aMjV0c5GLJg/s1600/Dabanarti.JPG>
>>>>>>>
>>>>>>> Hi all,
>>>>>>>
>>>>>>> I have worked through the example in Adam Cox's excellent guide to 
>>>>>>> creating a .Arches file from a Microsoft Excel file which is where we 
>>>>>>> have 
>>>>>>> all our archaeological site data at present, and it worked very well. 
>>>>>>> Now I 
>>>>>>> have a question. Our data, as everyone else's is a little more 
>>>>>>> complicated 
>>>>>>> than the example. How do you deal with a site that has more than one 
>>>>>>> name 
>>>>>>> and more than one classification etc? for example: Dabanarti with three 
>>>>>>> names and three site types. 
>>>>>>>
>>>>>>>
>>>>>>> <https://lh3.googleusercontent.com/-Qu9c0GYT4ig/VpvaZj1w5yI/AAAAAAAAFXo/aMjV0c5GLJg/s1600/Dabanarti.JPG>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Thank you very much, Lucy
>>>>>>>
>>>>>>>
>>>>>>> -- 
>> -- To post, send email to [email protected] <javascript:>. To 
>> unsubscribe, send email to [email protected] <javascript:>. 
>> For more information, visit 
>> https://groups.google.com/d/forum/archesproject?hl=en
>> --- 
>> You received this message because you are subscribed to the Google Groups 
>> "Arches Project" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
-- To post, send email to [email protected]. To unsubscribe, send 
email to [email protected]. For more information, 
visit https://groups.google.com/d/forum/archesproject?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"Arches Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to