[CODE4LIB] Digital Humanities workshop in Dublin

2012-05-19 Thread Jodi Schneider
I'm helping organize this Digital Humanities workshop in Dublin. Please 
forward, especially to European groups! (Is there an ELAG mailing list?)

If any of the North American contingent will be in Europe around that time, 
we'd love to get you involved!

:) -Jodi

 Date: Tuesday 23 October - Thursday 25 October 2012
 Venue: Dublin and Maynooth, Ireland
 
 Realising the Opportunities of Digital Humanities
 
 A workshop jointly organised by:
 
 Digital Repository of Ireland (DRI)
 Digital Enterprise Research Institute (DERI)
 Digital Humanities Observatory (DHO)
 Digital Research Infrastructure for the Arts and Humanities (DARIAH-EU)
 
 The focus of this event is to engage academia, industry, cultural 
 institutions and public bodies to identify the key research challenges in 
 digital humanities, and to further build the academic-industry partnerships 
 that will enable adoption of digital humanities skills, technologies and 
 tools. This focus will extend to the innovative use of digital humanities 
 technologies in the public sector demonstrating social benefit, such as the 
 digitisation of the Irish census and the use of open linked public data.
 
 This is the first in a series of workshops targeting the industry-academic 
 interface for identifying and realising the opportunities of the digital 
 humanities. The event is jointly organised by the two major digital 
 humanities national infrastructures (the DRI and the DHO), and the largest 
 semantic web research Institute (DERI), together with a large-scale European 
 digital infrastructure for the Arts and Humanities (DARIAH).
 
 During the three days, participants will expand their skills through a series 
 of digital labs, lectures and masterclasses lead by subject matter experts, 
 in the areas of data visualisation and in the application of linked data to 
 leverage the semantic web for scholarly humanities research. This workshop 
 will combine a day of hands-on practical application with master classes and 
 lectures and a day long symposium drawing together academic and industry 
 practitioners.
 
 Themes will include:
 
   • Data Visualisation and Data Analytics for Digital Humanities 
 Scholarship
   • Leveraging Linked Data and the Semantic Web for Scholarly Research
   • IP and Licensing  'What You Can and Can't Do With Data'
   • Infrastructure and Data Modelling
 
 This event will take place in Dublin and Maynooth from Tuesday 23 October - 
 Thursday 25 October 2012. 
 Registration and programme details to be announced  http://www.dri.ie/events/
 For more information please email d...@ria.ie or phone +353 1 609 0674


Re: [CODE4LIB] WorldCat SRU queries - elimination of records without a DDC no from the result set

2012-05-19 Thread Roy Tennant
Arash,
Yes, we have made WorldCat available to researchers under a special
license agreement. I suggest contacting Thom Hickeyhic...@oclc.org
about such an arrangement. Thanks,
Roy

On Fri, May 18, 2012 at 3:46 AM, Arash.Joorabchi arash.joorab...@ul.ie wrote:
 Dear Karen,

 I am conducting a research experiment on automatic text classification and I 
 am trying to retrieve top matching bib records (which include DDC fields) for 
 a set of keyphrases extracted from a given document. So, I suppose this is a 
 rather exceptional use case. In fact, the right approach for this experiment 
 is to process the full dump of WorldCat database directly rather than sending 
 a limited number of queries via the API.

 I read here:
 http://dltj.org/article/worldcat-lld-may-become-available under-odc-by/
 that WorldCat might become available as open linked data in future, which 
 would solve my problem and help similar text mining projects. However, I 
 wonder if it is currently available to researchers under a 
 research/non-commercial use license agreement.

 Regards,
 Arash

 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Karen 
 Coombs
 Sent: 17 May 2012 08:37
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] WorldCat SRU queries - elimination of records without 
 a DDC no from the result set

 I forwarded this thread to the Product Manager for the WorldCat Search
 API. She responded back that unfortunately this query is not possible
 using the API at this time.

 FYI, the SRU interface to WorldCat Search API doesn't currently
 support any scan type searches either.

 Is there a particular use case you're trying to support? Know that
 would help us document this as a possible enhancement.

 Karen

 Karen Coombs
 Senior Product Analyst
 Web Services
 OCLC
 coom...@oclc.org

 On Wed, May 16, 2012 at 9:49 PM, Arash.Joorabchi arash.joorab...@ul.ie 
 wrote:
 Hi Andy,



 I am a SRU newbie myself, so I don't know how this could be achieved
 using scan operations and could not find much info on SRU website
 (http://www.loc.gov/standards/sru/).

 As for the wildcards, according to this guide:
 http://www.oclc.org/support/documentation/worldcat/searching/refcard/sea
 rchworldcatquickreference.pdf the symbols should be preceded by at least
 3 characters, and therefore clauses like:



 ... AND srw.dd=*

 ... AND srw.dd=?.*

 ... AND srw/dd=###.*

 ... AND srw/dd=?3.*





 do not work and result in the following error:

 Diagnostics

 Identifier:

 info:srw/diagnostic/1/9

 Meaning:



 Details:



 Message:

 Not enough chars in truncated term:Truncated words too short(9)





 Thanks,

 Arash



 

 From: Houghton,Andrew [mailto:hough...@oclc.org]
 Sent: 16 May 2012 11:58
 To: Arash.Joorabchi
 Subject: Re: [CODE4LIB] WorldCat SRU queries - elimination of records
 without a DDC no from the result set



 I'm not an SRU guru, but is it possible to do a scan and look for a
 postings of zero?



 Andy.

 On May 16, 2012, at 6:39, Arash.Joorabchi arash.joorab...@ul.ie
 wrote:

        Hi mark,

        Srw.dd=* does not work either:

        Identifier:     info:srw/diagnostic/1/27
        Meaning:
        Details:        srw.dd
        Message:        The index [srw.dd] did not include a searchable
 value

        I suppose the only option left is to retrieve everything and
 filter the results on the client side.

        Thanks for your quick reply.
        Arash


        -Original Message-
        From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On
 Behalf Of Mike Taylor
        Sent: 16 May 2012 10:43
        To: CODE4LIB@LISTSERV.ND.EDU
        Subject: Re: [CODE4LIB] WorldCat SRU queries - elimination of
 records without a DDC no from the result set

        There is no standard way in CQL to express field X is not
 empty.
        Depending on implementations, NOT srw.dd= might work (but
 evidently
        doesn't in this case).  Another possibility is srw.dd=*, but
 again
        that may or may not work, and might be appallingly inefficient
 if it
        does.  NOT srw.dd=null will definitely not work: null is not a
        special word in CQL.

        -- Mike.


        On 16 May 2012 10:32, Arash.Joorabchi arash.joorab...@ul.ie
 wrote:
          Hi all,
        
         I am sending SRU queries to the WorldCat in the following
 form:
        
        
                        String host =
         http://worldcat.org/webservices/catalog/search/;;
                    String query = sru?query=srw.kw=\ + keyword +
 \
                                        +  AND srw.ln exact \eng\
                                        +  AND srw.mt all \bks\
                                        +  AND srw.nt=\ + keyword +
 \
                                        + servicelevel=full
                                        + maximumRecords=100
                                      + sortKeys=relevance,,0