Developers at this weekend's GLAMCamp NYC

http://meta.wikimedia.org/wiki/GLAMcamp_NYC

are developing a data-munging tool, based on pywikipediabot, to aid in 
mass uploads.  They'll be hacking on it in sprints this weekend, 
starting 11am-12:30pm NYC time tomorrow, Saturday the 21st.  Join them 
in person, or in #glamwiki on Freenode.

See notes from today's preliminary session:

http://etherpad.wikimedia.org/GLAMcampNYC-ut

Summary, I believe by Maarten Zeinstra:

> There is a Python library (pywikipediabot) where many of Maarten's 
> bots are derived from[1]. The desired outcome of this session is that 
> we're turning ita library that functions as a black box for uploading 
> data to Wikimedia  Commons. This library will have one external 
> function "put(metadata,  configuration). It also needs to include a 
> function to check for duplicates.
>
> configuration wll be a dictionary that holds the following keys:
> - configurationTemplate, holds URL to configuration template
> - configurationTitleTemplate, holds configuration of Title Template
> - sourceKey, holds the key of the metadata dict that indicate the url 
> of the source
>
> An  extra module will be written to ingest different formats and offer 
> its  metadata as dictionary in key-value format (metadata in put(); ). 
> This module can be GUI-ed. Written as a base class with can be 
> subclassed or extended, to enable different standards.
>
>
> [1] 
> https://fisheye.toolserver.org/browse/multichill/bot/Europeana/Europeana_uploader.py?hb=true

The Etherpad also describes the work they're going to do:

> We are going to make 3 modules
>
> 1. Upload module
> MaartenD is going to make a function
> put(metadata, configuration);
> with metadata as a dict (python for associative array)
> add duplicate checker
>
> 2. Conversion module / interface module
> Make a metadata conversion module that ingests CVSs/OAI-PMH and 
> converts to internal dict format add a GUI that create the 2 dicts 
> necessary (metadata, configuration)
>
> 3. Develop a configuration standard as an array of keys for a dict.
>
> draft of standard:
> configurationTemplate: holds template url
> ConfigurationTitleTemplate

Hope this is a productive weekend and we come out of it with something 
useful!

Sumana Harihareswara
Wikimedia Foundation
Volunteer Development Coordinator

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to