Hi,

I have been tasked with figuring out a way to automatically add items to
our repository from a variety of web-based resources, and I wanted to see
if you all, with more experience in this, would possibly have a
better/simpler approach.

Many of our data sources come from third-party applications, via APIs or
other downloads. For example, one site has a csv report that is updated
daily. We want to download that and add it to our repository, which will
serve as the reference point for "official" data dumps or reports. We want
it to be completely automated, as limited staffing is often a bottleneck in
these kinds of routine data management processes.

My idea was, via a script with cron, to:

   1. Use wget or cURL (or an API) to download the files to the server
   where DSpace is being hosted
   2. Generate a SAF CSV with the expected metadata (most fields remain the
   same, other than perhaps filename/date).
   3. Run SAFBuilder to create the SAF packages
   4. Run a Dspace import command to add these items to the repository

Has anyone done something like this before, and is there a better way to do
this? Also, I wasn't sure if import would allow for new item version
creation; I don't see it in the command-line reference, but does anyone
know?

Many thanks,
Charlene

-- 
Charlene Barina, MPH
Research Coordinator, US IMPACT Study
The Information School
303-359-6347 | Skype: cbarina
facebook.com/ImpactSurvey | twitter.com/impactsurvey
------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to