Hi,

I do batch import by converting our metadata folks' spreadsheets into a
"Simple Archive Format" package.
I built a standalone java tool:
https://wiki.duraspace.org/display/DSPACE/Simple+Archive+Format+Packager

Then we directly import that into DSpace via /dspace/bin/dspace import
https://wiki.duraspace.org/display/DSDOC/System+Administration#SystemAdministration-ImportingItems
<https://wiki.duraspace.org/display/DSDOC/System+Administration#SystemAdministration-ImportingItems>
--
Peter Dietz
Systems Developer/Engineer
Ohio State University Libraries



On Thu, Feb 17, 2011 at 9:14 AM, Joseph Rhoads <[email protected]> wrote:

> I’ve used the tools described by Odu and Emilio (CSV Import scripts, and
> Batch Metadata editing respectively).  They both work well.
>
> If you have bitstreams to import  the CSV import scripts are the best way
> to go.
>
> If you have pure metadata, they both work equally as well.
>
>
>
> You said you have about 5000 entries in your excel file;
>
> A couple of things to be aware of when using the Batch Metadata Editing
> tool:
>
>
>
> 1)      If you are using the gui, you will be limited in the number of
> entries you can upload at one time.  This is controlled in the dspace.cfg
> file
>
> a.       The parameter that controls this is “bulkedit.gui-item-limit”
>
> 2)      I’ve had experiences with the command line tool where loading
> large collections (12,000 items or so) resulted in an out of memory error
> from java.  After posting on this forum, I found that a general rule of
> thumb is to break the items up into groups of 1000 for importing.
>
> 3)      Even if you use the gui and increase the bulkedit.gui-item-limit
>  number, I think you may still run into the above limitation with memory.
>
>
>
> This is based on my experiences, your mileage may vary.
>
> -Joseph
>
>
> ----------------------------------------------------------------------------
>
> Joseph Rhoads
>
> Digital Curator
>
> Antonio J. Waring, Jr. Archaeology Laboratory
>
>
>
> Department of Anthropology
> University of West Georgia
> 1601 Maple St.
> Carrollton, Ga. 30118
>
>
>
> *From:* Shivraj Thorat [mailto:[email protected]]
> *Sent:* Thursday, February 17, 2011 6:05 AM
> *To:* [email protected]
> *Subject:* [Dspace-tech] Excel to Dspace
>
>
>
> Respected professionals
>
> i am using Dspace 1.7.0 for IR and i have 5000 records in excel file to be
> add in Dspace. entering it by entry form will take lot of time. so need ur
> help in this regard.
>
> --
> Regards
>
>
> Mr. Shivraj Thorat
> Librarian
> BVDU, Institute of Management & Entrepreneurship Development
> Paud Rd, Kothrud,
> Pune-38
> ------------------------------
>
>
> Spam<http://canit.westga.edu/canit/b.php?i=0fE8X7TDE&m=bb522c4e021b&t=20110217&c=s>
> Not 
> spam<http://canit.westga.edu/canit/b.php?i=0fE8X7TDE&m=bb522c4e021b&t=20110217&c=n>
> Forget previous 
> vote<http://canit.westga.edu/canit/b.php?i=0fE8X7TDE&m=bb522c4e021b&t=20110217&c=f>
>
>
> ------------------------------------------------------------------------------
> The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
> Pinpoint memory and threading errors before they happen.
> Find and fix more than 250 security defects in the development cycle.
> Locate bottlenecks in serial and parallel code that limit performance.
> http://p.sf.net/sfu/intel-dev2devfeb
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>
>
------------------------------------------------------------------------------
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to