Hello 

Details regarding Submission Information Package generation to
facilitate DSpace batch import is accessible at
http://wiki.dspace.org/index.php/Google_Summer_of_Code_2008_Batch_Import

This is being done as part of DSpace - GSOC2008. 

Although this has not been tested on dspace 1.5, it looks like it is
compatible based on Ilan's comments. Please email me if you encounter
issues in using this program. 

Regards,
Jayan

-----Original Message-----
From: Shomorony, Ilan [mailto:[EMAIL PROTECTED] 
Sent: Saturday, July 12, 2008 8:45 AM
To: #BLOOMA MOHAN JOHN#
Cc: Jayan Chirayath Kurian
Subject: RE: DSpace Submission Information Package generation

Hi Jayan and Blooma,

I'm writing to you to let you know that I finally managed to get my
DSpace to work properly, and so I started making attempts at importing
the metadata!
And I have to say that your instructions have been extremely helpful and
it all worked very nicely! I think one of the dublin core fields changed
name (maybe with DSpace 1.5) so I was getting an error at first, but I
fixed it. I'm also making a few changes to the .java file to better suit
the needs of our DSpace, but with your template everything became a
piece of cake!
Also, I followed your suggestion of how to do it for a Linux server, and
it worked.
I'm now playing around with different metadata fields, and making a
bunch of SQL queries to make sure that our data is in a good format to
be imported, but that shouldn't be bad.

Thank you very much,

Ilan

________________________________________
From: #BLOOMA MOHAN JOHN# [EMAIL PROTECTED]
Sent: Monday, June 16, 2008 5:51 AM
To: Shomorony, Ilan
Cc: Jayan Chirayath Kurian
Subject: DSpace Submission Information Package generation

Hi Ian,

Please find attached detailed instruction to prepare the dspace
submission information package (Dublin core, contents file and
pdf/doc/txt/jpg files) from a metadata encoded excel sheet for
subsequent import into dspace.

You may use any editor (e.g. JCreator) to compile the attached Java
program. Before executing the Java program use the attached excel sheet
to act as a data source for metadata. The steps for making Excel as a
data source are given below. In C drive, create 3 files (i.e.
paper1.pdf, paper2.pdf, paper3.pdf) as specified in the ResourceLocation
column of the excel sheet which are the items/records that we are
importing into DSpace.

The Excel file is named c:\qas.xls and the worksheet is also named as
qas (attached). Microsoft's ODBC driver treats the first row in a
spreadsheet as the column names and the worksheet name as the database
name. (Reference: Java World)
To access a spreadsheet with JDBC, you need to create a new ODBC data
source.

To create one in Windows 2000/XP
1. Go to "Control Panel"
2. Go to "Administrative Tools"
3. Go to "Data Sources"
4. Select "Add"
5. Choose "Microsoft Excel Driver" and "Finish,".

Figure 1. Create new data source
Then give the "Data Source Name" qalist and select the workbook, shown
in Figure 2. Browse and select the excel sheet qas.xls.

Figure 2. ODBC Microsoft Excel Setup
When you are done you should see your new qalist data source name:

Figure 3. New listing of user data sources

After this when you compile and execute the Java program, it asks to
give a main archive folder name. Give TESTIMPORT as the main archive
folder name. After execution, check the TESTIMPORT folder to see the
metadata archive folders (TESTIMPORT_1, TESTIMPORT_2, TESTIMPORT_3)
generated in DSpace submission information package format. In each
folder you will find 3 files one is the Dublin core file, next is the
actual resource or the file to be imported and third is a file (contents
file) with the name of the record/item to be imported. After this give
the Dspace ItemImport command at DSpace\bin> prompt to start importing
items into specific DSpace Collection. Please let me know if you
encounter problems.

As of now it's not compatible with Linux system since it uses MS Excel
and its driver on a windows machine. The New York University also had
the same platform and since I don't have a Linux installation I
suggested the following alternative. Please use a windows machine
(ordinary desktop would be enough) to install the program. After
executing the program transfer the generated submission information
package (folders that contain dubline core.XML, contents file and pdf
file) into your Linux machine and then run the ItemImport command.
Generating submission information package folder hardly takes few
seconds for around 300-400 pdf files.

Please feel free to ask if you encounter any errors.

Regards,
Blooma


==========================================
Blooma Mohan John
Research Scholar
Division of Information Studies
School of Communication and Information
Nanyang Technological University
31 Nanyang Link
Singapore 637718

Phone: (65) 67262428
Email: [EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>
==========================================

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to