Hello Details regarding Submission Information Package generation to facilitate DSpace batch import is accessible at http://wiki.dspace.org/index.php/Google_Summer_of_Code_2008_Batch_Import
This is being done as part of DSpace - GSOC2008. Although this has not been tested on dspace 1.5, it looks like it is compatible based on Ilan's comments. Please email me if you encounter issues in using this program. Regards, Jayan -----Original Message----- From: Shomorony, Ilan [mailto:[EMAIL PROTECTED] Sent: Saturday, July 12, 2008 8:45 AM To: #BLOOMA MOHAN JOHN# Cc: Jayan Chirayath Kurian Subject: RE: DSpace Submission Information Package generation Hi Jayan and Blooma, I'm writing to you to let you know that I finally managed to get my DSpace to work properly, and so I started making attempts at importing the metadata! And I have to say that your instructions have been extremely helpful and it all worked very nicely! I think one of the dublin core fields changed name (maybe with DSpace 1.5) so I was getting an error at first, but I fixed it. I'm also making a few changes to the .java file to better suit the needs of our DSpace, but with your template everything became a piece of cake! Also, I followed your suggestion of how to do it for a Linux server, and it worked. I'm now playing around with different metadata fields, and making a bunch of SQL queries to make sure that our data is in a good format to be imported, but that shouldn't be bad. Thank you very much, Ilan ________________________________________ From: #BLOOMA MOHAN JOHN# [EMAIL PROTECTED] Sent: Monday, June 16, 2008 5:51 AM To: Shomorony, Ilan Cc: Jayan Chirayath Kurian Subject: DSpace Submission Information Package generation Hi Ian, Please find attached detailed instruction to prepare the dspace submission information package (Dublin core, contents file and pdf/doc/txt/jpg files) from a metadata encoded excel sheet for subsequent import into dspace. You may use any editor (e.g. JCreator) to compile the attached Java program. Before executing the Java program use the attached excel sheet to act as a data source for metadata. The steps for making Excel as a data source are given below. In C drive, create 3 files (i.e. paper1.pdf, paper2.pdf, paper3.pdf) as specified in the ResourceLocation column of the excel sheet which are the items/records that we are importing into DSpace. The Excel file is named c:\qas.xls and the worksheet is also named as qas (attached). Microsoft's ODBC driver treats the first row in a spreadsheet as the column names and the worksheet name as the database name. (Reference: Java World) To access a spreadsheet with JDBC, you need to create a new ODBC data source. To create one in Windows 2000/XP 1. Go to "Control Panel" 2. Go to "Administrative Tools" 3. Go to "Data Sources" 4. Select "Add" 5. Choose "Microsoft Excel Driver" and "Finish,". Figure 1. Create new data source Then give the "Data Source Name" qalist and select the workbook, shown in Figure 2. Browse and select the excel sheet qas.xls. Figure 2. ODBC Microsoft Excel Setup When you are done you should see your new qalist data source name: Figure 3. New listing of user data sources After this when you compile and execute the Java program, it asks to give a main archive folder name. Give TESTIMPORT as the main archive folder name. After execution, check the TESTIMPORT folder to see the metadata archive folders (TESTIMPORT_1, TESTIMPORT_2, TESTIMPORT_3) generated in DSpace submission information package format. In each folder you will find 3 files one is the Dublin core file, next is the actual resource or the file to be imported and third is a file (contents file) with the name of the record/item to be imported. After this give the Dspace ItemImport command at DSpace\bin> prompt to start importing items into specific DSpace Collection. Please let me know if you encounter problems. As of now it's not compatible with Linux system since it uses MS Excel and its driver on a windows machine. The New York University also had the same platform and since I don't have a Linux installation I suggested the following alternative. Please use a windows machine (ordinary desktop would be enough) to install the program. After executing the program transfer the generated submission information package (folders that contain dubline core.XML, contents file and pdf file) into your Linux machine and then run the ItemImport command. Generating submission information package folder hardly takes few seconds for around 300-400 pdf files. Please feel free to ask if you encounter any errors. Regards, Blooma ========================================== Blooma Mohan John Research Scholar Division of Information Studies School of Communication and Information Nanyang Technological University 31 Nanyang Link Singapore 637718 Phone: (65) 67262428 Email: [EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]> ========================================== ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

