Edit your Contents file to look like this:

benitezjm_image06.jpg   bundle:ORIGINAL
benitezjm_thesis.pdf    bundle:ORIGINAL
benitezjm_image04.jpg   bundle:ORIGINAL
benitezjm_image02.jpg   bundle:ORIGINAL
benitezjm_image05.jpg   bundle:ORIGINAL
benitezjm_image01.jpg   bundle:ORIGINAL
benitezjm_image03.jpg   bundle:ORIGINAL

regards
Graeme Fox


-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of
[EMAIL PROTECTED]
Sent: Tuesday, 8 May 2007 8:28 a.m.
To: [email protected]
Subject: DSpace-tech Digest, Vol 13, Issue 12

Send DSpace-tech mailing list submissions to
        [email protected]

To subscribe or unsubscribe via the World Wide Web, visit
        https://lists.sourceforge.net/lists/listinfo/dspace-tech
or, via email, send a message with subject or body 'help' to
        [EMAIL PROTECTED]

You can reach the person managing the list at
        [EMAIL PROTECTED]

When replying, please edit your Subject line so it is more specific
than "Re: Contents of DSpace-tech digest..."


Today's Topics:

   1. Dspace/SRW question (orlando carvalho)
   2. help with import - bitstream error (susan rector)
   3. Re: help with import - bitstream error (susan rector)
   4. Re: help with import - bitstream error (George Kozak)
   5. Re: help with import - bitstream error (susan rector)
   6. Re: How to Configure a port number in DSpace (Kyle Brentnell)


----------------------------------------------------------------------

Message: 1
Date: Mon, 7 May 2007 15:41:07 +0100
From: "orlando carvalho" <[EMAIL PROTECTED]>
Subject: [Dspace-tech] Dspace/SRW question
To: [email protected]
Message-ID:
        <[EMAIL PROTECTED]>
Content-Type: text/plain; charset="iso-8859-1"

Hi there!
We have a question and we think you can help us!
We need to make searches into Dspace with SRW using the e-mail that a
user
uses when submitting an item but we don't know how to search for items
like
this!
Is it possible?how?

Thanks one more time for your precious help!

-- 
Orlando Carvalho
-------------- next part --------------
An HTML attachment was scrubbed...

------------------------------

Message: 2
Date: Mon, 07 May 2007 11:38:37 -0400
From: susan rector <[EMAIL PROTECTED]>
Subject: [Dspace-tech] help with import - bitstream error
To: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Hi all,
I fixed the last error with the ^M - thanks for all your help

Now, I'm running into another strange import error. I run the dspace 
importer in test mode and it works fine and imports 466 distinct 
directories with metadata and files.
Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --test

--add [EMAIL PROTECTED] --collection=5 
--source=/usr/local/archive_directory --mapfile mapfile.txt

When I run the same command just not in test mode, I get the following 
error stating that the contents file isn't there. In fact, it is there 
and has a listing of the files in that directory
Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add 
[EMAIL PROTECTED] --collection=5 
--source=/usr/local/archive_directory --mapfile mapfile.txt

I've changed all the file permissions thinking that might be it - didn't

work. It seems to get stuck on the jpeg's.

Thanks for any suggestions!
Susan Rector
[EMAIL PROTECTED]


Error:
 Processing contents file: 
/usr/local/archive_directory/etd-05152006-125030/contents
        Bitstream: benitezjm_image06.jpg
        Bitstream: benitezjm_thesis.pdf
        Bitstream: benitezjm_image04.jpg
        Bitstream: benitezjm_image02.jpg
java.io.IOException: No such file or directory
        at java.io.UnixFileSystem.createFileExclusively(Native Method)
        at java.io.File.createNewFile(File.java:883)
        at 
edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486)
        at 
org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora
geManager.java:332)
        at org.dspace.content.Bitstream.create(Bitstream.java:184)
        at org.dspace.content.Bundle.createBitstream(Bundle.java:362)
        at 
org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport.
java:1103)
        at 
org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java
:1030)
        at
org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631)
        at 
org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498)
        at
org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407)
java.io.IOException: No such file or directory



Contents file:
benitezjm_image06.jpg
benitezjm_thesis.pdf
benitezjm_image04.jpg
benitezjm_image02.jpg
benitezjm_image05.jpg
benitezjm_image01.jpg
benitezjm_image03.jpg




------------------------------

Message: 3
Date: Mon, 07 May 2007 11:42:33 -0400
From: susan rector <[EMAIL PROTECTED]>
Subject: Re: [Dspace-tech] help with import - bitstream error
To: susan rector <[EMAIL PROTECTED]>
Cc: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

A footnote:
It is working now but bombs out every so often - If I just resume the 
add command again, it works.
Thanks,
Susan

susan rector wrote:
> Hi all,
> I fixed the last error with the ^M - thanks for all your help
>
> Now, I'm running into another strange import error. I run the dspace 
> importer in test mode and it works fine and imports 466 distinct 
> directories with metadata and files.
> Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport
--test 
> --add [EMAIL PROTECTED] --collection=5 
> --source=/usr/local/archive_directory --mapfile mapfile.txt
>
> When I run the same command just not in test mode, I get the following

> error stating that the contents file isn't there. In fact, it is there

> and has a listing of the files in that directory
> Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add 
> [EMAIL PROTECTED] --collection=5 
> --source=/usr/local/archive_directory --mapfile mapfile.txt
>
> I've changed all the file permissions thinking that might be it -
didn't 
> work. It seems to get stuck on the jpeg's.
>
> Thanks for any suggestions!
> Susan Rector
> [EMAIL PROTECTED]
>
>
> Error:
>  Processing contents file: 
> /usr/local/archive_directory/etd-05152006-125030/contents
>         Bitstream: benitezjm_image06.jpg
>         Bitstream: benitezjm_thesis.pdf
>         Bitstream: benitezjm_image04.jpg
>         Bitstream: benitezjm_image02.jpg
> java.io.IOException: No such file or directory
>         at java.io.UnixFileSystem.createFileExclusively(Native Method)
>         at java.io.File.createNewFile(File.java:883)
>         at 
> edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486)
>         at 
>
org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora
geManager.java:332)
>         at org.dspace.content.Bitstream.create(Bitstream.java:184)
>         at org.dspace.content.Bundle.createBitstream(Bundle.java:362)
>         at 
>
org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport.
java:1103)
>         at 
>
org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java
:1030)
>         at
org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631)
>         at 
> org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498)
>         at
org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407)
> java.io.IOException: No such file or directory
>
>
>
> Contents file:
> benitezjm_image06.jpg
> benitezjm_thesis.pdf
> benitezjm_image04.jpg
> benitezjm_image02.jpg
> benitezjm_image05.jpg
> benitezjm_image01.jpg
> benitezjm_image03.jpg
>
>
>
------------------------------------------------------------------------
-
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>   




------------------------------

Message: 4
Date: Mon, 07 May 2007 12:25:32 -0400
From: George Kozak <[EMAIL PROTECTED]>
Subject: Re: [Dspace-tech] help with import - bitstream error
To: susan rector <[EMAIL PROTECTED]>, [email protected]
Message-ID:
        <[EMAIL PROTECTED]>
Content-Type: text/plain; charset="us-ascii"; format=flowed

Susan:

I have run into this problem running big batches.  Every so often, the 
batch importer comes up with the error (at least for me).  If I then
batch 
upload with smaller groups of items (without making other changes) 
everything runs fine.

At 11:38 AM 5/7/2007, susan rector wrote:
>Hi all,
>I fixed the last error with the ^M - thanks for all your help
>
>Now, I'm running into another strange import error. I run the dspace
>importer in test mode and it works fine and imports 466 distinct
>directories with metadata and files.
>Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport
--test
>--add [EMAIL PROTECTED] --collection=5
>--source=/usr/local/archive_directory --mapfile mapfile.txt
>
>When I run the same command just not in test mode, I get the following
>error stating that the contents file isn't there. In fact, it is there
>and has a listing of the files in that directory
>Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add
>[EMAIL PROTECTED] --collection=5
>--source=/usr/local/archive_directory --mapfile mapfile.txt
>
>I've changed all the file permissions thinking that might be it -
didn't
>work. It seems to get stuck on the jpeg's.
>
>Thanks for any suggestions!
>Susan Rector
>[EMAIL PROTECTED]
>
>
>Error:
>  Processing contents file:
>/usr/local/archive_directory/etd-05152006-125030/contents
>         Bitstream: benitezjm_image06.jpg
>         Bitstream: benitezjm_thesis.pdf
>         Bitstream: benitezjm_image04.jpg
>         Bitstream: benitezjm_image02.jpg
>java.io.IOException: No such file or directory
>         at java.io.UnixFileSystem.createFileExclusively(Native Method)
>         at java.io.File.createNewFile(File.java:883)
>         at
>edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486)
>         at
>org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStor
ageManager.java:332)
>         at org.dspace.content.Bitstream.create(Bitstream.java:184)
>         at org.dspace.content.Bundle.createBitstream(Bundle.java:362)
>         at
>org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport
.java:1103)
>         at
>org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.jav
a:1030)
>         at
org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631)
>         at
>org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498)
>         at
org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407)
>java.io.IOException: No such file or directory
>
>
>
>Contents file:
>benitezjm_image06.jpg
>benitezjm_thesis.pdf
>benitezjm_image04.jpg
>benitezjm_image02.jpg
>benitezjm_image05.jpg
>benitezjm_image01.jpg
>benitezjm_image03.jpg
>
>
>-----------------------------------------------------------------------
--
>This SF.net email is sponsored by DB2 Express
>Download DB2 Express C - the FREE version of DB2 express and take
>control of your XML. No limits. Just data. Click to get it now.
>http://sourceforge.net/powerbar/db2/
>_______________________________________________
>DSpace-tech mailing list
>[email protected]
>https://lists.sourceforge.net/lists/listinfo/dspace-tech

***************************
George Kozak
Coordinator
Web Development and Management
Digital Media Group
501 Olin Library
Cornell University
607-255-8924
***************************
[EMAIL PROTECTED] 




------------------------------

Message: 5
Date: Mon, 07 May 2007 12:29:30 -0400
From: susan rector <[EMAIL PROTECTED]>
Subject: Re: [Dspace-tech] help with import - bitstream error
To: George Kozak <[EMAIL PROTECTED]>
Cc: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Thanks George,
I just got them all in the system (467). It seemed to bomb out when the 
directory had a lot of images or .mov files. I think for our production 
version, I'm going to just import by year from etd-db so that I can 
avoid the continuous restart of of the program

thanks again,
Susan

George Kozak wrote:
> Susan:
>
> I have run into this problem running big batches.  Every so often, the

> batch importer comes up with the error (at least for me).  If I then 
> batch upload with smaller groups of items (without making other 
> changes) everything runs fine.
>
> At 11:38 AM 5/7/2007, susan rector wrote:
>> Hi all,
>> I fixed the last error with the ^M - thanks for all your help
>>
>> Now, I'm running into another strange import error. I run the dspace
>> importer in test mode and it works fine and imports 466 distinct
>> directories with metadata and files.
>> Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport
--test
>> --add [EMAIL PROTECTED] --collection=5
>> --source=/usr/local/archive_directory --mapfile mapfile.txt
>>
>> When I run the same command just not in test mode, I get the
following
>> error stating that the contents file isn't there. In fact, it is
there
>> and has a listing of the files in that directory
>> Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add
>> [EMAIL PROTECTED] --collection=5
>> --source=/usr/local/archive_directory --mapfile mapfile.txt
>>
>> I've changed all the file permissions thinking that might be it -
didn't
>> work. It seems to get stuck on the jpeg's.
>>
>> Thanks for any suggestions!
>> Susan Rector
>> [EMAIL PROTECTED]
>>
>>
>> Error:
>>  Processing contents file:
>> /usr/local/archive_directory/etd-05152006-125030/contents
>>         Bitstream: benitezjm_image06.jpg
>>         Bitstream: benitezjm_thesis.pdf
>>         Bitstream: benitezjm_image04.jpg
>>         Bitstream: benitezjm_image02.jpg
>> java.io.IOException: No such file or directory
>>         at java.io.UnixFileSystem.createFileExclusively(Native
Method)
>>         at java.io.File.createNewFile(File.java:883)
>>         at
>> edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486)
>>         at
>>
org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora
geManager.java:332) 
>>
>>         at org.dspace.content.Bitstream.create(Bitstream.java:184)
>>         at org.dspace.content.Bundle.createBitstream(Bundle.java:362)
>>         at
>>
org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport.
java:1103) 
>>
>>         at
>>
org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java
:1030) 
>>
>>         at 
>> org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631)
>>         at
>> org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498)
>>         at 
>> org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407)
>> java.io.IOException: No such file or directory
>>
>>
>>
>> Contents file:
>> benitezjm_image06.jpg
>> benitezjm_thesis.pdf
>> benitezjm_image04.jpg
>> benitezjm_image02.jpg
>> benitezjm_image05.jpg
>> benitezjm_image01.jpg
>> benitezjm_image03.jpg
>>
>>
>>
------------------------------------------------------------------------
- 
>>
>> This SF.net email is sponsored by DB2 Express
>> Download DB2 Express C - the FREE version of DB2 express and take
>> control of your XML. No limits. Just data. Click to get it now.
>> http://sourceforge.net/powerbar/db2/
>> _______________________________________________
>> DSpace-tech mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>
> ***************************
> George Kozak
> Coordinator
> Web Development and Management
> Digital Media Group
> 501 Olin Library
> Cornell University
> 607-255-8924
> ***************************
> [EMAIL PROTECTED] 




------------------------------

Message: 6
Date: Mon, 07 May 2007 16:28:03 -0400
From: Kyle Brentnell <[EMAIL PROTECTED]>
Subject: Re: [Dspace-tech] How to Configure a port number in DSpace
To: [email protected]
Message-ID: <[EMAIL PROTECTED]>
Content-Type: text/plain; charset="iso-8859-1"

Hi Thriu,

Before I send a reply, please tell me if I understand your question
correctly.

You want DSpace to connect to a mail server using port 587 for SMTP.  Is

that correct?

Thank you,
Kyle


At / ? 07:10 AM 07/05/2007, thirunavukarasu selvam wrote / a ?crit:
>Hello all,
>
>Dspace by default is listening to port 25 always.
>I have configured the mail.server in dspace.cfg to my own mail server.
>I am not allowing my mail server to listen to port 25. ie i am dropping

>the packets coming to that port.
>I am allowing the server to listen to port 587.
>
>So I want to make change in DSpace so that it also sends and listens to

>port 587.
>
>ie How to configure DSpace to listen to port 587. (or to any other port
no.)
>
>Thanks for ur reply to be done.
>
>Thanks and regards,
>Thiru....
>
>
>On 5/4/07, 
><mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]
ists.sourceforge.net 
><<mailto:[EMAIL PROTECTED]> 
>[EMAIL PROTECTED]> wrote:
>>Send DSpace-tech mailing list submissions to
>> 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net
>>
>>To subscribe or unsubscribe via the World Wide Web, visit
>>          https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>or, via email, send a message with subject or body 'help' to
>> 
>><mailto:[EMAIL PROTECTED]>dspace-tech-request@
lists.sourceforge.net 
>>
>>
>>You can reach the person managing the list at
>> 
>><mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]
s.sourceforge.net
>>
>>When replying, please edit your Subject line so it is more specific
>>than "Re: Contents of DSpace-tech digest..."
>>
>>
>>Today's Topics:
>>
>>    1. Re: srb/s3/etc and lucene (Cory Snavely)
>>    2. Re: srb/s3/etc and lucene (Mark Diggory)
>>    3. Re: srb/s3/etc and lucene (Mark H. Wood)
>>    4. Re: srb/s3/etc and lucene (Cory Snavely)
>>    5. Re: srb/s3/etc and lucene (Richard Rodgers)
>>    6. Configuring Port number in DSpace (thirunavukarasu selvam)
>>
>>
>>----------------------------------------------------------------------
>>
>>Message: 1
>>Date: Fri, 04 May 2007 09:07:36 -0400
>>From: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Subject: Re: [Dspace-tech] srb/s3/etc and lucene
>>To: Mark Diggory < <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Cc: 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net, 
>>Richard Rodgers
>>         <<mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED]>
>>Message-ID: 
>><<mailto:[EMAIL PROTECTED]>1178284056.2
[EMAIL PROTECTED]>
>>Content-Type: text/plain
>>
>>Thanks, but when you say assetstore, I'm not sure if you are referring
>>to the object-based storage in all cases. I will assume that you are
>>because of the the parenthetical "(s3)".
>>
>>So, this is what I believe you are saying: When filter-media runs, it
>>extracts text for formats such as PDF that Lucene can't directly
parse,
>>and places, using the object-based storage API those text bitstreams
>>alongside the originals, then again uses the object-based storage API
to
>>fetch the text back out and feed it to Lucene.
>>
>>Consequently, nothing is stored in the filesystem except for the
>>resulting index?
>>
>>Thanks,
>>Cory
>>
>>On Fri, 2007-05-04 at 00:10 -0400, Mark Diggory wrote:
>> > >
>> > > On 5/4/07, Cory Snavely 
>> <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> wrote:
>> > >         Well, I'm just wondering, in specific terms, if we use an
>> > >         object-based
>> > >         storage system as an assetstore rather than a filesystem,
>> > >         where the
>> > >         files that Lucene indexes actually sit.
>> >
>> >
>> > Its tricky, this is what FilterMedia is for, it actually extracts
the
>> > text and places it as a bitstream in the assetstore. Lucene full
text
>> > indexing is done against the assetstore bitstreams in all cases
(well
>> > accept for the metadata table in the database). So ultimately your
>> > pushing the text bitstreams into the assetstore (s3) in FilterMedia
>> > and pulling it back out on Lucene indexing, a double-whammy.
>> >
>> >
>> > Cheers,
>> > Mark
>> >
>> > >
>> > >         It's my understanding that in a filesystem-based
assetstore,
>> > >         for
>> > >         example, text is extracted from PDFs and stored in a
>> > >         separate file
>> > >         *within the assetstore directory* that Lucene crawls. I
just
>> > >         don't know
>> > >         how that sort of thing is handled when using object-based
>> > >         storage.
>> > >
>> > >         On Thu, 2007-05-03 at 13:28 -0400, Richard Rodgers wrote:
>> > >         > Hi Cory:
>> > >         >
>> > >         > Not sure about the limits of Lucene, but I think the
>> > >         larger point is
>> > >         > that the back-ends are expected only to hold the real
>> > >         content or assets.
>> > >         > Everything else (full-text indices and the like) are
>> > >         *artifacts* (can be
>> > >         > recreated from the assets) that we don't need to manage
in
>> > >         the same way.
>> > >         > If for performance reasons we want to put them where
the
>> > >         assets are we
>> > >         > can, but there is really no connection between the two
>> > >         that the system
>> > >         > imposes.
>> > >         >
>> > >         > Does this get at your question, or did I miss the
point?
>> > >         >
>> > >         > Thanks,
>> > >         >
>> > >         > Richard R
>> > >         >
>> > >         > On Thu, 2007-05-03 at 12:13 -0400, Cory Snavely wrote:
>> > >         > > (Apologies if this has been discussed to resolution;
>> > >         after a few
>> > >         > > attempts to search the archives, I concluded they are
>> > >         really broken. 500
>> > >         > > errors, bad links, etc.)
>> > >         > >
>> > >         > > For those using, interested in, or knowledgeable
about
>> > >         using API-based
>> > >         > > storage (SRB, S3) as a backend for DSpace: how does
>> > >         doing so affect
>> > >         > > full-text indexing? Can anyone describe how, in such
a
>> > >         setup, full text
>> > >         > > is stored and indexed?
>> > >         > >
>> > >         > > My uneducated impression is that Lucene would want to
>> > >         work only against
>> > >         > > a filesystem.
>> > >         > >
>> > >         > > Thanks,
>> > >         > > Cory Snavely
>> > >         > > University of Michigan Library IT Core Services
>> > >         > >
>> > >         > >
>> > >         > >
>> > >         > >
>> > > 
>>
------------------------------------------------------------------------
-
>> > >         > > This SF.net email is sponsored by DB2 Express
>> > >         > > Download DB2 Express C - the FREE version of DB2
express
>> > >         and take
>> > >         > > control of your XML. No limits. Just data. Click to
get
>> > >         it now.
>> > >         > > 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> > >         > > _______________________________________________
>> > >         > > DSpace-tech mailing list
>> > >         > > 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> > >         > >
https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> > >         >
>> > >
>> > >
>> > > 
>>
------------------------------------------------------------------------
-
>> > >         This SF.net email is sponsored by DB2 Express
>> > >         Download DB2 Express C - the FREE version of DB2 express
and
>> > >         take
>> > >         control of your XML. No limits. Just data. Click to get
it
>> > >         now.
>> > > 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> > >         _______________________________________________
>> > >         DSpace-tech mailing list
>> > > 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> > >         https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> > >
>> > > 
>>
------------------------------------------------------------------------
-
>> > > This SF.net email is sponsored by DB2 Express
>> > > Download DB2 Express C - the FREE version of DB2 express and take
>> > > control of your XML. No limits. Just data. Click to get it now.
>> > > 
>>
<http://sourceforge.net/powerbar/db2/___________________________________
____________>http://sourceforge.net/powerbar/db2/_______________________
________________________ 
>>
>> > > DSpace-tech mailing list
>> > > 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> > > https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >
>> > ~~~~~~~~~~~~~
>> > Mark R. Diggory - DSpace Systems Manager
>> > MIT Libraries, Systems and Technology Services
>> > Massachusetts Institute of Technology
>> >
>> >
>> >
>>
>>
>>
>>
>>------------------------------
>>
>>Message: 2
>>Date: Fri, 4 May 2007 09:09:10 -0400
>>From: Mark Diggory <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Subject: Re: [Dspace-tech] srb/s3/etc and lucene
>>To: DSpace Tech 
>><<mailto:[email protected]>[EMAIL PROTECTED]
ge.net>
>>Message-ID: <<mailto:[EMAIL PROTECTED]> 
>>[EMAIL PROTECTED]>
>>Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
>>
>>Well, the good news is that DSIndexer is now much more capable in
>>terms of its capability to update the index incrementally and avoid
>>corruption. So adding the "-u" option to index-all would alleviate
>>most of that issue.
>>
>>-Cheers,
>>Mark
>>
>>On May 4, 2007, at 9:03 AM, Richard Rodgers wrote:
>>
>> > Well if by whammy you mean a read access, yes. But my point was
>> > that the Lucene
>> > indexing is done (absent corruption) only once - the exploded text
>> > asset file
>> > is not needed for a Lucene lookup - it consults its own constructed
>> > index file.
>> > So the performance - i.e. routine use of the index for look-ups -
>> > is completely
>> > independent of the asset store.
>> >
>> > If there is a read performance problem with a given store back-end,
>> > that's
>> > surely a concern, but Lucene doesn't add any specially onerous
>> > overhead to it.
>> >
>> > Having said all this, it is true that 'index-alls' are run fairly
>> > cavalierly,
>> > and it it worth noting this dependency.
>> >
>> > Richard
>> >
>> > Quoting Mark Diggory < <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>:
>> >
>> >>>
>> >>> On 5/4/07, Cory Snavely < 
>> <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> wrote:
>> >>> Well, I'm just wondering, in specific terms, if we use an object-
>> >>> based
>> >>> storage system as an assetstore rather than a filesystem, where
the
>> >>> files that Lucene indexes actually sit.
>> >>
>> >> Its tricky, this is what FilterMedia is for, it actually extracts
>> >> the  text and places it as a bitstream in the assetstore. Lucene
>> >> full text  indexing is done against the assetstore bitstreams in
>> >> all cases (well  accept for the metadata table in the database).
>> >> So ultimately your  pushing the text bitstreams into the
>> >> assetstore (s3) in FilterMedia  and pulling it back out on Lucene
>> >> indexing, a double-whammy.
>> >>
>> >> Cheers,
>> >> Mark
>> >>
>> >>>
>> >>> It's my understanding that in a filesystem-based assetstore, for
>> >>> example, text is extracted from PDFs and stored in a separate
file
>> >>> *within the assetstore directory* that Lucene crawls. I just
>> >>> don't  know
>> >>> how that sort of thing is handled when using object-based
storage.
>> >>>
>> >>> On Thu, 2007-05-03 at 13:28 -0400, Richard Rodgers wrote:
>> >>> > Hi Cory:
>> >>> >
>> >>> > Not sure about the limits of Lucene, but I think the larger
>> >>> point is
>> >>> > that the back-ends are expected only to hold the real content
>> >>> or  assets.
>> >>> > Everything else (full-text indices and the like) are
>> >>> *artifacts*  (can be
>> >>> > recreated from the assets) that we don't need to manage in the
>> >>> same way.
>> >>> > If for performance reasons we want to put them where the
>> >>> assets  are we
>> >>> > can, but there is really no connection between the two that
>> >>> the  system
>> >>> > imposes.
>> >>> >
>> >>> > Does this get at your question, or did I miss the point?
>> >>> >
>> >>> > Thanks,
>> >>> >
>> >>> > Richard R
>> >>> >
>> >>> > On Thu, 2007-05-03 at 12:13 -0400, Cory Snavely wrote:
>> >>> > > (Apologies if this has been discussed to resolution; after a
few
>> >>> > > attempts to search the archives, I concluded they are really
>> >>> broken. 500
>> >>> > > errors, bad links, etc.)
>> >>> > >
>> >>> > > For those using, interested in, or knowledgeable about using
>> >>> API-based
>> >>> > > storage (SRB, S3) as a backend for DSpace: how does doing so
>> >>> affect
>> >>> > > full-text indexing? Can anyone describe how, in such a
>> >>> setup,  full text
>> >>> > > is stored and indexed?
>> >>> > >
>> >>> > > My uneducated impression is that Lucene would want to work
>> >>> only  against
>> >>> > > a filesystem.
>> >>> > >
>> >>> > > Thanks,
>> >>> > > Cory Snavely
>> >>> > > University of Michigan Library IT Core Services
>> >>> > >
>> >>> > >
>> >>> > >
>> >>> > >
>> >>>
--------------------------------------------------------------------
>> >>> -- ---
>> >>> > > This SF.net email is sponsored by DB2 Express
>> >>> > > Download DB2 Express C - the FREE version of DB2 express and
>> >>> take
>> >>> > > control of your XML. No limits. Just data. Click to get it
now.
>> >>> > > 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> >>> > > _______________________________________________
>> >>> > > DSpace-tech mailing list
>> >>> > > 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> >>> > > https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >>> >
>> >>>
>> >>>
>> >>>
--------------------------------------------------------------------
>> >>> -- ---
>> >>> This SF.net email is sponsored by DB2 Express
>> >>> Download DB2 Express C - the FREE version of DB2 express and take
>> >>> control of your XML. No limits. Just data. Click to get it now.
>> >>> 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> >>> _______________________________________________
>> >>> DSpace-tech mailing list
>> >>> 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >>>
>> >>>
--------------------------------------------------------------------
>> >>> -- ---
>> >>> This SF.net email is sponsored by DB2 Express
>> >>> Download DB2 Express C - the FREE version of DB2 express and take
>> >>> control of your XML. No limits. Just data. Click to get it now.
>> >>> 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> >>> _______________________________________________
>> >>> DSpace-tech mailing list
>> >>> 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >>
>> >> ~~~~~~~~~~~~~
>> >> Mark R. Diggory - DSpace Systems Manager
>> >> MIT Libraries, Systems and Technology Services
>> >> Massachusetts Institute of Technology
>> >>
>> >>
>> >>
>> >
>> >
>>
>>~~~~~~~~~~~~~
>>Mark R. Diggory - DSpace Systems Manager
>>MIT Libraries, Systems and Technology Services
>>Massachusetts Institute of Technology
>>Office: E25-131
>>Phone: (617) 253-1096
>>
>>
>>
>>
>>
>>------------------------------
>>
>>Message: 3
>>Date: Fri, 4 May 2007 09:23:56 -0400
>>From: "Mark H. Wood" <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Subject: Re: [Dspace-tech] srb/s3/etc and lucene
>>To: 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net
>>Message-ID: 
>><<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]
du>
>>Content-Type: text/plain; charset="us-ascii"
>>
>>There are two questions here:
>>
>>1)  Does the use of a non-filesystem asset store backend affect
Lucene's
>>     output?  One would guess, no, since it doesn't do output to the
>>     asset store.
>>
>>2)  Does the use of a non-filesystem asset store backend affect
>>     Lucene's input?  IOW how does Lucene, as used in DSpace, locate
>>     and gain access to the files it indexes?  If it doesn't go
through
>>     the DSpace storage layer or something equivalent then indexing is
>>     screwed.
>>
>>Ouch!  I hadn't thought about these at all.
>>
>>--
>>Mark H. Wood, Lead System 
>>Programmer   <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]
>>Typically when a software vendor says that a product is "intuitive" he
>>means the exact opposite.
>>
>>-------------- next part --------------
>>A non-text attachment was scrubbed...
>>Name: not available
>>Type: application/pgp-signature
>>Size: 189 bytes
>>Desc: not available
>>
>>------------------------------
>>
>>Message: 4
>>Date: Fri, 04 May 2007 09:39:41 -0400
>>From: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Subject: Re: [Dspace-tech] srb/s3/etc and lucene
>>To: "Mark H. Wood" <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Cc: 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net
>>Message-ID:
<<mailto:[EMAIL PROTECTED]> 
>>[EMAIL PROTECTED]>
>>Content-Type: text/plain
>>
>>Right--I am trying to get an understand of all this in very specific
>>terms.
>>
>>On Fri, 2007-05-04 at 09:23 -0400, Mark H. Wood wrote:
>> > There are two questions here:
>> >
>> > 1)  Does the use of a non-filesystem asset store backend affect
Lucene's
>> >     output?  One would guess, no, since it doesn't do output to the
>> >     asset store.
>> >
>> > 2)  Does the use of a non-filesystem asset store backend affect
>> >     Lucene's input?  IOW how does Lucene, as used in DSpace, locate
>> >     and gain access to the files it indexes?  If it doesn't go
through
>> >     the DSpace storage layer or something equivalent then indexing
is
>> >     screwed.
>> >
>> > Ouch!  I hadn't thought about these at all.
>> >
>> >
------------------------------------------------------------------------
-
>> > This SF.net email is sponsored by DB2 Express
>> > Download DB2 Express C - the FREE version of DB2 express and take
>> > control of your XML. No limits. Just data. Click to get it now.
>> >
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> > _______________________________________________ DSpace-tech mailing

>> list 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
n 
>> et https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>
>>
>>
>>
>>------------------------------
>>
>>Message: 5
>>Date: Fri, 04 May 2007 09:46:39 -0400
>>From: Richard Rodgers <<mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED]>
>>Subject: Re: [Dspace-tech] srb/s3/etc and lucene
>>To: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Cc: 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net, 
>>"Mark H. Wood"
>>         <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>
>>Message-ID: 
>><<mailto:[EMAIL PROTECTED]>2007050409463
[EMAIL PROTECTED]  
>> >
>>Content-Type: text/plain;       charset=ISO-8859-1;
format="flowed"
>>
>>See notes:
>>
>>Quoting Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>:
>>
>> > Right--I am trying to get an understand of all this in very
specific
>> > terms.
>> >
>> > On Fri, 2007-05-04 at 09:23 -0400, Mark H. Wood wrote:
>> >> There are two questions here:
>> >>
>> >> 1)  Does the use of a non-filesystem asset store backend affect
Lucene's
>> >>     output?  One would guess, no, since it doesn't do output to
the
>> >>     asset store.
>>Correct - no. Lucene reads the file for indexing through the storage
API - it
>>therefore has a BitStream, not a location on a storage device.
>> >>
>> >> 2)  Does the use of a non-filesystem asset store backend affect
>> >>     Lucene's input?  IOW how does Lucene, as used in DSpace,
locate
>> >>     and gain access to the files it indexes?  If it doesn't go
through
>> >>     the DSpace storage layer or something equivalent then indexing
is
>> >>     screwed.
>>No - for the same reason. It does not circumvent the storage API or
make
>>any assumptions about where the files with the text to index lives
>> >>
>> >> Ouch!  I hadn't thought about these at all.
>> >>
>>Remember, we already support SRB, (a non-local filesystem option), and
>>indexing
>>works fine.
>>
>>Thanks,
>>
>>Richard R
>> >>
------------------------------------------------------------------------
-
>> >> This SF.net email is sponsored by DB2 Express
>> >> Download DB2 Express C - the FREE version of DB2 express and take
>> >> control of your XML. No limits. Just data. Click to get it now.
>> >> 
>>
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> >> _______________________________________________ DSpace-tech
mailing
>> >> list 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> >> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >
>> >
>> >
------------------------------------------------------------------------
-
>> > This SF.net email is sponsored by DB2 Express
>> > Download DB2 Express C - the FREE version of DB2 express and take
>> > control of your XML. No limits. Just data. Click to get it now.
>> >
<http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db
2/
>> > _______________________________________________
>> > DSpace-tech mailing list
>> > 
>>
<mailto:[email protected]>[EMAIL PROTECTED]
net
>> > https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >
>>
>>
>>
>>
>>
>>------------------------------
>>
>>Message: 6
>>Date: Fri, 4 May 2007 21:44:42 +0530
>>From: "thirunavukarasu selvam" 
>><<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] >
>>Subject: [Dspace-tech] Configuring Port number in DSpace
>>To: 
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net
>>Message-ID:
>> 
>><<mailto:[EMAIL PROTECTED]> 
>>[EMAIL PROTECTED]>
>>Content-Type: text/plain; charset="iso-8859-1"
>>
>>Hello all,
>>
>>Dspace by default is listening to port 25 always.
>>I have configured the mail.server in dspace.cfg to my own mail server.
>>I am not allowing my mail server to listen to port 25. ie i am
dropping the
>>packets coming to that port.
>>I am allowing the server to listen to port 587.
>>
>>So I want to make change in DSpace so that it also sends and listens
to port
>>587.
>>
>>ie How to configure DSpace to listen to port 587. (or to any other
port no.)
>>
>>Thanks for ur reply to be done.
>>
>>Thanks and regards,
>>Thiru....
>>-------------- next part --------------
>>An HTML attachment was scrubbed...
>>
>>------------------------------
>>
>>----------------------------------------------------------------------
---
>>This SF.net email is sponsored by DB2 Express
>>Download DB2 Express C - the FREE version of DB2 express and take
>>control of your XML. No limits. Just data. Click to get it now.
>><http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/
db2/
>>
>>------------------------------
>>
>>_______________________________________________
>>DSpace-tech mailing list
>><mailto:[email protected]>[EMAIL PROTECTED]
e.net
>>https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>
>>
>>End of DSpace-tech Digest, Vol 13, Issue 7
>>******************************************
>
>-----------------------------------------------------------------------
--
>This SF.net email is sponsored by DB2 Express
>Download DB2 Express C - the FREE version of DB2 express and take
>control of your XML. No limits. Just data. Click to get it now.
>http://sourceforge.net/powerbar/db2/
>_______________________________________________
>DSpace-tech mailing list
>[email protected]
>https://lists.sourceforge.net/lists/listinfo/dspace-tech
-------------- next part --------------
An HTML attachment was scrubbed...

------------------------------

------------------------------------------------------------------------
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/

------------------------------

_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech


End of DSpace-tech Digest, Vol 13, Issue 12
*******************************************


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to