Edit your Contents file to look like this: benitezjm_image06.jpg bundle:ORIGINAL benitezjm_thesis.pdf bundle:ORIGINAL benitezjm_image04.jpg bundle:ORIGINAL benitezjm_image02.jpg bundle:ORIGINAL benitezjm_image05.jpg bundle:ORIGINAL benitezjm_image01.jpg bundle:ORIGINAL benitezjm_image03.jpg bundle:ORIGINAL
regards Graeme Fox -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: Tuesday, 8 May 2007 8:28 a.m. To: [email protected] Subject: DSpace-tech Digest, Vol 13, Issue 12 Send DSpace-tech mailing list submissions to [email protected] To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/dspace-tech or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than "Re: Contents of DSpace-tech digest..." Today's Topics: 1. Dspace/SRW question (orlando carvalho) 2. help with import - bitstream error (susan rector) 3. Re: help with import - bitstream error (susan rector) 4. Re: help with import - bitstream error (George Kozak) 5. Re: help with import - bitstream error (susan rector) 6. Re: How to Configure a port number in DSpace (Kyle Brentnell) ---------------------------------------------------------------------- Message: 1 Date: Mon, 7 May 2007 15:41:07 +0100 From: "orlando carvalho" <[EMAIL PROTECTED]> Subject: [Dspace-tech] Dspace/SRW question To: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="iso-8859-1" Hi there! We have a question and we think you can help us! We need to make searches into Dspace with SRW using the e-mail that a user uses when submitting an item but we don't know how to search for items like this! Is it possible?how? Thanks one more time for your precious help! -- Orlando Carvalho -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 2 Date: Mon, 07 May 2007 11:38:37 -0400 From: susan rector <[EMAIL PROTECTED]> Subject: [Dspace-tech] help with import - bitstream error To: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Hi all, I fixed the last error with the ^M - thanks for all your help Now, I'm running into another strange import error. I run the dspace importer in test mode and it works fine and imports 466 distinct directories with metadata and files. Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --test --add [EMAIL PROTECTED] --collection=5 --source=/usr/local/archive_directory --mapfile mapfile.txt When I run the same command just not in test mode, I get the following error stating that the contents file isn't there. In fact, it is there and has a listing of the files in that directory Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add [EMAIL PROTECTED] --collection=5 --source=/usr/local/archive_directory --mapfile mapfile.txt I've changed all the file permissions thinking that might be it - didn't work. It seems to get stuck on the jpeg's. Thanks for any suggestions! Susan Rector [EMAIL PROTECTED] Error: Processing contents file: /usr/local/archive_directory/etd-05152006-125030/contents Bitstream: benitezjm_image06.jpg Bitstream: benitezjm_thesis.pdf Bitstream: benitezjm_image04.jpg Bitstream: benitezjm_image02.jpg java.io.IOException: No such file or directory at java.io.UnixFileSystem.createFileExclusively(Native Method) at java.io.File.createNewFile(File.java:883) at edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486) at org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora geManager.java:332) at org.dspace.content.Bitstream.create(Bitstream.java:184) at org.dspace.content.Bundle.createBitstream(Bundle.java:362) at org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport. java:1103) at org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java :1030) at org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631) at org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498) at org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407) java.io.IOException: No such file or directory Contents file: benitezjm_image06.jpg benitezjm_thesis.pdf benitezjm_image04.jpg benitezjm_image02.jpg benitezjm_image05.jpg benitezjm_image01.jpg benitezjm_image03.jpg ------------------------------ Message: 3 Date: Mon, 07 May 2007 11:42:33 -0400 From: susan rector <[EMAIL PROTECTED]> Subject: Re: [Dspace-tech] help with import - bitstream error To: susan rector <[EMAIL PROTECTED]> Cc: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed A footnote: It is working now but bombs out every so often - If I just resume the add command again, it works. Thanks, Susan susan rector wrote: > Hi all, > I fixed the last error with the ^M - thanks for all your help > > Now, I'm running into another strange import error. I run the dspace > importer in test mode and it works fine and imports 466 distinct > directories with metadata and files. > Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --test > --add [EMAIL PROTECTED] --collection=5 > --source=/usr/local/archive_directory --mapfile mapfile.txt > > When I run the same command just not in test mode, I get the following > error stating that the contents file isn't there. In fact, it is there > and has a listing of the files in that directory > Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add > [EMAIL PROTECTED] --collection=5 > --source=/usr/local/archive_directory --mapfile mapfile.txt > > I've changed all the file permissions thinking that might be it - didn't > work. It seems to get stuck on the jpeg's. > > Thanks for any suggestions! > Susan Rector > [EMAIL PROTECTED] > > > Error: > Processing contents file: > /usr/local/archive_directory/etd-05152006-125030/contents > Bitstream: benitezjm_image06.jpg > Bitstream: benitezjm_thesis.pdf > Bitstream: benitezjm_image04.jpg > Bitstream: benitezjm_image02.jpg > java.io.IOException: No such file or directory > at java.io.UnixFileSystem.createFileExclusively(Native Method) > at java.io.File.createNewFile(File.java:883) > at > edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486) > at > org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora geManager.java:332) > at org.dspace.content.Bitstream.create(Bitstream.java:184) > at org.dspace.content.Bundle.createBitstream(Bundle.java:362) > at > org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport. java:1103) > at > org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java :1030) > at org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631) > at > org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498) > at org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407) > java.io.IOException: No such file or directory > > > > Contents file: > benitezjm_image06.jpg > benitezjm_thesis.pdf > benitezjm_image04.jpg > benitezjm_image02.jpg > benitezjm_image05.jpg > benitezjm_image01.jpg > benitezjm_image03.jpg > > > ------------------------------------------------------------------------ - > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > _______________________________________________ > DSpace-tech mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dspace-tech > ------------------------------ Message: 4 Date: Mon, 07 May 2007 12:25:32 -0400 From: George Kozak <[EMAIL PROTECTED]> Subject: Re: [Dspace-tech] help with import - bitstream error To: susan rector <[EMAIL PROTECTED]>, [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="us-ascii"; format=flowed Susan: I have run into this problem running big batches. Every so often, the batch importer comes up with the error (at least for me). If I then batch upload with smaller groups of items (without making other changes) everything runs fine. At 11:38 AM 5/7/2007, susan rector wrote: >Hi all, >I fixed the last error with the ^M - thanks for all your help > >Now, I'm running into another strange import error. I run the dspace >importer in test mode and it works fine and imports 466 distinct >directories with metadata and files. >Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --test >--add [EMAIL PROTECTED] --collection=5 >--source=/usr/local/archive_directory --mapfile mapfile.txt > >When I run the same command just not in test mode, I get the following >error stating that the contents file isn't there. In fact, it is there >and has a listing of the files in that directory >Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add >[EMAIL PROTECTED] --collection=5 >--source=/usr/local/archive_directory --mapfile mapfile.txt > >I've changed all the file permissions thinking that might be it - didn't >work. It seems to get stuck on the jpeg's. > >Thanks for any suggestions! >Susan Rector >[EMAIL PROTECTED] > > >Error: > Processing contents file: >/usr/local/archive_directory/etd-05152006-125030/contents > Bitstream: benitezjm_image06.jpg > Bitstream: benitezjm_thesis.pdf > Bitstream: benitezjm_image04.jpg > Bitstream: benitezjm_image02.jpg >java.io.IOException: No such file or directory > at java.io.UnixFileSystem.createFileExclusively(Native Method) > at java.io.File.createNewFile(File.java:883) > at >edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486) > at >org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStor ageManager.java:332) > at org.dspace.content.Bitstream.create(Bitstream.java:184) > at org.dspace.content.Bundle.createBitstream(Bundle.java:362) > at >org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport .java:1103) > at >org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.jav a:1030) > at org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631) > at >org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498) > at org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407) >java.io.IOException: No such file or directory > > > >Contents file: >benitezjm_image06.jpg >benitezjm_thesis.pdf >benitezjm_image04.jpg >benitezjm_image02.jpg >benitezjm_image05.jpg >benitezjm_image01.jpg >benitezjm_image03.jpg > > >----------------------------------------------------------------------- -- >This SF.net email is sponsored by DB2 Express >Download DB2 Express C - the FREE version of DB2 express and take >control of your XML. No limits. Just data. Click to get it now. >http://sourceforge.net/powerbar/db2/ >_______________________________________________ >DSpace-tech mailing list >[email protected] >https://lists.sourceforge.net/lists/listinfo/dspace-tech *************************** George Kozak Coordinator Web Development and Management Digital Media Group 501 Olin Library Cornell University 607-255-8924 *************************** [EMAIL PROTECTED] ------------------------------ Message: 5 Date: Mon, 07 May 2007 12:29:30 -0400 From: susan rector <[EMAIL PROTECTED]> Subject: Re: [Dspace-tech] help with import - bitstream error To: George Kozak <[EMAIL PROTECTED]> Cc: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Thanks George, I just got them all in the system (467). It seemed to bomb out when the directory had a lot of images or .mov files. I think for our production version, I'm going to just import by year from etd-db so that I can avoid the continuous restart of of the program thanks again, Susan George Kozak wrote: > Susan: > > I have run into this problem running big batches. Every so often, the > batch importer comes up with the error (at least for me). If I then > batch upload with smaller groups of items (without making other > changes) everything runs fine. > > At 11:38 AM 5/7/2007, susan rector wrote: >> Hi all, >> I fixed the last error with the ^M - thanks for all your help >> >> Now, I'm running into another strange import error. I run the dspace >> importer in test mode and it works fine and imports 466 distinct >> directories with metadata and files. >> Test mode: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --test >> --add [EMAIL PROTECTED] --collection=5 >> --source=/usr/local/archive_directory --mapfile mapfile.txt >> >> When I run the same command just not in test mode, I get the following >> error stating that the contents file isn't there. In fact, it is there >> and has a listing of the files in that directory >> Add: /dspace/bin/dsrun org.dspace.app.itemimport.ItemImport --add >> [EMAIL PROTECTED] --collection=5 >> --source=/usr/local/archive_directory --mapfile mapfile.txt >> >> I've changed all the file permissions thinking that might be it - didn't >> work. It seems to get stuck on the jpeg's. >> >> Thanks for any suggestions! >> Susan Rector >> [EMAIL PROTECTED] >> >> >> Error: >> Processing contents file: >> /usr/local/archive_directory/etd-05152006-125030/contents >> Bitstream: benitezjm_image06.jpg >> Bitstream: benitezjm_thesis.pdf >> Bitstream: benitezjm_image04.jpg >> Bitstream: benitezjm_image02.jpg >> java.io.IOException: No such file or directory >> at java.io.UnixFileSystem.createFileExclusively(Native Method) >> at java.io.File.createNewFile(File.java:883) >> at >> edu.sdsc.grid.io.local.LocalFile.createNewFile(LocalFile.java:486) >> at >> org.dspace.storage.bitstore.BitstreamStorageManager.store(BitstreamStora geManager.java:332) >> >> at org.dspace.content.Bitstream.create(Bitstream.java:184) >> at org.dspace.content.Bundle.createBitstream(Bundle.java:362) >> at >> org.dspace.app.itemimport.ItemImport.processContentFileEntry(ItemImport. java:1103) >> >> at >> org.dspace.app.itemimport.ItemImport.processContentsFile(ItemImport.java :1030) >> >> at >> org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:631) >> at >> org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:498) >> at >> org.dspace.app.itemimport.ItemImport.main(ItemImport.java:407) >> java.io.IOException: No such file or directory >> >> >> >> Contents file: >> benitezjm_image06.jpg >> benitezjm_thesis.pdf >> benitezjm_image04.jpg >> benitezjm_image02.jpg >> benitezjm_image05.jpg >> benitezjm_image01.jpg >> benitezjm_image03.jpg >> >> >> ------------------------------------------------------------------------ - >> >> This SF.net email is sponsored by DB2 Express >> Download DB2 Express C - the FREE version of DB2 express and take >> control of your XML. No limits. Just data. Click to get it now. >> http://sourceforge.net/powerbar/db2/ >> _______________________________________________ >> DSpace-tech mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dspace-tech > > *************************** > George Kozak > Coordinator > Web Development and Management > Digital Media Group > 501 Olin Library > Cornell University > 607-255-8924 > *************************** > [EMAIL PROTECTED] ------------------------------ Message: 6 Date: Mon, 07 May 2007 16:28:03 -0400 From: Kyle Brentnell <[EMAIL PROTECTED]> Subject: Re: [Dspace-tech] How to Configure a port number in DSpace To: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="iso-8859-1" Hi Thriu, Before I send a reply, please tell me if I understand your question correctly. You want DSpace to connect to a mail server using port 587 for SMTP. Is that correct? Thank you, Kyle At / ? 07:10 AM 07/05/2007, thirunavukarasu selvam wrote / a ?crit: >Hello all, > >Dspace by default is listening to port 25 always. >I have configured the mail.server in dspace.cfg to my own mail server. >I am not allowing my mail server to listen to port 25. ie i am dropping >the packets coming to that port. >I am allowing the server to listen to port 587. > >So I want to make change in DSpace so that it also sends and listens to >port 587. > >ie How to configure DSpace to listen to port 587. (or to any other port no.) > >Thanks for ur reply to be done. > >Thanks and regards, >Thiru.... > > >On 5/4/07, ><mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] ists.sourceforge.net ><<mailto:[EMAIL PROTECTED]> >[EMAIL PROTECTED]> wrote: >>Send DSpace-tech mailing list submissions to >> >><mailto:[email protected]>[EMAIL PROTECTED] e.net >> >>To subscribe or unsubscribe via the World Wide Web, visit >> https://lists.sourceforge.net/lists/listinfo/dspace-tech >>or, via email, send a message with subject or body 'help' to >> >><mailto:[EMAIL PROTECTED]>dspace-tech-request@ lists.sourceforge.net >> >> >>You can reach the person managing the list at >> >><mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] s.sourceforge.net >> >>When replying, please edit your Subject line so it is more specific >>than "Re: Contents of DSpace-tech digest..." >> >> >>Today's Topics: >> >> 1. Re: srb/s3/etc and lucene (Cory Snavely) >> 2. Re: srb/s3/etc and lucene (Mark Diggory) >> 3. Re: srb/s3/etc and lucene (Mark H. Wood) >> 4. Re: srb/s3/etc and lucene (Cory Snavely) >> 5. Re: srb/s3/etc and lucene (Richard Rodgers) >> 6. Configuring Port number in DSpace (thirunavukarasu selvam) >> >> >>---------------------------------------------------------------------- >> >>Message: 1 >>Date: Fri, 04 May 2007 09:07:36 -0400 >>From: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Subject: Re: [Dspace-tech] srb/s3/etc and lucene >>To: Mark Diggory < <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Cc: >><mailto:[email protected]>[EMAIL PROTECTED] e.net, >>Richard Rodgers >> <<mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED]> >>Message-ID: >><<mailto:[EMAIL PROTECTED]>1178284056.2 [EMAIL PROTECTED]> >>Content-Type: text/plain >> >>Thanks, but when you say assetstore, I'm not sure if you are referring >>to the object-based storage in all cases. I will assume that you are >>because of the the parenthetical "(s3)". >> >>So, this is what I believe you are saying: When filter-media runs, it >>extracts text for formats such as PDF that Lucene can't directly parse, >>and places, using the object-based storage API those text bitstreams >>alongside the originals, then again uses the object-based storage API to >>fetch the text back out and feed it to Lucene. >> >>Consequently, nothing is stored in the filesystem except for the >>resulting index? >> >>Thanks, >>Cory >> >>On Fri, 2007-05-04 at 00:10 -0400, Mark Diggory wrote: >> > > >> > > On 5/4/07, Cory Snavely >> <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> wrote: >> > > Well, I'm just wondering, in specific terms, if we use an >> > > object-based >> > > storage system as an assetstore rather than a filesystem, >> > > where the >> > > files that Lucene indexes actually sit. >> > >> > >> > Its tricky, this is what FilterMedia is for, it actually extracts the >> > text and places it as a bitstream in the assetstore. Lucene full text >> > indexing is done against the assetstore bitstreams in all cases (well >> > accept for the metadata table in the database). So ultimately your >> > pushing the text bitstreams into the assetstore (s3) in FilterMedia >> > and pulling it back out on Lucene indexing, a double-whammy. >> > >> > >> > Cheers, >> > Mark >> > >> > > >> > > It's my understanding that in a filesystem-based assetstore, >> > > for >> > > example, text is extracted from PDFs and stored in a >> > > separate file >> > > *within the assetstore directory* that Lucene crawls. I just >> > > don't know >> > > how that sort of thing is handled when using object-based >> > > storage. >> > > >> > > On Thu, 2007-05-03 at 13:28 -0400, Richard Rodgers wrote: >> > > > Hi Cory: >> > > > >> > > > Not sure about the limits of Lucene, but I think the >> > > larger point is >> > > > that the back-ends are expected only to hold the real >> > > content or assets. >> > > > Everything else (full-text indices and the like) are >> > > *artifacts* (can be >> > > > recreated from the assets) that we don't need to manage in >> > > the same way. >> > > > If for performance reasons we want to put them where the >> > > assets are we >> > > > can, but there is really no connection between the two >> > > that the system >> > > > imposes. >> > > > >> > > > Does this get at your question, or did I miss the point? >> > > > >> > > > Thanks, >> > > > >> > > > Richard R >> > > > >> > > > On Thu, 2007-05-03 at 12:13 -0400, Cory Snavely wrote: >> > > > > (Apologies if this has been discussed to resolution; >> > > after a few >> > > > > attempts to search the archives, I concluded they are >> > > really broken. 500 >> > > > > errors, bad links, etc.) >> > > > > >> > > > > For those using, interested in, or knowledgeable about >> > > using API-based >> > > > > storage (SRB, S3) as a backend for DSpace: how does >> > > doing so affect >> > > > > full-text indexing? Can anyone describe how, in such a >> > > setup, full text >> > > > > is stored and indexed? >> > > > > >> > > > > My uneducated impression is that Lucene would want to >> > > work only against >> > > > > a filesystem. >> > > > > >> > > > > Thanks, >> > > > > Cory Snavely >> > > > > University of Michigan Library IT Core Services >> > > > > >> > > > > >> > > > > >> > > > > >> > > >> ------------------------------------------------------------------------ - >> > > > > This SF.net email is sponsored by DB2 Express >> > > > > Download DB2 Express C - the FREE version of DB2 express >> > > and take >> > > > > control of your XML. No limits. Just data. Click to get >> > > it now. >> > > > > >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> > > > > _______________________________________________ >> > > > > DSpace-tech mailing list >> > > > > >> <mailto:[email protected]>[EMAIL PROTECTED] net >> > > > > https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > > > >> > > >> > > >> > > >> ------------------------------------------------------------------------ - >> > > This SF.net email is sponsored by DB2 Express >> > > Download DB2 Express C - the FREE version of DB2 express and >> > > take >> > > control of your XML. No limits. Just data. Click to get it >> > > now. >> > > >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> > > _______________________________________________ >> > > DSpace-tech mailing list >> > > >> <mailto:[email protected]>[EMAIL PROTECTED] net >> > > https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > > >> > > >> ------------------------------------------------------------------------ - >> > > This SF.net email is sponsored by DB2 Express >> > > Download DB2 Express C - the FREE version of DB2 express and take >> > > control of your XML. No limits. Just data. Click to get it now. >> > > >> <http://sourceforge.net/powerbar/db2/___________________________________ ____________>http://sourceforge.net/powerbar/db2/_______________________ ________________________ >> >> > > DSpace-tech mailing list >> > > >> <mailto:[email protected]>[EMAIL PROTECTED] net >> > > https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > >> > ~~~~~~~~~~~~~ >> > Mark R. Diggory - DSpace Systems Manager >> > MIT Libraries, Systems and Technology Services >> > Massachusetts Institute of Technology >> > >> > >> > >> >> >> >> >>------------------------------ >> >>Message: 2 >>Date: Fri, 4 May 2007 09:09:10 -0400 >>From: Mark Diggory <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Subject: Re: [Dspace-tech] srb/s3/etc and lucene >>To: DSpace Tech >><<mailto:[email protected]>[EMAIL PROTECTED] ge.net> >>Message-ID: <<mailto:[EMAIL PROTECTED]> >>[EMAIL PROTECTED]> >>Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed >> >>Well, the good news is that DSIndexer is now much more capable in >>terms of its capability to update the index incrementally and avoid >>corruption. So adding the "-u" option to index-all would alleviate >>most of that issue. >> >>-Cheers, >>Mark >> >>On May 4, 2007, at 9:03 AM, Richard Rodgers wrote: >> >> > Well if by whammy you mean a read access, yes. But my point was >> > that the Lucene >> > indexing is done (absent corruption) only once - the exploded text >> > asset file >> > is not needed for a Lucene lookup - it consults its own constructed >> > index file. >> > So the performance - i.e. routine use of the index for look-ups - >> > is completely >> > independent of the asset store. >> > >> > If there is a read performance problem with a given store back-end, >> > that's >> > surely a concern, but Lucene doesn't add any specially onerous >> > overhead to it. >> > >> > Having said all this, it is true that 'index-alls' are run fairly >> > cavalierly, >> > and it it worth noting this dependency. >> > >> > Richard >> > >> > Quoting Mark Diggory < <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>: >> > >> >>> >> >>> On 5/4/07, Cory Snavely < >> <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> wrote: >> >>> Well, I'm just wondering, in specific terms, if we use an object- >> >>> based >> >>> storage system as an assetstore rather than a filesystem, where the >> >>> files that Lucene indexes actually sit. >> >> >> >> Its tricky, this is what FilterMedia is for, it actually extracts >> >> the text and places it as a bitstream in the assetstore. Lucene >> >> full text indexing is done against the assetstore bitstreams in >> >> all cases (well accept for the metadata table in the database). >> >> So ultimately your pushing the text bitstreams into the >> >> assetstore (s3) in FilterMedia and pulling it back out on Lucene >> >> indexing, a double-whammy. >> >> >> >> Cheers, >> >> Mark >> >> >> >>> >> >>> It's my understanding that in a filesystem-based assetstore, for >> >>> example, text is extracted from PDFs and stored in a separate file >> >>> *within the assetstore directory* that Lucene crawls. I just >> >>> don't know >> >>> how that sort of thing is handled when using object-based storage. >> >>> >> >>> On Thu, 2007-05-03 at 13:28 -0400, Richard Rodgers wrote: >> >>> > Hi Cory: >> >>> > >> >>> > Not sure about the limits of Lucene, but I think the larger >> >>> point is >> >>> > that the back-ends are expected only to hold the real content >> >>> or assets. >> >>> > Everything else (full-text indices and the like) are >> >>> *artifacts* (can be >> >>> > recreated from the assets) that we don't need to manage in the >> >>> same way. >> >>> > If for performance reasons we want to put them where the >> >>> assets are we >> >>> > can, but there is really no connection between the two that >> >>> the system >> >>> > imposes. >> >>> > >> >>> > Does this get at your question, or did I miss the point? >> >>> > >> >>> > Thanks, >> >>> > >> >>> > Richard R >> >>> > >> >>> > On Thu, 2007-05-03 at 12:13 -0400, Cory Snavely wrote: >> >>> > > (Apologies if this has been discussed to resolution; after a few >> >>> > > attempts to search the archives, I concluded they are really >> >>> broken. 500 >> >>> > > errors, bad links, etc.) >> >>> > > >> >>> > > For those using, interested in, or knowledgeable about using >> >>> API-based >> >>> > > storage (SRB, S3) as a backend for DSpace: how does doing so >> >>> affect >> >>> > > full-text indexing? Can anyone describe how, in such a >> >>> setup, full text >> >>> > > is stored and indexed? >> >>> > > >> >>> > > My uneducated impression is that Lucene would want to work >> >>> only against >> >>> > > a filesystem. >> >>> > > >> >>> > > Thanks, >> >>> > > Cory Snavely >> >>> > > University of Michigan Library IT Core Services >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> -------------------------------------------------------------------- >> >>> -- --- >> >>> > > This SF.net email is sponsored by DB2 Express >> >>> > > Download DB2 Express C - the FREE version of DB2 express and >> >>> take >> >>> > > control of your XML. No limits. Just data. Click to get it now. >> >>> > > >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> >>> > > _______________________________________________ >> >>> > > DSpace-tech mailing list >> >>> > > >> <mailto:[email protected]>[EMAIL PROTECTED] net >> >>> > > https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >>> > >> >>> >> >>> >> >>> -------------------------------------------------------------------- >> >>> -- --- >> >>> This SF.net email is sponsored by DB2 Express >> >>> Download DB2 Express C - the FREE version of DB2 express and take >> >>> control of your XML. No limits. Just data. Click to get it now. >> >>> >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> >>> _______________________________________________ >> >>> DSpace-tech mailing list >> >>> >> <mailto:[email protected]>[EMAIL PROTECTED] net >> >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >>> >> >>> -------------------------------------------------------------------- >> >>> -- --- >> >>> This SF.net email is sponsored by DB2 Express >> >>> Download DB2 Express C - the FREE version of DB2 express and take >> >>> control of your XML. No limits. Just data. Click to get it now. >> >>> >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> >>> _______________________________________________ >> >>> DSpace-tech mailing list >> >>> >> <mailto:[email protected]>[EMAIL PROTECTED] net >> >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >> >> >> ~~~~~~~~~~~~~ >> >> Mark R. Diggory - DSpace Systems Manager >> >> MIT Libraries, Systems and Technology Services >> >> Massachusetts Institute of Technology >> >> >> >> >> >> >> > >> > >> >>~~~~~~~~~~~~~ >>Mark R. Diggory - DSpace Systems Manager >>MIT Libraries, Systems and Technology Services >>Massachusetts Institute of Technology >>Office: E25-131 >>Phone: (617) 253-1096 >> >> >> >> >> >>------------------------------ >> >>Message: 3 >>Date: Fri, 4 May 2007 09:23:56 -0400 >>From: "Mark H. Wood" <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Subject: Re: [Dspace-tech] srb/s3/etc and lucene >>To: >><mailto:[email protected]>[EMAIL PROTECTED] e.net >>Message-ID: >><<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] du> >>Content-Type: text/plain; charset="us-ascii" >> >>There are two questions here: >> >>1) Does the use of a non-filesystem asset store backend affect Lucene's >> output? One would guess, no, since it doesn't do output to the >> asset store. >> >>2) Does the use of a non-filesystem asset store backend affect >> Lucene's input? IOW how does Lucene, as used in DSpace, locate >> and gain access to the files it indexes? If it doesn't go through >> the DSpace storage layer or something equivalent then indexing is >> screwed. >> >>Ouch! I hadn't thought about these at all. >> >>-- >>Mark H. Wood, Lead System >>Programmer <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] >>Typically when a software vendor says that a product is "intuitive" he >>means the exact opposite. >> >>-------------- next part -------------- >>A non-text attachment was scrubbed... >>Name: not available >>Type: application/pgp-signature >>Size: 189 bytes >>Desc: not available >> >>------------------------------ >> >>Message: 4 >>Date: Fri, 04 May 2007 09:39:41 -0400 >>From: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Subject: Re: [Dspace-tech] srb/s3/etc and lucene >>To: "Mark H. Wood" <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Cc: >><mailto:[email protected]>[EMAIL PROTECTED] e.net >>Message-ID: <<mailto:[EMAIL PROTECTED]> >>[EMAIL PROTECTED]> >>Content-Type: text/plain >> >>Right--I am trying to get an understand of all this in very specific >>terms. >> >>On Fri, 2007-05-04 at 09:23 -0400, Mark H. Wood wrote: >> > There are two questions here: >> > >> > 1) Does the use of a non-filesystem asset store backend affect Lucene's >> > output? One would guess, no, since it doesn't do output to the >> > asset store. >> > >> > 2) Does the use of a non-filesystem asset store backend affect >> > Lucene's input? IOW how does Lucene, as used in DSpace, locate >> > and gain access to the files it indexes? If it doesn't go through >> > the DSpace storage layer or something equivalent then indexing is >> > screwed. >> > >> > Ouch! I hadn't thought about these at all. >> > >> > ------------------------------------------------------------------------ - >> > This SF.net email is sponsored by DB2 Express >> > Download DB2 Express C - the FREE version of DB2 express and take >> > control of your XML. No limits. Just data. Click to get it now. >> > <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> > _______________________________________________ DSpace-tech mailing >> list >> <mailto:[email protected]>[EMAIL PROTECTED] n >> et https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >> >> >> >>------------------------------ >> >>Message: 5 >>Date: Fri, 04 May 2007 09:46:39 -0400 >>From: Richard Rodgers <<mailto:[EMAIL PROTECTED]> [EMAIL PROTECTED]> >>Subject: Re: [Dspace-tech] srb/s3/etc and lucene >>To: Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Cc: >><mailto:[email protected]>[EMAIL PROTECTED] e.net, >>"Mark H. Wood" >> <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]> >>Message-ID: >><<mailto:[EMAIL PROTECTED]>2007050409463 [EMAIL PROTECTED] >> > >>Content-Type: text/plain; charset=ISO-8859-1; format="flowed" >> >>See notes: >> >>Quoting Cory Snavely <<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]>: >> >> > Right--I am trying to get an understand of all this in very specific >> > terms. >> > >> > On Fri, 2007-05-04 at 09:23 -0400, Mark H. Wood wrote: >> >> There are two questions here: >> >> >> >> 1) Does the use of a non-filesystem asset store backend affect Lucene's >> >> output? One would guess, no, since it doesn't do output to the >> >> asset store. >>Correct - no. Lucene reads the file for indexing through the storage API - it >>therefore has a BitStream, not a location on a storage device. >> >> >> >> 2) Does the use of a non-filesystem asset store backend affect >> >> Lucene's input? IOW how does Lucene, as used in DSpace, locate >> >> and gain access to the files it indexes? If it doesn't go through >> >> the DSpace storage layer or something equivalent then indexing is >> >> screwed. >>No - for the same reason. It does not circumvent the storage API or make >>any assumptions about where the files with the text to index lives >> >> >> >> Ouch! I hadn't thought about these at all. >> >> >>Remember, we already support SRB, (a non-local filesystem option), and >>indexing >>works fine. >> >>Thanks, >> >>Richard R >> >> ------------------------------------------------------------------------ - >> >> This SF.net email is sponsored by DB2 Express >> >> Download DB2 Express C - the FREE version of DB2 express and take >> >> control of your XML. No limits. Just data. Click to get it now. >> >> >> <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> >> _______________________________________________ DSpace-tech mailing >> >> list >> <mailto:[email protected]>[EMAIL PROTECTED] net >> >> https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > >> > >> > ------------------------------------------------------------------------ - >> > This SF.net email is sponsored by DB2 Express >> > Download DB2 Express C - the FREE version of DB2 express and take >> > control of your XML. No limits. Just data. Click to get it now. >> > <http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db 2/ >> > _______________________________________________ >> > DSpace-tech mailing list >> > >> <mailto:[email protected]>[EMAIL PROTECTED] net >> > https://lists.sourceforge.net/lists/listinfo/dspace-tech >> > >> >> >> >> >> >>------------------------------ >> >>Message: 6 >>Date: Fri, 4 May 2007 21:44:42 +0530 >>From: "thirunavukarasu selvam" >><<mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED] > >>Subject: [Dspace-tech] Configuring Port number in DSpace >>To: >><mailto:[email protected]>[EMAIL PROTECTED] e.net >>Message-ID: >> >><<mailto:[EMAIL PROTECTED]> >>[EMAIL PROTECTED]> >>Content-Type: text/plain; charset="iso-8859-1" >> >>Hello all, >> >>Dspace by default is listening to port 25 always. >>I have configured the mail.server in dspace.cfg to my own mail server. >>I am not allowing my mail server to listen to port 25. ie i am dropping the >>packets coming to that port. >>I am allowing the server to listen to port 587. >> >>So I want to make change in DSpace so that it also sends and listens to port >>587. >> >>ie How to configure DSpace to listen to port 587. (or to any other port no.) >> >>Thanks for ur reply to be done. >> >>Thanks and regards, >>Thiru.... >>-------------- next part -------------- >>An HTML attachment was scrubbed... >> >>------------------------------ >> >>---------------------------------------------------------------------- --- >>This SF.net email is sponsored by DB2 Express >>Download DB2 Express C - the FREE version of DB2 express and take >>control of your XML. No limits. Just data. Click to get it now. >><http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/ db2/ >> >>------------------------------ >> >>_______________________________________________ >>DSpace-tech mailing list >><mailto:[email protected]>[EMAIL PROTECTED] e.net >>https://lists.sourceforge.net/lists/listinfo/dspace-tech >> >> >>End of DSpace-tech Digest, Vol 13, Issue 7 >>****************************************** > >----------------------------------------------------------------------- -- >This SF.net email is sponsored by DB2 Express >Download DB2 Express C - the FREE version of DB2 express and take >control of your XML. No limits. Just data. Click to get it now. >http://sourceforge.net/powerbar/db2/ >_______________________________________________ >DSpace-tech mailing list >[email protected] >https://lists.sourceforge.net/lists/listinfo/dspace-tech -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ ------------------------------------------------------------------------ - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ------------------------------ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech End of DSpace-tech Digest, Vol 13, Issue 12 ******************************************* ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

