Re: [Dspace-tech] Wiki Security and spam
FYI 2) was exactly how the previous MoinMoin-based Wiki was set up (a few key locked pages, content-based blacklist) and that worked fine, we had little or no spam after those measures were in place. Given that, I'd suggest 1) is overkill. At the least, 2) is worth trying before resorting to 1). My tuppence worth anyway. Actually, by the time I started writing for the MoinMoin wiki, it worked the same way as the new changes to the mediawiki -- you had to be in the WikiUsersGroup to edit pages, and some pages (like the front page) had more restrictive access controls. Any heuristic technique of spam control, that allows open access while attempting to detect spam after the fact, puts the implementer in an arms race with the spammers. It requires some constant administrative effort to keep up with tuning and changes in the spam-detection software. Does anyone have the time to host the wiki and administer it that actively? -- Larry - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Can I use HTTPS for just login
The ability to mediate your user requests in Apache httpd (both bots and regular users) far outweighs any performance hit, which in my experience it is miniscule in comparison to the memory and CPU loads that tomcat itself produces being a multi-threaded Java application. Apache is a very efficient and optimized http service, the mod_jk connector simply streams the request/response cycle to tomcat. I'd like to see a scientific comparison.. not enough to do one, but someone should :-). I'll grant that apache itself doesn't add much overhead, but there's still the latency and extra context switches of piping all the bits through another process. The comparsion may become more interesting when DSpace is serving an archived website so a page load spawns many DSpace servlet requests for stylesheets, images, etc, and the overhead for each transaction adds up. Tomcat still does about the same amount of work for each request. For sites with a sever that is _only_ running DSpace, and no other web applications, and especially without anyone who already knows how to manage Apache Httpd, using Tomcat by itself seems a reasonable choice. -- Larry - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Setting up DSpace
There are instructions on the wiki -- http://wiki.dspace.org/index.php/DspaceOnStandardPorts All you really have to do is basic Tomcat configuration since is already (by default) a web server as well as a servlet container, so consult the doc for your version of Tomcat too. -- Larry Can anyone give me any help setting up DSpace using only Tomcat? I have Tomcat up and running and PostgreSQL installed and running. I just need a few pointers on getting the rest working. We don't want to do it with Apache. - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Dspace and Oracle Application Server 10g
You should be able to do this quite easily by adding the following to: [dspace-src]/etc/dspace-web.xml When merging Servlet webapps, one thing to watch out for is how they handle errors. The DSpace webapp has special error handlers that substitute its own pages. I think it also transforms some status codes. This may cause problems with the OAI-PMH protocol; as I recall, it did when I experimented with including WebDAV protocol servlets in the same webapp as the web UI. Aside from that, though, it was no problem to merge the OAI, WebUI, and WebDAV servlets into one webapp. -- Larry - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Are you using the DSpace History System?
Have a look at the wiki page about the new history system: http://wiki.dspace.org/index.php/HistorySystemPrototype It's already written, and in fact the project is over so I won't be doing any more work on it. If the Event patch gets adopted, it will remove the old history system, but open the way to let sites install this one as an add-on. Right now it only exists in the wiki and on the patch queue. The old history system wrote some very dodgy records; some outright errors, other records that cannot be correlated to actual events, and it used database IDs as identifiers instead of anything persistent. (iirc, one of its worst sins was using Handle-like identifiers with database-IDs in the suffix). So, if you're actually going to use the data you're better off with the new system. It also includes an RDF database (Sesame2) so you can make custom queries, which sounds like it would help your use case. -- Larry I am not currently using it, but bringing it up is timely because I'm reaching a point where what I am being drawn towards is the necessity of an audit tool for certain system activities. I haven't had time to evaluate what the history system can do for me in that regard, but if anyone is planning on making changes to it, I'd be interested in being involved in some way, shape, or form. Let me give you one or two examples of the kind of auditing that I need: as users add/remove files over time from their item as they prepare it, I need to track what was added/removed and by whom when (multiple users can work on a single item in our system). Similarly for licences. Also, administrators perform many tasks on items before they hit the public repository, and a navigable audit trail on item activities which can actually be interacted with would be of great benefit. Cheers, -- Richard Richard Jones| t: +44 (0)20 759 [48614 / 41815] Web Database | e: [EMAIL PROTECTED] Technology Specialist | b: http://chronicles-of-richard.blogspot.com/ Imperial College London | - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Sequence ID generation
First, it is assigned sequentially and IDs are not reused if a bitstream is deleted. There is no magic ordering, and it was *not* intended for organizing a set of bitstreams into a meaningful sequence (e.g. PDF chapters of a book). Its sole purpose is to provide a *durable* unique ID for a bitstream - think of it as a 'sub-handle' ID - modulo an item There's actually a bug in the data model, then. It's possible to get the same sequence ID reused, because when adding a Bitstream, the code only looks for the highest existing SequenceID and increments that. 1. Take an existing Item, go into the Edit Item admin page (/dspace/tools/edit-item), and add a new Bitstream with a distinctive name. Say, foo.pdf. 2. Determine its Sequence ID. Go to the Item page /dspace/handle/my-handle and observe the View/Open link next to your bitstream, the path element after its handle is the SequenceID. It should be the highest SequenecID there since it was most recently added. There are some invisible Bitstreams (like licenses) that also take up SIDs. 3. Go back to the Edit page and delete that newest bitstream. 4. Add a different bitstream with a different name, say, bar.pdf. 5. Go to a freshly-loaded copy of the Item page, and observe that bar.pdf has the same SequenceID that foo.pdf had before. I'll submit this as a bug on sourceforge too. -- Larry - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Need Sample for LNI
I'm trying to get up an running on LNI, but can't seem to get the syntax for PUT just right (I keep getting either 400 or 500 http errors). Is there anybody out there who can send me a syntax sample along with a sample package? I'm working with the LNISmokeTest class. The smoke test has a method doPut() which ought to be a viable example. Is that what you're using? It takes the Handle of a collection which you have write access too. The Handle should be a string like 123456789/241; the sample code uses the Lookup operation to get its LNI resource URI. Check that the packager value names a PackgeIngester plugin configured in dspace.cfg. It may help to crank up the log level to DEBUG and look at the server logs. -- Larry - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Persistent identifiers in DSpace -- thoughts please
On Wed, May 30, 2007 at 02:01:50AM -0400, Larry Stone wrote: How about the word resource to introduce the URI, since it is, after all, a reference to a resource -- the R in URI. It'd be: prefix/resource/encoded-URI e.g. http://dspace.me.ac.uk/resource/hdl/1234/56 This follows the proposal to encode the URI by tearing off the scheme and putting it in a separate pathname element to avoid issues over quoting the :. Note that I propose using the actual scheme label in the URL rather than a user-friendly label, e.g. hdl rather than handle. This sounds like some reasonable middle-ground. The only issue I can see here is that this mechanism only allows us to refer to objects that have persistent identifiers. Of course, we could still use an internal form of identifier for objects without actual persistent identifiers, but then if we have an internal format, should we not use that everywhere? Aside from consistency, Mark made the observation that including the persistent identifier in the URL is, to a certain extent, bogus. Perhaps we could just provide the ability to resolve URLs of the above form, but for making links, etc, we use an internal identifier format. That's a good point -- DSpace is taking on the function of resolving persistent identifiers like Handles and DOIs when there is no need, since Handles, at least, already have a Web proxy server. I wasn't counting on the add our own flavor of PIDs to DSpace getting resolved favorably.. It _does_ have to allow data model objects to be referenced (through Web interfaces) by an URL that includes a _persistent_ identifier (as opposed to, say, a database-ID). That's the URL that will get used in links and citations despite our best efforts to promote Handles, so it needs to be reasonably permanent. Given a DSpace-specific persistent identifier (e.g. the UUID scheme), I see two options: 1. Give every content-model object a DSpace-type PID, no matter what. External references are URLs including the DSpace PID. Other PID schemes (e.g. Handle) resolve to those URLs. Allow plugins to register other PIDs when an object is created. 2. Make the DSpace PID into a PersistentIdentifier plugin so it is a peer with the Handle or DOI plugins. The administrator chooses to support one or more, and the canonical external reference to an object becomes whichever kind of PID is configured to be canonical. Choice (1) is simpler and seems more sensible, but (2) could be completely backward-compatible. Note that some ingested objects will already have PIDs, e.g. if they are AIPs being re-ingested to reconstruct an archive after catastropic failure, or DIPs (AIPs) mirrored from another repository. If _all_ DSpaces have the same PID scheme as in (1), there's no problem ingesting and accessing another archive's objects. Under (2), you could end up generating new PIDs for old objects because your archive doesn't understand the kind of PID they already have. I think the UUID scheme (or something like it) makes a whole lot of sense, but it is a rather significant change. Re special characters and quoting: I agree with James' original point that the HTTP URL spec has quoting rules for just this reason, but from a practical point of view, the client and server implementations have a lot of bugs in this area. That's what I discovered implementing WebDAV for the LNI: it wasn't worth trying to encode a slash (/) in a URL, e.g. within a Handle, because it would just get stomped on differently by the different clients. Better to let it get used literally as a path element separator and make the servlet clever enough to figure it out. Also, construct the servlet's URL so the whole path after a certain point is part of the object URI, e.g. the Handle. Again, this sounds fine. The only reason this doesn't work with the current implementation with Handles is for referencing bitstreams -- we are forced to make assumptions about the structure of the persistent identifiers because we use the (arbitrary and unpredictable) filename as part of the URL. This must be avoided, whichever scheme we eventually use. Do you mean the way Bitstreams are referenced in a /bitstream/ servlet URL? I thought the path actually doesn't matter there -- it can be anything, the servlet only looks at the sequence ID, because the URL follows the pattern: prefix/bitstream/handle/SequenceID/path e.g. http://dspace.mit.edu/bitstream/1721.1/35700/2/60504128-MIT.pdf ...hmm, it didn't _used_ to care what the path was at all, it would retrieve the bitstream referenced by the Sequence ID. Now, at least on the 1.4.1 system I checked, both SID and path have to match. But it doesn't have to be implemetned that way. Since Sequence IDs are the ONLY Bitstream metadata which must be unique within an Item, the servlet might as well just ignore the path. (Of course, this ignores the necessity of the /html/ servlet which
Re: [Dspace-tech] Setting Primary Bitstream
While uploading multiple files (html files and other links files) using batch import facilty, DSplace displays all the files. While going through archive it is found in the mail That's still true for the batch importer AFAIK, but there is now another way to ingest an Item that will let you set the primary bitstream. If you make a METS package, which means creating a METS manifest and putting it into a Zip file with the content files, then you can use the package ingester. You can declare the primary bitstream in the METS manifest, see: http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile The package ingester can be run from the command line like the batch importer, although it only handles one package at a time. For details about the package ingester, see the system doc: http://dspace.svn.sourceforge.net/viewvc/*checkout*/dspace/trunk/dspace/docs/application.html#packager The easiest way to get an example is to take an item alraedy in the archive which is similar to what you want, and export it as a METS package using the command-line packager, then look at its METS manifest. -- Larry Message: 3 Date: Mon, 11 Jul 2005 18:15:59 EDT From: Larry Stone [EMAIL PROTECTED] Reply-To: Larry Stone [EMAIL PROTECTED] To: Jim Downing [EMAIL PROTECTED] Cc: PRAT [EMAIL PROTECTED], dspace-tech dspace-tech@lists.sourceforge.net Subject: RE: [Dspace-tech] (no subject) You have to set the primary bitstream for the item, manually, to the top-level web page -- e.g. usually index.html. The ItemImporter doesn't have any way to set a primary bitstream, although this seems like a very good thing to add. The only way I know to do this is in the administrative GUI for the Item. Whether DSpace has any plan to solve this problem in coming version?. Surinder Kumar Gaba Technical Director Bibliographic Informatics Division National Informatics Centre A-Block, CGO Complex, Lodhi Road New Delhi-110 003 Tel : 011-24362359 (o) : 011-27865224 (r) - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] How to get ingested pdfs to keep their names?
Right, that wasn't ever an option on the PDF packager. That class was really written as a demonstration of the package ingester and a proof-of-concept, but it was allowed to escape. The package ingester never sees the original filename; it is given an open InputStream created by the Packager application. Package ingesters run from a stream so they don't have to care if their input is from a file or a network connection, that way remote ingestion just works. However, you've already figured out most of what is needed to add the option you want. Just add some code to look for the name option and if found, set the bitstream's name to that. Make sure the disseminator will find custom-named PDFs, perhaps by looking for the first bitstream with a name ending in .pdf if there is no package.pdf. Please submit the change as a patch through Sourceforge, so it can get adopted into the source, see the wiki page How to contribute. -- Larry I'm trying to ingest a whole bunch of pdf's using the packager. The syntax used is /dspace/bin/dsrun org.dspace.app.packager.Packager -c hp/548 -e b [EMAIL PROTECTED] -w -t PDF /home/dspace/HPL-2007-2.pdf The problem is that the files get renamed to package.pdf (even if I add -o name=blah.pdf to the command.). This seems to happen in the class org.dspace.content.packager.PDFPackager bs = original.createBitstream(pkg); pkg.close(); bs.setName(package.pdf); Does anyone have a workaround that keeps the original pdf file name? is there something I'm not doing quite right? Thanks, Ben Weinberg HP Digital Library - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] jhove / preingest / techMDExtractor
What'd like to do is just validate that the extension of the bitstreams submitted during an item deposit correspond to the actual format(s) of the files using the output of jhove. Now, I don't know how difficult this will be, and if you are working on it, and it's going to be coming out in 6 months, then perhaps my time would be better spent doing something else. So, what do you think about this. Do you see this sort of functionality realistically being available in the near future ( version 1.5, perhaps? ), or is what I want to do not that difficult, and only a small subset of what you're working on, so why not just do it? The work I'm doing might help somewhat, since it will include more sophisticated and accurate format identification, plus a measurement of confidence in the identification -- but I can't make any promises whether it will get into 1.5. Full details should be available on the wiki within a few weeks, I'll anounce it on the dspace-tech and dspace-devel lists so the community can comment on my proposal. It's a whole framework for integrating external data format registries (like the GDFR), as well as format-identifying applications. It does not include format validators but they do have a place in the overall design. I'm not sure JHOVE version 1 will be much help, either -- I recommend taking a hard look at its limitations before spending any time on it. Its repertoire of formats is somewhat limited, and the output is not trivial to interpret. Also, we found it gave a significant number of false negatives when validating. What's left? Tools like DROID (droid.sourceforge.net), perhaps, although it has some difficulty integrating with DSpace. It just identifies formats, but you could use that as a quasi-validation. Also, the field of data format representation, identification, and validation is in great flux right now, so there will be improvements. Thats's why I'm designing a very flexible framework to let DSpace make use of external resources. We heard the JHOVE 2 project just got funded, so that will be worth watching. See http://fileformats.blogspot.com/search/label/JHOVE for some early hints. -- Larry - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Xml file with style sheet
I have an xml file that references a style sheet, but I'm not able to get the xml file to render correctly using the style sheet when I deposit this file into deep blue. I realize that I need to create a servlet to handle xml files, just like there is one for html files. Before I do this, I wanted to check if any one out there has already done this, or if there is in fact some other quicker way of doing this. You should just be able to use the HTMLServlet itself on the XML file; it works on any kind of content type, although the MIME-type it sends your browser is whatever the bitstream's BitstreamFormat dictates. So, if your bitstream has the name foo.xml in item 1721.1/99404, try the URL http://dspace.mit.edu/html/1721.1/99404/foo.xml (substitute appropriately of course..) If that XML file makes relative-URL references to the XSL stylesheets, then it ought to just work. You may have to tweak the BitstreamFormat and MIME type of the stylesheet bitstreams if the browser is fussy about their MIME types. It should just be a matter of getting the Item display page to use the HTMLServlet for your XML bitstreams the way it does for HTML, which is purely a kludge. -- Larry - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [vote] Do we want to assign external identifiers (Handles) to files?
+1 assign external identifiers to files - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Running DSpace in read-only mode.
On Thu, Jul 19, 2007 at 05:59:39PM -0400, Kyle Brentnell wrote: 2) Change the authenticateInternal method in the AuthenticationManager.java class to always return BAD_ARGS or NO_SUCH_USER, etc. to prevent users from logging into Dspace. To allow access again, restore the original authenticateInternal code (and compile and deploy of course). An easier way to implement this is to create a new stubbed-out AuthenticationMethod plugin that always returns NO_SUCH_USER or some other fatal error, add that to the source, and then just change the DSpace Configuration to make it the only method on the authentication stack (commenting-out the original lines for easy restoration). Then, to restore normal operation, all you have to do is put the configuration back and restart the servlet container. It saves changing code and compiling a second time, and makes it easier to engage read only mode again in the future. -- Larry - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] background on external/persistent identifiers
I ran across an excellent study published by the ECPA (European Commission on Preservation and Access) that is recommended background reading if you want to know more about the issues behind persistent identifiers. It doesn't recommend any one system, but describes all the leading ones and mentions pros and cons. Implementing Persistent Identifiers: overview of concepts, guidelines and recommendations (ECPA-report; 18) by Hans-Werner Hilse and Jochen Kothe http://www.knaw.nl/ecpa/publications.html http://www.knaw.nl/ecpa/publ/pdf/2732.pdf -- Larry - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Reply-To Header missing - list misconfigured?
FWIW, some of the lists to which I subscribe mung the reply-to header to force replies to go to the list, and some don't -- I'm invariably annoyed by the ones that _do_ mung headers. So take this as a vote NOT to change the behavior of the DSpace lists. My mail user agent is a certifiable antique, but it has simple, separate commands for reply to sender and reply to all. I prefer to have that option, but it goes away if teh Reply-To header forces a reply to the list even when I choose to write only to sender. I suspect we'll see a bunch of inadvertent posts to the whole lists if this gets changed, too. -- Larry - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Dspace Bulk Upload
On 9/7/07, Nandita Chaudhri [EMAIL PROTECTED] wrote: If one is not using the submission UI to load items one by one but is using the bulk upload facility is there a way to have two bitstreams with only one as the 'primary' one which will be displayed when the item is retrived using a search? I knoe with the submission UI it is possible. Regards, The METS package ingester (as of 1.4.2, at least) can set the primary bitstream. It must be mentioned in the METS manifest; the profile document says how to do this, search for Primary bitstream in: http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile You can ingest packages through the LNI or the prototype command-line class, org.dspace.app.packager.Packager - the system documentation has all the details. -- Larry - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Help with METS Ingest Import
See the Application Layer page in the DSpace system documentation (docs/application.html in the source), the section Package Importer and Exporter. Briefly, the [dspace]/bin/packager command ingests packages as well, you have to give it a -c collection-handle option and can specify -s for submit, although i believe this is the default. In any case, run [dspace]/bin/packager --help for help with all options. -- Larry I've been struggling trying to figure how to import METS ITEMS into DSPACE as in SIP. There's an export utility (dsrun) for exporting DSPACE items into METS Formats (DIP). How would I go about doing the reverse, importing METS Items into a Collection. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Viruses and DSpace
Has any thought been given to how Dspace might handle the remote ( hopefully ) possibility of a file containing a virus being deposited into a repository? It seems like jhove might be the kind of tool that could check for this. I believe there is some work going on to incorporate jhove into Dspace, how is that coming along? It's not part of of 1.5, but what about for the following release? The BitstreamFormat renovation (see http://wiki.dspace.org/index.php/BitstreamFormat_Renovation ) doesn't address this directly, but will make it much easier to integrate tools because file formats will be identified more effectively and precisely. Once the format is known you can add a mechanism like the mediafilters, perhaps integrated with workflow, to run specific checks depending on the format type. JHOVE version 1 is just a format validator and technical-metadata extractor, it isn't subtle enough to look for viruses. There _are_ tools in the email filtering domain which detect malicious MS Office files; I've heard of them but don't remember specifics. You could start by looking around the SpamAssassin software and ClamAV (see http://www.clamav.net/ ) However, be aware that any virus-checking software needs constant updating since you're essentially in an arms race. -- Larry (a recovering postmaster) - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)
I haven't looked at the Manakin code, but the MIME media-type of a Bitstream must be coming from its associated BitstreamFormat -- so why not get the human-readable name from the BitstreamFormat as well? There is no need to establish a separate map of MIME-type to user-friendly name when it already exists in teh BitstreamFormat registry. String friendly = bitstream.getFormat().getShortDescription(); One complication, or perhaps advantage, of using BSFs directly is that some of them have the same MIME-type, so getting the friendly name from the BSF actually identifies the format more precisely -- e.g. XML-based formats might all have the MIME-type text/xml, but distinct friendly names. Thus, you should go to the Bitstream's BSF to get the friendly name rather than attempt to use the BSF registry as a map, because it might have multiple matches for one MIME-type. -- Larry On Nov 21, 2007 10:06 PM, Conal Tuohy [EMAIL PROTECTED] wrote: On Wed, 2007-11-21 at 16:43 -0600, Dorothea Salo wrote: The mapping between media-types and friendly names could be introduced into the pipeline using a Manakin Aspect, and then utilised in a View, via XSLT. Aha. I can try to tackle this. What would be the closest existing code? Alternatively, perhaps this is really just a case of i18n? I thought about that, but I can't quite make it work happily. Every time an administrator adds a new bitstream format (something I assume Manakin still has UI for?), DSpace itself would have to make an automatic change to messages.xml, which is under most circumstances a human-authored and source-controlled file. Automagically changing it will make a mess of any installation that keeps its source in source control, I would think. Dorothea -- Dorothea Salo[EMAIL PROTECTED] Digital Repository Librarian AIM: mindsatuw University of Wisconsin Rm 218, Memorial Library (608) 262-5493 - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)
I agree that adding techMD to the METS is architecturally the right thing, but there is already a better (or at least, adequate..) technical metadata standard format in PREMIS. There is already a PREMIS crosswalk for the import/export METS packager which you can cannibalize, see org.dspace.content.crosswalk.PREMISCrosswalk You could put the format name in the element: objectCharacteristics/format/formatDesignation/formatName The crosswalk puts the MIME type there right now, but it would make more sense to use it for the name. (At the time I wrote the crosswalk, I didn't feel it was correct to dignify the chaos that is MIME types by calling it a registry of format identifiers, although we currently use it as such.) Then you can use PREMIS for other bits of technical metadata of use to Manakin, instead of perverting DIM. DIM was never supposed to escape beyond the internal use in XSLT-driven packager crosswalks! There's more about PREMIS here: http://www.loc.gov/standards/premis/ -- Larry The METS file element lets you have an admid attribute that points to administrative metadata about the file elsewhere in the METS document. For our purposes, we would probably end up creating something like this (assume the new metadata is in a ds schema) for each bitstream (or perhaps for each format in an item? one could point more than one bitstream to the same amdSec): mets:amdSec id=bitstream_1 mets:techMD id=bitstream_1_techmd mets:mdWrap OTHERMDTYPE=DIM MDTYPE=other mets:xmlData dim:dim dspaceType=BITSTREAM dim:field element=formatName mdschema=dsHappyFile version 1.01/dim:field /dim:dim /mets:xmlData /mets:mdWrap /mets:techMD /mets:amdSec The place to pop this in, I *think*, is somewhere in xmlui.objectmanager.ItemAdapter.java. If what I just suggested doesn't seem outrageous, I'll take a whack at coding it up. Dorothea -- Dorothea Salo[EMAIL PROTECTED] Digital Repository Librarian AIM: mindsatuw University of Wisconsin Rm 218, Memorial Library (608) 262-5493 - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] why vanilla?
I don't visit the wiki often enough, but my impression has been that work-in-progress and completed projects are jumbled together, probably because when projects are completed they just stay on the page and stop getting updates. Regularly moving released mod.s to a released projects page would help to make it clear what you can just plug in now and what you may be able to help develop, which is an important distinction depending on your needs and resources. This is a good application of Categories in the wiki -- brand project pages with either Project-In Progress or Project-Released, e.g. Then the summary pages would be automatically generated. See http://meta.wikimedia.org/wiki/Help:Category to learn all about categories. The use of categories looks somewhat haphazard right now; we really need a meta-page explaining how the tags are supposed to be used and offering guidelines to creating such things as project pages. See: http://wiki.dspace.org/index.php/Special:Categories -- Larry - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Uploading a 10Gig File. (Blanco, Jose)
Check the configurations of the Web server (e.g. apache) and/or servlet container (tomcat, if using mod_jk) for limits on the largest upload allowed. Start by looking at their error logs and transaction logs to see what happened to your HTTP request. There is theoretically no reason you shouldn't be able to do this, but it probably exceeds all sorts of default limits in the software stack. For example, see the Apache httpd LimitRequestBody directive. (Hmm, apache 2.2 seems to have a hard limit of 2 Gb in request bodies.) I have experimented with 20Gb uploads using a bare Tomcat 5.0 server and the HTTP PUT protocol in the LNI, so it ought to be possible. -- Larry The file is uncompressed. It is a mov file. The tmp dir has enough space for it. I am able to cp it there. What I have found is that if the /tmp dir does not have enough space for it, it starts to write it there and then things stop running once the dir fills up, but in this, it never tries to write it there. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Michael Faraci Sent: Wednesday, January 09, 2008 4:36 PM To: dspace-tech@lists.sourceforge.net Subject: [Dspace-tech] Uploading a 10Gig File. (Blanco, Jose) Message 4. Have you checked tmp directory size. Is the file compressed (zip/tar)? I would be concerned with the uncompressed size if it is. I have never dealt with submitting a file this large, so these are just some things that come to mind. Michael Faraci Network/Systems Administrator ALC 3305 N. 3rd St. Suite 301 Abilene, Texas (325)672-7081 x4 fax (325)672-7082 -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: Wednesday, January 09, 2008 11:14 AM To: dspace-tech@lists.sourceforge.net Subject: DSpace-tech Digest, Vol 21, Issue 20 Send DSpace-tech mailing list submissions to dspace-tech@lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/dspace-tech or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than Re: Contents of DSpace-tech digest... Today's Topics: 1. Re: Cannot delete collections from Dspace (Claudia J?rgen) 2. Re: Cannot delete collections from Dspace (Graham Triggs) 3. Re: Creative Commons Enigma (Richard Rodgers) 4. Uploading a 10Gig File. (Blanco, Jose) 5. Re: Getting image thumbnails (Keith Gilbertson) 6. Collection Item Mapper Search (Kyle Kaliebe) -- Message: 1 Date: Wed, 09 Jan 2008 15:02:30 +0100 From: Claudia J?rgen [EMAIL PROTECTED] Subject: Re: [Dspace-tech] Cannot delete collections from Dspace To: Filippos Kolovos [EMAIL PROTECTED] Cc: Dspace-Tech-List dspace-tech@lists.sourceforge.net, Dspace-general-list [EMAIL PROTECTED] Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=UTF-8; format=flowed Hi Filippos, are you sure you're running 1.4.2? The table dcvalue does not exist anymore in 1.4.2. It has been replaced in 1.4. In 1.3.2 there has been a known bug http://sourceforge.net/tracker/index.php?func=detailaid=1333269group_i d=19984atid=119984 on the deletion of collections/communities containing withdrawn items. cheers Claudia Filippos Kolovos schrieb: Dear all, Recently I am dealing with a problem with my Dspace installation (1.4.2). When I try to delete a collection and/or community from Dspace the system returns an Internal Server Error. The explanation of this error is shown below: Exception: org.postgresql.util.PSQLException: ERROR: update or delete on item violates foreign key constraint $1 on dcvalue at org.postgresql.util.PSQLException.parseServerError(PSQLException.java:13 9) at org.postgresql.core.QueryExecutor.executeV3(QueryExecutor.java:152) at org.postgresql.core.QueryExecutor.execute(QueryExecutor.java:100) at org.postgresql.core.QueryExecutor.execute(QueryExecutor.java:43) at org.postgresql.jdbc1.AbstractJdbc1Statement.execute(AbstractJdbc1Stateme nt.java:517) at org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Stateme nt.java:50) at org.postgresql.jdbc1.AbstractJdbc1Statement.executeUpdate(AbstractJdbc1S tatement.java:273) at org.apache.commons.dbcp.DelegatingPreparedStatement.executeUpdate(Delega tingPreparedStatement.java:101) at org.apache.commons.dbcp.DelegatingPreparedStatement.executeUpdate(Delega tingPreparedStatement.java:101) at org.dspace.storage.rdbms.DatabaseManager.updateQuery(DatabaseManager.jav a:519) etc, etc .. . The exact reason for this error seems to be that the collection id is referenced from another table,
Re: [Dspace-tech] remote Client Program for accessing and managing theDSpace
There are already several existing network interfaces -- are you sure none of them give the access you need? See the wiki: http://wiki.dspace.org/index.php/NetworkInterfaces In particular, the LNI -- see: http://wiki.dspace.org/index.php/LightweightNetworkInterface was designed to be extensible and to eventually support a full administrative interface, if needed. It's based on WebDAV, but it also contains an alternate SOAP interface for _some_ (not all) functions. I recommend reading the original preserved wiki page at: http://web.mit.edu/lcs/www/lni/ because the one in the current wiki still contains a lot of damage from an inaccurate translation to MediaWiki. The source code is part of DSpace 1.5, so you can refer to it as a SOAP example. hope this helps, -- Larry I am going to write a client program to manage and access the DSpace repository remote. It should look like a normal application. So I think that I should use the some communication protocol to communicate with the DSpace servlet web services. For example: SOAP is a very gut one. But the question is, whether the DSpace supports it? And how many DSpace Websites use the SOAP? I tried to search an answer from the System Document online in DSpace. You know, that I didn't any answer. DSpace is written with Java. So I think it should support the SOAP with RPC. The DSpace Mailinglist is very hot. So i hope, someone can help me! Thanks and regards Feng Wang -- GMX startet ShortView.de. Hier findest Du Leute mit Deinen Interessen! Jetzt dabei sein: http://www.shortview.de/[EMAIL PROTECTED] - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javao ne ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] bitstreams and file locations
The Bitstream's name field is also significant for Items taht are archived websites with internal links to other Bitstreams in the item. The UI (the JSP UI, at least) has a special html servlet that resolves references to the *name* of the Bitstream instead of its Sequence ID (like the normal handle servlet). For example, this Item is an archived OCW website: http://dspace.mit.edu/html/1721.1/41055/12-820Spring-2005/OcwWeb/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/CourseHome/index.htm and it refers to an image Bitstream with the URL http://dspace.mit.edu/html/1721.1/41055/12-820Spring-2005/NR/rdonlyres/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/3B9453A1-78D4-49B4-9DBD-B1F39C78CB29/0/chp_turbulent.jpg The Item handle is 1721.1/41055 and the Bitstream's name field is 12-820Spring-2005/NR/rdonlyres/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/3B9453A1-78D4-49B4-9DBD-B1F39C78CB29/0/chp_turbulent.jpg ...so, in a case like this, it's _essential_ to preserve the names of the Bitstreams or your web page will not work. -- Larry I have a simple question about the name column in the bitstream table. We're using 1.3.1 and migrating to a new server (but keeping 1.3.1 for now) Does this field have any value other than provenance information about the source of the ingested bitstream. The actual bitstreams are being stored in the assetstore and referenced differently, so I'm not sure about the reason for the name column? In a test migration the lack of a physical file referenced in the name column doesn't appear to have any consequence? Thanks Eric American Museum of Natural History - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] File Descriptions in Import
So, my question is: Can File descriptions for multiple bitstreams be added via the importer? Remember, the Packager plugins are another way to ingest Items -- SWORD, the LNI, and a command-line client (/dspace/bin/packager) can all invoke any of the package ingester plugins. Unfortunately, the most sophisticated packager that comes in the release, the METS (DSpace SIP profile) packager, does not recognize Bitstream descriptions. I *think* this is because there was no good place to put that metadata in the PREMIS object schema that we use for Bitstream-level metadata. There is a package-based solution that would let you ingest every detail of an Item -- the AIP prototype, see http://wiki.dspace.org/index.php/AipPrototype The whole nature of an AIP is that is a *complete* description of the DSpace object, down to every detail, so you don't lose anything when e.g. migrating Items between archives through AIPs. It's probably prohibitively hard to resuscitate this code, but perhaps you can just spin off your own customized METS ingester. Look at the AIP as a model if you want to adapt the existing METS packager code. You can subclass it pretty easily and just change the Bitstream crosswalk plugin it calls, perhaps to a modified PREMIS crosswalk, or use DC or something like that for Bitstream metadata. -- Larry - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] File Descriptions in Import
Never mind; answered my own question. This DOES NOT work for imported rather than registered bitstreams. Ah, well. I can edit bitstream descriptions manually for 300+ items; why not? You can update Bitstream description fields under program control with the LNI. It may even be possible to do this pretty easily with a scriptable WebDAV client like cadaver, or by writing a very simple LNI client. See http://wiki.dspace.org/index.php/LightweightNetworkInterface#PROPFIND_and_PROPPATCH_XML_elements Bitstream resources have a dspace:description property that is writable. Please refer to http://web.mit.edu/lcs/www/lni/ for a cleaner version of the LNI API document -- the one on the wiki is still very chewed-up from the conversion. -- Larry - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available
Actually, it responds to GET on some URIs, but the LNI is *not* meant to be used as an interactive web site; it is a WebDAV server. WebDAV happens to use the HTTP protocol but not in a way that gets along with the subset (and perversions) of HTTP most browsers speak. It also has some ability to converse in SOAP (for a subset of its functions) but that is even less browser-friendly. For all the details and leads, see the LNI documentation at http://web.mit.edu/lcs/www/lni/ You can also download a sample SOAP client there. For some reason, the client utilities and sample client were not included in 1.5. -- Larry not elegant or intuitive... http://host/lni/lni And you need the client to interact because it requires POST. -Mark On May 16, 2008, at 12:52 PM, Rick Runyan wrote: lni is right there in the Tomcat webapps folder right beside xmlui, = which works like a champ. I=92m not experienced with web apps, and = even less so with java web apps, so I=92m at something of a loss = here. I=92ve spent a while reading and trying to figure out the = problem, with not much success. Mostly, it seems like there should = be an index.html file in the WEB-INF directory, but there=92s not one. Anybody got lni working who=92d like to give me a hint? - Rick -- = --- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ = ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft = Defy all challenges. Microsoft(R) Visual Studio 2008. = http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Creating a WSDL file for LNI
There was a WSDL file in the original source; it was generated automatically, by an ant target that does not seem to have survived into the brave new world of maven. (You could look at the old LNI patches, see http://web.mit.edu/lcs/www/lni ) However, teh WSDL is still in the DSpace 1.5 source distribution, its path (relative to the tar file downloaded from sourceforge): dspace-lni/dspace-lni-client/src/main/resources/dspace-lni.wsdl This was built automatically by what is now a very old version of Axis, so good luck with it.. What I *really* recommend is to forget about SOAP entirely. You need WebDAV anyway for the GET and PUT methods (package export and ingest), and all of the rest of the functionality is availble through WebDAV too. The SOAP calls are just a thin veneer over WebDAV, only included because of contractual obligations. You may find it much easier to simply use a generic WebDAV client. -- Larry I have been fighting with this for about three days now, so I'm giving up the thought that I can figure it out on my own. =20 I want to call LNI from a .NET client, so I need a WSDL file (at least, I think I do). I've been trying to create one using the Axis JAVA2WSDL command, but I'm always getting a Class not found error. Speaking of which, I assume that the class I'd want to do this to is LNISoapServlet.java... =20 Has anybody done this successfully and, if so, can you tell me how you did it? =20 - Rick =20 =20 --_=_NextPart_001_01C8BC3C.DBC95B09 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: quoted-printable html xmlns:o=3Durn:schemas-microsoft-com:office:office = xmlns:w=3Durn:schemas-microsoft-com:office:word = xmlns=3Dhttp://www.w3.org/TR/REC-html40; head meta http-equiv=3DContent-Type content=3Dtext/html; = charset=3Dus-ascii meta name=3DGenerator content=3DMicrosoft Word 11 (filtered medium) style !-- /* Font Definitions */ @font-face {font-family:Californian FB; panose-1:2 7 4 3 6 8 11 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0in; margin-bottom:.0001pt; font-size:12.0pt; font-family:Times New Roman;} a:link, span.MsoHyperlink {color:blue; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline;} span.EmailStyle17 {mso-style-type:personal-compose; font-family:Californian FB; color:windowtext; font-weight:normal; font-style:normal; text-decoration:none none;} @page Section1 {size:8.5in 11.0in; margin:1.0in 1.25in 1.0in 1.25in;} div.Section1 {page:Section1;} -- /style /head body lang=3DEN-US link=3Dblue vlink=3Dpurple div class=3DSection1 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'Hi = #8211;o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'I have been fighting with this for = about three days now, so I#8217;m giving up the thought that I can figure it = out on my own.o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'I want to call LNI from a .NET = client, so I need a WSDL file (at least, I think I do).nbsp; I#8217;ve been trying = to create one using the Axis JAVA2WSDL command, but I#8217;m always = getting a #8220;Class not found#8221; error.nbsp; Speaking of which, I assume that the class = I#8217;d want to do this to is = LNISoapServlet.java#8230;o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'Has anybody done this successfully = and, if so, can you tell me how you did it?o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p p class=3DMsoNormalfont size=3D3 face=3DCalifornian FBspan = style=3D'font-size: 12.0pt;font-family:Californian FB'- Rick/span/fontfont size=3D2 face=3DArialspan = style=3D'font-size:10.0pt;font-family:Arial'o:p/o:p/span/font/p= p class=3DMsoNormalfont size=3D3 face=3DTimes New Romanspan = style=3D'font-size: 12.0pt'nbsp;/spano:p/o:p/font/p p class=3DMsoNormalfont size=3D3 face=3DTimes New Romanspan = style=3D'font-size:
Re: [Dspace-tech] LNI/WebDAV : GET returns 501
GET only works on Items and Bitstreams, so if you get the resource path to a Bitstream from PROPFIND (depth=1) on an Item, then you ought to be able to do a GET on it and receive its contents. This path looks like you've traversed some communities and collections: http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_123456789%243 You can also get a resource URL by doing a lookup on a Handle, e.g. GET on http://[server]:8080/lni/dav/lookup/handle/123456789/42 returns a Location: header with the URL. I don't think it works on Bitstreams, though. Anyway, if you do a PROPFIND on that resource you're trying to GET, look at the value for type (in the http://www.dspace.org/xmlns/dspace; namespace). It'll be an empty element for the DSpace object type, either collection/, community/, item/, bitstream/ Typically, the Bitstream resources have a path ending in /bitstream_{n}.{suffix}, e.g. .../bitstream_3.pdf So, I think the problem here is that you've got a resource for which GET isn't implemented. For some sketchy doc which is unfortunately the best available, see http://web.mit.edu/lcs/www/lni/ -- Larry I've finally figured out how to hook up a .NET app to LNI and am running some exploratory tests, using straight HTTP/DAV rather than SOAP. =20 Alas, GET doesn't seem to work. I've done a propfind call to get URIs for the bitstreams in a collection, but when I try to GET one of them, I get =20 System.ApplicationException: Error trying to GET http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_123 456789%243 ---=20 System.Net.WebException: The remote server returned an error: (501) Not Implemented. at System.Net.HttpWebRequest.GetResponse() =20 I am unfamiliar with Axis web services, so I'm having to learn everything as I go, which slows me down a lot. This entry in server-config.wsdd listing allowedMethods looks suspicious to me service name=3DDSpaceLNI provider=3Djava:RPC parameter name=3DallowedMethods value=3Dpropfind proppatch = lookup copy/ parameter name=3DclassName value=3Dorg.dspace.app.dav.LNISoapServlet/ /service But the web.xml file has a different class (org.dspace.app.dav.DAVServlet) set up to respond to /dav URIs, so I suspect that's not my problem. =20 - Rick =20 =20 --_=_NextPart_001_01C8C190.AFE31B16 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: quoted-printable html xmlns:o=3Durn:schemas-microsoft-com:office:office = xmlns:w=3Durn:schemas-microsoft-com:office:word = xmlns=3Dhttp://www.w3.org/TR/REC-html40; head meta http-equiv=3DContent-Type content=3Dtext/html; = charset=3Dus-ascii meta name=3DGenerator content=3DMicrosoft Word 11 (filtered medium) style !-- /* Font Definitions */ @font-face {font-family:Californian FB; panose-1:2 7 4 3 6 8 11 3 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {margin:0in; margin-bottom:.0001pt; font-size:12.0pt; font-family:Times New Roman;} a:link, span.MsoHyperlink {color:blue; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed {color:purple; text-decoration:underline;} span.EmailStyle17 {mso-style-type:personal-compose; font-family:Californian FB; color:windowtext; font-weight:normal; font-style:normal; text-decoration:none none;} @page Section1 {size:8.5in 11.0in; margin:1.0in 1.25in 1.0in 1.25in;} div.Section1 {page:Section1;} -- /style /head body lang=3DEN-US link=3Dblue vlink=3Dpurple div class=3DSection1 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'I#8217;ve finally figured out how = to hook up a .NET app to LNI and am running some exploratory tests, using = straight HTTP/DAV rather than SOAP. nbsp;o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'Alas, GET doesn#8217;t seem to work.nbsp; I#8217;ve done a propfind call to get URIs for the = bitstreams in a collection, but when I try to GET one of them, I = geto:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'System.ApplicationException: Error = trying to GET = http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_1234= 56789%243 ---gt; o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan = style=3D'font-size: 11.0pt;font-family:Californian FB'System.Net.WebException: The remote = server returned an error: (501) Not Implemented.o:p/o:p/span/font/p p class=3DMsoNormalfont size=3D2 face=3DCalifornian
Re: [Dspace-tech] LNI/WebDAV : GET returns 501
You can see how the lookup feature works by using a simple HTTP client like the GNU wget application to do a GET, and reading its diagnostic output.. for example, this does a lookup of handle 123456789/1 using HTTP Basic authentication to login (assuming you have username/password login on your server): wget [EMAIL PROTECTED] --http-password=PASSWORD -S http://MYDSPACE.mit.edu:8080/lni/dav/lookup/handle/123456789/1 Now, watch the headers -- the request as a whole will fail because wget is aggressive about responding to the HTTP 302 status with another GET on the real resource, but that GET probably isn't allowed or is missing some option. Anyway, just look for the Location: header in the first response: Connecting to rotarran.mit.edu|18.42.4.43|:8080... connected. HTTP request sent, awaiting response... HTTP/1.1 302 These are not the droids you are looking for. Set-Cookie: DSpaceDavAuth=1212468843785!1!18.42.4.8!ac52bf780d4bb6eb486b810a7d 5c4f24; Path=/lni Location: http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241 ^^ Content-Length: 0 Date: Tue, 03 Jun 2008 04:54:03 GMT Server: Apache-Coyote/1.1 Connection: Keep-Alive {Here's the second try, which fails.. ignore it, we've got what we need.} Location: http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241 [following] --00:54:03-- http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241 = `dso_123456789$1' Reusing existing connection to rotarran.mit.edu:8080. HTTP request sent, awaiting response... HTTP/1.1 501 GET method not implemented for Community. Content-Type: text/html;charset=utf-8 Content-Length: 1112 Date: Tue, 03 Jun 2008 04:54:03 GMT Server: Apache-Coyote/1.1 Connection: close 00:54:03 ERROR 501: GET method not implemented for Community.. This ought to show how to implement what you need with Apache HTTPClient or something like that. You're not using Perl, are you? The WebDAV library on CPAN was horribly broken last I looked, not really usable for this at all. Also, note that you don't *need* to go through the hierarchy to get at any Handle in the archive. If you already have the Handle of the resource you're looking for, just do a lookup to get its URL and use that. Traversing the hierarchy is just for exploring, and because it's the model of HTTP and WebDAV. -- Larry Thanks for the helpful response. I've still been unable to do a lookup or a get, but I'm concentrating right now on walking through the hierarchy to learn what turns up and what kind of properties I'll find. When I get comfortable with that, I'll re-read your message and try again. - Rick - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] build cookbook
I've added a page of cookbook-style instructions for integrating local modificatiosn with the new build system, see: http://wiki.dspace.org/index.php/BuildCookbook it builds on the OR08 presentation Customizing DSpace 1.5 with Basic Overlays; in fact, that has other useful examples which might well be added to the wiki page for easier access. -- Larry - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] DSpace-tech Digest, Vol 27, Issue 3
See http://wiki.dspace.org/index.php/Simple_LNI_Client for an example of the same idea in Java. It uses only the WebDAV protocol. Perhaps you can use that code as a model for your C# application. When you're done, please consider posting your client on the wiki too. -- Larry I'm working on a project that has to be able to insert items on DSpace thro= ugh a C# application... I am going to use LNI (LightWeightNetworkInterface) and a WebDAV C# library= that has been realesed. =20 Does anyone has done this before ? I think i will need a little help here..= .=20 =20 What do you think the software that i'm going to use? Is it the correct one= for this ? =20 Best regards, Guilherme Saraiva=20 _ Cansado de espa=E7o para s=F3 50 fotos? Conhe=E7a o Spaces, o site de relac= ionamentos com at=E9 6,000 fotos! http://www.amigosdomessenger.com.br= --_414b706f-7072-413e-8b54-b083041eedf5_ Content-Type: text/html; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable html head style .hmmessage P { margin:0px; padding:0px } body.hmmessage { FONT-SIZE: 10pt; FONT-FAMILY:Tahoma } /style /head body class=3D'hmmessage' BRHello again.BR nbsp;BR I'm working on a project that has to be able to insert items on DSpace thro= ugh a C# application...BR I am going to use LNI (LightWeightNetworkInterface) and a WebDAV C# library= that has been realesed.BR nbsp;BR Does anyone has done this before ? I think i will need a little help here..= . BR nbsp;BR What do you think the software that i'm going to use? Is it the correct one= for this ?BR nbsp;BR Best regards,BR Guilherme Saraivanbsp;BRbr /hr /Instale a Barra de Ferramentas com D= esktop Search e ganhe EMOTICONS para o Messenger! a href=3D'http://www.msn= .com.br/emoticonpack' target=3D'_new'=C9 GR=C1TIS!/a/body /html= --_414b706f-7072-413e-8b54-b083041eedf5_-- --===1635202196== Content-Type: text/plain; charset=us-ascii MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 --===1635202196== Content-Type: text/plain; charset=us-ascii MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech --===1635202196==-- - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Filter-media in 15
It _should_ never get an NPE over a missing configuration key. Given our sparse testing resources it behooves us to make the code as resilient as possible. Could you submit a bug report about this (or, even better, a patch) -- ideally it could would *log an error* if the configuration property is missing and set filternames to some default that may be illogical (i.e. empty list of plugins) but at least won't blow up. The error log entry explains the exact problem to the admin, who may not have known about the new configuration key, but can now tell exactly what to look for. See org.dspace.core.PluginManager for an example; it has a lot of protection against configuration errors because those are to be expected when plugin configurations get changed at every site. -- Larry I see that it blows up on: String enabledPlugins = ConfigurationManager.getProperty(MEDIA_FILTER_PLUGINS_KEY); filterNames = enabledPlugins.split(,\\s*); But when I check the dspace.cfg file I don't see a MEDIA_FILTER_PLUGINS_KEY Filter-media will need to be run as a cron job in 1.5, right? Thanks! Jose -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Blanco, Jose Sent: Friday, July 11, 2008 11:14 AM To: dspace-tech@lists.sourceforge.net Subject: [Dspace-tech] Filter-media in 15 When I run filter-media in 1.5, I'm getting: -bash-3.2$ ./filter-media Applying Media Filters Exception in thread main java.lang.NullPointerException at org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja va:240) - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [Dspace-devel] Batch Import document
There _was_ documentation for the venerable batch importer in the HTML pages in the docs subdirectory; in a source checkout, it's at the relative path: dspace/trunk/dspace/docs/application.html#itemimporter (I can't get the location in the Subversion repository because Sourceforge seems to be having a bad hair day..) Your new PDF document is better -- could you at least put it up on the wiki for how? The default METS SIP (Submission Information Package) accepted by the package ingester is also mentioned in the HTML docs: dspace/trunk/dspace/docs/application.html#packager but the link to a document describing the METS SIP profile is broken: http://www.dspace.org/standards/METS/SIP/profilev1p0/metsipv1p0.pdf There _is_ a page about it on the WIKI, although it could use some cleanup: http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile It could use an annotated example of a complete submission package, I think that would be the most helpful part for anyone putting their own SIPs together. (They should also see the DSpace SIP Toolkit wiki page) -- Larry It's a funny thing, the DSpace Batch Import Format (of which there are two) are not documented anywhere other than looking at the source code. We've run across the situation where a department has some content to put into the repository and the resources to create a batch import, but just don't know how to do it. To solve this problem, we've created a simple hand out to provide to a programmer or technical person in another department or group when they want to develop their own batch import script. While it doesn't cover the advanced topics, like registering bitstreams etc, nor dose it go over the package importer, it should solve the problem for the vast majority of cases where someone needs to create a batch import. I thought this might be useful to others out there, the only other resource on creating batch imports I was able to find is Dorothea's blog: http://cavlec.yarinareth.net/2008/01/07/the-dspace-batch- importer/. Here is our handout: https://www.tdl.org/documents/DSpaceBatchImportFormat.pdf Scott-- - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Dspace-devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dspace-devel - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [Dspace-devel] Batch Import document
While we're talking about the fate of DSpace documentation, I'd like to put in a plea to maintain a Web-friendly version *as well*. When consulting the doc for other open-source projects (usually because they are components of DSpace..) I find the online HTML version of the doc a lot more handy than PDF: it's easier to navigate, faster to get to, easier to read on a computer display. PDF is a whole lot better suited to printing than to viewing online. The PostgreSQL site is an excellent example of an online manual -- see http://www.postgresql.org/docs/8.2/interactive/index.html Note that it also has a place to attach user comments (along the lines of a blog). These often have very useful hints and tidbits. -- Larry - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Reusing bitstream sequence number
What is it that you need to accomplish? I need to be able to have a stable reference for a bitstream which might undergo a number of version updates. I do not expect to save the earlier versions at this time. Mark is correct about overloading the semantics here. Note, We adjusted the behavior behind the dspace 1.5 XMLUI (but not the JSPUI) to allow for unsequenced name resolution of the bitstreams. For instance: http://dspace.mit.edu/bitstream/handle/1721.1/39126/ womenpolicymakers_census_dta.tab http://dspace.mit.edu/bitstream/handle/1721.1/39126/ womenpolicymakers_census_dta.tab?sequence=3 http://dspace.mit.edu/bitstream/handle/1721.1/39126/3/ womenpolicymakers_census_dta.tab Are now all valid references the bitstream at this location. In the case where the sequence number is absent, the first bitstream encountered in the Item with that name is returned. I think this is what I need. Was this done via xslt or in servlet code. This is also possible in the JSPUI with the html servlet; it was intended to let DSpace serve an archived website. In certain circumstances (at least in 1.4, it was when the primary bitstream appeared to be an HTML file) the item page uses the html servlet instead of the bitstream servlet in the links to bitstreams, and only shows the primary bitstream. For example, these are equivalent references to bitstreams, but note how the second one doesn't have the Sequence ID: http://mydemo/jspui/bitstream/123456789/454/4/2dExports/stata/dxf/a3_6-10.dxf http://mydemo/jspui/html/123456789/454/2dExports/stata/dxf/a3_6-10.dxf There is another problem with Sequence IDs, however. None of the current item migration/transfer tools preserve them. The AIP packager (see the wiki page about the PLEDGE prototype) managed it. Currently, if an Item gets moved from one DSpace to another, the Sequence IDs can get scrambled. As Richard mentions, bitstream names are not necessarily unique. Hopefully the 2.0 data model will include some sort of relationship metadata so the code can do better than randomly choosing the first bitstream matching the correct name. -- Larry - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Adding your own customizations into DSpace 1.5.1
Can someone give me the short answer about where to put our local customizations for DSpace 1.5.1 and exactly what procedures we need to do to have them end up in our application? This wiki page: http://wiki.dspace.org/index.php/BuildCookbook was intended to cover this question -- please consider adding notes about the jspui and updating it for 1.5.1 if necessary -- Larry -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Installing LNI in DSpace 1.5.1
The LNI is already included in the 1.5.1 release as a maven package. There is very scant mention of it in the documentation, nothing about how to install it and access the webapp. It's probably already installed on your server. Is there a webapp named dspace-lni next to dspace-jspui and the others? If not, I can't help, you'll need a maven expert to figure it out. The wiki page is out of date and incomplete -- it was badly mangled in the translation from the old wiki. I recommend you read my preserved copy of the original doc: http://web.mit.edu/lcs/www/lni Just ignore the install instructions; those are for 1.4. Also search the DSpace wiki for the simple LNI client. When you get it working, please consider updating the wiki page to prevent others from being mislead. -- Larry I have successfully installed DSpace 1.5.1 in an Ubuntu 8.10 machine but no= w want to use LNI to integrate a PHP application with a DSpace repository via SOAP. I read instructions in http://wiki.dspace.org/index.php/LightweightNetworkInterface/DownloadsAndCl= ients#Install_Serverbut I could neither find nor deduce a way to install LNI server component in my DSpace. Could anybody provide some guidance to do this? The page says it can be installed as a plugin, but I cannot find a relationship between that statement and the code referred there ( http://dspace.svn.sourceforge.net/svnroot/dspace/branches/dspace-1_5_x/dspa= ce-lni/). I did a checkout of that folder and run mvn package which generated a targe= t folder in earch directory (core, client and webapp). By reading instruction= s for 1.4 (even though it is stated the installation is quite different) I tried to deploy the war generated in dspace-lni-webapp/target folder but n= o success when trying to start the application. Any clue about the right way of installing LNI in DSpace 1.5.x is really appreciate. Thanks in advance. Best regards, Luis Gal=E1rraga -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Installing LNI in DSpace 1.5.1
Furthermore, my aim is to implement a kind of integration between DSpace and J!Research (http://joomla-research.com). That is why it is important for me interact with DSpace via SOAP or WebDAV. Thanks a lot. If you have a choice, I recommend sticking with WebDAV (since you'll need it anyway for import and export), it avoids all of the compatibility misery of SOAP. For an example, see http://wiki.dspace.org/index.php/Simple_LNI_Client -- Larry -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] script to validate all PDFs ?
Does anyone have a script that checks all of the previously uploaded PDFs and find ones that are malformed and reports their URLs/record IDs? I think it's most appropriate to do this with the MediaFilter mechanism. The default DSpace (1.5.1) distribution includes the plugin: org.dspace.app.mediafilter.PDFFilter which extracts text from a PDF. To do that, it interprets the PDF contents with PDFBox, which is your asthmatic canary in the gassy coal mine that is PDF. You can count on it to keel over even on some files that are roughly legal and can be rendered by xpdf and Adobe Acrobat Reader. Running media-filter will log the Handle of failed Items in the DSpace log. See the manual for more info. It's a lot easier, and sounder practice, to leverage the existing media filter infrastructure than to go digging into the database and assetstore -- that implementation may change even in minor releases and configuration changes. If you want to get more aggressive and precise about validating the PDF, rather than just ensuring it is probably not corrupt, look into JHOVE at http://hul.harvard.edu/jhove/ and keep an eye on JHOVE2 http://confluence.ucop.edu/display/JHOVE2Info/Home -- Larry I can see how to write a script that uses the unix command line 'file' and 'pdftops' tools to check that every file that looks like a PDF is a good and valid PDF. Going from a file on the disk to a database record I'm not too sure of. cheers stuart -- Stuart Yeates http://www.nzetc.org/ New Zealand Electronic Text Centre http://researcharchive.vuw.ac.nz/ Institutional Repository -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Java Heap dumps during Filter-Media
The PDFBox library is _always_ going to be a problem because of its architecture. It insists on reading the entire PDF document, images included, into memory. This is not necessary, PDF was explicitly designed to let renderers process a page at a time in limited memory. Perhaps it could gain a lot by adding a mode where it ignores images (e.g. for text extraction, it is a complete waste of time to even read them into memory since it won't be getting any text out of them). I took a different approach that may be helpful to sites with a lot of PDF content that is pathological to PDFBox. I wrote a couple of filters that invoke the XPDF utilities as external OS-level command processes to do the dirty work. They are a bit more complicated to maintain since they rely on outside programs that have to be installed, but I've found the xpdf tools to be simple to install and maintain. The XPDF-based text extractor is about three times as fast as PDFBox and the only inputs it failed on PDFs were corrupt. There were also no issues with heap space since it runs outside of the JVM. See patch #2745393 for the code: https://sourceforge.net/tracker/?func=detailaid=2745393group_id=19984atid=319984 -- Larry -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Java Heap dumps during Filter-Media
Larry, I assume this is a donation to DSpace? If so I'll commit it so its available for testing/use in the 1.5.2 release. Sure, go ahead, although I won't have time to provide better documentation (for a while at least, maybe ever). My time on the FACADE project which produced this code is ending tomorrow, april 10; that's also the end of my time at MIT. I'm working desperately to finish other parts of the project and do not have any time to spend on this, that's why I just threw it over the wall because it looked like it could be useful right now. Eventually all of the code I produced for FACADE will be made available as open source; keep an eye on http://facade.mit.edu/ .. Not sure when this will happen, though. I'm not looking at any of the JIRA stuff (don't even have access yet) so if there's anything there that needs my attention, please send me personal mail -- I'm deleting anything with JIRA in the subject. Thanks, and enjoy.. -- Larry On Thu, Apr 9, 2009 at 10:56 AM, Graham Triggs gra...@biomedcentral.comwrote: Nice work Larry, I've replaced our PDF text extraction and thumbnail generation with this code. Thankfully, running on Debian, adding the third party tools was as hard as apt-get install xpdf ;) I actually ran into a few more difficulties with the ImageIO libraries - it's a pity that you don't get a simple ClassNotFoundException to be able to report this more clearly. But aside from that, my limited tests seem to work quite well. G -Original Message- From: Larry Stone [mailto:l...@mit.edu] Sent: 08 April 2009 22:21 To: Tim Donohue Cc: DSpace Tech; Jeffrey Trimble Subject: Re: [Dspace-tech] Java Heap dumps during Filter-Media The PDFBox library is _always_ going to be a problem because of its architecture. It insists on reading the entire PDF document, images included, into memory. This is not necessary, PDF was explicitly designed to let renderers process a page at a time in limited memory. Perhaps it could gain a lot by adding a mode where it ignores images (e.g. for text extraction, it is a complete waste of time to even read them into memory since it won't be getting any text out of them). I took a different approach that may be helpful to sites with a lot of PDF content that is pathological to PDFBox. I wrote a couple of filters that invoke the XPDF utilities as external OS-level command processes to do the dirty work. They are a bit more complicated to maintain since they rely on outside programs that have to be installed, but I've found the xpdf tools to be simple to install and maintain. The XPDF-based text extractor is about three times as fast as PDFBox and the only inputs it failed on PDFs were corrupt. There were also no issues with heap space since it runs outside of the JVM. See patch #2745393 for the code: https://sourceforge.net/tracker/?func=detailaid=2745393group_id=19984atid=319984 -- Larry -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Mark R. Diggory http://purl.org/net/mdiggory/homepage - Bio http://www.atmire.com - Institutional Repository Solutions http://www.togather.eu - Before getting together, get t...@ther --001636c5b1fac033c2046723300c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Larry, I assume this is a donation to DSpace? If so I#39;ll commit it so i= ts available for testing/use in the 1.5.2 release.brbrMarkbrbrbr= div class=3Dgmail_quoteOn Thu, Apr 9, 2009 at 10:56 AM, Graham Triggs s= pan dir=3Dltrlt;a href=3Dmailto:gra...@biomedcentral.com;gra...@biom= edcentral.com/agt;/span wrote:br blockquote class=3Dgmail_quote style=3Dborder-left: 1px solid rgb(204, = 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;Nice work Larry,= br br I#39;ve replaced our PDF text extraction and thumbnail generation with thi= s code.br br Thankfully, running on Debian, adding the third party tools was as hard as = quot;apt-get install xpdfquot; ;)br br I actually ran into a few more difficulties with the ImageIO libraries - it= #39
[Dspace-tech] authority control proposal
I have to add an authority control mechanism to DSpace for an institutional repository, so I'm doing it as modification to the 1.5.2 source in the hope it will get adopted into 1.6. To begin discussion, I put up a wiki page about the design: http://wiki.dspace.org/index.php/Authority_Control_of_Metadata_Values Since I have to get this into production locally in the fairly near future, please read it and respond promptly so there is time to consider your comments. There are also a few opportunities to fill in work I will not have time to do (JSPUI support, for example) so let me know if you're interested in volunteering to help. -- Larry -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [Dspace-general] Converte the Bibtex into qualified DC
If the bibtex metadata is already in XML, I recommend putting it right in the METS manifest in your SIP, and then add a crosswalk to DSpace to interpret it. This way you don't lose any information before getting into DSpace, and the crosswalk can translate Bibtex directly into the exact metadata fields your DSpace is using. To do this: 1. make up a different name for the MDTYPE attribute in the METS, e.g. bibtex, so the manifst looks like: dmdSec ... mdWrap ... MDTYPE=bibtex ...the Bibtex metadata 2. add line to your DSpace configuration to tell METS ingester what crosswalk to use for MDTYPE=bibtex: mets.submission.crosswalk.bibtex = bibtex2dspace 3. configure the bibtex2dspace crosswalk, the easiest way is to implement it as an XSLT stylesheet, e.g. add this to configuration: crosswalk.submission.bibtex2dspace.stylesheet = crosswalks/bibtex.xsl ...then just write the stylesheet; see the wiki and manual for examples. hope that helps, -- Larry On May 26, 2009, at 1:51 PM, Robin Wang wrote: Hi everyone, I wanna use the DSpaceSIP from Larry Stone and converte my bibtex xml file for original pdf file into qualified DC Record that i can insert the DC Record into a Mets Record for DSpace later. i know that i should use the XSLT to do it. but does anyone here know,what kind of a qualified DC should i build into METS dmdSec ID=DMD DSpaceID mdWrapQualified Dublin Core metadata /mdWrap /dmdSec dmdSec ID=dmd002 mdWrap MIMETYPE=text/xml MDTYPE=DC LABEL=Dublin Core Metadata xmlData dc:titleAlice's Adventures in Wonderland/dc:title dc:creatorLewis Carroll/ dc:creator dc:datebetween 1872 and 1890/ dc:date dc:publisherMcCloughlin Brothers/dc:publisher dc:typetext/dc:type /xmlData /mdWrap /dmdSec I do know that i should add an exetension schema for the DC, but can anyone tell me, what kind of an exetension schema for qualified DC in dspace should i give?? Or should i use the mods (http://www.loc.gov/standards/mods/v3/mods-3-1.xsd ) xsi:schemaLocation=http://www.loc.gov/mods/v3http://www.loc.gov/standards/mods/v3/mods-3-3.xsd http://www.loc.gov/standards/mods/v3/mods-userguide-examples.html#digitized_book Which one will be better for LNIClient??? Thanks Robin Posteingang immer voll? Der erste Speicher, der mitwächst: Unbegrenzter Speicher bei Windows Live Hotmail! ___ Dspace-general mailing list dspace-gene...@mit.edu http://mailman.mit.edu/mailman/listinfo/dspace-general -- Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT is a gathering of tech-side developers brand creativity professionals. Meet the minds behind Google Creative Lab, Visual Complexity, Processing, iPhoneDevCamp as they present alongside digital heavyweights like Barbarian Group, R/GA, Big Spaceship. http://p.sf.net/sfu/creativitycat-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Looking for Autocomplete for the XMLUI
Has anyone tried to add an autocomplete feature to the Manakin UI (XMLUI)? By autocomplete, I mean the Web UI trick where a text field is checked, as you type, against a list of possible completions and automatically filled in, or completed, by inserting text into the field or posting a dropdown menu of possible completions. For example, the AjaxTags library has it under Autocomplete, see http://ajaxtags.sourceforge.net/ -- but that's specifically for a JSP-based UI. Autocomplete usually seems to be implemented as part of a comprehensive Javascript UI toolkit, and most of those are large, complex, standalone, and/or do not play well when integrated with another vertical-web-UI-solution like Cocoon. So, do you know of any such toolkit that _is_ easy to integrate with Cocoon and/or Manakin? Has anyone developed their own autocomplete? I've been experimenting with the Yahoo UI widgets (http://developer.yahoo.com/yui/ ); it's well-documented and seems reasonably lightweight and modular, but it has some drawbacks. I'm hoping there is somebody already at the top of this learning curve who can see over this swamp from such a lofty vantage. thanks, -- Larry -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Looking for Autocomplete for the XMLUI
Thanks -- between this and AjaxTags for JSP, there is something of a consensus on script.aculo.us -- I'd passed over it at first since it looked infernally complicated but on second glance it's no worse than YUI, and perhaps more flexible. It certainly has a different idea of AJAX data, it can apparently work from an XHTML fragment. Thanks also to George, but since this is also for an authority-control prototype I'm hoping to use something already supported and maintained for the popular browser platforms, so that makes a homebrew solution less desireable. It is actually not too difficult to produce the AJAX responses through a special Cocoon pipeline. I wrote a simple generator that emits SAX events and added a pipline that essentially leaves it alone; might be easier than adding a servlet. I'll have that in the prototype. This way, users of other AJAX libraries can just add transformation steps to the pipeline to get the XML schema they need. The XHTML-XHTML translation is a neat trick. For the prototype I'm working with a slightly extended DRI (to add metadata authority values since they have to get pre-loaded into forms for the round trip), so I'll stick with DRI-XHTML for now. OBTW, I did notice the Cocoon Forms stuff but figured since the DRI/ Wing architecture sidesteps it completely, it would not be productive to try to use it. thanks, -- Larry On Jun 4, 2009, at 12:27 PM, Tim Donohue wrote: At Illinois, we are using Scriptaculous (http://script.aculo.us/) with the XMLUI to perform autocomplete on our Subject, Publisher and Author fields. (For Author, instead of having separate lastname and firstname fields, we combine it all in one field which expects 'last, first m' format). We actually add in the necessary Javascript code for each of these fields in our Theme by using an XSL which transforms XHTML - XHTML. So, we have our theme's sitemap.xmap setup to do the following: (1) Run through our normal DRI - XHTML conversion via XSLT (2) If the path matches */*/*/submit/*.continue (i.e. in Submission process), run the resulting XHTML through another XSLT to add in the autocomplete javascript for the necessary form fields. On the server side of things, we did have to create a custom Java Servlet (which we registered in the web.xml alongside Cocoon), to process those AJAX requests and return the properly formatted results. So, essentially, our autocomplete functionality is running *outside* of Cocoon. There might be an easier way to do all of this...but, this is essentially what we came up with as a relatively quick solution. If it is of interest, I am willing to share the code we've written for this. - Tim -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Producing mets.xml for SWORD
The Harvard METS toolkit gives you an convenient (though DOM-like) API for the METS data model, takes care of some of the housekeeping, and it has logic to prevent schema mistakes: http://hul.harvard.edu/mets/ There's also a toolkit that helps you build a whole submission package, including the METS. It ought to work with SWORD too; see: http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit (If you develop any tricks or procedures for using it with SWORD, please add them to that wiki page!) -- Larry On Jun 5, 2009, at 5:25 AM, mikan.d.dspace listmail wrote: Im hoping to use SWORD in order to deposit items to DSpace. What would be the best way to produce mets-metadata for the items to be submitted? Should I write the entire metadata file myself or is there templates / frameworks available for this? Thanks, Mika -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Export/import of noncontent objects (Epersons, etc.)?
I recommend looking at the old AIP prototype (see http://wiki.dspace.org/index.php/AipPrototype for doc and fossilized code) for a start on this. It extended the METS implementation to cover Communities, Collections, and every aspect of Items, including most administrative metadata. Although it stopped short of fully representing the EPerson, Group and Policy objects those would be straightforward to add. It does show some of the issues involved in building a copy of an archive from scratch. Migrating and mirroring content between repositories was one of the use cases for AIPs. *Every* existing interchange mechanism (batch import, packager, etc) loses *some* details of the Item and its child objects. Only the AIP was complete, and iirc it also had a little bug or two (e.g. bitstream sequence IDs weren't always restored perfectly). good luck! -- Larry On Jun 17, 2009, at 4:56 PM, Mark H. Wood wrote: We're working with a partner who want to keep a separate test instance with content tracking the (sizable) live repository fairly closely. The requirement that I've been given is to entirely replace the content from live every week or two. Deleting 17,000 items (and over 20,000 bitstreams) is an all-day operation, and then comes the loading phase. It would save a lot of time if I could export the Community/Collection structure, EPerson and Group objects, registries, and anything else that's not an Item, Bundle, or Bitstream; drop and recreate the database; empty the assetstore and history; reload the noncontent tables; and then begin loading. So I'm looking at adding export/import for all of those objects, probably to XML. In the case of Community and Collection I guess the best thing would be to just do a single exporter producing the same XML dialect consumed by the existing Community and Collection Structure Importer. Likewise for the registries, it seems. The other classes would need importers built as well as exporters. Comments? Or is there a smarter way to make a consistent clone of a DSpace instance, with its own Handles, that is writable but doesn't affect the original? (The Handle business, plus the need to quiesce the production site to ensure consistency across database and assetstore, is why I don't just use tar and pg_dump.) -- Mark H. Wood, Lead System Programmer mw...@iupui.edu Friends don't let friends publish revisable-form documents. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Question regarding LNI ingest of Packages and workflow
See these pages for an easy way to get started making SIPs. A package is just a manifest plus content files, so it can contain anything so long as the METS manifest describes it accurately. http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit http://wiki.dspace.org/index.php/Simple_LNI_Client As for the LNI, when originally written it supported workflow and it's in the architecture but I don't know how recently it has been tested. Submitting to a collection with workflow is no different, but there are also some URLs to access workflow objects. See the original documentation here: http://web.mit.edu/lcs/www/lni/ The version on the wiki is badly mutilated and hard to read, it never got repaired after a failed wiki format conversion. /lcs On Jul 14, 2009, at 8:01 AM, Vidya R wrote: I am testing with DSpace version - 1.5.1 and am working to integrate our LMS to the DSpace repository through LNI. I waould want add different types of content through the interface. I have successfully run the LNI with the smoke test and have been successful in ingesting a few contents of PDF type into the repository. I find that in order to ingest other contents through LNI, they need to be converted to SIP - conforming to METS / MODs package format. Could anyone explain if I can ingest any type of content through METS package format? Also I would want to know if I can use LNI to ingest items to collections configured with workflows. I get an error : the item is in state = 3. unknown. If I alter the existing LNI code will it be possible for me to ingest items with workflow? Workflow of contents is one of our basic requirements. Please reply. Thanks Vidya -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] ORACLE - FATAL org.dspace.storage.rdbms.InitializeDatabase @ Caught Exception
This message means there is a problem in your JDBC configuration: probably db.url has the wrong hostname or port number. Also check db.username and db.password. Make sure the same values work with a simpler Oracle client like SQLPlus (although you'll have to rearrange the URL, it has a different pattern). /lcs On Sep 2, 2009, at 12:41 PM, José Eslava wrote: java] 2009-09-02 10:25:08,232 FATAL org.dspace.storage.rdbms.InitializeDat abase @ Caught exception: [java] java.sql.SQLException: Excepci¾n de E/S: The Network Adapter could n ot establish the connection -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] work around authentication mechanism
If you just want to deny all access based on the requestor's IP address, that is best done in the web server or servlet container. If you're using naked Tomcat, see the doc for org.apache.catalina.valves.RemoteAddrValve. If you're using Apache httpd it's very easy to configure, just see the server docs. -- Larry On Sep 10, 2009, at 9:50 PM, Van Ly wrote: Hi, I may have a situation where one of the items in the list for `plugin.sequence.org.dspace.eperson.AuthenticationMethod' isn't behaving as expected. To work around, if I need to put up a firewall to restrict access based on ip-address and bypass the authentication mechanism entirely, what would be a way? Thanks in advance. Van Ly vly at usyd dot edu dot au -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech smime.p7s Description: S/MIME cryptographic signature -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] SMTP server for mail question
Like a lot of things, the answer is that depends. The SMTP server is usually an MTA (Mail Transfer Agent), or post-office server, that is expected to be able to receive mail as well as send it. However, when you're part of a big site like my.edu, you can often get away with setting up an MTA on your local machine that sends out mail with a return address of u...@my.edu, so it does _not_ have to be open to listening to the world at large. It sends the mail out, and if it bounces, those bounce messages go to the MTA for my.edu. I like to set up a local service on the same machine hosting DSpace just so that outgoing mail never gets lost because the other machine or the network is down. It can be a simple outgoing-only Sendmail (or equivalent Postfix, exim, etc) that stores and forwards outbound messages. It fixes up headers so the mail appears to come from my.edu, bounces go back to my.edu, and thus it wouldn't need to have port 25 open because nobody should ever be talking to it. In fact, that's probably a good idea, since if you did have it open you'd have to know how to configure it so as not to relay SPAM. -- Larry On Oct 6, 2009, at 4:06 PM, Jeffrey Trimble wrote: Does the SMTP server need to be publicly available for DSpace to work correctly? Our networking engineers are opening a firewall for another server and have questioned the opening of port 25. TIA, Jeff Jeffrey Trimble System LIbrarian William F. Maag Library Youngstown State University 330.941.2483 (Office) jtrim...@cc.ysu.edu http://www.maag.ysu.edu http://digital.maag.ysu.edu I must not fear. Fear is the mind-killer. I will permit it to pass over me and through me... --Litany against fear -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Filtering errors with XPDF Tools
These errors imply that the JVM cannot access the executable file. Since the file exists, make sure the user under whose UID the JVM is running has read and execute access to the file /usr/bin/pdftoppm (as well as the /usr and /usr/bin directories, although those are usually world-rx). Better yet, use su or sudo to assume the UID under which the JVM is running and make sure you can actually run that command. Java IOExceptions have a way of conflating all file-access errors to no such file or directory, even when there's e.g. a permission problem, which makes it more of a challenge to discover what is really wrong. -- Larry On Oct 20, 2009, at 4:54 PM, Kurzenberger, Eric wrote: Error: ERROR filtering, skipping bitstream: Item Handle: 10538/148 Bundle Name: ORIGINAL File Size: 6898424 Checksum: 4637dd47354393dc87f3d2b881ec311d (MD5) Asset Store: 0 java.io.IOException: Cannot run program /usr/bin/pdftoppm : java.io.IOException: error=2, No such file or directory java.io.IOException: Cannot run program /usr/bin/pdftoppm : java.io.IOException: error=2, No such file or directory at java.lang.ProcessBuilder.start(ProcessBuilder.java:459) at java.lang.Runtime.exec(Runtime.java:593) at java.lang.Runtime.exec(Runtime.java:466) at org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream smime.p7s Description: S/MIME cryptographic signature -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Differences in db with upgrade and fresh install
I just noticed that too, while contemplating a switch from Oracle to PostgreSQL. The Oracle schema also still has the mets_bitstream_id column in the Bundle table, which is no longer used and has been dropped. I'll start a JIRA issue to fix this. (OBTW, before you switch over to Oracle, you ought to be aware that many DSpace tables use CLOBs, and those can make life very difficult because most of Oracle's own tools (e.g. SQL*Plus) don't work with them.) thanks, -- Larry On Dec 15, 2009, at 5:02 AM, Bram Luyten wrote: Hello Francis, these tables are indeed deprecated and not in use anymore. best regards, Bram @mire - http://www.atmire.com Technologielaan 9 - 3001 Heverlee - Belgium 533 2nd Street - Encinitas, CA 92024 - USA http://www.togather.eu - Before getting together, get t...@ther On Tue, Dec 15, 2009 at 10:50 AM, Brouns, Francis francis.bro...@ou.nl wrote: Hi, We are in the process of upgrading from Dspace 1.4.2 to 1.5.2, at the same time moving from Postgresql to Oracle. When I perform an upgrade from 1.4 to 1.5.2 and run the database_schema_142-15.sql on Postgresql the tables history and history_state are dropped. However, when I do a clean install of an Oracle database from database_schema.sql (1.5.2) the tables history and history_state are created. In the Postgresql database_schema.sql (1.5.2) those tables are not created. It is save to assume that both tables are no longer required? Best wishes, Francis Brouns Open Universiteit Nederland -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech