Re: [Dspace-tech] Wiki Security and spam

2007-01-20 Thread Larry Stone
FYI 2) was exactly how the previous MoinMoin-based Wiki was set up (a few key locked pages, content-based blacklist) and that worked fine, we had little or no spam after those measures were in place. Given that, I'd suggest 1) is overkill. At the least, 2) is worth trying before resorting

Re: [Dspace-tech] Can I use HTTPS for just login

2007-01-25 Thread Larry Stone
The ability to mediate your user requests in Apache httpd (both bots and regular users) far outweighs any performance hit, which in my experience it is miniscule in comparison to the memory and CPU loads that tomcat itself produces being a multi-threaded Java application. Apache is a very

Re: [Dspace-tech] Setting up DSpace

2007-02-13 Thread Larry Stone
There are instructions on the wiki -- http://wiki.dspace.org/index.php/DspaceOnStandardPorts All you really have to do is basic Tomcat configuration since is already (by default) a web server as well as a servlet container, so consult the doc for your version of Tomcat too. -- Larry Can

Re: [Dspace-tech] Dspace and Oracle Application Server 10g

2007-02-16 Thread Larry Stone
You should be able to do this quite easily by adding the following to: [dspace-src]/etc/dspace-web.xml When merging Servlet webapps, one thing to watch out for is how they handle errors. The DSpace webapp has special error handlers that substitute its own pages. I think it also transforms

Re: [Dspace-tech] Are you using the DSpace History System?

2007-03-07 Thread Larry Stone
Have a look at the wiki page about the new history system: http://wiki.dspace.org/index.php/HistorySystemPrototype It's already written, and in fact the project is over so I won't be doing any more work on it. If the Event patch gets adopted, it will remove the old history system, but open the

Re: [Dspace-tech] Sequence ID generation

2007-05-08 Thread Larry Stone
First, it is assigned sequentially and IDs are not reused if a bitstream is deleted. There is no magic ordering, and it was *not* intended for organizing a set of bitstreams into a meaningful sequence (e.g. PDF chapters of a book). Its sole purpose is to provide a *durable* unique ID for a

Re: [Dspace-tech] Need Sample for LNI

2007-05-29 Thread Larry Stone
I'm trying to get up an running on LNI, but can't seem to get the syntax for PUT just right (I keep getting either 400 or 500 http errors). Is there anybody out there who can send me a syntax sample along with a sample package? I'm working with the LNISmokeTest class. The smoke test has a

Re: [Dspace-tech] Persistent identifiers in DSpace -- thoughts please

2007-05-30 Thread Larry Stone
On Wed, May 30, 2007 at 02:01:50AM -0400, Larry Stone wrote: How about the word resource to introduce the URI, since it is, after all, a reference to a resource -- the R in URI. It'd be: prefix/resource/encoded-URI e.g. http://dspace.me.ac.uk/resource/hdl/1234/56 This follows

Re: [Dspace-tech] Setting Primary Bitstream

2007-06-12 Thread Larry Stone
The easiest way to get an example is to take an item alraedy in the archive which is similar to what you want, and export it as a METS package using the command-line packager, then look at its METS manifest. -- Larry Message: 3 Date: Mon, 11 Jul 2005 18:15:59 EDT From: Larry Stone [EMAIL

Re: [Dspace-tech] How to get ingested pdfs to keep their names?

2007-06-15 Thread Larry Stone
Right, that wasn't ever an option on the PDF packager. That class was really written as a demonstration of the package ingester and a proof-of-concept, but it was allowed to escape. The package ingester never sees the original filename; it is given an open InputStream created by the Packager

Re: [Dspace-tech] jhove / preingest / techMDExtractor

2007-06-21 Thread Larry Stone
What'd like to do is just validate that the extension of the bitstreams submitted during an item deposit correspond to the actual format(s) of the files using the output of jhove. Now, I don't know how difficult this will be, and if you are working on it, and it's going to be coming out in 6

Re: [Dspace-tech] Xml file with style sheet

2007-07-03 Thread Larry Stone
I have an xml file that references a style sheet, but I'm not able to get the xml file to render correctly using the style sheet when I deposit this file into deep blue. I realize that I need to create a servlet to handle xml files, just like there is one for html files. Before I do this, I

Re: [Dspace-tech] [vote] Do we want to assign external identifiers (Handles) to files?

2007-07-19 Thread Larry Stone
+1 assign external identifiers to files - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/

Re: [Dspace-tech] Running DSpace in read-only mode.

2007-07-20 Thread Larry Stone
On Thu, Jul 19, 2007 at 05:59:39PM -0400, Kyle Brentnell wrote: 2) Change the authenticateInternal method in the AuthenticationManager.java class to always return BAD_ARGS or NO_SUCH_USER, etc. to prevent users from logging into Dspace. To allow access again, restore the original

[Dspace-tech] background on external/persistent identifiers

2007-07-24 Thread Larry Stone
I ran across an excellent study published by the ECPA (European Commission on Preservation and Access) that is recommended background reading if you want to know more about the issues behind persistent identifiers. It doesn't recommend any one system, but describes all the leading ones and

Re: [Dspace-tech] Reply-To Header missing - list misconfigured?

2007-07-31 Thread Larry Stone
FWIW, some of the lists to which I subscribe mung the reply-to header to force replies to go to the list, and some don't -- I'm invariably annoyed by the ones that _do_ mung headers. So take this as a vote NOT to change the behavior of the DSpace lists. My mail user agent is a certifiable

Re: [Dspace-tech] Dspace Bulk Upload

2007-09-07 Thread Larry Stone
On 9/7/07, Nandita Chaudhri [EMAIL PROTECTED] wrote: If one is not using the submission UI to load items one by one but is using the bulk upload facility is there a way to have two bitstreams with only one as the 'primary' one which will be displayed when the item is retrived using a

Re: [Dspace-tech] Help with METS Ingest Import

2007-10-26 Thread Larry Stone
See the Application Layer page in the DSpace system documentation (docs/application.html in the source), the section Package Importer and Exporter. Briefly, the [dspace]/bin/packager command ingests packages as well, you have to give it a -c collection-handle option and can specify -s for submit,

Re: [Dspace-tech] Viruses and DSpace

2007-11-16 Thread Larry Stone
Has any thought been given to how Dspace might handle the remote ( hopefully ) possibility of a file containing a virus being deposited into a repository? It seems like jhove might be the kind of tool that could check for this. I believe there is some work going on to incorporate jhove into

Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)

2007-11-26 Thread Larry Stone
I haven't looked at the Manakin code, but the MIME media-type of a Bitstream must be coming from its associated BitstreamFormat -- so why not get the human-readable name from the BitstreamFormat as well? There is no need to establish a separate map of MIME-type to user-friendly name when it

Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)

2007-11-28 Thread Larry Stone
I agree that adding techMD to the METS is architecturally the right thing, but there is already a better (or at least, adequate..) technical metadata standard format in PREMIS. There is already a PREMIS crosswalk for the import/export METS packager which you can cannibalize, see

Re: [Dspace-tech] why vanilla?

2007-12-14 Thread Larry Stone
I don't visit the wiki often enough, but my impression has been that work-in-progress and completed projects are jumbled together, probably because when projects are completed they just stay on the page and stop getting updates. Regularly moving released mod.s to a released projects page

Re: [Dspace-tech] Uploading a 10Gig File. (Blanco, Jose)

2008-01-09 Thread Larry Stone
Check the configurations of the Web server (e.g. apache) and/or servlet container (tomcat, if using mod_jk) for limits on the largest upload allowed. Start by looking at their error logs and transaction logs to see what happened to your HTTP request. There is theoretically no reason you shouldn't

Re: [Dspace-tech] remote Client Program for accessing and managing theDSpace

2008-04-17 Thread Larry Stone
There are already several existing network interfaces -- are you sure none of them give the access you need? See the wiki: http://wiki.dspace.org/index.php/NetworkInterfaces In particular, the LNI -- see: http://wiki.dspace.org/index.php/LightweightNetworkInterface was designed to be

Re: [Dspace-tech] bitstreams and file locations

2008-05-05 Thread Larry Stone
The Bitstream's name field is also significant for Items taht are archived websites with internal links to other Bitstreams in the item. The UI (the JSP UI, at least) has a special html servlet that resolves references to the *name* of the Bitstream instead of its Sequence ID (like the normal

Re: [Dspace-tech] File Descriptions in Import

2008-05-09 Thread Larry Stone
So, my question is: Can File descriptions for multiple bitstreams be added via the importer? Remember, the Packager plugins are another way to ingest Items -- SWORD, the LNI, and a command-line client (/dspace/bin/packager) can all invoke any of the package ingester plugins. Unfortunately, the

Re: [Dspace-tech] File Descriptions in Import

2008-05-14 Thread Larry Stone
Never mind; answered my own question. This DOES NOT work for imported rather than registered bitstreams. Ah, well. I can edit bitstream descriptions manually for 300+ items; why not? You can update Bitstream description fields under program control with the LNI. It may even be possible to do

Re: [Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available

2008-05-16 Thread Larry Stone
Actually, it responds to GET on some URIs, but the LNI is *not* meant to be used as an interactive web site; it is a WebDAV server. WebDAV happens to use the HTTP protocol but not in a way that gets along with the subset (and perversions) of HTTP most browsers speak. It also has some ability to

Re: [Dspace-tech] Creating a WSDL file for LNI

2008-05-22 Thread Larry Stone
There was a WSDL file in the original source; it was generated automatically, by an ant target that does not seem to have survived into the brave new world of maven. (You could look at the old LNI patches, see http://web.mit.edu/lcs/www/lni ) However, teh WSDL is still in the DSpace 1.5 source

Re: [Dspace-tech] LNI/WebDAV : GET returns 501

2008-05-29 Thread Larry Stone
GET only works on Items and Bitstreams, so if you get the resource path to a Bitstream from PROPFIND (depth=1) on an Item, then you ought to be able to do a GET on it and receive its contents. This path looks like you've traversed some communities and collections:

Re: [Dspace-tech] LNI/WebDAV : GET returns 501

2008-06-02 Thread Larry Stone
You can see how the lookup feature works by using a simple HTTP client like the GNU wget application to do a GET, and reading its diagnostic output.. for example, this does a lookup of handle 123456789/1 using HTTP Basic authentication to login (assuming you have username/password login on your

[Dspace-tech] build cookbook

2008-06-20 Thread Larry Stone
I've added a page of cookbook-style instructions for integrating local modificatiosn with the new build system, see: http://wiki.dspace.org/index.php/BuildCookbook it builds on the OR08 presentation Customizing DSpace 1.5 with Basic Overlays; in fact, that has other useful examples which might

Re: [Dspace-tech] DSpace-tech Digest, Vol 27, Issue 3

2008-07-01 Thread Larry Stone
See http://wiki.dspace.org/index.php/Simple_LNI_Client for an example of the same idea in Java. It uses only the WebDAV protocol. Perhaps you can use that code as a model for your C# application. When you're done, please consider posting your client on the wiki too. -- Larry I'm working

Re: [Dspace-tech] Filter-media in 15

2008-07-11 Thread Larry Stone
It _should_ never get an NPE over a missing configuration key. Given our sparse testing resources it behooves us to make the code as resilient as possible. Could you submit a bug report about this (or, even better, a patch) -- ideally it could would *log an error* if the configuration property

Re: [Dspace-tech] [Dspace-devel] Batch Import document

2008-07-21 Thread Larry Stone
There _was_ documentation for the venerable batch importer in the HTML pages in the docs subdirectory; in a source checkout, it's at the relative path: dspace/trunk/dspace/docs/application.html#itemimporter (I can't get the location in the Subversion repository because Sourceforge seems to be

Re: [Dspace-tech] [Dspace-devel] Batch Import document

2008-07-25 Thread Larry Stone
While we're talking about the fate of DSpace documentation, I'd like to put in a plea to maintain a Web-friendly version *as well*. When consulting the doc for other open-source projects (usually because they are components of DSpace..) I find the online HTML version of the doc a lot more handy

Re: [Dspace-tech] Reusing bitstream sequence number

2008-08-15 Thread Larry Stone
What is it that you need to accomplish? I need to be able to have a stable reference for a bitstream which might undergo a number of version updates. I do not expect to save the earlier versions at this time. Mark is correct about overloading the semantics here. Note, We adjusted the

Re: [Dspace-tech] Adding your own customizations into DSpace 1.5.1

2009-01-27 Thread Larry Stone
Can someone give me the short answer about where to put our local customizations for DSpace 1.5.1 and exactly what procedures we need to do to have them end up in our application? This wiki page: http://wiki.dspace.org/index.php/BuildCookbook was intended to cover this question -- please

Re: [Dspace-tech] Installing LNI in DSpace 1.5.1

2009-02-17 Thread Larry Stone
The LNI is already included in the 1.5.1 release as a maven package. There is very scant mention of it in the documentation, nothing about how to install it and access the webapp. It's probably already installed on your server. Is there a webapp named dspace-lni next to dspace-jspui and the

Re: [Dspace-tech] Installing LNI in DSpace 1.5.1

2009-02-18 Thread Larry Stone
Furthermore, my aim is to implement a kind of integration between DSpace and J!Research (http://joomla-research.com). That is why it is important for me interact with DSpace via SOAP or WebDAV. Thanks a lot. If you have a choice, I recommend sticking with WebDAV (since you'll need it anyway

Re: [Dspace-tech] script to validate all PDFs ?

2009-02-24 Thread Larry Stone
Does anyone have a script that checks all of the previously uploaded PDFs and find ones that are malformed and reports their URLs/record IDs? I think it's most appropriate to do this with the MediaFilter mechanism. The default DSpace (1.5.1) distribution includes the plugin:

Re: [Dspace-tech] Java Heap dumps during Filter-Media

2009-04-08 Thread Larry Stone
The PDFBox library is _always_ going to be a problem because of its architecture. It insists on reading the entire PDF document, images included, into memory. This is not necessary, PDF was explicitly designed to let renderers process a page at a time in limited memory. Perhaps it could gain a

Re: [Dspace-tech] Java Heap dumps during Filter-Media

2009-04-09 Thread Larry Stone
to work quite well. G -Original Message- From: Larry Stone [mailto:l...@mit.edu] Sent: 08 April 2009 22:21 To: Tim Donohue Cc: DSpace Tech; Jeffrey Trimble Subject: Re: [Dspace-tech] Java Heap dumps during Filter-Media The PDFBox library is _always_ going to be a problem

[Dspace-tech] authority control proposal

2009-05-13 Thread Larry Stone
I have to add an authority control mechanism to DSpace for an institutional repository, so I'm doing it as modification to the 1.5.2 source in the hope it will get adopted into 1.6. To begin discussion, I put up a wiki page about the design:

Re: [Dspace-tech] [Dspace-general] Converte the Bibtex into qualified DC

2009-05-26 Thread Larry Stone
the stylesheet; see the wiki and manual for examples. hope that helps, -- Larry On May 26, 2009, at 1:51 PM, Robin Wang wrote: Hi everyone, I wanna use the DSpaceSIP from Larry Stone and converte my bibtex xml file for original pdf file into qualified DC Record that i can insert the DC

[Dspace-tech] Looking for Autocomplete for the XMLUI

2009-06-03 Thread Larry Stone
Has anyone tried to add an autocomplete feature to the Manakin UI (XMLUI)? By autocomplete, I mean the Web UI trick where a text field is checked, as you type, against a list of possible completions and automatically filled in, or completed, by inserting text into the field or posting a

Re: [Dspace-tech] Looking for Autocomplete for the XMLUI

2009-06-04 Thread Larry Stone
Thanks -- between this and AjaxTags for JSP, there is something of a consensus on script.aculo.us -- I'd passed over it at first since it looked infernally complicated but on second glance it's no worse than YUI, and perhaps more flexible. It certainly has a different idea of AJAX data,

Re: [Dspace-tech] Producing mets.xml for SWORD

2009-06-05 Thread Larry Stone
The Harvard METS toolkit gives you an convenient (though DOM-like) API for the METS data model, takes care of some of the housekeeping, and it has logic to prevent schema mistakes: http://hul.harvard.edu/mets/ There's also a toolkit that helps you build a whole submission package,

Re: [Dspace-tech] Export/import of noncontent objects (Epersons, etc.)?

2009-06-17 Thread Larry Stone
I recommend looking at the old AIP prototype (see http://wiki.dspace.org/index.php/AipPrototype for doc and fossilized code) for a start on this. It extended the METS implementation to cover Communities, Collections, and every aspect of Items, including most administrative metadata.

Re: [Dspace-tech] Question regarding LNI ingest of Packages and workflow

2009-07-14 Thread Larry Stone
See these pages for an easy way to get started making SIPs. A package is just a manifest plus content files, so it can contain anything so long as the METS manifest describes it accurately. http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit http://wiki.dspace.org/index.php/Simple_LNI_Client

Re: [Dspace-tech] ORACLE - FATAL org.dspace.storage.rdbms.InitializeDatabase @ Caught Exception

2009-09-02 Thread Larry Stone
This message means there is a problem in your JDBC configuration: probably db.url has the wrong hostname or port number. Also check db.username and db.password. Make sure the same values work with a simpler Oracle client like SQLPlus (although you'll have to rearrange the URL, it has a

Re: [Dspace-tech] work around authentication mechanism

2009-09-10 Thread Larry Stone
If you just want to deny all access based on the requestor's IP address, that is best done in the web server or servlet container. If you're using naked Tomcat, see the doc for org.apache.catalina.valves.RemoteAddrValve. If you're using Apache httpd it's very easy to configure, just see

Re: [Dspace-tech] SMTP server for mail question

2009-10-06 Thread Larry Stone
Like a lot of things, the answer is that depends. The SMTP server is usually an MTA (Mail Transfer Agent), or post-office server, that is expected to be able to receive mail as well as send it. However, when you're part of a big site like my.edu, you can often get away with setting up an

Re: [Dspace-tech] Filtering errors with XPDF Tools

2009-10-20 Thread Larry Stone
These errors imply that the JVM cannot access the executable file. Since the file exists, make sure the user under whose UID the JVM is running has read and execute access to the file /usr/bin/pdftoppm (as well as the /usr and /usr/bin directories, although those are usually world-rx).

Re: [Dspace-tech] Differences in db with upgrade and fresh install

2009-12-15 Thread Larry Stone
I just noticed that too, while contemplating a switch from Oracle to PostgreSQL. The Oracle schema also still has the mets_bitstream_id column in the Bundle table, which is no longer used and has been dropped. I'll start a JIRA issue to fix this. (OBTW, before you switch over to Oracle,