FYI 2) was exactly how the previous MoinMoin-based Wiki was set up (a
few key locked pages, content-based blacklist) and that worked fine,
we had little or no spam after those measures were in place. Given
that, I'd suggest 1) is overkill. At the least, 2) is worth trying
before resorting
The ability to mediate your user requests in Apache httpd (both bots
and regular users) far outweighs any performance hit, which in my
experience it is miniscule in comparison to the memory and CPU loads
that tomcat itself produces being a multi-threaded Java application.
Apache is a very
There are instructions on the wiki --
http://wiki.dspace.org/index.php/DspaceOnStandardPorts
All you really have to do is basic Tomcat configuration since is already
(by default) a web server as well as a servlet container, so consult the
doc for your version of Tomcat too.
-- Larry
Can
You should be able to do this quite easily by adding the following to:
[dspace-src]/etc/dspace-web.xml
When merging Servlet webapps, one thing to watch out for is how they
handle errors. The DSpace webapp has special error handlers that
substitute its own pages. I think it also transforms
Have a look at the wiki page about the new history system:
http://wiki.dspace.org/index.php/HistorySystemPrototype
It's already written, and in fact the project is over so I won't be
doing any more work on it. If the Event patch gets adopted, it will
remove the old history system, but open the
First, it is assigned sequentially and IDs are not reused if a bitstream
is deleted. There is no magic ordering, and it was *not* intended for
organizing a set of bitstreams into a meaningful sequence (e.g. PDF
chapters of a book). Its sole purpose is to provide a *durable* unique
ID for a
I'm trying to get up an running on LNI, but can't seem to get the syntax
for PUT just right (I keep getting either 400 or 500 http errors).
Is there anybody out there who can send me a syntax sample along with a
sample package?
I'm working with the LNISmokeTest class.
The smoke test has a
On Wed, May 30, 2007 at 02:01:50AM -0400, Larry Stone wrote:
How about the word resource to introduce the URI, since it is, after
all, a reference to a resource -- the R in URI. It'd be:
prefix/resource/encoded-URI e.g.
http://dspace.me.ac.uk/resource/hdl/1234/56
This follows
The easiest way to get an example is to take an item alraedy in the
archive which is similar to what you want, and export it as a METS package
using the command-line packager, then look at its METS manifest.
-- Larry
Message: 3
Date: Mon, 11 Jul 2005 18:15:59 EDT
From: Larry Stone [EMAIL
Right, that wasn't ever an option on the PDF packager. That
class was really written as a demonstration of the package ingester
and a proof-of-concept, but it was allowed to escape.
The package ingester never sees the original filename; it is given
an open InputStream created by the Packager
What'd like to do is just validate that the extension of the bitstreams
submitted during an item deposit correspond to the actual format(s) of the
files using the output of jhove. Now, I don't know how difficult this will
be, and if you are working on it, and it's going to be coming out in 6
I have an xml file that references a style sheet, but I'm not able to
get the xml file to render correctly using the style sheet when I
deposit this file into deep blue. I realize that I need to create a
servlet to handle xml files, just like there is one for html files.
Before I do this, I
+1 assign external identifiers to files
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
On Thu, Jul 19, 2007 at 05:59:39PM -0400, Kyle Brentnell wrote:
2) Change the authenticateInternal method in the AuthenticationManager.java
class to always return BAD_ARGS or NO_SUCH_USER, etc. to prevent users
from logging into Dspace. To allow access again, restore the original
I ran across an excellent study published by the ECPA (European Commission on
Preservation and Access) that is recommended background reading if
you want to know more about the issues behind persistent identifiers.
It doesn't recommend any one system, but describes all the leading ones
and
FWIW, some of the lists to which I subscribe mung the reply-to header
to force replies to go to the list, and some don't -- I'm invariably
annoyed by the ones that _do_ mung headers. So take this as a vote NOT to
change the behavior of the DSpace lists.
My mail user agent is a certifiable
On 9/7/07, Nandita Chaudhri [EMAIL PROTECTED] wrote:
If one is not using the submission UI to load items one by one but is using
the bulk upload facility is there a way to have two bitstreams with only one
as the 'primary' one which will be displayed when the item is retrived using
a
See the Application Layer page in the DSpace system documentation
(docs/application.html in the source), the section Package Importer and
Exporter. Briefly, the [dspace]/bin/packager command ingests packages
as well, you have to give it a -c collection-handle option and can
specify -s for submit,
Has any thought been given to how Dspace might handle the remote (
hopefully ) possibility of a file containing a virus being deposited
into a repository? It seems like jhove might be the kind of tool that
could check for this. I believe there is some work going on to
incorporate jhove into
I haven't looked at the Manakin code, but the MIME media-type of a Bitstream
must be coming from its associated BitstreamFormat -- so why not get
the human-readable name from the BitstreamFormat as well? There is no
need to establish a separate map of MIME-type to user-friendly name
when it
I agree that adding techMD to the METS is architecturally the right
thing, but there is already a better (or at least, adequate..) technical
metadata standard format in PREMIS. There is already a PREMIS crosswalk
for the import/export METS packager which you can cannibalize, see
I don't visit the wiki often enough, but my impression has been that
work-in-progress and completed projects are jumbled together, probably
because when projects are completed they just stay on the page and
stop getting updates. Regularly moving released mod.s to a released
projects page
Check the configurations of the Web server (e.g. apache) and/or servlet
container (tomcat, if using mod_jk) for limits on the largest upload allowed.
Start by looking at their error logs and transaction logs to see what
happened to your HTTP request. There is theoretically no reason you
shouldn't
There are already several existing network interfaces -- are you sure
none of them give the access you need? See the wiki:
http://wiki.dspace.org/index.php/NetworkInterfaces
In particular, the LNI -- see:
http://wiki.dspace.org/index.php/LightweightNetworkInterface
was designed to be
The Bitstream's name field is also significant for Items taht are
archived websites with internal links to other Bitstreams in the
item. The UI (the JSP UI, at least) has a special html servlet
that resolves references to the *name* of the Bitstream instead
of its Sequence ID (like the normal
So, my question is: Can File descriptions for multiple bitstreams be
added via the importer?
Remember, the Packager plugins are another way to ingest Items -- SWORD,
the LNI, and a command-line client (/dspace/bin/packager) can all invoke
any of the package ingester plugins.
Unfortunately, the
Never mind; answered my own question. This DOES NOT work for imported
rather than registered bitstreams. Ah, well. I can edit bitstream
descriptions manually for 300+ items; why not?
You can update Bitstream description fields under program control with
the LNI. It may even be possible to do
Actually, it responds to GET on some URIs, but the LNI is *not*
meant to be used as an interactive web site; it is a WebDAV server.
WebDAV happens to use the HTTP protocol but not in a way that gets along
with the subset (and perversions) of HTTP most browsers speak.
It also has some ability to
There was a WSDL file in the original source; it was generated automatically,
by an ant target that does not seem to have survived into the brave new world
of maven. (You could look at the old LNI patches, see
http://web.mit.edu/lcs/www/lni )
However, teh WSDL is still in the DSpace 1.5 source
GET only works on Items and Bitstreams, so if you get the resource path
to a Bitstream from PROPFIND (depth=1) on an Item, then you ought to be able
to do a GET on it and receive its contents.
This path looks like you've traversed some communities and collections:
You can see how the lookup feature works by using a simple HTTP client
like the GNU wget application to do a GET, and reading its diagnostic
output.. for example, this does a lookup of handle 123456789/1
using HTTP Basic authentication to login (assuming you have username/password
login on your
I've added a page of cookbook-style instructions for integrating
local modificatiosn with the new build system, see:
http://wiki.dspace.org/index.php/BuildCookbook
it builds on the OR08 presentation Customizing DSpace 1.5 with Basic
Overlays; in fact, that has other useful examples which might
See
http://wiki.dspace.org/index.php/Simple_LNI_Client
for an example of the same idea in Java. It uses only the WebDAV protocol.
Perhaps you can use that code as a model for your C# application.
When you're done, please consider posting your client on the wiki too.
-- Larry
I'm working
It _should_ never get an NPE over a missing configuration key. Given
our sparse testing resources it behooves us to make the code as
resilient as possible.
Could you submit a bug report about this (or, even better, a patch) --
ideally it could would *log an error* if the configuration property
There _was_ documentation for the venerable batch importer in the
HTML pages in the docs subdirectory; in a source checkout, it's at
the relative path:
dspace/trunk/dspace/docs/application.html#itemimporter
(I can't get the location in the Subversion repository because
Sourceforge seems to be
While we're talking about the fate of DSpace documentation,
I'd like to put in a plea to maintain a Web-friendly version *as well*.
When consulting the doc for other open-source projects (usually because
they are components of DSpace..) I find the online HTML version of
the doc a lot more handy
What is it that you need to accomplish?
I need to be able to have a stable reference for a bitstream which
might undergo a number of version updates. I do not expect to save the
earlier versions at this time.
Mark is correct about overloading the semantics here. Note, We
adjusted the
Can someone give me the short answer about where to put our local
customizations for DSpace 1.5.1 and exactly what procedures we need to
do to have them end up in our application?
This wiki page:
http://wiki.dspace.org/index.php/BuildCookbook
was intended to cover this question -- please
The LNI is already included in the 1.5.1 release as a maven package.
There is very scant mention of it in the documentation, nothing about
how to install it and access the webapp.
It's probably already installed on your server. Is there a webapp
named dspace-lni next to dspace-jspui and the
Furthermore, my aim is to implement a kind of integration
between DSpace and J!Research (http://joomla-research.com). That is why it
is important for me interact with DSpace via SOAP or WebDAV. Thanks a lot.
If you have a choice, I recommend sticking with WebDAV (since
you'll need it anyway
Does anyone have a script that checks all of the previously uploaded
PDFs and find ones that are malformed and reports their URLs/record IDs?
I think it's most appropriate to do this with the MediaFilter mechanism.
The default DSpace (1.5.1) distribution includes the plugin:
The PDFBox library is _always_ going to be a problem because of its
architecture. It insists on reading the entire PDF document, images
included, into memory. This is not necessary, PDF was explicitly
designed to let renderers process a page at a time in limited memory.
Perhaps it could gain a
to work quite well.
G
-Original Message-
From: Larry Stone [mailto:l...@mit.edu]
Sent: 08 April 2009 22:21
To: Tim Donohue
Cc: DSpace Tech; Jeffrey Trimble
Subject: Re: [Dspace-tech] Java Heap dumps during Filter-Media
The PDFBox library is _always_ going to be a problem
I have to add an authority control mechanism to DSpace for an
institutional repository, so I'm doing it as modification to the 1.5.2
source in the hope it will get adopted into 1.6.
To begin discussion, I put up a wiki page about the design:
the stylesheet; see the wiki and manual for examples.
hope that helps,
-- Larry
On May 26, 2009, at 1:51 PM, Robin Wang wrote:
Hi everyone,
I wanna use the DSpaceSIP from Larry Stone and converte my bibtex
xml file for original pdf file into qualified DC Record that i can
insert the DC
Has anyone tried to add an autocomplete feature to the Manakin UI
(XMLUI)? By autocomplete, I mean the Web UI trick where a text
field is checked, as you type, against a list of possible completions
and automatically filled in, or completed, by inserting text into the
field or posting a
Thanks -- between this and AjaxTags for JSP, there is something of a
consensus on script.aculo.us -- I'd passed over it at first since it
looked infernally complicated but on second glance it's no worse than
YUI, and perhaps more flexible. It certainly has a different idea of
AJAX data,
The Harvard METS toolkit gives you an convenient (though DOM-like) API
for the METS data model, takes care of some of the housekeeping, and
it has logic to prevent schema mistakes:
http://hul.harvard.edu/mets/
There's also a toolkit that helps you build a whole submission
package,
I recommend looking at the old AIP prototype (see
http://wiki.dspace.org/index.php/AipPrototype
for doc and fossilized code) for a start on this. It extended the
METS implementation to cover Communities, Collections, and every
aspect of Items, including most administrative metadata.
See these pages for an easy way to get started making SIPs. A package
is just a manifest plus content files, so it can contain anything so
long as the METS manifest describes it accurately.
http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit
http://wiki.dspace.org/index.php/Simple_LNI_Client
This message means there is a problem in your JDBC configuration:
probably db.url has the wrong hostname or port number. Also check
db.username and db.password. Make sure the same values work with a
simpler Oracle client like SQLPlus (although you'll have to rearrange
the URL, it has a
If you just want to deny all access based on the requestor's IP
address, that is best done in the web server or servlet container. If
you're using naked Tomcat, see the doc for
org.apache.catalina.valves.RemoteAddrValve. If you're using Apache
httpd it's very easy to configure, just see
Like a lot of things, the answer is that depends. The SMTP server
is usually an MTA (Mail Transfer Agent), or post-office server, that
is expected to be able to receive mail as well as send it. However,
when you're part of a big site like my.edu, you can often get away
with setting up an
These errors imply that the JVM cannot access the executable file.
Since the file exists, make sure the user under whose UID the JVM is
running has read and execute access to the file /usr/bin/pdftoppm (as
well as the /usr and /usr/bin directories, although those are usually
world-rx).
I just noticed that too, while contemplating a switch from Oracle to
PostgreSQL. The Oracle schema also still has the mets_bitstream_id
column in the Bundle table, which is no longer used and has been
dropped.
I'll start a JIRA issue to fix this.
(OBTW, before you switch over to Oracle,
55 matches
Mail list logo