Re: [Dspace-tech] Wiki Security and spam

2007-01-20 Thread Larry Stone
 FYI 2) was exactly how the previous MoinMoin-based Wiki was set up (a
 few key locked pages, content-based blacklist) and that worked fine,
 we had little or no spam after those measures were in place.  Given
 that, I'd suggest 1) is overkill.  At the least, 2) is worth trying
 before resorting to 1).  My tuppence worth anyway.

Actually, by the time I started writing for the MoinMoin wiki,
it worked the same way as the new changes to the mediawiki -- you
had to be in the WikiUsersGroup to edit pages, and some pages (like
the front page) had more restrictive access controls.

Any heuristic technique of spam control, that allows open access while
attempting to detect spam after the fact, puts the implementer in an
arms race with the spammers.  It requires some constant administrative
effort to keep up with tuning and changes in the spam-detection
software.  Does anyone have the time to host the wiki and administer
it that actively?

-- Larry


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Can I use HTTPS for just login

2007-01-25 Thread Larry Stone
 The ability to mediate your user requests in Apache httpd (both bots
 and regular users) far outweighs any performance hit, which in my
 experience it is miniscule in comparison to the memory and CPU loads
 that tomcat itself produces being a multi-threaded Java application.
 Apache is a very efficient and optimized http service, the mod_jk
 connector simply streams the request/response cycle to tomcat.

I'd like to see a scientific comparison.. not enough to do one, but
someone should :-).  I'll grant that apache itself doesn't add much
overhead, but there's still the latency and extra context switches
of piping all the bits through another process.  The comparsion
may become more interesting when DSpace is serving an archived website
so a page load spawns many DSpace servlet requests for stylesheets, images,
etc, and the overhead for each transaction adds up.

Tomcat still does about the same amount of work for each request.  For
sites with a sever that is _only_ running DSpace, and no other web
applications, and especially without anyone who already knows how to
manage Apache Httpd, using Tomcat by itself seems a reasonable choice.

  -- Larry



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Setting up DSpace

2007-02-13 Thread Larry Stone
There are instructions on the wiki --
http://wiki.dspace.org/index.php/DspaceOnStandardPorts

All you really have to do is basic Tomcat configuration since is already
(by default) a web server as well as a servlet container, so consult the
doc for your version of Tomcat too.

-- Larry

 Can anyone give me any help setting up DSpace using only Tomcat?  I have
 Tomcat up and running and PostgreSQL installed and running.  I just need
 a few pointers on getting the rest working.  We don't want to do it with
 Apache.


-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Dspace and Oracle Application Server 10g

2007-02-16 Thread Larry Stone
 You should be able to do this quite easily by adding the following to:
 [dspace-src]/etc/dspace-web.xml

When merging Servlet webapps, one thing to watch out for is how they
handle errors.  The DSpace webapp has special error handlers that
substitute its own pages.  I think it also transforms some status codes.
This may cause problems with the OAI-PMH protocol; as I recall, it
did when I experimented with including WebDAV protocol servlets in
the same webapp as the web UI.

Aside from that, though, it was no problem to merge the OAI, WebUI, and
WebDAV servlets into one webapp.

-- Larry



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Are you using the DSpace History System?

2007-03-07 Thread Larry Stone
Have a look at the wiki page about the new history system:
http://wiki.dspace.org/index.php/HistorySystemPrototype

It's already written, and in fact the project is over so I won't be
doing any more work on it.  If the Event patch gets adopted, it will
remove the old history system, but open the way to let sites install
this one as an add-on.  Right now it only exists in the wiki and on the
patch queue.

The old history system wrote some very dodgy records; some outright errors,
other records that cannot be correlated to actual events, and it used
database IDs as identifiers instead of anything persistent.  (iirc, one of its
worst sins was using Handle-like identifiers with database-IDs in the suffix).
So, if you're actually going to use the data you're better off with the new
system.  It also includes an RDF database (Sesame2) so you can make
custom queries, which sounds like it would help your use case.

-- Larry

 I am not currently using it, but bringing it up is timely because I'm
 reaching a point where what I am being drawn towards is the necessity of
 an audit tool for certain system activities.  I haven't had time to
 evaluate what the history system can do for me in that regard, but if
 anyone is planning on making changes to it, I'd be interested in being
 involved in some way, shape, or form.

 Let me give you one or two examples of the kind of auditing that I need:
 as users add/remove files over time from their item as they prepare it,
 I need to track what was added/removed and by whom when (multiple users
 can work on a single item in our system).  Similarly for licences.
 Also, administrators perform many tasks on items before they hit the
 public repository, and a navigable audit trail on item activities which
 can actually be interacted with would be of great benefit.

 Cheers,

 --
 Richard
 
 Richard Jones| t: +44 (0)20 759 [48614 / 41815]
 Web  Database   | e: [EMAIL PROTECTED]
Technology Specialist  | b: http://chronicles-of-richard.blogspot.com/
 Imperial College London  |
 

 -
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to share your
 opinions on IT  business topics through brief surveys-and earn cash
 http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Sequence ID generation

2007-05-08 Thread Larry Stone
 First, it is assigned sequentially and IDs are not reused if a bitstream
 is deleted. There is no magic ordering, and it was *not* intended for
 organizing a set of bitstreams into a meaningful sequence (e.g. PDF
 chapters of a book). Its sole purpose is to provide a *durable* unique
 ID for a bitstream - think of it as a 'sub-handle' ID - modulo an item

There's actually a bug in the data model, then.  It's possible to get
the same sequence ID reused, because when adding a Bitstream, the code
only looks for the highest existing SequenceID and increments that.

1. Take an existing Item, go into the Edit Item admin page
   (/dspace/tools/edit-item), and add a new Bitstream with a distinctive name.
   Say, foo.pdf.

2. Determine its Sequence ID.  Go to the Item page
/dspace/handle/my-handle and observe the View/Open link next
   to your bitstream, the path element after its handle is the SequenceID.
   It should be the highest SequenecID there since it was most recently added.
   There are some invisible Bitstreams (like licenses) that also take
   up SIDs.

3. Go back to the Edit page and delete that newest bitstream.

4. Add a different bitstream with a different name, say, bar.pdf.

5. Go to a freshly-loaded copy of the Item page, and observe that
   bar.pdf has the same SequenceID that foo.pdf had before.

I'll submit this as a bug on sourceforge too.

-- Larry


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Need Sample for LNI

2007-05-29 Thread Larry Stone
 I'm trying to get up an running on LNI, but can't seem to get the syntax
 for PUT just right (I keep getting either 400 or 500 http errors).
 Is there anybody out there who can send me a syntax sample along with a
 sample package?
 I'm working with the LNISmokeTest class.

The smoke test has a method doPut() which ought to be a viable example.
Is that what you're using?
It takes the Handle of a collection which you have write access too.
The Handle should be a string like 123456789/241; the sample code
uses the Lookup operation to get its LNI resource URI.

Check that the packager value names a PackgeIngester plugin configured
in dspace.cfg.  It may help to crank up the log level to DEBUG and
look at the server logs.

-- Larry


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Persistent identifiers in DSpace -- thoughts please

2007-05-30 Thread Larry Stone
 On Wed, May 30, 2007 at 02:01:50AM -0400, Larry Stone wrote:
  How about the word resource to introduce the URI, since it is, after
  all, a reference to a resource -- the R in URI.  It'd be:
 
   prefix/resource/encoded-URI  e.g.
 
   http://dspace.me.ac.uk/resource/hdl/1234/56
 
  This follows the proposal to encode the URI by tearing off the scheme
  and putting it in a separate pathname element to avoid issues over
  quoting the :.  Note that I propose using the actual scheme label
  in the URL rather than a user-friendly label, e.g. hdl rather than
  handle.

 This sounds like some reasonable middle-ground. The only issue I can see
 here is that this mechanism only allows us to refer to objects that have
 persistent identifiers. Of course, we could still use an internal form
 of identifier for objects without actual persistent identifiers, but
 then if we have an internal format, should we not use that everywhere?
 Aside from consistency, Mark made the observation that including the
 persistent identifier in the URL is, to a certain extent, bogus. Perhaps
 we could just provide the ability to resolve URLs of the above form, but
 for making links, etc, we use an internal identifier format.

That's a good point -- DSpace is taking on the function of resolving
persistent identifiers like Handles and DOIs when there is no need, since
Handles, at least, already have a Web proxy server.  I wasn't counting
on the add our own flavor of PIDs to DSpace getting resolved favorably..

It _does_ have to allow data model objects to be referenced (through Web
interfaces) by an URL that includes a _persistent_ identifier (as
opposed to, say, a database-ID).  That's the URL that will get used in
links and citations despite our best efforts to promote Handles, so it
needs to be reasonably permanent.

Given a DSpace-specific persistent identifier (e.g. the UUID scheme),
I see two options:

1. Give every content-model object a DSpace-type PID, no matter what.
   External references are URLs including the DSpace PID.
   Other PID schemes (e.g. Handle) resolve to those URLs.
   Allow plugins to register other PIDs when an object is created.

2. Make the DSpace PID into a PersistentIdentifier plugin so it is a
   peer with the Handle or DOI plugins.  The administrator chooses to
   support one or more, and the canonical external reference to an
   object becomes whichever kind of PID is configured to be canonical.

Choice (1) is simpler and seems more sensible, but (2) could be
completely backward-compatible.

Note that some ingested objects will already have PIDs, e.g. if they
are AIPs being re-ingested to reconstruct an archive after catastropic
failure, or DIPs (AIPs) mirrored from another repository.  If _all_
DSpaces have the same PID scheme as in (1), there's no problem ingesting
and accessing another archive's objects.  Under (2), you could end up
generating new PIDs for old objects because your archive doesn't
understand the kind of PID they already have.

I think the UUID scheme (or something like it) makes a whole lot of sense,
but it is a rather significant change.

  Re special characters and quoting: I agree with James' original point that
  the HTTP URL spec has quoting rules for just this reason, but from a
  practical point of view, the client and server implementations have a lot
  of bugs in this area.  That's what I discovered implementing WebDAV for
  the LNI: it wasn't worth trying to encode a slash (/) in a URL, e.g.
  within a Handle, because it would just get stomped on differently by
  the different clients.  Better to let it get used literally as a
  path element separator and make the servlet clever enough to figure it out.
  Also, construct the servlet's URL so the whole path after a certain
  point is part of the object URI, e.g. the Handle.

 Again, this sounds fine. The only reason this doesn't work with the
 current implementation with Handles is for referencing bitstreams -- we
 are forced to make assumptions about the structure of the persistent
 identifiers because we use the (arbitrary and unpredictable) filename as
 part of the URL. This must be avoided, whichever scheme we eventually
 use.

Do you mean the way Bitstreams are referenced in a /bitstream/ servlet
URL?  I thought the path actually doesn't matter there -- it can be
anything, the servlet only looks at the sequence ID, because the URL
follows the pattern:

 prefix/bitstream/handle/SequenceID/path
  
e.g.

 http://dspace.mit.edu/bitstream/1721.1/35700/2/60504128-MIT.pdf

...hmm, it didn't _used_ to care what the path was at all, it would
retrieve the bitstream referenced by the Sequence ID.  Now, at
least on the 1.4.1 system I checked, both SID and path have to match.

But it doesn't have to be implemetned that way.  Since Sequence IDs
are the ONLY Bitstream metadata which must be unique within an Item,
the servlet might as well just ignore the path.

(Of course, this ignores the necessity of the /html/ servlet which

Re: [Dspace-tech] Setting Primary Bitstream

2007-06-12 Thread Larry Stone
 While uploading multiple files (html files and other links files)
 using batch import facilty, DSplace displays all the files.  While
 going through archive it is found in the mail

That's still true for the batch importer AFAIK, but there is now another
way to ingest an Item that will let you set the primary bitstream.
If you make a METS package, which means creating a METS manifest
and putting it into a Zip file with the content files, then you can
use the package ingester.  You can declare the primary bitstream in the
METS manifest, see:

http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile

The package ingester can be run from the command line like the batch
importer, although it only handles one package at a time.
For details about the package ingester, see the system doc:

http://dspace.svn.sourceforge.net/viewvc/*checkout*/dspace/trunk/dspace/docs/application.html#packager

The easiest way to get an example is to take an item alraedy in the
archive which is similar to what you want, and export it as a METS package
using the command-line packager, then look at its METS manifest.

  -- Larry

 Message: 3
 Date: Mon, 11 Jul 2005 18:15:59 EDT
 From: Larry Stone [EMAIL PROTECTED]
 Reply-To: Larry Stone [EMAIL PROTECTED]
 To: Jim Downing [EMAIL PROTECTED]
 Cc: PRAT [EMAIL PROTECTED], dspace-tech dspace-tech@lists.sourceforge.net
 Subject: RE: [Dspace-tech] (no subject)

 You have to set the primary bitstream for the item, manually, to
 the top-level web page -- e.g. usually index.html.  The ItemImporter
 doesn't have any way to set a primary bitstream, although this seems
 like a very good thing to add.  The only way I know to do this is in
 the administrative GUI for the Item.

 Whether DSpace has any plan to solve this problem in coming version?.

 Surinder Kumar Gaba
 Technical Director
 Bibliographic Informatics Division
 National Informatics Centre
 A-Block, CGO Complex, Lodhi Road
 New Delhi-110 003
 Tel : 011-24362359 (o)
   : 011-27865224 (r)

 -
 This SF.net email is sponsored by DB2 Express
 Download DB2 Express C - the FREE version of DB2 express and take
 control of your XML. No limits. Just data. Click to get it now.
 http://sourceforge.net/powerbar/db2/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] How to get ingested pdfs to keep their names?

2007-06-15 Thread Larry Stone
Right, that wasn't ever an option on the PDF packager.  That
class was really written as a demonstration of the package ingester
and a proof-of-concept, but it was allowed to escape.

The package ingester never sees the original filename; it is given
an open InputStream created by the Packager application.  Package
ingesters run from a stream so they don't have to care if their input
is from a file or a network connection, that way remote ingestion just works.

However, you've already figured out most of what is needed to add the
option you want.  Just add some code to look for the name option
and if found, set the bitstream's name to that.  Make sure the
disseminator will find custom-named PDFs, perhaps by looking for the
first bitstream with a name ending in .pdf if there is no
package.pdf.

Please submit the change as a patch through Sourceforge, so it can get
adopted into the source, see the wiki page How to contribute.

-- Larry

 I'm trying to ingest a whole bunch of pdf's using the packager.

 The syntax used is /dspace/bin/dsrun org.dspace.app.packager.Packager -c
 hp/548 -e b
 [EMAIL PROTECTED] -w  -t PDF  /home/dspace/HPL-2007-2.pdf

 The problem is that the files get renamed to package.pdf
 (even if I add -o name=blah.pdf to the command.).

 This seems to happen in the class
 org.dspace.content.packager.PDFPackager


   bs = original.createBitstream(pkg);
 pkg.close();
 bs.setName(package.pdf);


 Does anyone have a workaround that keeps the original pdf file name? is
 there something I'm not doing quite right?
 Thanks,
 Ben Weinberg
 HP Digital Library

 -
 This SF.net email is sponsored by DB2 Express
 Download DB2 Express C - the FREE version of DB2 express and take
 control of your XML. No limits. Just data. Click to get it now.
 http://sourceforge.net/powerbar/db2/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] jhove / preingest / techMDExtractor

2007-06-21 Thread Larry Stone
 What'd like to do is just validate that the extension of the bitstreams
 submitted during an item deposit correspond to the actual format(s) of the
 files using the output of jhove. Now, I don't know how difficult this will
 be, and if you are working on it, and it's going to be coming out in 6
 months, then perhaps my time would be better spent doing something else.

 So, what do you think about this.  Do you see this sort of functionality
 realistically being available in the near future ( version 1.5, perhaps? ),
 or is what I want to do not that difficult, and only a small subset of what
 you're working on, so why not just do it?

The work I'm doing might help somewhat, since it will include more
sophisticated and accurate format identification, plus a measurement of
confidence in the identification -- but I can't make any promises
whether it will get into 1.5.  Full details should be available on the
wiki within a few weeks, I'll anounce it on the dspace-tech and
dspace-devel lists so the community can comment on my proposal.  It's a
whole framework for integrating external data format registries (like
the GDFR), as well as format-identifying applications.  It does not
include format validators but they do have a place in the overall
design.

I'm not sure JHOVE version 1 will be much help, either -- I recommend
taking a hard look at its limitations before spending any time on it.
Its repertoire of formats is somewhat limited, and the output is not
trivial to interpret.  Also, we found it gave a significant number of
false negatives when validating.

What's left?  Tools like DROID (droid.sourceforge.net), perhaps, although
it has some difficulty integrating with DSpace.  It just identifies
formats, but you could use that as a quasi-validation.

Also, the field of data format representation, identification, and
validation is in great flux right now, so there will be improvements.
Thats's why I'm designing a very flexible framework to let DSpace make
use of external resources.

We heard the JHOVE 2 project just got funded, so that will be worth
watching.  See http://fileformats.blogspot.com/search/label/JHOVE
for some early hints.

-- Larry


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Xml file with style sheet

2007-07-03 Thread Larry Stone
 I have an xml file that references a style sheet, but I'm not able to
 get the xml file to render correctly using the style sheet when I
 deposit this file into deep blue.  I realize that I need to create a
 servlet to handle xml files, just like there is one for html files.
 Before I do this, I wanted to check if any one out there has already
 done this, or if there is in fact some other quicker way of doing this.

You should just be able to use the HTMLServlet itself on the XML file;
it works on any kind of content type, although the MIME-type it sends
your browser is whatever the bitstream's BitstreamFormat dictates.

So, if your bitstream has the name foo.xml in item 1721.1/99404,
try the URL

http://dspace.mit.edu/html/1721.1/99404/foo.xml

(substitute appropriately of course..)

If that XML file makes relative-URL references to the XSL stylesheets,
then it ought to just work.
You may have to tweak the BitstreamFormat and MIME type of the
stylesheet bitstreams if the browser is fussy about their MIME types.

It should just be a matter of getting the Item display page to use
the HTMLServlet for your XML bitstreams the way it does for HTML,
which is purely a kludge.

-- Larry


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [vote] Do we want to assign external identifiers (Handles) to files?

2007-07-19 Thread Larry Stone
+1   assign external identifiers to files

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Running DSpace in read-only mode.

2007-07-20 Thread Larry Stone
 On Thu, Jul 19, 2007 at 05:59:39PM -0400, Kyle Brentnell wrote:
  2) Change the authenticateInternal method in the AuthenticationManager.java
  class to always return BAD_ARGS or NO_SUCH_USER, etc. to prevent users
  from logging into Dspace.  To allow access again, restore the original
  authenticateInternal code (and compile and deploy of course).

An easier way to implement this is to create a new stubbed-out
AuthenticationMethod plugin that always returns NO_SUCH_USER or some
other fatal error, add that to the source, and then just change the
DSpace Configuration to make it the only method on the authentication
stack (commenting-out the original lines for easy restoration).  Then,
to restore normal operation, all you have to do is put the configuration
back and restart the servlet container.  It saves changing code and
compiling a second time, and makes it easier to engage read only mode
again in the future.

-- Larry


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] background on external/persistent identifiers

2007-07-24 Thread Larry Stone
I ran across an excellent study published by the ECPA (European Commission on
Preservation and Access) that is recommended background reading if
you want to know more about the issues behind persistent identifiers.
It doesn't recommend any one system, but describes all the leading ones
and mentions pros and cons.  

  Implementing Persistent Identifiers: overview of concepts, guidelines and
recommendations (ECPA-report; 18) 
  by Hans-Werner Hilse and Jochen Kothe
  
http://www.knaw.nl/ecpa/publications.html
http://www.knaw.nl/ecpa/publ/pdf/2732.pdf

-- Larry

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Reply-To Header missing - list misconfigured?

2007-07-31 Thread Larry Stone
FWIW, some of the lists to which I subscribe mung the reply-to header
to force replies to go to the list, and some don't -- I'm invariably
annoyed by the ones that _do_ mung headers.  So take this as a vote NOT to
change the behavior of the DSpace lists.

My mail user agent is a certifiable antique, but it has simple, separate
commands for reply to sender and reply to all.  I prefer to have that
option, but it goes away if teh Reply-To header forces a reply to the
list even when I choose to write only to sender.  I suspect we'll see
a bunch of inadvertent posts to the whole lists if this gets changed, too.

-- Larry


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Dspace Bulk Upload

2007-09-07 Thread Larry Stone
 On 9/7/07, Nandita Chaudhri [EMAIL PROTECTED] wrote:
  If one is not using the submission UI to load items one by one but is using
  the bulk upload facility is there a way to have two bitstreams with only one
  as the 'primary' one which will be displayed when the item is retrived using
  a search? I knoe with the submission UI it is possible. Regards,

The METS package ingester (as of 1.4.2, at least) can set the primary
bitstream. It must be mentioned in the METS manifest;  the profile
document says how to do this, search for Primary bitstream in:
http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile

You can ingest packages through the LNI or the prototype command-line
class, org.dspace.app.packager.Packager  - the system documentation has
all the details.

-- Larry


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Help with METS Ingest Import

2007-10-26 Thread Larry Stone
See the Application Layer page in the DSpace system documentation
(docs/application.html in the source), the section Package Importer and
Exporter.  Briefly, the [dspace]/bin/packager command ingests packages
as well, you have to give it a -c collection-handle option and can
specify -s for submit, although i believe this is the default.

In any case, run [dspace]/bin/packager --help  for help with all options.

-- Larry

 I've been struggling trying to figure how to import METS ITEMS into
 DSPACE as in SIP. There's an export utility (dsrun) for exporting DSPACE
 items into METS Formats (DIP).  How would I go about doing the reverse,
 importing METS Items into a Collection.


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Viruses and DSpace

2007-11-16 Thread Larry Stone
 Has any thought been given to how Dspace might handle the remote (
 hopefully ) possibility of a file containing a virus being deposited
 into a repository?  It seems like jhove might be the kind of tool that
 could check for this.  I believe there is some work going on to
 incorporate jhove into Dspace, how is that coming along?  It's not part
 of of 1.5, but what about for the following release?

The BitstreamFormat renovation (see
http://wiki.dspace.org/index.php/BitstreamFormat_Renovation ) doesn't
address this directly, but will make it much easier to integrate tools
because file formats will be identified more effectively and precisely.

Once the format is known you can add a mechanism like the mediafilters,
perhaps integrated with workflow, to run specific checks depending on
the format type.

JHOVE version 1 is just a format validator and technical-metadata
extractor, it isn't subtle enough to look for viruses.

There _are_ tools in the email filtering domain which detect malicious
MS Office files; I've heard of them but don't remember specifics.  You
could start by looking around the SpamAssassin software and ClamAV
(see http://www.clamav.net/ )  However, be aware that any virus-checking
software needs constant updating since you're essentially in an arms race.

-- Larry (a recovering postmaster)


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)

2007-11-26 Thread Larry Stone
I haven't looked at the Manakin code, but the MIME media-type of a Bitstream
must be coming from its associated BitstreamFormat -- so why not get
the human-readable name from the BitstreamFormat as well?  There is no
need to establish a separate map of MIME-type to user-friendly name
when it already exists in teh BitstreamFormat registry.

  String friendly = bitstream.getFormat().getShortDescription();

One complication, or perhaps advantage, of using BSFs directly is that
some of them have the same MIME-type,  so getting the friendly name
from the BSF actually identifies the format more precisely -- e.g.
XML-based formats might all have the MIME-type text/xml, but distinct
friendly names.  Thus, you should go to the Bitstream's BSF to get the
friendly name rather than attempt to use the BSF registry as a map,
because it might have multiple matches for one MIME-type.

-- Larry

 On Nov 21, 2007 10:06 PM, Conal Tuohy [EMAIL PROTECTED] wrote:
  On Wed, 2007-11-21 at 16:43 -0600, Dorothea Salo wrote:
  The mapping between media-types and friendly names could be introduced
  into the pipeline using a Manakin Aspect, and then utilised in a View,
  via XSLT.

 Aha. I can try to tackle this. What would be the closest existing code?

  Alternatively, perhaps this is really just a case of i18n?

 I thought about that, but I can't quite make it work happily. Every
 time an administrator adds a new bitstream format (something I assume
 Manakin still has UI for?), DSpace itself would have to make an
 automatic change to messages.xml, which is under most circumstances a
 human-authored and source-controlled file. Automagically changing it
 will make a mess of any installation that keeps its source in source
 control, I would think.

 Dorothea

 --
 Dorothea Salo[EMAIL PROTECTED]
 Digital Repository Librarian  AIM: mindsatuw
 University of Wisconsin
 Rm 218, Memorial Library
 (608) 262-5493

 -
 This SF.net email is sponsored by: Microsoft
 Defy all challenges. Microsoft(R) Visual Studio 2005.
 http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Reading other files in Manakin (was: Re: Desired hierarchical display of communities and collection list)

2007-11-28 Thread Larry Stone
I agree that adding techMD to the METS is architecturally the right
thing, but there is already a better (or at least, adequate..) technical
metadata standard format in PREMIS.  There is already a PREMIS crosswalk
for the import/export METS packager which you can cannibalize, see
org.dspace.content.crosswalk.PREMISCrosswalk


You could put the format name in the element:
  objectCharacteristics/format/formatDesignation/formatName
The crosswalk puts the MIME type there right now, but
it would make more sense to use it for the name.
(At the time I wrote the crosswalk, I didn't feel it was correct to
dignify the chaos that is MIME types by calling it a registry of format
identifiers, although we currently use it as such.)

Then you can use PREMIS for other bits of technical metadata of
use to Manakin, instead of perverting DIM.  DIM was never supposed to
escape beyond the internal use in XSLT-driven packager crosswalks!

There's more about PREMIS here: http://www.loc.gov/standards/premis/

-- Larry

 The METS file element lets you have an admid attribute that points
 to administrative metadata about the file elsewhere in the METS
 document. For our purposes, we would probably end up creating
 something like this (assume the new metadata is in a ds schema) for
 each bitstream (or perhaps for each format in an item? one could point
 more than one bitstream to the same amdSec):

 mets:amdSec id=bitstream_1
mets:techMD id=bitstream_1_techmd
   mets:mdWrap OTHERMDTYPE=DIM MDTYPE=other
  mets:xmlData
 dim:dim dspaceType=BITSTREAM
dim:field element=formatName mdschema=dsHappyFile
 version 1.01/dim:field
 /dim:dim
  /mets:xmlData
   /mets:mdWrap
/mets:techMD
 /mets:amdSec

 The place to pop this in, I *think*, is somewhere in
 xmlui.objectmanager.ItemAdapter.java. If what I just suggested doesn't
 seem outrageous, I'll take a whack at coding it up.

 Dorothea

 --
 Dorothea Salo[EMAIL PROTECTED]
 Digital Repository Librarian  AIM: mindsatuw
 University of Wisconsin
 Rm 218, Memorial Library
 (608) 262-5493

 -
 This SF.net email is sponsored by: Microsoft
 Defy all challenges. Microsoft(R) Visual Studio 2005.
 http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] why vanilla?

2007-12-14 Thread Larry Stone
 I don't visit the wiki often enough, but my impression has been that
 work-in-progress and completed projects are jumbled together, probably
 because when projects are completed they just stay on the page and
 stop getting updates.  Regularly moving released mod.s to a released
 projects page would help to make it clear what you can just plug in
 now and what you may be able to help develop, which is an important
 distinction depending on your needs and resources.

This is a good application of Categories in the wiki -- brand
project pages with either Project-In Progress or Project-Released, e.g.
Then the summary pages would be automatically generated.
See http://meta.wikimedia.org/wiki/Help:Category to learn all
about categories.

The use of categories looks somewhat haphazard right now; we really
need a meta-page explaining how the tags are supposed to be used and
offering guidelines to creating such things as project pages.

See: http://wiki.dspace.org/index.php/Special:Categories

-- Larry


-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Uploading a 10Gig File. (Blanco, Jose)

2008-01-09 Thread Larry Stone
Check the configurations of the Web server (e.g. apache) and/or servlet
container (tomcat, if using mod_jk) for limits on the largest upload allowed.
Start by looking at their error logs and transaction logs to see what
happened to your HTTP request.  There is theoretically no reason you
shouldn't be able to do this, but it probably exceeds all sorts of default
limits in the software stack.  For example, see the Apache httpd
LimitRequestBody directive.  (Hmm, apache 2.2 seems to have a hard
limit of 2 Gb in request bodies.)

I have experimented with 20Gb uploads using a bare Tomcat 5.0 server
and the HTTP PUT protocol in the LNI, so it ought to be possible.

-- Larry

 The file is uncompressed.  It is a mov file.  The tmp dir has enough
 space for it.  I am able to cp it there.  What I have found is that if
 the /tmp dir does not have enough space for it, it starts to write it
 there and then things stop running once the dir fills up, but in this,
 it never tries to write it there.

 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Michael
 Faraci
 Sent: Wednesday, January 09, 2008 4:36 PM
 To: dspace-tech@lists.sourceforge.net
 Subject: [Dspace-tech] Uploading a 10Gig File. (Blanco, Jose)

 Message 4.

 Have you checked tmp directory size. Is the file compressed (zip/tar)? I
 would be concerned with the uncompressed size if it is. I have never
 dealt with submitting a file this large, so these are just some things
 that come to mind.

 Michael Faraci

 Network/Systems Administrator

 ALC

 3305 N. 3rd St.

 Suite 301

 Abilene, Texas

 (325)672-7081 x4

  fax (325)672-7082

 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of
 [EMAIL PROTECTED]
 Sent: Wednesday, January 09, 2008 11:14 AM
 To: dspace-tech@lists.sourceforge.net
 Subject: DSpace-tech Digest, Vol 21, Issue 20

 Send DSpace-tech mailing list submissions to
   dspace-tech@lists.sourceforge.net

 To subscribe or unsubscribe via the World Wide Web, visit
   https://lists.sourceforge.net/lists/listinfo/dspace-tech
 or, via email, send a message with subject or body 'help' to
   [EMAIL PROTECTED]

 You can reach the person managing the list at
   [EMAIL PROTECTED]

 When replying, please edit your Subject line so it is more specific than
 Re: Contents of DSpace-tech digest...


 Today's Topics:

1. Re: Cannot delete collections from Dspace (Claudia J?rgen)
2. Re: Cannot delete collections from Dspace (Graham Triggs)
3. Re: Creative Commons Enigma (Richard Rodgers)
4. Uploading a 10Gig File. (Blanco, Jose)
5. Re: Getting image thumbnails (Keith Gilbertson)
6. Collection Item Mapper Search (Kyle Kaliebe)


 --

 Message: 1
 Date: Wed, 09 Jan 2008 15:02:30 +0100
 From: Claudia J?rgen  [EMAIL PROTECTED]
 Subject: Re: [Dspace-tech] Cannot delete collections from Dspace
 To: Filippos Kolovos [EMAIL PROTECTED]
 Cc: Dspace-Tech-List dspace-tech@lists.sourceforge.net,
   Dspace-general-list [EMAIL PROTECTED]
 Message-ID: [EMAIL PROTECTED]
 Content-Type: text/plain; charset=UTF-8; format=flowed

 Hi Filippos,


 are you sure you're running 1.4.2?
 The table dcvalue does not exist anymore in 1.4.2. It has been
 replaced in 1.4.

 In 1.3.2 there has been a known bug
 http://sourceforge.net/tracker/index.php?func=detailaid=1333269group_i
 d=19984atid=119984
 on the deletion of collections/communities containing withdrawn items.

 cheers

 Claudia


 Filippos Kolovos schrieb:
  Dear all,
 
  Recently I am dealing with a problem with my Dspace installation
 (1.4.2).
  When I try to delete a collection and/or community from Dspace the
  system returns an Internal Server Error. The explanation of this
  error is shown below:
 
  Exception:
  org.postgresql.util.PSQLException: ERROR: update or delete on item
 violates foreign key constraint $1 on dcvalue
 
  at
 org.postgresql.util.PSQLException.parseServerError(PSQLException.java:13
 9)
  at org.postgresql.core.QueryExecutor.executeV3(QueryExecutor.java:152)
  at org.postgresql.core.QueryExecutor.execute(QueryExecutor.java:100)
  at org.postgresql.core.QueryExecutor.execute(QueryExecutor.java:43)
  at
 org.postgresql.jdbc1.AbstractJdbc1Statement.execute(AbstractJdbc1Stateme
 nt.java:517)
  at
 org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Stateme
 nt.java:50)
  at
 org.postgresql.jdbc1.AbstractJdbc1Statement.executeUpdate(AbstractJdbc1S
 tatement.java:273)
  at
 org.apache.commons.dbcp.DelegatingPreparedStatement.executeUpdate(Delega
 tingPreparedStatement.java:101)
  at
 org.apache.commons.dbcp.DelegatingPreparedStatement.executeUpdate(Delega
 tingPreparedStatement.java:101)
  at
 org.dspace.storage.rdbms.DatabaseManager.updateQuery(DatabaseManager.jav
 a:519)
  etc, etc
  ..
  .
 
  The exact reason for this error seems to be that the collection id is
 referenced from another table, 

Re: [Dspace-tech] remote Client Program for accessing and managing theDSpace

2008-04-17 Thread Larry Stone
There are already several existing network interfaces -- are you sure
none of them give the access you need?  See the wiki:

http://wiki.dspace.org/index.php/NetworkInterfaces

In particular, the LNI -- see:
http://wiki.dspace.org/index.php/LightweightNetworkInterface

was designed to be extensible and to eventually support a full
administrative interface, if needed.  It's based on WebDAV, but it
also contains an alternate SOAP interface for _some_ (not all)
functions.  I recommend reading the original preserved wiki page at:

  http://web.mit.edu/lcs/www/lni/

because the one in the current wiki still contains a lot of damage from
an inaccurate translation to MediaWiki.  The source code is part of DSpace
1.5, so you can refer to it as a SOAP example.

hope this helps,

-- Larry

 I am going to write a client program to manage and access the DSpace
 repository remote. It should look like a normal application. So I think that
 I should use the some communication protocol to communicate with the DSpace
 servlet web services. For example: SOAP is a very gut one. But the question
 is, whether the DSpace supports it? And how many DSpace Websites use the
 SOAP?

 I tried to search an answer from the System Document online in DSpace. You
 know, that I didn't any answer. DSpace is written with Java. So I think it
 should support the SOAP with RPC.

 The DSpace Mailinglist is very hot. So i hope, someone can help me!
 Thanks and regards
 Feng Wang


 --
 GMX startet ShortView.de. Hier findest Du Leute mit Deinen Interessen!
 Jetzt dabei sein: http://www.shortview.de/[EMAIL PROTECTED]

 -
 This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss
 this year's exciting event. There's still time to save $100.
 Use priority code J8TL2D2.
 http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javao
 ne
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


 -
 This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
 Don't miss this year's exciting event. There's still time to save $100.
 Use priority code J8TL2D2.
 http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] bitstreams and file locations

2008-05-05 Thread Larry Stone
The Bitstream's name field is also significant for Items taht are
archived websites with internal links to other Bitstreams in the
item.  The UI (the JSP UI, at least) has a special html servlet
that resolves references to the *name* of the Bitstream instead
of its Sequence ID (like the normal handle servlet).  For example,
this Item is an archived OCW website:
http://dspace.mit.edu/html/1721.1/41055/12-820Spring-2005/OcwWeb/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/CourseHome/index.htm

and it refers to an image Bitstream with the URL
http://dspace.mit.edu/html/1721.1/41055/12-820Spring-2005/NR/rdonlyres/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/3B9453A1-78D4-49B4-9DBD-B1F39C78CB29/0/chp_turbulent.jpg

The Item handle is 1721.1/41055
and the Bitstream's name field is
12-820Spring-2005/NR/rdonlyres/Earth--Atmospheric--and-Planetary-Sciences/12-820Spring-2005/3B9453A1-78D4-49B4-9DBD-B1F39C78CB29/0/chp_turbulent.jpg

...so, in a case like this, it's _essential_ to preserve the
names of the Bitstreams or your web page will not work.

-- Larry

 I have a simple question about the name column in the bitstream table.
 We're using 1.3.1 and migrating to a new server (but keeping 1.3.1 for
 now)
 Does this field have any value other than provenance information about
 the source of the ingested bitstream.
 The actual bitstreams are being stored in the assetstore and
 referenced differently, so I'm not sure about the reason for the name
 column?
 In a test migration the lack of a physical file referenced in the name
 column doesn't appear to have any consequence?

 Thanks
 Eric
 American Museum of Natural History


-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] File Descriptions in Import

2008-05-09 Thread Larry Stone
 So, my question is: Can File descriptions for multiple bitstreams be
 added via the importer?

Remember, the Packager plugins are another way to ingest Items -- SWORD,
the LNI, and a command-line client (/dspace/bin/packager) can all invoke
any of the package ingester plugins.

Unfortunately, the most sophisticated packager that comes in the release,
the METS (DSpace SIP profile) packager, does not recognize Bitstream
descriptions.  I *think* this is because there was no good place to
put that metadata in the PREMIS object schema that we use for Bitstream-level
metadata.

There is a package-based solution that would let you ingest every
detail of an Item -- the AIP prototype, see
http://wiki.dspace.org/index.php/AipPrototype
The whole nature of an AIP is that is a *complete* description of the
DSpace object, down to every detail, so you don't lose anything
when e.g. migrating Items between archives through AIPs.

It's probably prohibitively hard to resuscitate this code, but perhaps
you can just spin off your own customized METS ingester.
Look at the AIP as a model if you want to adapt the existing METS packager code.
You can subclass it pretty easily and just change the Bitstream
crosswalk plugin it calls, perhaps to a modified PREMIS crosswalk, or
use DC or something like that for Bitstream metadata.

-- Larry


-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] File Descriptions in Import

2008-05-14 Thread Larry Stone
 Never mind; answered my own question. This DOES NOT work for imported
 rather than registered bitstreams. Ah, well. I can edit bitstream
 descriptions manually for 300+ items; why not?

You can update Bitstream description fields under program control with
the LNI.  It may even be possible to do this pretty easily with a
scriptable WebDAV client like cadaver, or by writing a very simple LNI
client.  See
http://wiki.dspace.org/index.php/LightweightNetworkInterface#PROPFIND_and_PROPPATCH_XML_elements

Bitstream resources have a dspace:description property that is writable.

Please refer to http://web.mit.edu/lcs/www/lni/ for a cleaner version
of the LNI API document -- the one on the wiki is still very chewed-up from
the conversion.

-- Larry


-
This SF.net email is sponsored by: Microsoft 
Defy all challenges. Microsoft(R) Visual Studio 2008. 
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available

2008-05-16 Thread Larry Stone
Actually, it responds to GET on some URIs, but the LNI is *not*
meant to be used as an interactive web site; it is a WebDAV server.
WebDAV happens to use the HTTP protocol but not in a way that gets along
with the subset (and perversions) of HTTP most browsers speak.
It also has some ability to converse in SOAP (for a subset of its functions)
but that is even less browser-friendly.

For all the details and leads, see the LNI documentation at
http://web.mit.edu/lcs/www/lni/

You can also download a sample SOAP client there.  For some reason,
the client utilities and sample client were not included in 1.5.

-- Larry

 not elegant or intuitive...

 http://host/lni/lni

 And you need the client to interact because it requires POST.

 -Mark

 On May 16, 2008, at 12:52 PM, Rick Runyan wrote:

  lni is right there in the Tomcat webapps folder right beside xmlui,  =

  which works like a champ.  I=92m not experienced with web apps, and  =

  even less so with java web apps, so I=92m at something of a loss  =

  here.  I=92ve spent a while reading and trying to figure out the  =

  problem, with not much success.  Mostly, it seems like there should  =

  be an index.html file in the WEB-INF directory, but there=92s not one.
 
 
 
  Anybody got lni working who=92d like to give me a hint?
 
 
 
  - Rick
 
 
 
 
 
  -- =

  ---
  This SF.net email is sponsored by: Microsoft
  Defy all challenges. Microsoft(R) Visual Studio 2008.
  http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ =

  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech


 -
 This SF.net email is sponsored by: Microsoft =

 Defy all challenges. Microsoft(R) Visual Studio 2008. =

 http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
This SF.net email is sponsored by: Microsoft 
Defy all challenges. Microsoft(R) Visual Studio 2008. 
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Creating a WSDL file for LNI

2008-05-22 Thread Larry Stone
There was a WSDL file in the original source; it was generated automatically,
by an ant target that does not seem to have survived into the brave new world
of maven.  (You could look at the old LNI patches, see
http://web.mit.edu/lcs/www/lni )

However, teh WSDL is still in the DSpace 1.5 source distribution,
its path (relative to the tar file downloaded from sourceforge):
dspace-lni/dspace-lni-client/src/main/resources/dspace-lni.wsdl

This was built automatically by what is now a very old version of Axis,
so good luck with it..

What I *really* recommend is to forget about SOAP entirely.  You need
WebDAV anyway for the GET and PUT methods (package export and ingest),
and all of the rest of the functionality is availble through WebDAV too.
The SOAP calls are just a thin veneer over WebDAV, only included because
of contractual obligations.  You may find it much easier to simply use
a generic WebDAV client.

-- Larry

 I have been fighting with this for about three days now, so I'm giving
 up the thought that I can figure it out on my own.

 =20

 I want to call LNI from a .NET client, so I need a WSDL file (at least,
 I think I do).  I've been trying to create one using the Axis JAVA2WSDL
 command, but I'm always getting a Class not found error.  Speaking of
 which, I assume that the class I'd want to do this to is
 LNISoapServlet.java...

 =20

 Has anybody done this successfully and, if so, can you tell me how you
 did it?

 =20

 - Rick

 =20

 =20


 --_=_NextPart_001_01C8BC3C.DBC95B09
 Content-Type: text/html;
   charset=us-ascii
 Content-Transfer-Encoding: quoted-printable

 html xmlns:o=3Durn:schemas-microsoft-com:office:office =
 xmlns:w=3Durn:schemas-microsoft-com:office:word =
 xmlns=3Dhttp://www.w3.org/TR/REC-html40;

 head
 meta http-equiv=3DContent-Type content=3Dtext/html; =
 charset=3Dus-ascii
 meta name=3DGenerator content=3DMicrosoft Word 11 (filtered medium)
 style
 !--
  /* Font Definitions */
  @font-face
   {font-family:Californian FB;
   panose-1:2 7 4 3 6 8 11 3 2 4;}
  /* Style Definitions */
  p.MsoNormal, li.MsoNormal, div.MsoNormal
   {margin:0in;
   margin-bottom:.0001pt;
   font-size:12.0pt;
   font-family:Times New Roman;}
 a:link, span.MsoHyperlink
   {color:blue;
   text-decoration:underline;}
 a:visited, span.MsoHyperlinkFollowed
   {color:purple;
   text-decoration:underline;}
 span.EmailStyle17
   {mso-style-type:personal-compose;
   font-family:Californian FB;
   color:windowtext;
   font-weight:normal;
   font-style:normal;
   text-decoration:none none;}
 @page Section1
   {size:8.5in 11.0in;
   margin:1.0in 1.25in 1.0in 1.25in;}
 div.Section1
   {page:Section1;}
 --
 /style

 /head

 body lang=3DEN-US link=3Dblue vlink=3Dpurple

 div class=3DSection1

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'Hi =
 #8211;o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'I have been fighting with this for =
 about
 three days now, so I#8217;m giving up the thought that I can figure it =
 out on
 my own.o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'I want to call LNI from a .NET =
 client, so I
 need a WSDL file (at least, I think I do).nbsp; I#8217;ve been trying =
 to
 create one using the Axis JAVA2WSDL command, but I#8217;m always =
 getting a #8220;Class
 not found#8221; error.nbsp; Speaking of which, I assume that the class =
 I#8217;d
 want to do this to is =
 LNISoapServlet.java#8230;o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'Has anybody done this successfully =
 and, if
 so, can you tell me how you did it?o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p

 p class=3DMsoNormalfont size=3D3 face=3DCalifornian FBspan =
 style=3D'font-size:
 12.0pt;font-family:Californian FB'- Rick/span/fontfont size=3D2
 face=3DArialspan =
 style=3D'font-size:10.0pt;font-family:Arial'o:p/o:p/span/font/p=
 

 p class=3DMsoNormalfont size=3D3 face=3DTimes New Romanspan =
 style=3D'font-size:
 12.0pt'nbsp;/spano:p/o:p/font/p

 p class=3DMsoNormalfont size=3D3 face=3DTimes New Romanspan =
 style=3D'font-size:
 

Re: [Dspace-tech] LNI/WebDAV : GET returns 501

2008-05-29 Thread Larry Stone
GET only works on Items and Bitstreams, so if you get the resource path
to a Bitstream from PROPFIND (depth=1) on an Item, then you ought to be able
to do a GET on it and receive its contents.

This path looks like you've traversed some communities and collections:

 http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_123456789%243

You can also get a resource URL by doing a lookup on a Handle, e.g. GET
on http://[server]:8080/lni/dav/lookup/handle/123456789/42
returns a Location: header with the URL.  I don't think it works on
Bitstreams, though.

Anyway, if you do a PROPFIND on that resource you're trying to GET, look
at the value for type (in the http://www.dspace.org/xmlns/dspace; namespace).
It'll be an empty element for the DSpace object type, either collection/,
community/, item/, bitstream/

Typically, the Bitstream resources have a path ending in
  /bitstream_{n}.{suffix}, e.g.  .../bitstream_3.pdf

So, I think the problem here is that you've got a resource for which
GET isn't implemented.

For some sketchy doc which is unfortunately the best available, see
http://web.mit.edu/lcs/www/lni/

-- Larry

 I've finally figured out how to hook up a .NET app to LNI and am running
 some exploratory tests, using straight HTTP/DAV rather than SOAP. =20

 Alas, GET doesn't seem to work.  I've done a propfind call to get URIs
 for the bitstreams in a collection, but when I try to GET one of them, I
 get

 =20

 System.ApplicationException: Error trying to GET
 http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_123
 456789%243 ---=20

 System.Net.WebException: The remote server returned an error: (501) Not
 Implemented.

  at System.Net.HttpWebRequest.GetResponse()

 =20

 I am unfamiliar with Axis web services, so I'm having to learn
 everything as I go, which slows me down a lot.

 This entry in server-config.wsdd listing allowedMethods looks
 suspicious to me

  service name=3DDSpaceLNI provider=3Djava:RPC

 parameter name=3DallowedMethods value=3Dpropfind proppatch =
 lookup
 copy/

 parameter name=3DclassName
 value=3Dorg.dspace.app.dav.LNISoapServlet/

  /service

 But the web.xml file has a different class
 (org.dspace.app.dav.DAVServlet) set up to respond to /dav URIs, so I
 suspect that's not my problem.

 =20

 - Rick

 =20

 =20


 --_=_NextPart_001_01C8C190.AFE31B16
 Content-Type: text/html;
   charset=us-ascii
 Content-Transfer-Encoding: quoted-printable

 html xmlns:o=3Durn:schemas-microsoft-com:office:office =
 xmlns:w=3Durn:schemas-microsoft-com:office:word =
 xmlns=3Dhttp://www.w3.org/TR/REC-html40;

 head
 meta http-equiv=3DContent-Type content=3Dtext/html; =
 charset=3Dus-ascii
 meta name=3DGenerator content=3DMicrosoft Word 11 (filtered medium)
 style
 !--
  /* Font Definitions */
  @font-face
   {font-family:Californian FB;
   panose-1:2 7 4 3 6 8 11 3 2 4;}
  /* Style Definitions */
  p.MsoNormal, li.MsoNormal, div.MsoNormal
   {margin:0in;
   margin-bottom:.0001pt;
   font-size:12.0pt;
   font-family:Times New Roman;}
 a:link, span.MsoHyperlink
   {color:blue;
   text-decoration:underline;}
 a:visited, span.MsoHyperlinkFollowed
   {color:purple;
   text-decoration:underline;}
 span.EmailStyle17
   {mso-style-type:personal-compose;
   font-family:Californian FB;
   color:windowtext;
   font-weight:normal;
   font-style:normal;
   text-decoration:none none;}
 @page Section1
   {size:8.5in 11.0in;
   margin:1.0in 1.25in 1.0in 1.25in;}
 div.Section1
   {page:Section1;}
 --
 /style

 /head

 body lang=3DEN-US link=3Dblue vlink=3Dpurple

 div class=3DSection1

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'I#8217;ve finally figured out how =
 to hook
 up a .NET app to LNI and am running some exploratory tests, using =
 straight
 HTTP/DAV rather than SOAP. nbsp;o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'Alas, GET doesn#8217;t seem to
 work.nbsp; I#8217;ve done a propfind call to get URIs for the =
 bitstreams in a
 collection, but when I try to GET one of them, I =
 geto:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'o:pnbsp;/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'System.ApplicationException: Error =
 trying
 to GET =
 http://[server]:8080/lni/dav/dso_123456789%241/dso_123456789%242/dso_1234=
 56789%243
 ---gt; o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian FBspan =
 style=3D'font-size:
 11.0pt;font-family:Californian FB'System.Net.WebException: The remote =
 server
 returned an error: (501) Not Implemented.o:p/o:p/span/font/p

 p class=3DMsoNormalfont size=3D2 face=3DCalifornian 

Re: [Dspace-tech] LNI/WebDAV : GET returns 501

2008-06-02 Thread Larry Stone
You can see how the lookup feature works by using a simple HTTP client
like the GNU wget application to do a GET, and reading its diagnostic
output.. for example, this does a lookup of handle 123456789/1
using HTTP Basic authentication to login (assuming you have username/password
login on your server):

wget [EMAIL PROTECTED] --http-password=PASSWORD -S 
http://MYDSPACE.mit.edu:8080/lni/dav/lookup/handle/123456789/1

Now, watch the headers -- the request as a whole will fail because wget
is aggressive about responding to the HTTP 302 status with another GET on
the real resource, but that GET probably isn't allowed or is missing some
option.  Anyway, just look for the Location: header in the first response:

Connecting to rotarran.mit.edu|18.42.4.43|:8080... connected.
HTTP request sent, awaiting response...
  HTTP/1.1 302 These are not the droids you are looking for.
  Set-Cookie: DSpaceDavAuth=1212468843785!1!18.42.4.8!ac52bf780d4bb6eb486b810a7d
5c4f24; Path=/lni
  Location: http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241
  ^^

  Content-Length: 0
  Date: Tue, 03 Jun 2008 04:54:03 GMT
  Server: Apache-Coyote/1.1
  Connection: Keep-Alive

{Here's the second try, which fails.. ignore it, we've got what we need.}

Location: http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241 [following]
--00:54:03--  http://rotarran.mit.edu:8080/lni/dav/dso_123456789%241
   = `dso_123456789$1'
Reusing existing connection to rotarran.mit.edu:8080.
HTTP request sent, awaiting response...
  HTTP/1.1 501 GET method not implemented for Community.
  Content-Type: text/html;charset=utf-8
  Content-Length: 1112
  Date: Tue, 03 Jun 2008 04:54:03 GMT
  Server: Apache-Coyote/1.1
  Connection: close
00:54:03 ERROR 501: GET method not implemented for Community..

This ought to show how to implement what you need with Apache HTTPClient
or something like that.

You're not using Perl, are you?  The WebDAV library on CPAN was horribly
broken last I looked, not really usable for this at all.

Also, note that you don't *need* to go through the hierarchy to get at
any Handle in the archive.  If you already have the Handle of the resource
you're looking for, just do a lookup to get its URL and use that.
Traversing the hierarchy is just for exploring, and because it's the
model of HTTP and WebDAV.

-- Larry

 Thanks for the helpful response.  I've still been unable to do a lookup
 or a get, but I'm concentrating right now on walking through the
 hierarchy to learn what turns up and what kind of properties I'll find.
 When I get comfortable with that, I'll re-read your message and try
 again.

 - Rick



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] build cookbook

2008-06-20 Thread Larry Stone
I've added a page of cookbook-style instructions for integrating 
local modificatiosn with the new build system, see:

http://wiki.dspace.org/index.php/BuildCookbook

it builds on the OR08 presentation Customizing DSpace 1.5 with Basic
Overlays; in fact, that has other useful examples which might well
be added to the wiki page for easier access.

-- Larry

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] DSpace-tech Digest, Vol 27, Issue 3

2008-07-01 Thread Larry Stone
See
http://wiki.dspace.org/index.php/Simple_LNI_Client
for an example of the same idea in Java.  It uses only the WebDAV protocol.
Perhaps you can use that code as a model for your C# application.

When you're done, please consider posting your client on the wiki too.

-- Larry

 I'm working on a project that has to be able to insert items on DSpace thro=
 ugh a C# application...
 I am going to use LNI (LightWeightNetworkInterface) and a WebDAV C# library=
  that has been realesed.
 =20
 Does anyone has done this before ? I think i will need a little help here..=
 .=20
 =20
 What do you think the software that i'm going to use? Is it the correct one=
  for this ?
 =20
 Best regards,
 Guilherme Saraiva=20
 _
 Cansado de espa=E7o para s=F3 50 fotos? Conhe=E7a o Spaces, o site de relac=
 ionamentos com at=E9 6,000 fotos!
 http://www.amigosdomessenger.com.br=

 --_414b706f-7072-413e-8b54-b083041eedf5_
 Content-Type: text/html; charset=iso-8859-1
 Content-Transfer-Encoding: quoted-printable

 html
 head
 style
 .hmmessage P
 {
 margin:0px;
 padding:0px
 }
 body.hmmessage
 {
 FONT-SIZE: 10pt;
 FONT-FAMILY:Tahoma
 }
 /style
 /head
 body class=3D'hmmessage'
 BRHello again.BR
 nbsp;BR
 I'm working on a project that has to be able to insert items on DSpace thro=
 ugh a C# application...BR
 I am going to use LNI (LightWeightNetworkInterface) and a WebDAV C# library=
  that has been realesed.BR
 nbsp;BR
 Does anyone has done this before ? I think i will need a little help here..=
 . BR
 nbsp;BR
 What do you think the software that i'm going to use? Is it the correct one=
  for this ?BR
 nbsp;BR
 Best regards,BR
 Guilherme Saraivanbsp;BRbr /hr /Instale a Barra de Ferramentas com D=
 esktop Search e ganhe EMOTICONS para o Messenger! a href=3D'http://www.msn=
 .com.br/emoticonpack' target=3D'_new'=C9 GR=C1TIS!/a/body
 /html=

 --_414b706f-7072-413e-8b54-b083041eedf5_--


 --===1635202196==
 Content-Type: text/plain; charset=us-ascii
 MIME-Version: 1.0
 Content-Transfer-Encoding: 7bit
 Content-Disposition: inline

 -
 Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
 Studies have shown that voting for your favorite open source project,
 along with a healthy diet, reduces your potential for chronic lameness
 and boredom. Vote Now at http://www.sourceforge.net/community/cca08
 --===1635202196==
 Content-Type: text/plain; charset=us-ascii
 MIME-Version: 1.0
 Content-Transfer-Encoding: 7bit
 Content-Disposition: inline

 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

 --===1635202196==--



-
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Filter-media in 15

2008-07-11 Thread Larry Stone
It _should_ never get an NPE over a missing configuration key.  Given
our sparse testing resources it behooves us to make the code as
resilient as possible.

Could you submit a bug report about this (or, even better, a patch) --
ideally it could would *log an error* if the configuration property is
missing and set filternames to some default that may be illogical (i.e.
empty list of plugins) but at least won't blow up.

The error log entry explains the exact problem to the admin, who may not
have known about the new configuration key, but can now tell exactly
what to look for.

See org.dspace.core.PluginManager for an example; it has a lot of protection
against configuration errors because those are to be expected when
plugin configurations get changed at every site.

-- Larry

 I see that it blows up on:

 String enabledPlugins =
 ConfigurationManager.getProperty(MEDIA_FILTER_PLUGINS_KEY);
 filterNames = enabledPlugins.split(,\\s*);

 But when I check the dspace.cfg file I don't see a
 MEDIA_FILTER_PLUGINS_KEY

 Filter-media will need to be run as a cron job in 1.5, right?

 Thanks!
 Jose

 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Blanco,
 Jose
 Sent: Friday, July 11, 2008 11:14 AM
 To: dspace-tech@lists.sourceforge.net
 Subject: [Dspace-tech] Filter-media in 15

 When I run filter-media in 1.5, I'm getting:

 -bash-3.2$ ./filter-media
 Applying Media Filters
 Exception in thread main java.lang.NullPointerException
 at
 org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja
 va:240)

 
 -
 Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
 Studies have shown that voting for your favorite open source project,
 along with a healthy diet, reduces your potential for chronic lameness
 and boredom. Vote Now at http://www.sourceforge.net/community/cca08
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

 -
 Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
 Studies have shown that voting for your favorite open source project,
 along with a healthy diet, reduces your potential for chronic lameness
 and boredom. Vote Now at http://www.sourceforge.net/community/cca08
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


-
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [Dspace-devel] Batch Import document

2008-07-21 Thread Larry Stone
There _was_ documentation for the venerable batch importer in the
HTML pages in the docs subdirectory; in a source checkout, it's at
the relative path:
dspace/trunk/dspace/docs/application.html#itemimporter
(I can't get the location in the Subversion repository because
Sourceforge seems to be having a bad hair day..)

Your new PDF document is better -- could you at least put it up on the
wiki for how?

The default METS SIP (Submission Information Package) accepted by the package
ingester is also mentioned in the HTML docs:
dspace/trunk/dspace/docs/application.html#packager
but the link to a document describing the METS SIP profile is broken:
http://www.dspace.org/standards/METS/SIP/profilev1p0/metsipv1p0.pdf

There _is_ a page about it on the WIKI, although it could use some cleanup:
http://wiki.dspace.org/index.php/DSpaceMETSSIPProfile
It could use an annotated example of a complete submission package, I
think that would be the most helpful part for anyone putting their own
SIPs together.  (They should also see the DSpace SIP Toolkit wiki page)

-- Larry

 It's a funny thing, the DSpace Batch Import Format (of which there are
 two) are not documented anywhere other than looking at the source
 code. We've run across the situation where a department has some
 content to put into the repository and the resources to create a batch
 import, but just don't know how to do it. To solve this problem, we've
 created a simple hand out to provide to a programmer or technical
 person in another department or group when they want to develop their
 own batch import script. While it doesn't cover the advanced topics,
 like registering bitstreams etc, nor dose it go over the package
 importer, it should solve the problem for the vast majority of cases
 where someone needs to create a batch import.

 I thought this might be useful to others out there, the only other
 resource on creating batch imports I was able to find is Dorothea's
 blog: http://cavlec.yarinareth.net/2008/01/07/the-dspace-batch-
 importer/.

 Here is our handout:
 https://www.tdl.org/documents/DSpaceBatchImportFormat.pdf

 Scott--






 -
 This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
 Build the coolest Linux based applications with Moblin SDK  win great prizes
 Grand prize is a trip for two to an Open Source event anywhere in the world
 http://moblin-contest.org/redirect.php?banner_id=100url=/
 ___
 Dspace-devel mailing list
 [EMAIL PROTECTED]
 https://lists.sourceforge.net/lists/listinfo/dspace-devel


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [Dspace-devel] Batch Import document

2008-07-25 Thread Larry Stone
While we're talking about the fate of DSpace documentation,
I'd like to put in a plea to maintain a Web-friendly version *as well*.
When consulting the doc for other open-source projects (usually because
they are components of DSpace..) I find the online HTML version of
the doc a lot more handy than PDF: it's easier to navigate, faster
to get to, easier to read on a computer display.
PDF is a whole lot better suited to printing than to viewing online.

The PostgreSQL site is an excellent example of an online manual --
see http://www.postgresql.org/docs/8.2/interactive/index.html
Note that it also has a place to attach user comments (along the
lines of a blog). These often have very useful hints and tidbits.

-- Larry



-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Reusing bitstream sequence number

2008-08-15 Thread Larry Stone
  What is it that you need to accomplish?
 I need to be able to have a stable reference for a bitstream which
 might undergo a number of version updates. I do not expect to save the
 earlier versions at this time.

  Mark is correct about overloading the semantics here.  Note, We
  adjusted the behavior behind the dspace 1.5 XMLUI (but not the JSPUI)
  to allow for unsequenced name resolution of the bitstreams. For
  instance:
 
  http://dspace.mit.edu/bitstream/handle/1721.1/39126/
  womenpolicymakers_census_dta.tab
  http://dspace.mit.edu/bitstream/handle/1721.1/39126/
  womenpolicymakers_census_dta.tab?sequence=3
  http://dspace.mit.edu/bitstream/handle/1721.1/39126/3/
  womenpolicymakers_census_dta.tab
 
  Are now all valid references the bitstream at this location. In the
  case where the sequence number is absent, the first bitstream
  encountered in the Item with that name is returned.
 I think this is what I need. Was this done via xslt or in servlet code.

This is also possible in the JSPUI with the html servlet; it was
intended to let DSpace serve an archived website.  In certain circumstances
(at least in 1.4, it was when the primary bitstream appeared to be an HTML
file) the item page uses the html servlet instead of the bitstream
servlet in the links to bitstreams, and only shows the primary bitstream.

For example, these are equivalent references to bitstreams, but note
how the second one doesn't have the Sequence ID:

http://mydemo/jspui/bitstream/123456789/454/4/2dExports/stata/dxf/a3_6-10.dxf

http://mydemo/jspui/html/123456789/454/2dExports/stata/dxf/a3_6-10.dxf

There is another problem with Sequence IDs, however.  None of the current
item migration/transfer tools preserve them.  The AIP packager (see the
wiki page about the PLEDGE prototype) managed it.  Currently, if an Item
gets moved from one DSpace to another, the Sequence IDs can get scrambled.

As Richard mentions, bitstream names are not necessarily unique.  Hopefully
the 2.0 data model will include some sort of relationship metadata so
the code can do better than randomly choosing the first bitstream
matching the correct name.

-- Larry


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Adding your own customizations into DSpace 1.5.1

2009-01-27 Thread Larry Stone
 Can someone give me the short answer about where to put our local
 customizations for DSpace 1.5.1 and exactly what procedures we need to
 do to have them end up in our application?

This wiki page:
http://wiki.dspace.org/index.php/BuildCookbook

was intended to cover this question -- please consider adding notes about
the jspui and updating it for 1.5.1 if necessary

-- Larry


--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Installing LNI in DSpace 1.5.1

2009-02-17 Thread Larry Stone
The LNI is already included in the 1.5.1 release as a maven package.
There is very scant mention of it in the documentation, nothing about
how to install it and access the webapp.

It's probably already installed on your server.  Is there a webapp
named dspace-lni next to dspace-jspui and the others?  If not, I
can't help, you'll need a maven expert to figure it out.

The wiki page is out of date and incomplete -- it was badly mangled in the
translation from the old wiki.  I recommend you read my preserved copy
of the original doc:

http://web.mit.edu/lcs/www/lni

Just ignore the install instructions; those are for 1.4.  Also search
the DSpace wiki for the simple LNI client.

When you get it working, please consider updating the wiki page to
prevent others from being mislead.

-- Larry


 I have successfully installed DSpace 1.5.1 in an Ubuntu 8.10 machine but no=
 w
 want to use LNI to integrate a PHP application with a DSpace repository via
 SOAP. I read instructions in
 http://wiki.dspace.org/index.php/LightweightNetworkInterface/DownloadsAndCl=
 ients#Install_Serverbut
 I could neither find nor deduce a way to install LNI server component
 in
 my DSpace. Could anybody provide some guidance to do this?

 The page says it can be installed as a plugin, but I cannot find a
 relationship between that statement and the code referred there (
 http://dspace.svn.sourceforge.net/svnroot/dspace/branches/dspace-1_5_x/dspa=
 ce-lni/).
 I did a checkout of that folder and run mvn package which generated a targe=
 t
 folder in earch directory (core, client and webapp). By reading instruction=
 s
 for 1.4 (even though it is stated the installation is quite different) I
 tried to deploy the  war generated in dspace-lni-webapp/target folder but n=
 o
 success when trying to start the application.

 Any clue about the right way of installing LNI in DSpace 1.5.x is really
 appreciate. Thanks in advance.

 Best regards,
 Luis Gal=E1rraga



--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Installing LNI in DSpace 1.5.1

2009-02-18 Thread Larry Stone
 Furthermore, my aim is to implement a kind of integration
 between DSpace and J!Research (http://joomla-research.com). That is why it
 is important for me interact with DSpace via SOAP or WebDAV. Thanks a lot.

If you have a choice, I recommend sticking with WebDAV (since
you'll need it anyway for import and export), it avoids all of
the compatibility misery of SOAP.  For an example, see
http://wiki.dspace.org/index.php/Simple_LNI_Client

-- Larry


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] script to validate all PDFs ?

2009-02-24 Thread Larry Stone
 Does anyone have a script that checks all of the previously uploaded
 PDFs and find ones that are malformed and reports their URLs/record IDs?

I think it's most appropriate to do this with the MediaFilter mechanism.
The default DSpace (1.5.1) distribution includes  the plugin:
org.dspace.app.mediafilter.PDFFilter
which extracts text from a PDF.  To do that, it interprets the PDF
contents with PDFBox, which is your asthmatic canary in the gassy
coal mine that is PDF.  You can count on it to keel over even on some files
that are roughly legal and can be rendered by xpdf and Adobe Acrobat Reader.

Running media-filter will log the Handle of failed Items in the DSpace
log.  See the manual for more info.  It's a lot easier, and sounder
practice, to leverage the existing media filter infrastructure than to
go digging into the database and assetstore -- that implementation may
change even in minor releases and configuration changes.

If you want to get more aggressive and precise about validating the PDF,
rather than just ensuring it is probably not corrupt, look into JHOVE
at http://hul.harvard.edu/jhove/ and keep an eye on JHOVE2
http://confluence.ucop.edu/display/JHOVE2Info/Home

-- Larry

 I can see how to write a script that uses the unix command line 'file'
 and 'pdftops' tools to check that every file that looks like a PDF is a
 good and valid PDF. Going from a file on the disk to a database record
 I'm not too sure of.



















 cheers
 stuart
 --
 Stuart Yeates
 http://www.nzetc.org/   New Zealand Electronic Text Centre
 http://researcharchive.vuw.ac.nz/ Institutional Repository

 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Java Heap dumps during Filter-Media

2009-04-08 Thread Larry Stone
The PDFBox library is _always_ going to be a problem because of its
architecture.  It insists on reading the entire PDF document, images
included, into memory.  This is not necessary, PDF was explicitly
designed to let renderers process a page at a time in limited memory.
Perhaps it could gain a lot by adding a mode where it ignores images
(e.g. for text extraction, it is a complete waste of time to even
read them into memory since it won't be getting any text out of them).

I took a different approach that may be helpful to sites with a lot
of PDF content that is pathological to PDFBox.  I wrote a couple of
filters that invoke the XPDF utilities as external OS-level command
processes to do the dirty work.  They are a bit more complicated to
maintain since they rely on outside programs that have to be installed,
but I've found the xpdf tools to be simple to install and maintain.
The XPDF-based text extractor is about three times as fast as PDFBox and
the only inputs it failed on PDFs were corrupt.  There were also no
issues with heap space since it runs outside of the JVM.

See patch #2745393 for the code:
https://sourceforge.net/tracker/?func=detailaid=2745393group_id=19984atid=319984

-- Larry


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Java Heap dumps during Filter-Media

2009-04-09 Thread Larry Stone
 Larry, I assume this is a donation to DSpace? If so I'll commit it so its
 available for testing/use in the 1.5.2 release.

Sure, go ahead, although I won't have time to provide better documentation
(for a while at least, maybe ever).  My time on the FACADE project which
produced this code is ending tomorrow, april 10; that's also the end of
my time at MIT.  I'm working desperately to finish other parts of the
project and do not have any time to spend on this, that's why I just
threw it over the wall because it looked like it could be useful right now.

Eventually all of the code I produced for FACADE will be made available
as open source; keep an eye on http://facade.mit.edu/  .. Not sure when
this will happen, though.

I'm not looking at any of the JIRA stuff (don't even have access yet)
so if there's anything there that needs my attention, please send me
personal mail -- I'm deleting anything with JIRA in the subject.
Thanks, and enjoy..

-- Larry

 On Thu, Apr 9, 2009 at 10:56 AM, Graham Triggs 
 gra...@biomedcentral.comwrote:
  Nice work Larry,
 
  I've replaced our PDF text extraction and thumbnail generation with this
  code.
 
  Thankfully, running on Debian, adding the third party tools was as hard as
  apt-get install xpdf ;)
 
  I actually ran into a few more difficulties with the ImageIO libraries -
  it's a pity that you don't get a simple ClassNotFoundException to be able to
  report this more clearly.
 
  But aside from that, my limited tests seem to work quite well.
 
  G
 
  -Original Message-
  From: Larry Stone [mailto:l...@mit.edu]
  Sent: 08 April 2009 22:21
  To: Tim Donohue
  Cc: DSpace Tech; Jeffrey Trimble
  Subject: Re: [Dspace-tech] Java Heap dumps during Filter-Media
 
  The PDFBox library is _always_ going to be a problem because of its
  architecture.  It insists on reading the entire PDF document, images
  included, into memory.  This is not necessary, PDF was explicitly designed
  to let renderers process a page at a time in limited memory.
  Perhaps it could gain a lot by adding a mode where it ignores images
  (e.g. for text extraction, it is a complete waste of time to even read them
  into memory since it won't be getting any text out of them).
 
  I took a different approach that may be helpful to sites with a lot of PDF
  content that is pathological to PDFBox.  I wrote a couple of filters that
  invoke the XPDF utilities as external OS-level command processes to do the
  dirty work.  They are a bit more complicated to maintain since they rely on
  outside programs that have to be installed, but I've found the xpdf tools to
  be simple to install and maintain.
  The XPDF-based text extractor is about three times as fast as PDFBox and
  the only inputs it failed on PDFs were corrupt.  There were also no issues
  with heap space since it runs outside of the JVM.
 
  See patch #2745393 for the code:
 
  https://sourceforge.net/tracker/?func=detailaid=2745393group_id=19984atid=319984
 
 -- Larry
 
 
 
  --
  This SF.net email is sponsored by:
  High Quality Requirements in a Collaborative Environment.
  Download a free trial of Rational Requirements Composer Now!
  http://p.sf.net/sfu/www-ibm-com
  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech
 
 
  --
  This SF.net email is sponsored by:
  High Quality Requirements in a Collaborative Environment.
  Download a free trial of Rational Requirements Composer Now!
  http://p.sf.net/sfu/www-ibm-com
  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech
 



 --
 Mark R. Diggory
 http://purl.org/net/mdiggory/homepage - Bio
 http://www.atmire.com - Institutional Repository Solutions
 http://www.togather.eu - Before getting together, get t...@ther

 --001636c5b1fac033c2046723300c
 Content-Type: text/html; charset=ISO-8859-1
 Content-Transfer-Encoding: quoted-printable

 Larry, I assume this is a donation to DSpace? If so I#39;ll commit it so i=
 ts available for testing/use in the 1.5.2 release.brbrMarkbrbrbr=
 div class=3Dgmail_quoteOn Thu, Apr 9, 2009 at 10:56 AM, Graham Triggs s=
 pan dir=3Dltrlt;a href=3Dmailto:gra...@biomedcentral.com;gra...@biom=
 edcentral.com/agt;/span wrote:br

 blockquote class=3Dgmail_quote style=3Dborder-left: 1px solid rgb(204, =
 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;Nice work Larry,=
 br
 br
 I#39;ve replaced our PDF text extraction and thumbnail generation with thi=
 s code.br
 br
 Thankfully, running on Debian, adding the third party tools was as hard as =
 quot;apt-get install xpdfquot; ;)br
 br
 I actually ran into a few more difficulties with the ImageIO libraries - it=
 #39

[Dspace-tech] authority control proposal

2009-05-13 Thread Larry Stone
I have to add an authority control mechanism to DSpace for an  
institutional repository, so I'm doing it as modification to the 1.5.2  
source in the hope it will get adopted into 1.6.

To begin discussion, I put up a wiki page about the design:
http://wiki.dspace.org/index.php/Authority_Control_of_Metadata_Values

Since I have to get this into production locally in the fairly near  
future, please read it and respond promptly so there is time to  
consider your comments.  There are also a few opportunities to fill in  
work I will not have time to do (JSPUI support, for example) so let me  
know if you're interested in volunteering to help.

  -- Larry


--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [Dspace-general] Converte the Bibtex into qualified DC

2009-05-26 Thread Larry Stone
If the bibtex metadata is already in XML, I recommend putting it right  
in the METS manifest in your SIP, and then add a crosswalk to DSpace  
to interpret it.  This way you don't lose any information before  
getting into DSpace, and the crosswalk can translate Bibtex directly  
into the exact metadata fields your DSpace is using.  To do this:

1. make up a different name for the MDTYPE attribute in the METS, e.g.
bibtex, so the manifst looks like:

dmdSec ...
mdWrap ... MDTYPE=bibtex
...the Bibtex metadata

2. add line to your DSpace configuration to tell METS ingester what  
crosswalk to use for MDTYPE=bibtex:

mets.submission.crosswalk.bibtex = bibtex2dspace

3. configure the bibtex2dspace crosswalk, the easiest way is to
implement it as an XSLT stylesheet, e.g. add this to configuration:

crosswalk.submission.bibtex2dspace.stylesheet = crosswalks/bibtex.xsl

...then just write the stylesheet; see the wiki and manual for examples.


hope that helps,
   -- Larry

On May 26, 2009, at 1:51 PM, Robin Wang wrote:

 Hi everyone,
 I wanna use the DSpaceSIP from Larry Stone and converte my bibtex  
 xml file for original pdf file into qualified DC Record that i can  
 insert the DC Record into a Mets Record for DSpace later. i know  
 that i should use the XSLT to do it. but does anyone here know,what  
 kind of a qualified DC should i build into METS

 dmdSec ID=DMD DSpaceID
 mdWrapQualified Dublin Core metadata /mdWrap
 /dmdSec


 dmdSec ID=dmd002
  mdWrap MIMETYPE=text/xml MDTYPE=DC  
 LABEL=Dublin Core Metadata
   xmlData
 dc:titleAlice's Adventures in  
 Wonderland/dc:title
 dc:creatorLewis Carroll/ 
 dc:creator
 dc:datebetween 1872 and 1890/ 
 dc:date
 dc:publisherMcCloughlin  
 Brothers/dc:publisher
 dc:typetext/dc:type
   /xmlData
  /mdWrap
   /dmdSec

 I do know that i should add an exetension schema for the DC, but can  
 anyone tell me, what kind of an exetension schema for qualified DC  
 in dspace should i give?? Or should i use the mods 
 (http://www.loc.gov/standards/mods/v3/mods-3-1.xsd 
 ) 
 xsi:schemaLocation=http://www.loc.gov/mods/v3http://www.loc.gov/standards/mods/v3/mods-3-3.xsd

 http://www.loc.gov/standards/mods/v3/mods-userguide-examples.html#digitized_book

 Which one will be better for LNIClient???

 Thanks

 Robin


 Posteingang immer voll? Der erste Speicher, der mitwächst:  
 Unbegrenzter Speicher bei Windows Live Hotmail!  
 ___
 Dspace-general mailing list
 dspace-gene...@mit.edu
 http://mailman.mit.edu/mailman/listinfo/dspace-general


--
Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT 
is a gathering of tech-side developers  brand creativity professionals. Meet
the minds behind Google Creative Lab, Visual Complexity, Processing,  
iPhoneDevCamp as they present alongside digital heavyweights like Barbarian 
Group, R/GA,  Big Spaceship. http://p.sf.net/sfu/creativitycat-com 
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Looking for Autocomplete for the XMLUI

2009-06-03 Thread Larry Stone
Has anyone tried to add an autocomplete feature to the Manakin UI  
(XMLUI)?  By autocomplete, I mean the Web UI trick where a text  
field is checked, as you type, against a list of possible completions  
and automatically filled in, or completed, by inserting text into the  
field or posting a dropdown menu of possible completions.  For  
example, the AjaxTags library has it under Autocomplete, see 
http://ajaxtags.sourceforge.net/ 
  -- but that's specifically for a JSP-based UI.

Autocomplete usually seems to be implemented as part of a  
comprehensive Javascript UI toolkit, and most of those are large,  
complex, standalone, and/or do not play well when integrated with  
another vertical-web-UI-solution like Cocoon.  So, do you know of any  
such toolkit that _is_ easy to integrate with Cocoon and/or Manakin?   
Has anyone developed their own autocomplete?

I've been experimenting with the Yahoo UI widgets 
(http://developer.yahoo.com/yui/ 
); it's well-documented and seems reasonably lightweight and modular,  
but it has some drawbacks. I'm hoping there is somebody already at the  
top of this learning curve who can see over this swamp from such a  
lofty vantage.

thanks,

   -- Larry

--
OpenSolaris 2009.06 is a cutting edge operating system for enterprises 
looking to deploy the next generation of Solaris that includes the latest 
innovations from Sun and the OpenSource community. Download a copy and 
enjoy capabilities such as Networking, Storage and Virtualization. 
Go to: http://p.sf.net/sfu/opensolaris-get
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Looking for Autocomplete for the XMLUI

2009-06-04 Thread Larry Stone
Thanks -- between this and AjaxTags for JSP, there is something of a  
consensus on script.aculo.us -- I'd passed over it at first since it  
looked infernally complicated but on second glance it's no worse than  
YUI, and perhaps more flexible.  It certainly has a different idea of  
AJAX data, it can apparently work from an XHTML fragment.

Thanks also to George, but since this is also for an authority-control  
prototype I'm hoping to use something already supported and maintained  
for the popular browser platforms, so that makes a homebrew solution  
less desireable.

It is actually not too difficult to produce the AJAX responses through  
a special Cocoon pipeline.  I wrote a simple generator that emits SAX  
events and added a pipline that essentially leaves it alone; might be  
easier than adding a servlet.  I'll have that in the prototype.  This  
way, users of other AJAX libraries can just add transformation steps  
to the pipeline to get the XML schema they need.

The XHTML-XHTML translation is a neat trick.  For the prototype I'm  
working with a slightly extended DRI (to add metadata authority values  
since they have to get pre-loaded into forms for the round trip), so  
I'll stick with DRI-XHTML for now.

OBTW, I did notice the Cocoon Forms stuff but figured since the DRI/ 
Wing architecture sidesteps it completely, it would not be productive  
to try to use it.

thanks,

   -- Larry

On Jun 4, 2009, at 12:27 PM, Tim Donohue wrote:

 At Illinois, we are using Scriptaculous (http://script.aculo.us/)  
 with the XMLUI to perform autocomplete on our Subject, Publisher and  
 Author fields.  (For Author, instead of having separate lastname and  
 firstname fields, we combine it all in one field which expects  
 'last, first m' format).

 We actually add in the necessary Javascript code for each of these  
 fields in our Theme by using an XSL which transforms XHTML - XHTML.  
 So, we have our theme's sitemap.xmap setup to do the following:

 (1) Run through our normal DRI - XHTML conversion via XSLT
 (2) If the path matches */*/*/submit/*.continue (i.e. in Submission  
 process), run the resulting XHTML through another XSLT to add in the  
 autocomplete javascript for the necessary form fields.

 On the server side of things, we did have to create a custom Java  
 Servlet (which we registered in the web.xml alongside Cocoon), to  
 process those AJAX requests and return the properly formatted  
 results. So, essentially, our autocomplete functionality is running  
 *outside* of Cocoon.

 There might be an easier way to do all of this...but, this is  
 essentially what we came up with as a relatively quick solution.  If  
 it is of interest, I am willing to share the code we've written for  
 this.

 - Tim


--
OpenSolaris 2009.06 is a cutting edge operating system for enterprises 
looking to deploy the next generation of Solaris that includes the latest 
innovations from Sun and the OpenSource community. Download a copy and 
enjoy capabilities such as Networking, Storage and Virtualization. 
Go to: http://p.sf.net/sfu/opensolaris-get
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Producing mets.xml for SWORD

2009-06-05 Thread Larry Stone
The Harvard METS toolkit gives you an convenient (though DOM-like) API  
for the METS data model, takes care of some of the housekeeping, and  
it has logic to prevent schema mistakes:

http://hul.harvard.edu/mets/

There's also a toolkit that helps you build a whole submission  
package, including the METS. It ought to work with SWORD too; see:
http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit

(If you develop any tricks or procedures for using it with SWORD,  
please add them to that wiki page!)

  -- Larry

On Jun 5, 2009, at 5:25 AM, mikan.d.dspace listmail wrote:

 Im hoping to use SWORD in order to deposit items to DSpace. What would
 be the best way to produce mets-metadata for the items to be
 submitted? Should I write the entire metadata file myself or is there
 templates / frameworks available for this?

 Thanks,
 Mika


--
OpenSolaris 2009.06 is a cutting edge operating system for enterprises 
looking to deploy the next generation of Solaris that includes the latest 
innovations from Sun and the OpenSource community. Download a copy and 
enjoy capabilities such as Networking, Storage and Virtualization. 
Go to: http://p.sf.net/sfu/opensolaris-get
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Export/import of noncontent objects (Epersons, etc.)?

2009-06-17 Thread Larry Stone
I recommend looking at the old AIP prototype (see 
http://wiki.dspace.org/index.php/AipPrototype 
  for doc and fossilized code) for a start on this.  It extended the  
METS implementation to cover Communities, Collections, and every  
aspect of Items, including most administrative metadata. Although it  
stopped short of fully representing the EPerson, Group and Policy  
objects  those would be straightforward to add.  It does show some of  
the issues involved in building a copy of an archive from scratch.

Migrating and mirroring content between repositories was one of the  
use cases for AIPs.  *Every* existing interchange mechanism (batch  
import, packager, etc) loses *some* details of the Item and its child  
objects.  Only the AIP was complete, and iirc it also had a little bug  
or two (e.g. bitstream sequence IDs weren't always restored perfectly).

good luck!

  -- Larry

On Jun 17, 2009, at 4:56 PM, Mark H. Wood wrote:

 We're working with a partner who want to keep a separate test instance
 with content tracking the (sizable) live repository fairly closely.
 The requirement that I've been given is to entirely replace the
 content from live every week or two.

 Deleting 17,000 items (and over 20,000 bitstreams) is an all-day
 operation, and then comes the loading phase.  It would save a lot of
 time if I could export the Community/Collection structure, EPerson and
 Group objects, registries, and anything else that's not an Item,
 Bundle, or Bitstream; drop and recreate the database; empty the
 assetstore and history; reload the noncontent tables; and then begin
 loading.

 So I'm looking at adding export/import for all of those objects,
 probably to XML.  In the case of Community and Collection I guess the
 best thing would be to just do a single exporter producing the same
 XML dialect consumed by the existing Community and Collection
 Structure Importer.  Likewise for the registries, it seems.  The other
 classes would need importers built as well as exporters.  Comments?

 Or is there a smarter way to make a consistent clone of a DSpace
 instance, with its own Handles, that is writable but doesn't affect
 the original?  (The Handle business, plus the need to quiesce the
 production site to ensure consistency across database and assetstore,
 is why I don't just use tar and pg_dump.)

 -- 
 Mark H. Wood, Lead System Programmer   mw...@iupui.edu
 Friends don't let friends publish revisable-form documents.
 --
 Crystal Reports - New Free Runtime and 30 Day Trial
 Check out the new simplified licensing option that enables unlimited
 royalty-free distribution of the report engine for externally facing
 server and web deployment.
 http://p.sf.net/sfu/businessobjects___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
Crystal Reports - New Free Runtime and 30 Day Trial
Check out the new simplified licensing option that enables unlimited
royalty-free distribution of the report engine for externally facing 
server and web deployment.
http://p.sf.net/sfu/businessobjects
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Question regarding LNI ingest of Packages and workflow

2009-07-14 Thread Larry Stone
See these pages for an easy way to get started making SIPs.  A package  
is just a manifest plus content files, so it can contain anything so  
long as the METS manifest describes it accurately.

http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit
http://wiki.dspace.org/index.php/Simple_LNI_Client

As for the LNI, when originally written it supported workflow and it's  
in the architecture but I don't know how recently it has been tested.   
Submitting to a collection with workflow is no different, but there  
are also some URLs to access workflow objects. See the original  
documentation here:

http://web.mit.edu/lcs/www/lni/

The version on the wiki is badly mutilated and hard to read, it never  
got repaired after a failed wiki format conversion.

/lcs

On Jul 14, 2009, at 8:01 AM, Vidya R wrote:




 I am testing with DSpace version - 1.5.1 and am working to integrate  
 our LMS to the DSpace repository through LNI. I waould want add  
 different types of content through the interface.
  I have successfully run the LNI with the smoke test and have been  
 successful in ingesting a few contents of PDF type into the  
 repository.
 I find that in order to ingest other contents through LNI, they need  
 to be converted to SIP - conforming to METS / MODs package format.
 Could anyone explain if I can ingest any type of content through  
 METS package format?

 Also I would want to know if I can use LNI to ingest items to  
 collections configured with workflows. I get an error : the item is  
 in state = 3. unknown.
 If I alter the existing LNI code will it be possible for me to  
 ingest items with workflow?
 Workflow of contents is one of our basic requirements. Please reply.

 Thanks
 Vidya


 --
 Enter the BlackBerry Developer Challenge
 This is your chance to win up to $100,000 in prizes! For a limited  
 time,
 vendors submitting new applications to BlackBerry App World(TM) will  
 have
 the opportunity to enter the BlackBerry Developer Challenge. See  
 full prize
 details at: 
 http://p.sf.net/sfu/Challenge___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
Enter the BlackBerry Developer Challenge  
This is your chance to win up to $100,000 in prizes! For a limited time, 
vendors submitting new applications to BlackBerry App World(TM) will have
the opportunity to enter the BlackBerry Developer Challenge. See full prize  
details at: http://p.sf.net/sfu/Challenge
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] ORACLE - FATAL org.dspace.storage.rdbms.InitializeDatabase @ Caught Exception

2009-09-02 Thread Larry Stone
This message means there is a problem in your JDBC configuration:  
probably db.url has the wrong hostname or port number.  Also check  
db.username and db.password.  Make sure the same values work with a  
simpler Oracle client like SQLPlus (although you'll have to rearrange  
the URL, it has a different pattern).

/lcs

On Sep 2, 2009, at 12:41 PM, José Eslava wrote:

 java] 2009-09-02 10:25:08,232 FATAL  
 org.dspace.storage.rdbms.InitializeDat
 abase @ Caught exception:
 [java] java.sql.SQLException: Excepci¾n de E/S: The Network  
 Adapter
 could n
 ot establish the connection


--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] work around authentication mechanism

2009-09-10 Thread Larry Stone
If you just want to deny all access based on the requestor's IP  
address, that is best done in the web server or servlet container.  If  
you're using naked Tomcat, see the doc for  
org.apache.catalina.valves.RemoteAddrValve.  If you're using Apache  
httpd it's very easy to configure, just see the server docs.


  -- Larry

On Sep 10, 2009, at 9:50 PM, Van Ly wrote:



Hi,

I may have a situation where one of the items in the list for
`plugin.sequence.org.dspace.eperson.AuthenticationMethod' isn't
behaving as expected.

To work around, if I need to put up a firewall to restrict access
based on ip-address and bypass the authentication mechanism entirely,
what would be a way?

Thanks in advance.

Van Ly
vly at usyd dot edu dot au





--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008  
30-Day
trial. Simplify your report design, integration and deployment - and  
focus on

what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech




smime.p7s
Description: S/MIME cryptographic signature
--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] SMTP server for mail question

2009-10-06 Thread Larry Stone
Like a lot of things, the answer is that depends.  The SMTP server  
is usually an MTA (Mail Transfer Agent), or post-office server, that  
is expected to be able to receive mail as well as send it.  However,  
when you're part of a big site like my.edu, you can often get away  
with setting up an MTA on your local machine that sends out mail with  
a return address of u...@my.edu, so it does _not_ have to be open to  
listening to the world at large.  It sends the mail out, and if it  
bounces, those bounce messages go to the MTA for my.edu.

I like to set up a local service on the same machine hosting DSpace  
just so that outgoing mail never gets lost because the other machine  
or the network is down.  It can be a simple outgoing-only Sendmail (or  
equivalent Postfix, exim, etc) that stores and forwards outbound  
messages.  It fixes up headers so the mail appears to come from  
my.edu, bounces go back to my.edu, and thus it wouldn't need to have  
port 25 open because nobody should ever be talking to it. In fact,  
that's probably a good idea, since if you did have it open you'd have  
to know how to configure it so as not to relay SPAM.

   -- Larry

On Oct 6, 2009, at 4:06 PM, Jeffrey Trimble wrote:
 Does the SMTP server need to be publicly available for DSpace to  
 work correctly?
 Our networking engineers are opening a firewall for another server  
 and have questioned
 the opening of port 25.

 TIA,

 Jeff

 Jeffrey Trimble
 System LIbrarian
 William F.  Maag Library
 Youngstown State University
 330.941.2483 (Office)
 jtrim...@cc.ysu.edu
 http://www.maag.ysu.edu
 http://digital.maag.ysu.edu
 I must not fear.  Fear is the mind-killer.
 I will permit it to pass over me and through me...
 --Litany against fear

 --
 Come build with us! The BlackBerry(R) Developer Conference in SF, CA
 is the only developer event you need to attend this year. Jumpstart  
 your
 developing skills, take BlackBerry mobile applications to market and  
 stay
 ahead of the curve. Join us from November 9 - 12, 2009. Register now!
 http://p.sf.net/sfu/devconference___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Filtering errors with XPDF Tools

2009-10-20 Thread Larry Stone
These errors imply that the JVM cannot access the executable file.   
Since the file exists, make sure the user under whose UID the JVM is  
running has read and execute access to the file /usr/bin/pdftoppm (as  
well as the /usr and /usr/bin directories, although those are usually  
world-rx).  Better yet, use su or sudo to assume the UID under  
which the JVM is running and make sure you can actually run that  
command.


Java IOExceptions have a way of conflating all file-access errors to  
no such file or directory, even when there's e.g. a permission  
problem, which makes it more of a challenge to discover what is really  
wrong.


  -- Larry

On Oct 20, 2009, at 4:54 PM, Kurzenberger, Eric wrote:


Error:

ERROR filtering, skipping bitstream:

  Item Handle: 10538/148
  Bundle Name: ORIGINAL
  File Size: 6898424
  Checksum: 4637dd47354393dc87f3d2b881ec311d (MD5)
  Asset Store: 0
java.io.IOException: Cannot run program /usr/bin/pdftoppm :
java.io.IOException: error=2, No such file or directory
java.io.IOException: Cannot run program /usr/bin/pdftoppm :
java.io.IOException: error=2, No such file or directory
  at java.lang.ProcessBuilder.start(ProcessBuilder.java:459)
  at java.lang.Runtime.exec(Runtime.java:593)
  at java.lang.Runtime.exec(Runtime.java:466)
  at org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream




smime.p7s
Description: S/MIME cryptographic signature
--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Differences in db with upgrade and fresh install

2009-12-15 Thread Larry Stone
I just noticed that too, while contemplating a switch from Oracle to  
PostgreSQL.  The Oracle schema also still has the mets_bitstream_id  
column in the Bundle table, which is no longer used and has been  
dropped.

I'll start a JIRA issue to fix this.

(OBTW, before you switch over to Oracle, you ought to be aware that  
many DSpace tables use CLOBs, and those can make life very difficult  
because most of Oracle's own tools (e.g. SQL*Plus) don't work with  
them.)

thanks,

   -- Larry

On Dec 15, 2009, at 5:02 AM, Bram Luyten wrote:

 Hello Francis,

 these tables are indeed deprecated and not in use anymore.

 best regards,

 Bram

 @mire - http://www.atmire.com

 Technologielaan 9 - 3001 Heverlee - Belgium
 533 2nd Street - Encinitas, CA 92024 - USA

 http://www.togather.eu - Before getting together, get t...@ther


 On Tue, Dec 15, 2009 at 10:50 AM, Brouns, Francis francis.bro...@ou.nl 
  wrote:
 Hi,

 We are in the process of upgrading from Dspace 1.4.2 to 1.5.2, at  
 the same time moving from Postgresql to Oracle. When I perform an  
 upgrade from 1.4 to 1.5.2 and run the database_schema_142-15.sql on  
 Postgresql the tables history and history_state are dropped.  
 However, when I do a clean install of an Oracle database from  
 database_schema.sql (1.5.2) the tables history and history_state are  
 created. In the Postgresql database_schema.sql (1.5.2) those tables  
 are not created. It is save to assume that both tables are no longer  
 required?

 Best wishes,
 Francis Brouns
 Open Universiteit Nederland
 --
 Return on Information:
 Google Enterprise Search pays you back
 Get the facts.
 http://p.sf.net/sfu/google-dev2dev
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

 --
 Return on Information:
 Google Enterprise Search pays you back
 Get the facts.
 http://p.sf.net/sfu/google-dev2dev
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech