Re: [Dspace-tech] Fuzzy search in DSpace
Dear Jayan, Thank you very much for this interesting reference on Lucene Search in DSpace! I forwarded it to my power users. Just one point: you can configure DSpace for either an implicit OR between search terms or an implicit AND. OR is the default. When no patch is applied to add sorting to DSpace, the search result is implicitly sorted from the most relevant (most search terms with most relative frequency) to the least relevant (for instance, one occurrence of one of the search terms). This is just nice with implicit OR and useful for many applications. When AND is choosen as the implicit operator, the relevancy sorting is less relevant (!) and sorting by date is often prefered. Have a nice day! Christophe Jayan Chirayath Kurian a écrit : Hello, It looks quite nice to experiment with different options. http://drtc.isibang.ac.in:8080/jspui/handle/1849/244 The link refers to an interesting write-up. Cheers! Jayan -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Vlastimil Krejcir Sent: Thursday, May 15, 2008 10:49 PM To: [EMAIL PROTECTED]; dspace-tech@lists.sourceforge.net; [EMAIL PROTECTED] Subject: [Dspace-tech] Fuzzy search in DSpace Hi all, maybe I've just discovered something that is well known in the whole DSpace community. I'm not sure if everybody knows that Lucene (and so the DSpace) has fuzzy search. In my opinion this feature is not promoted enough (or not promoted at all). You can use the fuzzy search by adding ~ to query. For example we have an item about the movie Spiderman. So the query spiterman doesn't give us any results whereas spiterman~ give us the right item about the movie (and maybe more items depends on the fuzzy search setting). This can be use also for the thing I personally call cutted of diacritics search. Because it also works for words with diacritics (so krejcir~ gives all items where I'm the author even if there is only my surname with diacritics (Krejčíř) stored. It's not exact because this gives also results which have nothing common with me. On the other hand why not to use it. For details you can consult the Lucene documentation. hope this post might help Vlastik Vlastimil Krejčíř Library and Information Centre, Institute of Computer Science Masaryk University in Brno, Czech Republic Email: krejcir (at) ics (dot) muni (dot) cz Phone: +420 549 49 3872 ICQ: 163963217 Jabber: [EMAIL PROTECTED] - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech begin:vcard fn:Christophe Dupriez n:Dupriez;Christophe org:DESTIN inc. SSEB adr;quoted-printable:;;rue des Palais 44, bo=C3=AEte 1;Bruxelles;;B-1030;Belgique email;internet:[EMAIL PROTECTED] title:Informaticien tel;work:+32/2/216.66.15 tel;fax:+32/2/242.97.25 tel;cell:+32/475.77.62.11 note;quoted-printable:D=C3=A9veloppement de Syst=C3=A8mes de Traitement de l'Information x-mozilla-html:TRUE url:http://www.destin.be version:2.1 end:vcard - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] SRW/U
Hi Claudia, Sometime back we did a test setup of SRW-2.0 on a Linux box. That time, we tried it with dspace 1.4.2. I believe dspace 1.4.2 should work fine with SRW 2.0. Thanks Regards, Karthik --- Claudia Jürgen [EMAIL PROTECTED] wrote: Hi all, can anyone tell me with which DSpace Version the SRW/U 2.0 see http://www.oclc.org/research/software/srw/default.htm is supposed to work? Just set up a test on a DSpace 1.4.1 test instance http://eldorado2.uni-dortmund.de:8080/SRW/search/DSpace which only partially works and does not pass the http://alcme.oclc.org/srw/test/SRUServerTester test. cheers Claudia - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] virus scan for online submission of items in DSpace
Hi!, We have started with our online theses submission system using DSpace. We have on -access virus scanner software installed in our submission server. Since students use LDAP to authenticate and submit online, I was just curious to know while saving items into DSpace Assetstore, the online virus scanner would perform a check since it would be enabled on item access. Whether there is a possibility that the server would be already infected at the time of submission. Please suggest if other alternative exists or the best possible suggestion since the servers started accepting online submissions. The good news is that it has already surpassed 500 online submissions and waiting for approval in the workflow. Any suggestions are greatly appreciated. Thanks, Jayan - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Maven and multiple 1.5 source dirs
I'm planning on having the 1.5 dspace code for 1.5 installed in multiple directories pointing at a particular repository directory ( where dspace.cfg lives and the assetstore). For example: /dspace-dev-source using the /dsace-dev-repository /dspace-prod-source using the /dspace-prod-repository /dspace-blancoj-source uinsg the /dspace-dev-repositoyr This is the way I had things setup for 1.4.2. Now that in 1.5 we are using maven, if I want to build the code for any of these areas, I assume I just follow the instructions, and say I want to build /dspace-dev-source, I do this: cd /dspace-dev-source/dspace/; mvn package cd /dspace-dev-source/dspace/target/dspace-1.5-build.dir/; ant -Dconfig=/dspace-dev-repository/config/dspace.cfg update and then move the webapp dir to the appropriate tomcat dir etc I've done this just fine with one source directory, but when I have multiple ones, is this the way it will work? I also had to install a jar file for the the one source dir I worked on, and I will need this jar file in the other source dirs, so I assume I will have to do the same install for each source dir? I will look around for some documentation on maven to try to better understand it, but if any one knows of a good site, please let me know. Thank you! Jose - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Maven and multiple 1.5 source dirs
On May 16, 2008, at 7:26 AM, Blanco, Jose wrote: I'm planning on having the 1.5 dspace code for 1.5 installed in multiple directories pointing at a particular repository directory ( where dspace.cfg lives and the assetstore). For example: /dspace-dev-source using the /dsace-dev-repository /dspace-prod-source using the /dspace-prod-repository /dspace-blancoj-source uinsg the /dspace-dev-repositoyr This is the way I had things setup for 1.4.2. Now that in 1.5 we are using maven, if I want to build the code for any of these areas, I assume I just follow the instructions, and say I want to build /dspace-dev-source, I do this: cd /dspace-dev-source/dspace/; mvn package cd /dspace-dev-source/dspace/target/dspace-1.5-build.dir/; ant -Dconfig=/dspace-dev-repository/config/dspace.cfg update Yes, this will work initially... However, I recommend that you'll even want o be maintaining your configs changes in that original / dspace-stage-source and that this would include your /dspace- stage-source/configs/dspace.cfg, if you do it this way, then you don't need the -Dconfig and instead you would run init_configs instead... cd /dspace-dev-source/dspace/target/dspace-1.5-build.dir/; ant init_configs update and then move the webapp dir to the appropriate tomcat dir etc why not just add a Host entry in your tomcat server.xml for each of your /dspace-stage-repository/webapps? Then you don't need to move anything. I've done this just fine with one source directory, but when I have multiple ones, is this the way it will work? I also had to install a jar file for the the one source dir I worked on, and I will need this jar file in the other source dirs, so I assume I will have to do the same install for each source dir? The same maven repository would be used in each build, so I assume you will be fine installing it once. I will look around for some documentation on maven to try to better understand it, but if any one knows of a good site, please let me know. I think your approach is sound. But I do fear you may find yourself moving changes back and forth between your stages an awful lot, and this may turn out to be arduous... Using a svn repository, branching and tagging, might save you from that pain of tracking your customizations across the stages. Cheers, Mark ~ Mark R. Diggory - DSpace Developer and Systems Manager MIT Libraries, Systems and Technology Services Massachusetts Institute of Technology - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] virus scan for online submission of items in DSpace
Hi Jayan, We have started with our online theses submission system using DSpace. We have on -access virus scanner software installed in our submission server. Since students use LDAP to authenticate and submit online, I was just curious to know while saving items into DSpace Assetstore, the online virus scanner would perform a check since it would be enabled on item access. Whether there is a possibility that the server would be already infected at the time of submission. Please suggest if other alternative exists or the best possible suggestion since the servers started accepting online submissions. The good news is that it has already surpassed 500 online submissions and waiting for approval in the workflow. Any suggestions are greatly appreciated. I suspect you are OK with the way you are doing it. The file first gets uploaded to [dspace]/uploads/ temporarily until it is finally ingested into the assetstore. You might find enabling 'on write scanning' helps. If it doesn't get written there because your AV software catches it, you'll probably get a DSpace warning saying something along the lines of Upload failed as you would if for example your tomcat user couldn't write to that directory. Try using the EICAR test virus file ( http://www.eicar.org/anti_virus_test_file.htm ) to see how it performs. I'd be interested to hear your results. Thanks, Stuart _ Gwasanaethau Gwybodaeth Information Services Prifysgol Aberystwyth Aberystwyth University E-bost / E-mail: [EMAIL PROTECTED] Ffon / Tel: (01970) 622860 _ - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] SRW/U
Just to let you know I've not been ignoring this: Claudia and I have exchanged some emails. I tried just sending her an updated jar but with no success. I've installed a DSpace-1.4.2 on my machine and tested it with my copy of my SRU server. It seems to work just fine. I've created a war file with all my latest code (http://pubserv.oclc.org/srw/SRW.war) and asked her to try it out. I'll report back when I hear something from her. I've also installed DSpace-1.5 on my machine and noticed what look to be gratuitous class name changes (e.g. BrowseScope changed to BrowserScope), so I'm going to have to fork a branch in my code for 1.4.2 and make changes to support 1.5 on my trunk. I'll let you know when I have SRU running for 1.5. Ralph -Original Message- From: [EMAIL PROTECTED] [mailto:dspace-tech- [EMAIL PROTECTED] On Behalf Of Karthik Dathathri Sent: Friday, May 16, 2008 9:48 AM To: Claudia Jürgen Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] SRW/U Hi Claudia, Sometime back we did a test setup of SRW-2.0 on a Linux box. That time, we tried it with dspace 1.4.2. I believe dspace 1.4.2 should work fine with SRW 2.0. Thanks Regards, Karthik --- Claudia Jürgen [EMAIL PROTECTED] wrote: Hi all, can anyone tell me with which DSpace Version the SRW/U 2.0 see http://www.oclc.org/research/software/srw/default.htm is supposed to work? Just set up a test on a DSpace 1.4.1 test instance http://eldorado2.uni-dortmund.de:8080/SRW/search/DSpace which only partially works and does not pass the http://alcme.oclc.org/srw/test/SRUServerTester test. cheers Claudia --- -- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/ javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech --- -- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] SRW/U
Hi Ralph, thanx for updated code, gonna try it next week, I'm allready in the weekend. sunny greetings Claudia Just to let you know I've not been ignoring this: Claudia and I have exchanged some emails. I tried just sending her an updated jar but with no success. I've installed a DSpace-1.4.2 on my machine and tested it with my copy of my SRU server. It seems to work just fine. I've created a war file with all my latest code (http://pubserv.oclc.org/srw/SRW.war) and asked her to try it out. I'll report back when I hear something from her. I've also installed DSpace-1.5 on my machine and noticed what look to be gratuitous class name changes (e.g. BrowseScope changed to BrowserScope), so I'm going to have to fork a branch in my code for 1.4.2 and make changes to support 1.5 on my trunk. I'll let you know when I have SRU running for 1.5. Ralph -Original Message- From: [EMAIL PROTECTED] [mailto:dspace-tech- [EMAIL PROTECTED] On Behalf Of Karthik Dathathri Sent: Friday, May 16, 2008 9:48 AM To: Claudia Jürgen Cc: dspace-tech@lists.sourceforge.net Subject: Re: [Dspace-tech] SRW/U Hi Claudia, Sometime back we did a test setup of SRW-2.0 on a Linux box. That time, we tried it with dspace 1.4.2. I believe dspace 1.4.2 should work fine with SRW 2.0. Thanks Regards, Karthik --- Claudia Jürgen [EMAIL PROTECTED] wrote: Hi all, can anyone tell me with which DSpace Version the SRW/U 2.0 see http://www.oclc.org/research/software/srw/default.htm is supposed to work? Just set up a test on a DSpace 1.4.1 test instance http://eldorado2.uni-dortmund.de:8080/SRW/search/DSpace which only partially works and does not pass the http://alcme.oclc.org/srw/test/SRUServerTester test. cheers Claudia --- -- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/ javaone ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech --- -- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available
lni is right there in the Tomcat webapps folder right beside xmlui, which works like a champ. I'm not experienced with web apps, and even less so with java web apps, so I'm at something of a loss here. I've spent a while reading and trying to figure out the problem, with not much success. Mostly, it seems like there should be an index.html file in the WEB-INF directory, but there's not one. Anybody got lni working who'd like to give me a hint? - Rick - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Usning tomcat 5
I'm trying out an installation of tomcat 5, and I'm getting INFO: Deploying web application archive dspace.war May 16, 2008 4:17:56 PM org.apache.catalina.core.StandardContext processTlds SEVERE: Error reading tld listeners javax.servlet.ServletException: Exception processing TLD at resource path /WEB-INF/fmt.tld in context /dspace javax.servlet.ServletException: Exception processing TLD at resource path /WEB-INF/fmt.tld in context /dspace at org.apache.catalina.startup.TldConfig.tldScanTld(TldConfig.java:555) at org.apache.catalina.startup.TldConfig.execute(TldConfig.java:301) at org.apache.catalina.core.StandardContext.processTlds(StandardContext.jav a:4307) at org.apache.catalina.core.StandardContext.start(StandardContext.java:4144 ) at org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.ja va:760) This is with dsapce 1.4.2 Thanks! Jose - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available
not elegant or intuitive... http://host/lni/lni And you need the client to interact because it requires POST. -Mark On May 16, 2008, at 12:52 PM, Rick Runyan wrote: lni is right there in the Tomcat webapps folder right beside xmlui, which works like a champ. I’m not experienced with web apps, and even less so with java web apps, so I’m at something of a loss here. I’ve spent a while reading and trying to figure out the problem, with not much success. Mostly, it seems like there should be an index.html file in the WEB-INF directory, but there’s not one. Anybody got lni working who’d like to give me a hint? - Rick -- --- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] LNI - no index.html; The requested resource (/lni/) is not available
Actually, it responds to GET on some URIs, but the LNI is *not* meant to be used as an interactive web site; it is a WebDAV server. WebDAV happens to use the HTTP protocol but not in a way that gets along with the subset (and perversions) of HTTP most browsers speak. It also has some ability to converse in SOAP (for a subset of its functions) but that is even less browser-friendly. For all the details and leads, see the LNI documentation at http://web.mit.edu/lcs/www/lni/ You can also download a sample SOAP client there. For some reason, the client utilities and sample client were not included in 1.5. -- Larry not elegant or intuitive... http://host/lni/lni And you need the client to interact because it requires POST. -Mark On May 16, 2008, at 12:52 PM, Rick Runyan wrote: lni is right there in the Tomcat webapps folder right beside xmlui, = which works like a champ. I=92m not experienced with web apps, and = even less so with java web apps, so I=92m at something of a loss = here. I=92ve spent a while reading and trying to figure out the = problem, with not much success. Mostly, it seems like there should = be an index.html file in the WEB-INF directory, but there=92s not one. Anybody got lni working who=92d like to give me a hint? - Rick -- = --- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ = ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft = Defy all challenges. Microsoft(R) Visual Studio 2008. = http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech