[Dspace-tech] [SPAM] Identifying items with missing bitstreams in batch
Hi, I ran batch import via Simple Archive Format for my DSpace install since I had over 6000 items from other phased out repositories, however, most of the items lacked files attached to them. As you browse through the items in the collections you encounter There are no files linked to this item. What I require is a way in which to identify all the items that do not have bitstreams associated with them so that I can hand this list to our librarian for him to locate the necessary files. Does such a way exist? Thanks, Mark FAO Somalia -- View this message in context: http://dspace.2283337.n4.nabble.com/Identifying-items-with-missing-bitstreams-in-batch-tp4657146.html Sent from the DSpace - Tech mailing list archive at Nabble.com. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [SPAM] Identifying items with missing bitstreams in batch
On Wed, Aug 29, 2012 at 8:32 AM, Mark Kariuki mark.kari...@fao.org wrote: What I require is a way in which to identify all the items that do not have bitstreams associated with them so that I can hand this list to our librarian for him to locate the necessary files. Does such a way exist? Mark, in Discovery, this is a simple query: -fulltext:[* TO *] (without the quotes). The corresponding SQL query turned out to be a bit complicated. The inner query selects items which have bitstreams. The outer query negates it and removes special cases. The negation was simpler to construct than directly querying for items that have bitstreams. SELECT DISTINCT 'http://example.com/handle/'||handle.handle FROM handle,item WHERE handle.handle NOT IN ( -- select all items with bitstreams SELECT DISTINCT handle.handle FROM handle,item, item2bundle, bundle2bitstream WHERE handle.resource_id = item.item_id AND item.item_id = item2bundle.item_id AND item2bundle.bundle_id = bundle2bitstream.bundle_id AND bitstream_id IN (SELECT bitstream_id FROM bitstream) ) -- only handles which are items AND handle.resource_type_id=2 -- handles which belong to existing (not deleted) items AND handle.resource_id IS NOT NULL -- and item is not withdrawn AND handle.resource_id = item.item_id AND item.withdrawn = 'f' ORDER BY 'http://example.com/handle/'||handle.handle The numbers returned with the Solr and the SQL query may not be an exact match, but it should do. Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Change text in messages.xml
Hi Helix, For this reason, I was just making changes directly in messages.xml in dspace dir only. But all these days it was working fine, only for the past 2 days its not. So will continue making changes in dspace dir instead of dspace-src dir. In the meanwhile, If you have any other idea to resolve this problem, please let me know. Thanks regards Priya On Wed, Aug 29, 2012 at 12:17 AM, helix84 heli...@centrum.sk wrote: On Wed, Aug 29, 2012 at 12:07 AM, Priya S priya579...@googlemail.com wrote: I made some changes to the text in messages.xml under ~/dspace/modules/xmlui/src/main/webapp/i18n and after mvn package ant update, the changes are not taken place in built/installed messages.xml under ~/dir/webapps/xmlui/i18/ Hi Priya, I'm guessing here because I haven't done it that way for a long time, but I think the dspace-xmlui-lang repository has a higher priority in Maven than the local modules overlay. To test this, try running Maven in offline mode mvn -o package, if I'm right, your changes wouldn't be overwritten. Of course, this may have other side-effects, so the solution right now is to copy the messages.xml files manually from [dspace-src] to [dspace] after ant is run. At least that's what I do. Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Change text in messages.xml
Did you test with mvn -o to confirm that that is what causes the problem? Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Change text in messages.xml
Hi Helix, Yes, I did test with mvn -o, but earlier also I used to run mvn in offline mode only which never created the problem. Rgds Priya On Wed, Aug 29, 2012 at 3:18 PM, helix84 heli...@centrum.sk wrote: Did you test with mvn -o to confirm that that is what causes the problem? Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] data entry errors
I posted this a week ago and no one has responded yet, so I'm trying again: For input fields where it is not possible (or practical) to implement controlled vocabularies or drop down lists, is there a less labour-intensive way of preventing data entry errors? For example: The author of several documents is ABC Statistics Inc., but each document is added by a different ePerson,and each of these people makes a spelling error when filling out the AUTHOR field, so these items appear to have different authors. (ABC Statisitcs, Inc., ABC Statistics, Inc, ABC Statistics, etc.). Originally I thought that this would be a minor issue, easily correctable through raw SQL queries to update the offending fields. Unfortunately, my estimates as to the number of mistakes that would be made has proven to be extremely conservative. I do not want to be responsible for correcting so many entries myself, nor do I want to reject so many entries asking users to match the AUTHOR name that already exists. Does anyone have any ideas? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] recent submissions 1.8.2
Hi, After a recent upgrade from 1.6.2 to 1.8.2 we seem to have lost the display for the recent submissions at the collection level. The configuration is the same as it was before: Additional configuration for Recent Submissions code # the sort option name (from webui.itemlist.sort-option above) to use for # displaying recent submissions. (this # is used by the Recent Submissions system and any other time based # browse query such as FeedServlet) # recent.submissions.sort-option = dateaccessioned # how many recent submissions should be displayed at any one time recent.submissions.count = 500 # tell the community and collection pages that we are using the Recent # Submissions code plugin.sequence.org.dspace.plugin.CommunityHomeProcessor = \ org.dspace.app.webui.components.RecentCommunitySubmissions plugin.sequence.org.dspace.plugin.CollectionHomeProcessor = \ org.dspace.app.webui.components.RecentCollectionSubmissions but there are no listed items when we look at the collection page. Has anyone encountered this before? Thanks, Dan -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] data entry errors
Hi, i can only think of implementing an Authority Control for that. Anyway, deposit workflow is meant to accomplish that task (validate/correct metadata values). On 29 August 2012 16:22, Darren Arsenault arse...@algonquincollege.comwrote: I posted this a week ago and no one has responded yet, so I'm trying again: For input fields where it is not possible (or practical) to implement controlled vocabularies or drop down lists, is there a less labour-intensive way of preventing data entry errors? For example: The author of several documents is ABC Statistics Inc., but each document is added by a different ePerson,and each of these people makes a spelling error when filling out the AUTHOR field, so these items appear to have different authors. (ABC Statisitcs, Inc., ABC Statistics, Inc, ABC Statistics, etc.). Originally I thought that this would be a minor issue, easily correctable through raw SQL queries to update the offending fields. Unfortunately, my estimates as to the number of mistakes that would be made has proven to be extremely conservative. I do not want to be responsible for correcting so many entries myself, nor do I want to reject so many entries asking users to match the AUTHOR name that already exists. Does anyone have any ideas? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Thanks, DSpace Department *Lyncode*: Official websitehttp://www.google.com/url?q=http%3A%2F%2Fwww.lyncode.com%2Fsa=Dsntz=1usg=AFrqEzdV8iS6rMxflxnn138XReuRfUG3OQ [image: Follow us on Facebook]http://www.google.com/url?q=http%3A%2F%2Ftwitter.com%2Flyncodesa=Dsntz=1usg=AFrqEzeDuT3ZqMW5uVIA8AoxtTtAeiCX3Q http://www.google.com/url?q=http%3A%2F%2Fwww.facebook.com%2Flyncodesa=Dsntz=1usg=AFrqEzcWXjHa3gKBGLsNVxktapxkiWDnww -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] recent submissions 1.8.2
On Wed, Aug 29, 2012 at 5:53 PM, Daniel Sifton daniel.sif...@viu.ca wrote: After a recent upgrade from 1.6.2 to 1.8.2 we seem to have lost the display for the recent submissions at the collection level. The configuration is the same as it was before: Hi Daniel, did you happen to enable Discovery? Because in Discovery, this is done differently, as described here: https://wiki.duraspace.org/display/DSDOC18/Discovery#Discovery-EnablingDiscovery Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Problem with im4java
On Wed, Aug 29, 2012 at 6:55 PM, Daniel Shin danielshin...@gmail.com wrote: The code create a newtest.png picture and save on the images directory. But, it doesn't work in my application. I don't have any java exception about these. What doesn't work? If you have the image saved in [dspace]/webapps/xmlui/themes/Mirage/images/newtest.png, displaying it is just a matter of writing some simple XSL. Do you need help with that? Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Mirage: customising the summary item view
Hi, On the summary item view (actually even the detailed item view), I would like to modify the section that lists the collection(s) to which a given item belongs to. Among other things, I would like to also add the owning community information. which .xsl file should i look at that handles that? I have already looked at the item-view.xsl but it looks to me, it is not the place that holds that information. regards, mushashu -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] XPDF to Thumbnail Preview in DSpace 1.8.2
Just a follow up on my previous email, I ran the pdftoppm manually using this command and got the error below: pdftoppm -q -f 1 -l 1 -r 62 DevelopmentBulletin-73_2009.pdf bleg2 Bogus memory allocation size Link to the pdf file at: http://hdl.handle.net/1885/9207 Has anyone seen this error? Thanks On 29/08/2012, at 2:52 PM, Osama Alkadi wrote: Hi all, We are running dspace 1.8.2/Linux and having some issues with the pdftoppm tool when extracting some PDF's thumbnail. Some properties of the PDF's: - Encoding software includes: Adobe PDF Library, Acrobat Distiller, Acrobat PDFWriter. - Size: varies from 1 to 10 MB. In the logs (in debug mode) throws this after executing filter-media: INFO org.dspace.app.mediafilter.XPDF2Thumbnail @ XPDF2Thumbnail: outPrefix: /tmp/prevu1738144616485715914out ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ Unable to delete file ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ PDF conversion proc failed, exit status=1, file=/tmp/DSfilt2694438157933967840.pdf -- Full Filter Name: org.dspace.app.mediafilter.HTMLFilter org.dspace.app.mediafilter.HTMLFilter Full Filter Name: org.dspace.app.mediafilter.WordFilter org.dspace.app.mediafilter.WordFilter Full Filter Name: org.dspace.app.mediafilter.JPEGFilter org.dspace.app.mediafilter.JPEGFilter Full Filter Name: org.dspace.app.mediafilter.XPDF2Text org.dspace.app.mediafilter.XPDF2Text Full Filter Name: org.dspace.app.mediafilter.BrandedPreviewJPEGFilter org.dspace.app.mediafilter.BrandedPreviewJPEGFilter Full Filter Name: org.dspace.app.mediafilter.XPDF2Thumbnail org.dspace.app.mediafilter.XPDF2Thumbnail Full Filter Name: org.dspace.app.mediafilter.PowerPointFilter org.dspace.app.mediafilter.PowerPointFilter FILTERED: bitstream 38802 (item: 1885/8749) and created 'DevelopmentBulletin-73_2009.pdf.txt' ERROR filtering, skipping bitstream: Item Handle: 1885/8749 Bundle Name: ORIGINAL File Size: 1445348 Checksum: 1a1b0472e9361c4a4a00d30846f3e211 (MD5) Asset Store: 0 javax.imageio.IIOException: Can't read input file! javax.imageio.IIOException: Can't read input file! at javax.imageio.ImageIO.read(ImageIO.java:1275) at org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream(XPDF2Thumbnail.java:246) at org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:746) at org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:561) at org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:511) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:479) at org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:353) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:183) FILTERED: bitstream 38805 (item: 1885/8749) and created '01whole_Grubb.pdf.txt' FILTERED: bitstream 38805 (item: 1885/8749) and created '01whole_Grubb.pdf.jpg' Updating search index: Strangely, even when running the pdftoppm tool manually I get Bogus memory allocation size error. My JAVA_OPTS is set to -Xmx1024M -Xms128M -XX:PermSize=192M -XX:MaxPermSize=384M Also someone on the mailing list suggested a solution to change a line in XPDF2Thumbnail.java near the line reporting the error . The line was File outf = new File(outPrefix+-01.ppm); and change to File outf = new File(outPrefix+-001.ppm); Unfortunately, this has not worked for me. Any help would be appreciated? Thanks -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list