[Dspace-tech] [SPAM] Identifying items with missing bitstreams in batch

2012-08-29 Thread Mark Kariuki
Hi,

I ran batch import via Simple Archive Format for my DSpace install since I
had over 6000 items from other phased out repositories, however, most of the
items lacked files attached to them.
As you browse through the items in the collections you encounter There are
no files linked to this item. 

What I require is a way in which to identify all the items that do not have
bitstreams associated with them so that I can hand this list to our
librarian for him to locate the necessary files. Does such a way exist?

Thanks,
Mark
FAO Somalia



--
View this message in context: 
http://dspace.2283337.n4.nabble.com/Identifying-items-with-missing-bitstreams-in-batch-tp4657146.html
Sent from the DSpace - Tech mailing list archive at Nabble.com.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [SPAM] Identifying items with missing bitstreams in batch

2012-08-29 Thread helix84
On Wed, Aug 29, 2012 at 8:32 AM, Mark Kariuki mark.kari...@fao.org wrote:
 What I require is a way in which to identify all the items that do not have
 bitstreams associated with them so that I can hand this list to our
 librarian for him to locate the necessary files. Does such a way exist?

Mark, in Discovery, this is a simple query: -fulltext:[* TO *]
(without the quotes).

The corresponding SQL query turned out to be a bit complicated. The
inner query selects items which have bitstreams. The outer query
negates it and removes special cases. The negation was simpler to
construct than directly querying for items that have bitstreams.

SELECT DISTINCT 'http://example.com/handle/'||handle.handle
FROM handle,item
WHERE handle.handle NOT IN (
  -- select all items with bitstreams
  SELECT DISTINCT handle.handle
  FROM handle,item, item2bundle, bundle2bitstream
  WHERE handle.resource_id = item.item_id
  AND item.item_id = item2bundle.item_id
  AND item2bundle.bundle_id = bundle2bitstream.bundle_id
  AND bitstream_id IN (SELECT bitstream_id FROM bitstream)
)
-- only handles which are items
AND handle.resource_type_id=2
-- handles which belong to existing (not deleted) items
AND handle.resource_id IS NOT NULL
-- and item is not withdrawn
AND handle.resource_id = item.item_id
AND item.withdrawn = 'f'
ORDER BY 'http://example.com/handle/'||handle.handle

The numbers returned with the Solr and the SQL query may not be an
exact match, but it should do.

Regards,
~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Change text in messages.xml

2012-08-29 Thread Priya S
Hi Helix,

For this reason, I was just making changes directly in messages.xml in
dspace dir only. But all these days it was working fine, only for the past
2 days its not.

So will continue making changes in dspace dir instead of dspace-src dir.

In the meanwhile, If you have any other idea to resolve this problem,
please let me know.

Thanks  regards
Priya

On Wed, Aug 29, 2012 at 12:17 AM, helix84 heli...@centrum.sk wrote:

 On Wed, Aug 29, 2012 at 12:07 AM, Priya S priya579...@googlemail.com
 wrote:
  I made some changes to the text in messages.xml under
  ~/dspace/modules/xmlui/src/main/webapp/i18n and after mvn package  ant
  update, the changes are not taken place in built/installed messages.xml
  under ~/dir/webapps/xmlui/i18/

 Hi Priya,

 I'm guessing here because I haven't done it that way for a long time,
 but I think the dspace-xmlui-lang repository has a higher priority in
 Maven than the local modules overlay. To test this, try running Maven
 in offline mode mvn -o package, if I'm right, your changes wouldn't
 be overwritten. Of course, this may have other side-effects, so the
 solution right now is to copy the messages.xml files manually from
 [dspace-src] to [dspace] after ant is run. At least that's what I do.

 Regards,
 ~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Change text in messages.xml

2012-08-29 Thread helix84
Did you test with mvn -o to confirm that that is what causes the problem?

Regards,
~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Change text in messages.xml

2012-08-29 Thread Priya S
Hi Helix,

Yes, I did test with mvn -o, but earlier also I used to run mvn in offline
mode only which never created the problem.

Rgds
Priya

On Wed, Aug 29, 2012 at 3:18 PM, helix84 heli...@centrum.sk wrote:

 Did you test with mvn -o to confirm that that is what causes the problem?

 Regards,
 ~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] data entry errors

2012-08-29 Thread Darren Arsenault
I posted this a week ago and no one has responded yet, so I'm trying again:

For input fields where it is not possible (or practical) to implement 
controlled vocabularies or drop down lists, is there a less labour-intensive 
way of preventing data entry errors? For example: The author of several 
documents is ABC Statistics Inc., but each document is added by a different 
ePerson,and each of these people makes a spelling error when filling out the 
AUTHOR field, so these items appear to have different authors. (ABC 
Statisitcs, Inc., ABC Statistics, Inc, ABC Statistics, etc.).

Originally I thought that this would be a minor issue, easily correctable 
through raw SQL queries to update the offending fields. Unfortunately, my 
estimates as to the number of mistakes that would be made has proven to be 
extremely conservative. I do not want to be responsible for correcting so many 
entries myself, nor do I want to reject so many entries asking users to match 
the AUTHOR name that already exists.



Does anyone have any ideas?



--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] recent submissions 1.8.2

2012-08-29 Thread Daniel Sifton
Hi, 

 

 

After a recent upgrade from 1.6.2 to 1.8.2 we seem to have lost the
display for the recent submissions at the collection level. The
configuration is the same as it was before:

 

 

 Additional configuration for Recent Submissions code 

 

# the sort option name (from webui.itemlist.sort-option above) to use
for

# displaying recent submissions.  (this

# is used by the Recent Submissions system and any other time based

# browse query such as FeedServlet)

#

recent.submissions.sort-option = dateaccessioned

 

# how many recent submissions should be displayed at any one time

recent.submissions.count = 500

 

# tell the community and collection pages that we are using the Recent

# Submissions code

plugin.sequence.org.dspace.plugin.CommunityHomeProcessor = \

org.dspace.app.webui.components.RecentCommunitySubmissions

 

plugin.sequence.org.dspace.plugin.CollectionHomeProcessor = \

org.dspace.app.webui.components.RecentCollectionSubmissions

 

 

but there are no listed items when we look at the collection page. Has
anyone encountered this before? 

 

 

Thanks, 

 

 

Dan 

 

 

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] data entry errors

2012-08-29 Thread DSpace @ Lyncode
Hi,

i can only think of implementing an Authority Control for that.
Anyway, deposit workflow is meant to accomplish that task (validate/correct
metadata values).

On 29 August 2012 16:22, Darren Arsenault arse...@algonquincollege.comwrote:

 I posted this a week ago and no one has responded yet, so I'm trying again:

 For input fields where it is not possible (or practical) to implement
 controlled vocabularies or drop down lists, is there a less
 labour-intensive way of preventing data entry errors? For example: The
 author of several documents is ABC Statistics Inc., but each document is
 added by a different ePerson,and each of these people makes a spelling
 error when filling out the AUTHOR field, so these items appear to have
 different authors. (ABC Statisitcs, Inc., ABC Statistics, Inc, ABC
 Statistics, etc.).

 Originally I thought that this would be a minor issue, easily correctable
 through raw SQL queries to update the offending fields. Unfortunately, my
 estimates as to the number of mistakes that would be made has proven to be
 extremely conservative. I do not want to be responsible for correcting so
 many entries myself, nor do I want to reject so many entries asking users
 to match the AUTHOR name that already exists.



 Does anyone have any ideas?




 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech




-- 
Thanks,
DSpace Department
*Lyncode*: Official
websitehttp://www.google.com/url?q=http%3A%2F%2Fwww.lyncode.com%2Fsa=Dsntz=1usg=AFrqEzdV8iS6rMxflxnn138XReuRfUG3OQ

[image: Follow us on
Facebook]http://www.google.com/url?q=http%3A%2F%2Ftwitter.com%2Flyncodesa=Dsntz=1usg=AFrqEzeDuT3ZqMW5uVIA8AoxtTtAeiCX3Q
http://www.google.com/url?q=http%3A%2F%2Fwww.facebook.com%2Flyncodesa=Dsntz=1usg=AFrqEzcWXjHa3gKBGLsNVxktapxkiWDnww
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] recent submissions 1.8.2

2012-08-29 Thread helix84
On Wed, Aug 29, 2012 at 5:53 PM, Daniel Sifton daniel.sif...@viu.ca wrote:
 After a recent upgrade from 1.6.2 to 1.8.2 we seem to have lost the display
 for the recent submissions at the collection level. The configuration is the
 same as it was before:

Hi Daniel,

did you happen to enable Discovery? Because in Discovery, this is done
differently, as described here:

https://wiki.duraspace.org/display/DSDOC18/Discovery#Discovery-EnablingDiscovery

Regards,
~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Problem with im4java

2012-08-29 Thread helix84
On Wed, Aug 29, 2012 at 6:55 PM, Daniel Shin danielshin...@gmail.com wrote:
 The code create a newtest.png picture and save on the images directory. But,
 it doesn't work in my application.
 I don't have any java exception about these.

What doesn't work? If you have the image saved in
[dspace]/webapps/xmlui/themes/Mirage/images/newtest.png, displaying it
is just a matter of writing some simple XSL. Do you need help with
that?

Regards,
~~helix84

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Mirage: customising the summary item view

2012-08-29 Thread Mushashu Mwansa Lumpa
Hi,

On the summary item view (actually even the detailed item view), I would
like to modify the section that lists the collection(s) to which a given
item belongs to. Among other things, I would like to also add the owning
community information. which .xsl file should i look at that handles that?
I have already looked at the item-view.xsl but it looks to me, it is not
the place that holds that information.

regards,

mushashu
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] XPDF to Thumbnail Preview in DSpace 1.8.2

2012-08-29 Thread Osama Alkadi
Just a follow up on my previous email, I ran the pdftoppm manually using this 
command and got the error below:

pdftoppm -q -f 1 -l 1 -r 62 DevelopmentBulletin-73_2009.pdf bleg2
Bogus memory allocation size

Link to the pdf file at: http://hdl.handle.net/1885/9207

Has anyone seen this error?

Thanks


On 29/08/2012, at 2:52 PM, Osama Alkadi wrote:

 Hi all,
 
 We are running dspace 1.8.2/Linux and having some issues with the pdftoppm 
 tool when extracting some PDF's thumbnail. 
 
 Some properties of the PDF's:
 
 - Encoding software includes: Adobe PDF Library, Acrobat Distiller,  Acrobat 
 PDFWriter.
 - Size: varies from 1 to 10 MB.
 
 In the logs (in debug mode) throws this after executing filter-media:
 
 INFO  org.dspace.app.mediafilter.XPDF2Thumbnail @ XPDF2Thumbnail: outPrefix: 
 /tmp/prevu1738144616485715914out
 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ Unable to delete file
 ERROR org.dspace.app.mediafilter.XPDF2Thumbnail @ PDF conversion proc failed, 
 exit status=1, file=/tmp/DSfilt2694438157933967840.pdf
 --
 Full Filter Name: org.dspace.app.mediafilter.HTMLFilter
 org.dspace.app.mediafilter.HTMLFilter
 Full Filter Name: org.dspace.app.mediafilter.WordFilter
 org.dspace.app.mediafilter.WordFilter
 Full Filter Name: org.dspace.app.mediafilter.JPEGFilter
 org.dspace.app.mediafilter.JPEGFilter
 Full Filter Name: org.dspace.app.mediafilter.XPDF2Text
 org.dspace.app.mediafilter.XPDF2Text
 Full Filter Name: org.dspace.app.mediafilter.BrandedPreviewJPEGFilter
 org.dspace.app.mediafilter.BrandedPreviewJPEGFilter
 Full Filter Name: org.dspace.app.mediafilter.XPDF2Thumbnail
 org.dspace.app.mediafilter.XPDF2Thumbnail
 Full Filter Name: org.dspace.app.mediafilter.PowerPointFilter
 org.dspace.app.mediafilter.PowerPointFilter
 FILTERED: bitstream 38802 (item: 1885/8749) and created 
 'DevelopmentBulletin-73_2009.pdf.txt'
 ERROR filtering, skipping bitstream:
 
   Item Handle: 1885/8749
   Bundle Name: ORIGINAL
   File Size: 1445348
   Checksum: 1a1b0472e9361c4a4a00d30846f3e211 (MD5)
   Asset Store: 0
 javax.imageio.IIOException: Can't read input file!
 javax.imageio.IIOException: Can't read input file!
   at javax.imageio.ImageIO.read(ImageIO.java:1275)
   at 
 org.dspace.app.mediafilter.XPDF2Thumbnail.getDestinationStream(XPDF2Thumbnail.java:246)
   at 
 org.dspace.app.mediafilter.MediaFilterManager.processBitstream(MediaFilterManager.java:746)
   at 
 org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:561)
   at 
 org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:511)
   at 
 org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:479)
   at 
 org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:353)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:183)
 FILTERED: bitstream 38805 (item: 1885/8749) and created 
 '01whole_Grubb.pdf.txt'
 FILTERED: bitstream 38805 (item: 1885/8749) and created 
 '01whole_Grubb.pdf.jpg'
 Updating search index:
 
 Strangely, even when running the pdftoppm tool  manually I get  Bogus memory 
 allocation size  error.  My JAVA_OPTS is set to -Xmx1024M -Xms128M 
 -XX:PermSize=192M -XX:MaxPermSize=384M
 
 Also someone on the mailing list  suggested a solution to change a line in 
 XPDF2Thumbnail.java near the line reporting the error . The line was
 
 File outf = new File(outPrefix+-01.ppm);
 and change to 
 File outf = new File(outPrefix+-001.ppm);
 
 Unfortunately, this has not worked for me. Any help would be appreciated?
 
 Thanks
 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and 
 threat landscape has changed and how IT managers can respond. Discussions 
 will include endpoint security, mobile security and the latest in malware 
 threats. 
 http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list