Yes.

Also, what version of DSpace are you running?


Sue Walker-Thornton
(w):  (757) 864-2368
(m):  (757) 506-9903

From: Paul Go [mailto:pgo...@gmail.com]
Sent: Friday, February 10, 2012 2:00 PM
To: Brian Freels-Stendel
Cc: Thornton, Susan M. (LARC-B702)[LITES]; DSpace General Mailing List; Dspace 
Tech list
Subject: Re: [Dspace-tech] Full text search reindexing

What would happen if I deleted all of the Extracted Text files for the items 
that have redacted material?  Will new extracted text files be generated the 
next time the jobs are run?

Paul Go

Systems Librarian /
Library Technology Manager
Paul V. Galvin Library
35 West 33rd Street
Chicago, IL  60616
312.567.7997
p...@iit.edu<mailto:p...@iit.edu>


On Fri, Feb 10, 2012 at 12:52 PM, Brian Freels-Stendel 
<bfre...@unm.edu<mailto:bfre...@unm.edu>> wrote:
Hi Paul,

I'm going to hope that DSIndexer can do the trick.

If not, though, there is still a last-resort, manual option.  Signed in
as an administrator, you can edit item bitstreams individually.  The
extracted texts will show up under the TEXT Bundle, under the name
[originalFileName].txt.  If your new files are named the same as the old
ones were, you may see two .txt files with the same name.  If so, you
can either view each to see which is the bad one, or you could delete
them both and re-index.

B--

>>> On 2/10/2012 at 11:41 AM, in message
<CAGD4+EtT=GRH+H3QX-EJEzmVh=Z=s960fkfyffujlykgaig...@mail.gmail.com<mailto:s960fkfyffujlykgaig...@mail.gmail.com>>,
Paul Go
<pgo...@gmail.com<mailto:pgo...@gmail.com>> wrote:
> Sad to say, this did not solve the problem.  We ran filter-media,
cleanup
> and index-init jobs.
>
> I have not looked into running DSIndexer directly yet but that is my
next
> option.
>
> Paul Go
>
> Systems Librarian /
> Library Technology Manager
> Paul V. Galvin Library
> 35 West 33rd Street
> Chicago, IL  60616
> 312.567.7997<tel:312.567.7997>
> p...@iit.edu<mailto:p...@iit.edu>
>
>
>
> On Fri, Feb 10, 2012 at 9:29 AM, Thornton, Susan M.
(LARC-B702)[LITES] <
> susan.m.thorn...@nasa.gov<mailto:susan.m.thorn...@nasa.gov>> wrote:
>
>> Please let us know if this solves your problem as I'm really
curious.****
>>
>> Thanks!****
>>
>> Sue****
>>
>> ** **
>>
>> ** **
>>
>> *Sue Walker-Thornton*
>>
>> *(w):  (757) 864-2368<tel:%28757%29%20864-2368>*
>>
>> *(m):  (757) 506-9903<tel:%28757%29%20506-9903>*
>>
>> ** **
>>
>> *From:* Paul Go [mailto:pgo...@gmail.com<mailto:pgo...@gmail.com>]
>> *Sent:* Friday, February 10, 2012 9:11 AM
>> *To:* Thornton, Susan M. (LARC-B702)[LITES]
>> *Cc:* Dspace Tech list
>> *Subject:* Re: [Dspace-tech] Full text search reindexing****
>>
>> ** **
>>
>> I ran index-init.  I will try the cleanup script as well.  Thank
you.
>>
>> Paul Go
>>
>> Systems Librarian /
>> Library Technology Manager
>> Paul V. Galvin Library
>> 35 West 33rd Street
>> Chicago, IL  60616
>> 312.567.7997<tel:312.567.7997>
>> p...@iit.edu<mailto:p...@iit.edu>
>>
>>
>> ****
>>
>> On Thu, Feb 9, 2012 at 4:36 PM, Thornton, Susan M.
(LARC-B702)[LITES] <
>> susan.m.thorn...@nasa.gov<mailto:susan.m.thorn...@nasa.gov>> wrote:****
>>
>> I'm not 100% sure, but try running the "cleanup" script.  This
removes
>> bitstreams where bitstream.deleted = true.****
>>
>>  ****
>>
>> Also, did you run "index-update" or "index-init" to rebuild
your indices?
>> I always run index-init after I've done something like that, just
to be
>> safe.****
>>
>>  ****
>>
>> Best regards,****
>>
>> Sue****
>>
>>  ****
>>
>>  ****
>>
>> *Sue Walker-Thornton*****
>>
>> *(w):  (757) 864-2368<tel:%28757%29%20864-2368>*****
>>
>> *(m):  (757) 506-9903<tel:%28757%29%20506-9903>*****
>>
>>  ****
>>
>> *From:* Paul Go [mailto:pgo...@gmail.com<mailto:pgo...@gmail.com>]
>> *Sent:* Thursday, February 09, 2012 4:24 PM
>> *To:* DSpace General Mailing List; Dspace Tech list
>> *Subject:* [Dspace-tech] Full text search reindexing****
>>
>>  ****
>>
>>  ****
>>
>> We have redacted some information from PDFs that are in our DSpace
>> instance.  This involved downloading, redacting, and re-ingesting
the
>> files, making sure the originals with the offending information
were
>> removed.  We've done a full reindexing (with Tomcat off) but the
redacted
>> material is still showing up in a full-text search (even though the
target
>> items no longer have the information).****
>>
>>  ****
>>
>> How can we force re-index the full-text search?  It was my
understanding
>> that reindexing would do the trick.****
>>
>>  ****
>>
>>
>>
>> Paul Go
>>
>> Systems Librarian /
>> Library Technology Manager
>> Paul V. Galvin Library
>> 35 West 33rd Street
>> Chicago, IL  60616
>> 312.567.7997<tel:312.567.7997>
>> p...@iit.edu<mailto:p...@iit.edu>****
>>
>> ** **
>>

------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to