Hello Paul,

there is no need to manually delete files created by filter media.
Running filter media with the option -f (force) will force the creation 
of media filter files and delete existing ones.

Hope that helps

Claudia Jürgen


Am 10.02.2012 20:00, schrieb Paul Go:
> What would happen if I deleted all of the Extracted Text files for the
> items that have redacted material?  Will new extracted text files be
> generated the next time the jobs are run?
>
> Paul Go
>
> Systems Librarian /
> Library Technology Manager
> Paul V. Galvin Library
> 35 West 33rd Street
> Chicago, IL  60616
> 312.567.7997
> [email protected]
>
>
>
> On Fri, Feb 10, 2012 at 12:52 PM, Brian Freels-Stendel<[email protected]>wrote:
>
>> Hi Paul,
>>
>> I'm going to hope that DSIndexer can do the trick.
>>
>> If not, though, there is still a last-resort, manual option.  Signed in
>> as an administrator, you can edit item bitstreams individually.  The
>> extracted texts will show up under the TEXT Bundle, under the name
>> [originalFileName].txt.  If your new files are named the same as the old
>> ones were, you may see two .txt files with the same name.  If so, you
>> can either view each to see which is the bad one, or you could delete
>> them both and re-index.
>>
>> B--
>>
>>>>> On 2/10/2012 at 11:41 AM, in message
>> <CAGD4+EtT=GRH+H3QX-EJEzmVh=Z=s960fkfyffujlykgaig...@mail.gmail.com>,
>> Paul Go
>> <[email protected]>  wrote:
>>> Sad to say, this did not solve the problem.  We ran filter-media,
>> cleanup
>>> and index-init jobs.
>>>
>>> I have not looked into running DSIndexer directly yet but that is my
>> next
>>> option.
>>>
>>> Paul Go
>>>
>>> Systems Librarian /
>>> Library Technology Manager
>>> Paul V. Galvin Library
>>> 35 West 33rd Street
>>> Chicago, IL  60616
>>> 312.567.7997
>>> [email protected]
>>>
>>>
>>>
>>> On Fri, Feb 10, 2012 at 9:29 AM, Thornton, Susan M.
>> (LARC-B702)[LITES]<
>>> [email protected]>  wrote:
>>>
>>>> Please let us know if this solves your problem as I’m really
>> curious.****
>>>>
>>>> Thanks!****
>>>>
>>>> Sue****
>>>>
>>>> ** **
>>>>
>>>> ** **
>>>>
>>>> *Sue Walker-Thornton*
>>>>
>>>> *(w):  (757) 864-2368*
>>>>
>>>> *(m):  (757) 506-9903*
>>>>
>>>> ** **
>>>>
>>>> *From:* Paul Go [mailto:[email protected]]
>>>> *Sent:* Friday, February 10, 2012 9:11 AM
>>>> *To:* Thornton, Susan M. (LARC-B702)[LITES]
>>>> *Cc:* Dspace Tech list
>>>> *Subject:* Re: [Dspace-tech] Full text search reindexing****
>>>>
>>>> ** **
>>>>
>>>> I ran index-init.  I will try the cleanup script as well.  Thank
>> you.
>>>>
>>>> Paul Go
>>>>
>>>> Systems Librarian /
>>>> Library Technology Manager
>>>> Paul V. Galvin Library
>>>> 35 West 33rd Street
>>>> Chicago, IL  60616
>>>> 312.567.7997
>>>> [email protected]
>>>>
>>>>
>>>> ****
>>>>
>>>> On Thu, Feb 9, 2012 at 4:36 PM, Thornton, Susan M.
>> (LARC-B702)[LITES]<
>>>> [email protected]>  wrote:****
>>>>
>>>> I’m not 100% sure, but try running the “cleanup” script.  This
>> removes
>>>> bitstreams where bitstream.deleted = true.****
>>>>
>>>>   ****
>>>>
>>>> Also, did you run “index-update” or “index-init” to rebuild
>> your indices?
>>>> I always run index-init after I’ve done something like that, just
>> to be
>>>> safe.****
>>>>
>>>>   ****
>>>>
>>>> Best regards,****
>>>>
>>>> Sue****
>>>>
>>>>   ****
>>>>
>>>>   ****
>>>>
>>>> *Sue Walker-Thornton*****
>>>>
>>>> *(w):  (757) 864-2368*****
>>>>
>>>> *(m):  (757) 506-9903*****
>>>>
>>>>   ****
>>>>
>>>> *From:* Paul Go [mailto:[email protected]]
>>>> *Sent:* Thursday, February 09, 2012 4:24 PM
>>>> *To:* DSpace General Mailing List; Dspace Tech list
>>>> *Subject:* [Dspace-tech] Full text search reindexing****
>>>>
>>>>   ****
>>>>
>>>>   ****
>>>>
>>>> We have redacted some information from PDFs that are in our DSpace
>>>> instance.  This involved downloading, redacting, and re-ingesting
>> the
>>>> files, making sure the originals with the offending information
>> were
>>>> removed.  We've done a full reindexing (with Tomcat off) but the
>> redacted
>>>> material is still showing up in a full-text search (even though the
>> target
>>>> items no longer have the information).****
>>>>
>>>>   ****
>>>>
>>>> How can we force re-index the full-text search?  It was my
>> understanding
>>>> that reindexing would do the trick.****
>>>>
>>>>   ****
>>>>
>>>>
>>>>
>>>> Paul Go
>>>>
>>>> Systems Librarian /
>>>> Library Technology Manager
>>>> Paul V. Galvin Library
>>>> 35 West 33rd Street
>>>> Chicago, IL  60616
>>>> 312.567.7997
>>>> [email protected]****
>>>>
>>>> ** **
>>>>
>>
>
>
>
> ------------------------------------------------------------------------------
> Virtualization&  Cloud Management Using Capacity Planning
> Cloud computing makes use of virtualization - but cloud computing
> also focuses on allowing computing to be delivered as a service.
> http://www.accelacomm.com/jaw/sfnl/114/51521223/
>
>
>
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech

-- 
Claudia Juergen
Universitaetsbibliothek Dortmund
Eldorado
0231/755-4043
https://eldorado.tu-dortmund.de/

------------------------------------------------------------------------------
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to