Re: [Zope] attribute used to index PDFs?

2006-02-24 Thread Andreas Jung
--On 12. Dezember 2005 14:54:09 -0500 Garth B. [EMAIL PROTECTED] wrote: On closer inspection, the Word docs aren't actually being indexed appropriately either. When I browse the vocabulary for these indexed Word docs, I happen to see textual content that can be seen by also cat'ing the

Re: [Zope] attribute used to index PDFs?

2006-02-24 Thread Garth B.
Hmm? I must have missed where it was suggested in this old thread to enter this issue into the bug tracker. At any rate, what I eventually concluded was that this really isn't an issue, just a misconception I had about what TXNG3 actually provides as native indexing support (given the

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Andreas Jung
--On 12. Dezember 2005 11:33:13 -0500 Garth B. [EMAIL PROTECTED] wrote: TextIndexNG 3.1.1 Zope 2.8.0 Python 2.3.5 What attribute should be specified when indexing PDFs? I've been using data. Word docs are indexed properly, but the PDFs aren't. The PDFs are still found with the rest of the

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Garth B.
Hi Andreas, Neither PrincipiaSearchSource nor SearchableText does anything for these File-type objects. I guess nothing for SearchableText is expected since these are not CMF or Plone-derived objects. The only way I've managed to get *anything* indexed for these File-type objects is by

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Garth B.
On closer inspection, the Word docs aren't actually being indexed appropriately either. When I browse the vocabulary for these indexed Word docs, I happen to see textual content that can be seen by also cat'ing the document to the stdout. The vocab includes other strings that certainly are not

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Andreas Jung
--On 12. Dezember 2005 14:54:09 -0500 Garth B. [EMAIL PROTECTED] wrote: - Digging further in this file, mimetype is only defined when extract_content() in content.py calls icc.addBinary(...). This only happens when the indexed object provides a txng_get() hook (or I suppose if an adapter