Re: [Zope] attribute used to index PDFs?

2006-02-24 Thread Garth B.
Hmm? I must have missed where it was suggested in this old thread to enter this "issue" into the bug tracker. At any rate, what I eventually concluded was that this really isn't an issue, just a misconception I had about what TXNG3 actually provides as native indexing support (given the appropria

Re: [Zope] attribute used to index PDFs?

2006-02-24 Thread Andreas Jung
--On 12. Dezember 2005 14:54:09 -0500 "Garth B." <[EMAIL PROTECTED]> wrote: On closer inspection, the Word docs aren't actually being indexed appropriately either. When I browse the vocabulary for these indexed Word docs, I happen to see textual content that can be seen by also cat'ing the do

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Andreas Jung
--On 12. Dezember 2005 14:54:09 -0500 "Garth B." <[EMAIL PROTECTED]> wrote: - Digging further in this file, "mimetype" is only defined when extract_content() in content.py calls "icc.addBinary(...)". This only happens when the indexed object provides a txng_get() hook (or I suppose if an adap

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Garth B.
On closer inspection, the Word docs aren't actually being indexed appropriately either. When I browse the vocabulary for these indexed Word docs, I happen to see textual content that can be seen by also cat'ing the document to the stdout. The vocab includes other strings that certainly are not co

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Garth B.
Hi Andreas, Neither PrincipiaSearchSource nor SearchableText does anything for these File-type objects. I guess nothing for SearchableText is expected since these are not CMF or Plone-derived objects. The only way I've managed to get *anything* indexed for these File-type objects is by specifyin

Re: [Zope] attribute used to index PDFs?

2005-12-12 Thread Andreas Jung
--On 12. Dezember 2005 11:33:13 -0500 "Garth B." <[EMAIL PROTECTED]> wrote: TextIndexNG 3.1.1 Zope 2.8.0 Python 2.3.5 What attribute should be specified when indexing PDFs? I've been using "data". Word docs are indexed properly, but the PDFs aren't. The PDFs are still found with the rest of

[Zope] attribute used to index PDFs?

2005-12-12 Thread Garth B.
TextIndexNG 3.1.1 Zope 2.8.0 Python 2.3.5 What attribute should be specified when indexing PDFs? I've been using "data". Word docs are indexed properly, but the PDFs aren't. The PDFs are still found with the rest of the files, but the indexed content is not what I expected. To try narrow thing