> chad  wrote: 
> Verity is limited.  You cannot do Chinese characters and you cannot index 
> PDFs.
> 

I've been able to do both of those things on Solaris and would imagine
that functionality is virtually the same between Linux and Solaris. 
All I can think of are that your PDFs are scanned (images) versus
text, but you say you've indexed the same PDFs on Windows so that
couldn't be the case.

In one of my applications I've indexed millions of pages of PDFs with
some of the "books" containing 30,000+ pages.  Verity has worked fine
save for one problem:

it doesn't seem to recognize embedded fonts with special characters. 
So, for example, if someone is searching for "32-11-22" they won't
find it, but if they search for "321122" they'll find it.  It's as if
Verity doesn't even see the dashes.  The work-around I use is to have
the users search for "32*11*22".  That's not for all fonts, just weird
embedded ones.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:208375
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

Reply via email to