Hello,

I encountered such a problem. my goal is to extract links from a text using tsearch2. Everything seemed to be well, unless I got some youtube links - there are some small and big letters inside, and a tsearch parser is lowering everything (from http://youtube.com/Y6dsHDX I got http://youtube.com/y6dshdx, which is not working). I went through PostgreSQL docs, and it seem that each of default dictionaries (simple, ispell, snowball) are lowering lexems during normalization, and there is no option to disable it.

I started to look for some tutorials, how to create own dictionary, or modify existing one (I'm talking about dictionary like snowball, with my own source code - not just a dictionary created by 'CREATE DICTIONARY...' query), but all I found is really out-of-date, and uses some mechanisms that are deprecated in latest version of Postgres (I'm working on v 9.2) - like 'contrib/gendict' here: http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/custom-dict.html <http://www.sai.msu.su/%7Emegera/postgres/gist/tsearch/V2/docs/custom-dict.html>

So now, I have no idea what to do with my case sensitivity problem... Is there any other way to overcome it, apart from creating own dictionary? If no - how to create one on the Postgres 9.2?

Regards,
xaru


--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Reply via email to