[DOCS] Example non-Latin words for text search parser docs?

2007-10-21 Thread Tom Lane
I'm afraid my English-centricity is showing, but I could use a little
help filling in the missing examples in the table here:
http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html
I'm not sure of a suitable example all-non-ASCII-letters word, and
even less sure of how to represent it in SGML.  (I remember we had
quite a bit of trouble dealing with accented letters in people's names,
for instance.)

regards, tom lane

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
   choose an index scan if your joining column's datatypes do not
   match


Re: [DOCS] Example non-Latin words for text search parser docs?

2007-10-21 Thread Alvaro Herrera
Tom Lane wrote:
> I'm afraid my English-centricity is showing, but I could use a little
> help filling in the missing examples in the table here:
> http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html
> I'm not sure of a suitable example all-non-ASCII-letters word,

It's easy to find an example -- I went to the english Wikipedia,
searched for "elephant", then clicked on the russian link at the left.
It gives you "Слоновые", which I see on my terminal as a series of black
squares :-) so there's not a single latin letter in it.

http://ru.wikipedia.org/wiki/%D0%A1%D0%BB%D0%BE%D0%BD%D0%BE%D0%B2%D1%8B%D0%B5

In that page they also mention the word "Слон" which looks like "Slon".

> and even less sure of how to represent it in SGML.  (I remember we had
> quite a bit of trouble dealing with accented letters in people's
> names, for instance.)

Yeah, that will prove difficult.

-- 
Alvaro Herrerahttp://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.

---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
   subscribe-nomail command to [EMAIL PROTECTED] so that your
   message can get through to the mailing list cleanly


Re: [DOCS] Example non-Latin words for text search parser docs?

2007-10-21 Thread Tom Lane
Alvaro Herrera <[EMAIL PROTECTED]> writes:
> Tom Lane wrote:
>> and even less sure of how to represent it in SGML.  (I remember we had
>> quite a bit of trouble dealing with accented letters in people's
>> names, for instance.)

> Yeah, that will prove difficult.

This problem largely goes away if we redefine the word categories as
under discussion in the -hackers thread: with any of the proposed
alternatives it'd be pretty easy to make up real words that are easily
representable in SGML.

regards, tom lane

---(end of broadcast)---
TIP 6: explain analyze is your friend