[DOCS] Example non-Latin words for text search parser docs?
I'm afraid my English-centricity is showing, but I could use a little help filling in the missing examples in the table here: http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html I'm not sure of a suitable example all-non-ASCII-letters word, and even less sure of how to represent it in SGML. (I remember we had quite a bit of trouble dealing with accented letters in people's names, for instance.) regards, tom lane ---(end of broadcast)--- TIP 9: In versions below 8.0, the planner will ignore your desire to choose an index scan if your joining column's datatypes do not match
Re: [DOCS] Example non-Latin words for text search parser docs?
Tom Lane wrote: > I'm afraid my English-centricity is showing, but I could use a little > help filling in the missing examples in the table here: > http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html > I'm not sure of a suitable example all-non-ASCII-letters word, It's easy to find an example -- I went to the english Wikipedia, searched for "elephant", then clicked on the russian link at the left. It gives you "Слоновые", which I see on my terminal as a series of black squares :-) so there's not a single latin letter in it. http://ru.wikipedia.org/wiki/%D0%A1%D0%BB%D0%BE%D0%BD%D0%BE%D0%B2%D1%8B%D0%B5 In that page they also mention the word "Слон" which looks like "Slon". > and even less sure of how to represent it in SGML. (I remember we had > quite a bit of trouble dealing with accented letters in people's > names, for instance.) Yeah, that will prove difficult. -- Alvaro Herrerahttp://www.CommandPrompt.com/ The PostgreSQL Company - Command Prompt, Inc. ---(end of broadcast)--- TIP 1: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to [EMAIL PROTECTED] so that your message can get through to the mailing list cleanly
Re: [DOCS] Example non-Latin words for text search parser docs?
Alvaro Herrera <[EMAIL PROTECTED]> writes: > Tom Lane wrote: >> and even less sure of how to represent it in SGML. (I remember we had >> quite a bit of trouble dealing with accented letters in people's >> names, for instance.) > Yeah, that will prove difficult. This problem largely goes away if we redefine the word categories as under discussion in the -hackers thread: with any of the proposed alternatives it'd be pretty easy to make up real words that are easily representable in SGML. regards, tom lane ---(end of broadcast)--- TIP 6: explain analyze is your friend
