Re: [DOCS] Example non-Latin words for text search parser docs?

2007-10-23 Thread Tom Lane
Alvaro Herrera <[EMAIL PROTECTED]> writes:
> Tom Lane wrote:
>> I'm afraid my English-centricity is showing, but I could use a little
>> help filling in the missing examples in the table here:
>> http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html
>> I'm not sure of a suitable example all-non-ASCII-letters word,

> It's easy to find an example -- I went to the english Wikipedia,
> searched for "elephant", then clicked on the russian link at the left.
> It gives you "Слоновые", which I see on my terminal as a series of 
> black
> squares :-) so there's not a single latin letter in it.

Given the just-applied changes in the definition of a "word", we no
longer need a totally-not-ASCII sample word.  But I wonder if anyone
has a better idea than the føø that I made up on the
spot...

regards, tom lane

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [DOCS] Example non-Latin words for text search parser docs?

2007-10-23 Thread Alvaro Herrera
Tom Lane wrote:
> Alvaro Herrera <[EMAIL PROTECTED]> writes:
> > Tom Lane wrote:
> >> I'm afraid my English-centricity is showing, but I could use a little
> >> help filling in the missing examples in the table here:
> >> http://developer.postgresql.org/pgdocs/postgres/textsearch-parsers.html
> >> I'm not sure of a suitable example all-non-ASCII-letters word,
> 
> > It's easy to find an example -- I went to the english Wikipedia,
> > searched for "elephant", then clicked on the russian link at the left.
> > It gives you "Слоновые", which I see on my terminal as a series of black
> > squares :-) so there's not a single latin letter in it.
> 
> Given the just-applied changes in the definition of a "word", we no
> longer need a totally-not-ASCII sample word.  But I wonder if anyone
> has a better idea than the føø that I made up on the
> spot...

Actually I was wondering if we should use actual words.  So instead of
"foo" we could use "elephant" for asciiword and "Éléphant" (french) for
word.  And for the hword, "sous-espèces" (which appears on the French
Wikipedia) would do.

-- 
Alvaro Herrera http://www.flickr.com/photos/alvherre/
"La espina, desde que nace, ya pincha" (Proverbio africano)

---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster


Re: [DOCS] Example non-Latin words for text search parser docs?

2007-10-23 Thread Tom Lane
Alvaro Herrera <[EMAIL PROTECTED]> writes:
> Actually I was wondering if we should use actual words.  So instead of
> "foo" we could use "elephant" for asciiword and "Éléphant" (french) for
> word.  And for the hword, "sous-espèces" (which appears on the French
> Wikipedia) would do.

Hmm ... I see a potential problem with that, which is that if someone
happened to be viewing the page on something that dropped the accents,
or even just made them too small to be easily readable, the examples
wouldn't make any sense at all.

I have no problem with "elephant" as a sample asciiword, but for the
sample non-ascii word I'd suggest something that (a) is clearly not
English and (b) as much as possible, everybody knows has an accent.
At least in large parts of the US, something like "mañana" would
work nicely.

Anyway, feel free to hack on it --- I'm getting a bit weary of looking
at that chapter.

regards, tom lane

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate