Re: [HACKERS] Problem with site doc search

2008-04-16 Thread Cédric Villemain
Notice that :

http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r
and 
http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d

do not provide same result (3 results by date, 1 by rank) even if only the 
sorting is changed.

-- 
Cédric Villemain
Administrateur de Base de Données
Cel: +33 (0)6 74 15 56 53
http://dalibo.com - http://dalibo.org


signature.asc
Description: This is a digitally signed message part.


Re: [HACKERS] Problem with site doc search

2008-04-16 Thread Magnus Hagander
Oleg Bartunov wrote:
 On Tue, 15 Apr 2008, Magnus Hagander wrote:
 
  I didn't do anything, but possibly it got fixed by a different
  upgrade at some point, and the recrawling of the sites.
 
 Magnus, we have parser for indexing pgdocs, do you need it ?

Yes, please!

//Magnus

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-16 Thread Magnus Hagander
Cédric Villemain wrote:
 Notice that :
 
 http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r
 and 
 http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d
 
 do not provide same result (3 results by date, 1 by rank) even if
 only the sorting is changed.

Actually, I get 5 and 7, in the other order.

The reason for this is that Tom Lane is way too active. It's
gin_fuzzy_search_limit that's doing the restriction first, and the date
restriction comes in later.

//Magnus

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-16 Thread Cédric Villemain
Le Wednesday 16 April 2008, Magnus Hagander a écrit :
 Cédric Villemain wrote:
  Notice that :
 
  http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r
  and
  http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d
 
  do not provide same result (3 results by date, 1 by rank) even if
  only the sorting is changed.

 Actually, I get 5 and 7, in the other order.

 The reason for this is that Tom Lane is way too active. It's
 gin_fuzzy_search_limit that's doing the restriction first, and the date
 restriction comes in later.

Yes, you are perfectly right. Can I suggest to deactivate 
gin_fuzzy_search_limit (or increase the value) when one condition (and use 
the condition earlier) can considerably reduce the number of results (like 
the 'post date' here) ?


 //Magnus



-- 
Cédric Villemain
Administrateur de Base de Données
Cel: +33 (0)6 74 15 56 53
http://dalibo.com - http://dalibo.org


signature.asc
Description: This is a digitally signed message part.


Re: [HACKERS] Problem with site doc search

2008-04-16 Thread Magnus Hagander
Cédric Villemain wrote:
 Le Wednesday 16 April 2008, Magnus Hagander a écrit :
  Cédric Villemain wrote:
   Notice that :
  
   http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r
   and
   http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d
  
   do not provide same result (3 results by date, 1 by rank) even if
   only the sorting is changed.
 
  Actually, I get 5 and 7, in the other order.
 
  The reason for this is that Tom Lane is way too active. It's
  gin_fuzzy_search_limit that's doing the restriction first, and the
  date restriction comes in later.
 
 Yes, you are perfectly right. Can I suggest to deactivate 
 gin_fuzzy_search_limit (or increase the value) when one condition
 (and use the condition earlier) can considerably reduce the number of
 results (like the 'post date' here) ?

You'd have to convince the planner to actually not use an indexscan at
all on the tsvector. Normally it'll choose an index scan on each and
then a bitmap join, and we don't want to bring back so many rows...

//Magnus

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Bruce Momjian

I just searched for pg_standby and it looks like it is fixed now.

---

Gurjeet Singh wrote:
 On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED]
 wrote:
 
  Gurjeet Singh wrote:
   On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED]
   mailto:[EMAIL PROTECTED] wrote:
  
 No, it's still on the TODO. Gevik has also been looking a bit
 at it (I think - at least he's indicated that he is), and he
 recently got some new parser code to look at to see if we can
 use to fix it.

  
   I have the new parser code. Next week I have some time reserved to
   look at
   it.
  
  
   For the last few days I have been noticing that the _ (underscore) is no
   longer being treated as a word boundary. Can you confirm this has been
   fixed?
 
  AFAIK, nothing has been done yet :-)
 
 
 You are right I was looking for pg_standby and another pg_something
 and since I found those results on top, I thought there was some action
 taken.
 
 The problem persists
 
 Best regards,
 -- 
 [EMAIL PROTECTED]
 [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
 
 EnterpriseDB http://www.enterprisedb.com
 
 Mail sent from my BlackLaptop device

-- 
  Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
  EnterpriseDB http://enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Magnus Hagander
I didn't do anything, but possibly it got fixed by a different upgrade
at some point, and the recrawling of the sites.

//Magnus

Bruce Momjian wrote:
 
 I just searched for pg_standby and it looks like it is fixed now.
 
 ---
 
 Gurjeet Singh wrote:
  On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
  [EMAIL PROTECTED] wrote:
  
   Gurjeet Singh wrote:
On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED]
mailto:[EMAIL PROTECTED] wrote:
   
  No, it's still on the TODO. Gevik has also been looking
  a bit at it (I think - at least he's indicated that he
  is), and he recently got some new parser code to look at
  to see if we can use to fix it.
 
   
I have the new parser code. Next week I have some time
reserved to look at
it.
   
   
For the last few days I have been noticing that the _
(underscore) is no longer being treated as a word boundary. Can
you confirm this has been fixed?
  
   AFAIK, nothing has been done yet :-)
  
  
  You are right I was looking for pg_standby and another
  pg_something and since I found those results on top, I thought
  there was some action taken.
  
  The problem persists
  
  Best regards,
  -- 
  [EMAIL PROTECTED]
  [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
  
  EnterpriseDB http://www.enterprisedb.com
  
  Mail sent from my BlackLaptop device
 
 -- 
   Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
   EnterpriseDB http://enterprisedb.com
 
   + If your life is a hard drive, Christ can be your backup. +


-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Bruce Momjian
Magnus Hagander wrote:
 I didn't do anything, but possibly it got fixed by a different upgrade
 at some point, and the recrawling of the sites.

Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work
but pg and standby returns the same results.

Is that correct?  How do I test this?

---

 
 //Magnus
 
 Bruce Momjian wrote:
  
  I just searched for pg_standby and it looks like it is fixed now.
  
  ---
  
  Gurjeet Singh wrote:
   On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
   [EMAIL PROTECTED] wrote:
   
Gurjeet Singh wrote:
 On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED]
 mailto:[EMAIL PROTECTED] wrote:

   No, it's still on the TODO. Gevik has also been looking
   a bit at it (I think - at least he's indicated that he
   is), and he recently got some new parser code to look at
   to see if we can use to fix it.
  

 I have the new parser code. Next week I have some time
 reserved to look at
 it.


 For the last few days I have been noticing that the _
 (underscore) is no longer being treated as a word boundary. Can
 you confirm this has been fixed?
   
AFAIK, nothing has been done yet :-)
   
   
   You are right I was looking for pg_standby and another
   pg_something and since I found those results on top, I thought
   there was some action taken.
   
   The problem persists
   
   Best regards,
   -- 
   [EMAIL PROTECTED]
   [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
   
   EnterpriseDB http://www.enterprisedb.com
   
   Mail sent from my BlackLaptop device
  
  -- 
Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
EnterpriseDB http://enterprisedb.com
  
+ If your life is a hard drive, Christ can be your backup. +

-- 
  Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
  EnterpriseDB http://enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Richard Huxton

Bruce Momjian wrote:

Magnus Hagander wrote:

I didn't do anything, but possibly it got fixed by a different upgrade
at some point, and the recrawling of the sites.


Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work
but pg and standby returns the same results.

Is that correct?  How do I test this?


The default is to split words on underscore, so it's probably doing what 
it always did. Try to_tsquery and you should see it matching tsquery 
(probably to is a stopword).


I did put together a custom parser that allowed underscore in words, but 
given my extensive C experience in the last decade (one tsearch 
parser) you don't want to just plug that into the live site. Someone 
(Gevik?) was going to have a look at it when they had the time, but I'd 
guess that's the one thing none of us have much of.


--
  Richard Huxton
  Archonet Ltd

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Magnus Hagander
Richard Huxton wrote:
 Bruce Momjian wrote:
  Magnus Hagander wrote:
  I didn't do anything, but possibly it got fixed by a different
  upgrade at some point, and the recrawling of the sites.
  
  Oops, maybe it isn't fixed. I tried pg_standby and it seemed to
  work but pg and standby returns the same results.
  
  Is that correct?  How do I test this?
 
 The default is to split words on underscore, so it's probably doing
 what it always did. Try to_tsquery and you should see it matching
 tsquery (probably to is a stopword).
 
 I did put together a custom parser that allowed underscore in words,
 but given my extensive C experience in the last decade (one tsearch 
 parser) you don't want to just plug that into the live site. Someone 
 (Gevik?) was going to have a look at it when they had the time, but
 I'd guess that's the one thing none of us have much of.

Yeah.
Did you ever post the code to anybody other than Gevik? If not, please
send it to pgsql-www and someone can give it a quick look-over (perhaps
Oleg can help us there?)

//Magnus

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Richard Huxton

Magnus Hagander wrote:

Did you ever post the code to anybody other than Gevik? If not, please
send it to pgsql-www and someone can give it a quick look-over (perhaps
Oleg can help us there?)


Will do.

--
  Richard Huxton
  Archonet Ltd

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Oleg Bartunov

On Tue, 15 Apr 2008, Magnus Hagander wrote:


I didn't do anything, but possibly it got fixed by a different upgrade
at some point, and the recrawling of the sites.


Magnus, we have parser for indexing pgdocs, do you need it ?



//Magnus

Bruce Momjian wrote:


I just searched for pg_standby and it looks like it is fixed now.

---

Gurjeet Singh wrote:

On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander
[EMAIL PROTECTED] wrote:


Gurjeet Singh wrote:

On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED]
mailto:[EMAIL PROTECTED] wrote:

 No, it's still on the TODO. Gevik has also been looking
 a bit at it (I think - at least he's indicated that he
 is), and he recently got some new parser code to look at
 to see if we can use to fix it.


I have the new parser code. Next week I have some time
reserved to look at
it.


For the last few days I have been noticing that the _
(underscore) is no longer being treated as a word boundary. Can
you confirm this has been fixed?


AFAIK, nothing has been done yet :-)



You are right I was looking for pg_standby and another
pg_something and since I found those results on top, I thought
there was some action taken.

The problem persists

Best regards,
--
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB http://www.enterprisedb.com

Mail sent from my BlackLaptop device


--
  Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
  EnterpriseDB http://enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +




Regards,
Oleg
_
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Oleg Bartunov

Sergey Karpov prepared contrib/extend_parser, which we intend to use for
indexing pg-related documents. It handles '_' properly, so if anybody
interested, we could post it. Also, it can be useful  for playing, since
it's standalone contrib module.

Oleg

On Tue, 15 Apr 2008, Richard Huxton wrote:


Bruce Momjian wrote:

Magnus Hagander wrote:

I didn't do anything, but possibly it got fixed by a different upgrade
at some point, and the recrawling of the sites.


Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work
but pg and standby returns the same results.

Is that correct?  How do I test this?


The default is to split words on underscore, so it's probably doing what it 
always did. Try to_tsquery and you should see it matching tsquery 
(probably to is a stopword).


I did put together a custom parser that allowed underscore in words, but 
given my extensive C experience in the last decade (one tsearch parser) you 
don't want to just plug that into the live site. Someone (Gevik?) was going 
to have a look at it when they had the time, but I'd guess that's the one 
thing none of us have much of.





Regards,
Oleg
_
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-04-15 Thread Richard Huxton

Oleg Bartunov wrote:

Sergey Karpov prepared contrib/extend_parser, which we intend to use for
indexing pg-related documents. It handles '_' properly, so if anybody
interested, we could post it. Also, it can be useful  for playing, since
it's standalone contrib module.


Does it make sense to back-patch the default parser for 8.4? At present, 
it can't handle underscores in file-paths.


--
  Richard Huxton
  Archonet Ltd

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-03-26 Thread Gurjeet Singh
On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED]
wrote:

 Gurjeet Singh wrote:
  On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
 
No, it's still on the TODO. Gevik has also been looking a bit
at it (I think - at least he's indicated that he is), and he
recently got some new parser code to look at to see if we can
use to fix it.
   
 
  I have the new parser code. Next week I have some time reserved to
  look at
  it.
 
 
  For the last few days I have been noticing that the _ (underscore) is no
  longer being treated as a word boundary. Can you confirm this has been
  fixed?

 AFAIK, nothing has been done yet :-)


You are right I was looking for pg_standby and another pg_something
and since I found those results on top, I thought there was some action
taken.

The problem persists

Best regards,
-- 
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB http://www.enterprisedb.com

Mail sent from my BlackLaptop device


Re: [HACKERS] Problem with site doc search

2008-03-24 Thread Gurjeet Singh
On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] wrote:

  No, it's still on the TODO. Gevik has also been looking a bit
  at it (I think - at least he's indicated that he is), and he
  recently got some new parser code to look at to see if we can
  use to fix it.
 

 I have the new parser code. Next week I have some time reserved to look at
 it.


For the last few days I have been noticing that the _ (underscore) is no
longer being treated as a word boundary. Can you confirm this has been
fixed?

Best regards,

-- 
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB http://www.enterprisedb.com

Mail sent from my BlackLaptop device


Re: [HACKERS] Problem with site doc search

2008-03-24 Thread Magnus Hagander

Gurjeet Singh wrote:
On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] 
mailto:[EMAIL PROTECTED] wrote:


  No, it's still on the TODO. Gevik has also been looking a bit
  at it (I think - at least he's indicated that he is), and he
  recently got some new parser code to look at to see if we can
  use to fix it.
 

I have the new parser code. Next week I have some time reserved to
look at
it.


For the last few days I have been noticing that the _ (underscore) is no 
longer being treated as a word boundary. Can you confirm this has been 
fixed?


AFAIK, nothing has been done yet :-)

//Magnus

-
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-03-06 Thread Magnus Hagander

No, it's still on the TODO. Gevik has also been looking a bit at it (I
think - at least he's indicated that he is), and he recently got some nwe
parser code to look at to see if we can use to fix it.

//Magnus

On Wed, Mar 05, 2008 at 10:23:28PM -0500, Bruce Momjian wrote:
 
 Has this been addressed?
 
 ---
 
 Oleg Bartunov wrote:
  On Tue, 5 Feb 2008, Magnus Hagander wrote:
  
   No. It's on the list, but other things around the release haev priority.
  
  I just returned from my Europe trip and have many things to do :)
  
  
   //Magnus
  
   On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
   Hi guys any updates on this? Pinging you just so that we do not 
   forget
   it in the heap of mails in our inboxes.
  
   Best regards,
  
   On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote:
  
   Oleg Bartunov wrote:
   On Sat, 2 Feb 2008, Gurjeet Singh wrote:
  
   Hi All,
  
  I just noticed a minor bug in our search results. Searching for
   is_insteadbool in 8.3 docs returns the following page:
  
   http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
  
  is_instead is a column, and bool is the datatype, both mentioned in
   different columns. I know it is based on postgres' own full text
   search, but
   am not sure about the method how docs are read by the search engine. 
   It
   seems that the problem lies in the the way the doc was read and fed to
   the
   index builder, because the initial search results show these two words
   combined.
  
  Also, is it possible to teach our search engine to *not* treat _
   (underscore) as a word separator? This would be great help and would
   result
   much better results.
  
  
   Sure, there are many ways to do this. Magnus ?
  
   Which way would you recommend? You're the tsearch master ;-)
  
   //Magnus
  
  
  
  
   --
   [EMAIL PROTECTED]
   [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
  
   EnterpriseDB  http://www.enterprisedb.com
  
   17? 29' 34.37N,   78? 30' 59.76E - Hyderabad
   18? 32' 57.25N,   73? 56' 25.42E - Pune
   37? 47' 19.72N, 122? 24' 1.69 W - San Francisco *
  
   http://gurjeet.frihost.net
  
   Mail sent from my BlackLaptop device
  
   ---(end of broadcast)---
   TIP 7: You can help support the PostgreSQL project by donating at
  
  http://www.postgresql.org/about/donate
  
  
  Regards,
  Oleg
  _
  Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
  Sternberg Astronomical Institute, Moscow University, Russia
  Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
  phone: +007(495)939-16-83, +007(495)939-23-83
  ---(end of broadcast)---
  TIP 1: if posting/reading through Usenet, please send an appropriate
 subscribe-nomail command to [EMAIL PROTECTED] so that your
 message can get through to the mailing list cleanly
 
 -- 
   Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
   EnterpriseDB http://postgres.enterprisedb.com
 
   + If your life is a hard drive, Christ can be your backup. +

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-03-06 Thread Gevik Babakhani
 No, it's still on the TODO. Gevik has also been looking a bit 
 at it (I think - at least he's indicated that he is), and he 
 recently got some new parser code to look at to see if we can 
 use to fix it.
 

I have the new parser code. Next week I have some time reserved to look at
it.


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-03-05 Thread Bruce Momjian

Has this been addressed?

---

Oleg Bartunov wrote:
 On Tue, 5 Feb 2008, Magnus Hagander wrote:
 
  No. It's on the list, but other things around the release haev priority.
 
 I just returned from my Europe trip and have many things to do :)
 
 
  //Magnus
 
  On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
  Hi guys any updates on this? Pinging you just so that we do not forget
  it in the heap of mails in our inboxes.
 
  Best regards,
 
  On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote:
 
  Oleg Bartunov wrote:
  On Sat, 2 Feb 2008, Gurjeet Singh wrote:
 
  Hi All,
 
 I just noticed a minor bug in our search results. Searching for
  is_insteadbool in 8.3 docs returns the following page:
 
  http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
 
 is_instead is a column, and bool is the datatype, both mentioned in
  different columns. I know it is based on postgres' own full text
  search, but
  am not sure about the method how docs are read by the search engine. It
  seems that the problem lies in the the way the doc was read and fed to
  the
  index builder, because the initial search results show these two words
  combined.
 
 Also, is it possible to teach our search engine to *not* treat _
  (underscore) as a word separator? This would be great help and would
  result
  much better results.
 
 
  Sure, there are many ways to do this. Magnus ?
 
  Which way would you recommend? You're the tsearch master ;-)
 
  //Magnus
 
 
 
 
  --
  [EMAIL PROTECTED]
  [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
 
  EnterpriseDB  http://www.enterprisedb.com
 
  17? 29' 34.37N,   78? 30' 59.76E - Hyderabad
  18? 32' 57.25N,   73? 56' 25.42E - Pune
  37? 47' 19.72N, 122? 24' 1.69 W - San Francisco *
 
  http://gurjeet.frihost.net
 
  Mail sent from my BlackLaptop device
 
  ---(end of broadcast)---
  TIP 7: You can help support the PostgreSQL project by donating at
 
 http://www.postgresql.org/about/donate
 
 
   Regards,
   Oleg
 _
 Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
 Sternberg Astronomical Institute, Moscow University, Russia
 Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
 phone: +007(495)939-16-83, +007(495)939-23-83
 ---(end of broadcast)---
 TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to [EMAIL PROTECTED] so that your
message can get through to the mailing list cleanly

-- 
  Bruce Momjian  [EMAIL PROTECTED]http://momjian.us
  EnterpriseDB http://postgres.enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers


Re: [HACKERS] Problem with site doc search

2008-02-07 Thread Oleg Bartunov

On Tue, 5 Feb 2008, Magnus Hagander wrote:


No. It's on the list, but other things around the release haev priority.


I just returned from my Europe trip and have many things to do :)



//Magnus

On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:

Hi guys any updates on this? Pinging you just so that we do not forget
it in the heap of mails in our inboxes.

Best regards,

On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote:


Oleg Bartunov wrote:

On Sat, 2 Feb 2008, Gurjeet Singh wrote:


Hi All,

   I just noticed a minor bug in our search results. Searching for
is_insteadbool in 8.3 docs returns the following page:

http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html

   is_instead is a column, and bool is the datatype, both mentioned in
different columns. I know it is based on postgres' own full text
search, but
am not sure about the method how docs are read by the search engine. It
seems that the problem lies in the the way the doc was read and fed to
the
index builder, because the initial search results show these two words
combined.

   Also, is it possible to teach our search engine to *not* treat _
(underscore) as a word separator? This would be great help and would
result
much better results.



Sure, there are many ways to do this. Magnus ?


Which way would you recommend? You're the tsearch master ;-)

//Magnus





--
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB  http://www.enterprisedb.com

17° 29' 34.37N,   78° 30' 59.76E - Hyderabad
18° 32' 57.25N,   73° 56' 25.42E - Pune
37° 47' 19.72N, 122° 24' 1.69 W - San Francisco *

http://gurjeet.frihost.net

Mail sent from my BlackLaptop device


---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

   http://www.postgresql.org/about/donate



Regards,
Oleg
_
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83
---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
  subscribe-nomail command to [EMAIL PROTECTED] so that your
  message can get through to the mailing list cleanly


Re: [HACKERS] Problem with site doc search

2008-02-05 Thread Magnus Hagander
No. It's on the list, but other things around the release haev priority.

//Magnus

On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote:
 Hi guys any updates on this? Pinging you just so that we do not forget
 it in the heap of mails in our inboxes.
 
 Best regards,
 
 On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote:
 
  Oleg Bartunov wrote:
   On Sat, 2 Feb 2008, Gurjeet Singh wrote:
  
   Hi All,
  
  I just noticed a minor bug in our search results. Searching for
   is_insteadbool in 8.3 docs returns the following page:
  
   http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
  
  is_instead is a column, and bool is the datatype, both mentioned in
   different columns. I know it is based on postgres' own full text
   search, but
   am not sure about the method how docs are read by the search engine. It
   seems that the problem lies in the the way the doc was read and fed to
   the
   index builder, because the initial search results show these two words
   combined.
  
  Also, is it possible to teach our search engine to *not* treat _
   (underscore) as a word separator? This would be great help and would
   result
   much better results.
  
  
   Sure, there are many ways to do this. Magnus ?
 
  Which way would you recommend? You're the tsearch master ;-)
 
  //Magnus
 
 
 
 
 -- 
 [EMAIL PROTECTED]
 [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com
 
 EnterpriseDB  http://www.enterprisedb.com
 
 17° 29' 34.37N,   78° 30' 59.76E - Hyderabad
 18° 32' 57.25N,   73° 56' 25.42E - Pune
 37° 47' 19.72N, 122° 24' 1.69 W - San Francisco *
 
 http://gurjeet.frihost.net
 
 Mail sent from my BlackLaptop device

---(end of broadcast)---
TIP 7: You can help support the PostgreSQL project by donating at

http://www.postgresql.org/about/donate


Re: [HACKERS] Problem with site doc search

2008-02-04 Thread Gurjeet Singh
Hi guys any updates on this? Pinging you just so that we do not forget
it in the heap of mails in our inboxes.

Best regards,

On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote:

 Oleg Bartunov wrote:
  On Sat, 2 Feb 2008, Gurjeet Singh wrote:
 
  Hi All,
 
 I just noticed a minor bug in our search results. Searching for
  is_insteadbool in 8.3 docs returns the following page:
 
  http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html
 
 is_instead is a column, and bool is the datatype, both mentioned in
  different columns. I know it is based on postgres' own full text
  search, but
  am not sure about the method how docs are read by the search engine. It
  seems that the problem lies in the the way the doc was read and fed to
  the
  index builder, because the initial search results show these two words
  combined.
 
 Also, is it possible to teach our search engine to *not* treat _
  (underscore) as a word separator? This would be great help and would
  result
  much better results.
 
 
  Sure, there are many ways to do this. Magnus ?

 Which way would you recommend? You're the tsearch master ;-)

 //Magnus




-- 
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB  http://www.enterprisedb.com

17° 29' 34.37N,   78° 30' 59.76E - Hyderabad
18° 32' 57.25N,   73° 56' 25.42E - Pune
37° 47' 19.72N, 122° 24' 1.69 W - San Francisco *

http://gurjeet.frihost.net

Mail sent from my BlackLaptop device


Re: [HACKERS] Problem with site doc search

2008-02-03 Thread Magnus Hagander

Oleg Bartunov wrote:

On Sat, 2 Feb 2008, Gurjeet Singh wrote:


Hi All,

   I just noticed a minor bug in our search results. Searching for
is_insteadbool in 8.3 docs returns the following page:

http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html

   is_instead is a column, and bool is the datatype, both mentioned in
different columns. I know it is based on postgres' own full text 
search, but

am not sure about the method how docs are read by the search engine. It
seems that the problem lies in the the way the doc was read and fed to 
the

index builder, because the initial search results show these two words
combined.

   Also, is it possible to teach our search engine to *not* treat _
(underscore) as a word separator? This would be great help and would 
result

much better results.



Sure, there are many ways to do this. Magnus ?


Which way would you recommend? You're the tsearch master ;-)

//Magnus

---(end of broadcast)---
TIP 9: In versions below 8.0, the planner will ignore your desire to
  choose an index scan if your joining column's datatypes do not
  match


[HACKERS] Problem with site doc search

2008-02-02 Thread Gurjeet Singh
Hi All,

I just noticed a minor bug in our search results. Searching for
is_insteadbool in 8.3 docs returns the following page:

http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html

is_instead is a column, and bool is the datatype, both mentioned in
different columns. I know it is based on postgres' own full text search, but
am not sure about the method how docs are read by the search engine. It
seems that the problem lies in the the way the doc was read and fed to the
index builder, because the initial search results show these two words
combined.

Also, is it possible to teach our search engine to *not* treat _
(underscore) as a word separator? This would be great help and would result
much better results.

Thanks and best regards,

-- 
[EMAIL PROTECTED]
[EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com

EnterpriseDB  http://www.enterprisedb.com

17° 29' 34.37N,   78° 30' 59.76E - Hyderabad
18° 32' 57.25N,   73° 56' 25.42E - Pune
37° 47' 19.72N, 122° 24' 1.69 W - San Francisco *

http://gurjeet.frihost.net

Mail sent from my BlackLaptop device


Re: [HACKERS] Problem with site doc search

2008-02-02 Thread Oleg Bartunov

On Sat, 2 Feb 2008, Gurjeet Singh wrote:


Hi All,

   I just noticed a minor bug in our search results. Searching for
is_insteadbool in 8.3 docs returns the following page:

http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html

   is_instead is a column, and bool is the datatype, both mentioned in
different columns. I know it is based on postgres' own full text search, but
am not sure about the method how docs are read by the search engine. It
seems that the problem lies in the the way the doc was read and fed to the
index builder, because the initial search results show these two words
combined.

   Also, is it possible to teach our search engine to *not* treat _
(underscore) as a word separator? This would be great help and would result
much better results.



Sure, there are many ways to do this. Magnus ?


Thanks and best regards,




Regards,
Oleg
_
Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru),
Sternberg Astronomical Institute, Moscow University, Russia
Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/
phone: +007(495)939-16-83, +007(495)939-23-83

---(end of broadcast)---
TIP 6: explain analyze is your friend