Re: [HACKERS] Problem with site doc search
Notice that : http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r and http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d do not provide same result (3 results by date, 1 by rank) even if only the sorting is changed. -- Cédric Villemain Administrateur de Base de Données Cel: +33 (0)6 74 15 56 53 http://dalibo.com - http://dalibo.org signature.asc Description: This is a digitally signed message part.
Re: [HACKERS] Problem with site doc search
Oleg Bartunov wrote: On Tue, 15 Apr 2008, Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Magnus, we have parser for indexing pgdocs, do you need it ? Yes, please! //Magnus -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Cédric Villemain wrote: Notice that : http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r and http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d do not provide same result (3 results by date, 1 by rank) even if only the sorting is changed. Actually, I get 5 and 7, in the other order. The reason for this is that Tom Lane is way too active. It's gin_fuzzy_search_limit that's doing the restriction first, and the date restriction comes in later. //Magnus -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Le Wednesday 16 April 2008, Magnus Hagander a écrit : Cédric Villemain wrote: Notice that : http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r and http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d do not provide same result (3 results by date, 1 by rank) even if only the sorting is changed. Actually, I get 5 and 7, in the other order. The reason for this is that Tom Lane is way too active. It's gin_fuzzy_search_limit that's doing the restriction first, and the date restriction comes in later. Yes, you are perfectly right. Can I suggest to deactivate gin_fuzzy_search_limit (or increase the value) when one condition (and use the condition earlier) can considerably reduce the number of results (like the 'post date' here) ? //Magnus -- Cédric Villemain Administrateur de Base de Données Cel: +33 (0)6 74 15 56 53 http://dalibo.com - http://dalibo.org signature.asc Description: This is a digitally signed message part.
Re: [HACKERS] Problem with site doc search
Cédric Villemain wrote: Le Wednesday 16 April 2008, Magnus Hagander a écrit : Cédric Villemain wrote: Notice that : http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=r and http://search.postgresql.org/search?q=tom+lanem=1l=d=1s=d do not provide same result (3 results by date, 1 by rank) even if only the sorting is changed. Actually, I get 5 and 7, in the other order. The reason for this is that Tom Lane is way too active. It's gin_fuzzy_search_limit that's doing the restriction first, and the date restriction comes in later. Yes, you are perfectly right. Can I suggest to deactivate gin_fuzzy_search_limit (or increase the value) when one condition (and use the condition earlier) can considerably reduce the number of results (like the 'post date' here) ? You'd have to convince the planner to actually not use an indexscan at all on the tsvector. Normally it'll choose an index scan on each and then a bitmap join, and we don't want to bring back so many rows... //Magnus -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
I just searched for pg_standby and it looks like it is fixed now. --- Gurjeet Singh wrote: On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) You are right I was looking for pg_standby and another pg_something and since I found those results on top, I thought there was some action taken. The problem persists Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. //Magnus Bruce Momjian wrote: I just searched for pg_standby and it looks like it is fixed now. --- Gurjeet Singh wrote: On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) You are right I was looking for pg_standby and another pg_something and since I found those results on top, I thought there was some action taken. The problem persists Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work but pg and standby returns the same results. Is that correct? How do I test this? --- //Magnus Bruce Momjian wrote: I just searched for pg_standby and it looks like it is fixed now. --- Gurjeet Singh wrote: On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) You are right I was looking for pg_standby and another pg_something and since I found those results on top, I thought there was some action taken. The problem persists Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Bruce Momjian wrote: Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work but pg and standby returns the same results. Is that correct? How do I test this? The default is to split words on underscore, so it's probably doing what it always did. Try to_tsquery and you should see it matching tsquery (probably to is a stopword). I did put together a custom parser that allowed underscore in words, but given my extensive C experience in the last decade (one tsearch parser) you don't want to just plug that into the live site. Someone (Gevik?) was going to have a look at it when they had the time, but I'd guess that's the one thing none of us have much of. -- Richard Huxton Archonet Ltd -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Richard Huxton wrote: Bruce Momjian wrote: Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work but pg and standby returns the same results. Is that correct? How do I test this? The default is to split words on underscore, so it's probably doing what it always did. Try to_tsquery and you should see it matching tsquery (probably to is a stopword). I did put together a custom parser that allowed underscore in words, but given my extensive C experience in the last decade (one tsearch parser) you don't want to just plug that into the live site. Someone (Gevik?) was going to have a look at it when they had the time, but I'd guess that's the one thing none of us have much of. Yeah. Did you ever post the code to anybody other than Gevik? If not, please send it to pgsql-www and someone can give it a quick look-over (perhaps Oleg can help us there?) //Magnus -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Magnus Hagander wrote: Did you ever post the code to anybody other than Gevik? If not, please send it to pgsql-www and someone can give it a quick look-over (perhaps Oleg can help us there?) Will do. -- Richard Huxton Archonet Ltd -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
On Tue, 15 Apr 2008, Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Magnus, we have parser for indexing pgdocs, do you need it ? //Magnus Bruce Momjian wrote: I just searched for pg_standby and it looks like it is fixed now. --- Gurjeet Singh wrote: On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) You are right I was looking for pg_standby and another pg_something and since I found those results on top, I thought there was some action taken. The problem persists Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://enterprisedb.com + If your life is a hard drive, Christ can be your backup. + Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Sergey Karpov prepared contrib/extend_parser, which we intend to use for indexing pg-related documents. It handles '_' properly, so if anybody interested, we could post it. Also, it can be useful for playing, since it's standalone contrib module. Oleg On Tue, 15 Apr 2008, Richard Huxton wrote: Bruce Momjian wrote: Magnus Hagander wrote: I didn't do anything, but possibly it got fixed by a different upgrade at some point, and the recrawling of the sites. Oops, maybe it isn't fixed. I tried pg_standby and it seemed to work but pg and standby returns the same results. Is that correct? How do I test this? The default is to split words on underscore, so it's probably doing what it always did. Try to_tsquery and you should see it matching tsquery (probably to is a stopword). I did put together a custom parser that allowed underscore in words, but given my extensive C experience in the last decade (one tsearch parser) you don't want to just plug that into the live site. Someone (Gevik?) was going to have a look at it when they had the time, but I'd guess that's the one thing none of us have much of. Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
Oleg Bartunov wrote: Sergey Karpov prepared contrib/extend_parser, which we intend to use for indexing pg-related documents. It handles '_' properly, so if anybody interested, we could post it. Also, it can be useful for playing, since it's standalone contrib module. Does it make sense to back-patch the default parser for 8.4? At present, it can't handle underscores in file-paths. -- Richard Huxton Archonet Ltd -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
On Tue, Mar 25, 2008 at 3:01 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) You are right I was looking for pg_standby and another pg_something and since I found those results on top, I thought there was some action taken. The problem persists Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device
Re: [HACKERS] Problem with site doc search
On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? Best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com Mail sent from my BlackLaptop device
Re: [HACKERS] Problem with site doc search
Gurjeet Singh wrote: On Thu, Mar 6, 2008 at 5:01 PM, Gevik Babakhani [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. For the last few days I have been noticing that the _ (underscore) is no longer being treated as a word boundary. Can you confirm this has been fixed? AFAIK, nothing has been done yet :-) //Magnus - Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers
Re: [HACKERS] Problem with site doc search
No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some nwe parser code to look at to see if we can use to fix it. //Magnus On Wed, Mar 05, 2008 at 10:23:28PM -0500, Bruce Momjian wrote: Has this been addressed? --- Oleg Bartunov wrote: On Tue, 5 Feb 2008, Magnus Hagander wrote: No. It's on the list, but other things around the release haev priority. I just returned from my Europe trip and have many things to do :) //Magnus On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote: Hi guys any updates on this? Pinging you just so that we do not forget it in the heap of mails in our inboxes. Best regards, On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17? 29' 34.37N, 78? 30' 59.76E - Hyderabad 18? 32' 57.25N, 73? 56' 25.42E - Pune 37? 47' 19.72N, 122? 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device ---(end of broadcast)--- TIP 7: You can help support the PostgreSQL project by donating at http://www.postgresql.org/about/donate Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 ---(end of broadcast)--- TIP 1: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to [EMAIL PROTECTED] so that your message can get through to the mailing list cleanly -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://postgres.enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers
Re: [HACKERS] Problem with site doc search
No, it's still on the TODO. Gevik has also been looking a bit at it (I think - at least he's indicated that he is), and he recently got some new parser code to look at to see if we can use to fix it. I have the new parser code. Next week I have some time reserved to look at it. -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers
Re: [HACKERS] Problem with site doc search
Has this been addressed? --- Oleg Bartunov wrote: On Tue, 5 Feb 2008, Magnus Hagander wrote: No. It's on the list, but other things around the release haev priority. I just returned from my Europe trip and have many things to do :) //Magnus On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote: Hi guys any updates on this? Pinging you just so that we do not forget it in the heap of mails in our inboxes. Best regards, On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17? 29' 34.37N, 78? 30' 59.76E - Hyderabad 18? 32' 57.25N, 73? 56' 25.42E - Pune 37? 47' 19.72N, 122? 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device ---(end of broadcast)--- TIP 7: You can help support the PostgreSQL project by donating at http://www.postgresql.org/about/donate Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 ---(end of broadcast)--- TIP 1: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to [EMAIL PROTECTED] so that your message can get through to the mailing list cleanly -- Bruce Momjian [EMAIL PROTECTED]http://momjian.us EnterpriseDB http://postgres.enterprisedb.com + If your life is a hard drive, Christ can be your backup. + -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.orgextra=pgsql-hackers
Re: [HACKERS] Problem with site doc search
On Tue, 5 Feb 2008, Magnus Hagander wrote: No. It's on the list, but other things around the release haev priority. I just returned from my Europe trip and have many things to do :) //Magnus On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote: Hi guys any updates on this? Pinging you just so that we do not forget it in the heap of mails in our inboxes. Best regards, On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17° 29' 34.37N, 78° 30' 59.76E - Hyderabad 18° 32' 57.25N, 73° 56' 25.42E - Pune 37° 47' 19.72N, 122° 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device ---(end of broadcast)--- TIP 7: You can help support the PostgreSQL project by donating at http://www.postgresql.org/about/donate Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 ---(end of broadcast)--- TIP 1: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to [EMAIL PROTECTED] so that your message can get through to the mailing list cleanly
Re: [HACKERS] Problem with site doc search
No. It's on the list, but other things around the release haev priority. //Magnus On Mon, Feb 04, 2008 at 06:43:09PM -0800, Gurjeet Singh wrote: Hi guys any updates on this? Pinging you just so that we do not forget it in the heap of mails in our inboxes. Best regards, On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17° 29' 34.37N, 78° 30' 59.76E - Hyderabad 18° 32' 57.25N, 73° 56' 25.42E - Pune 37° 47' 19.72N, 122° 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device ---(end of broadcast)--- TIP 7: You can help support the PostgreSQL project by donating at http://www.postgresql.org/about/donate
Re: [HACKERS] Problem with site doc search
Hi guys any updates on this? Pinging you just so that we do not forget it in the heap of mails in our inboxes. Best regards, On Feb 3, 2008 8:40 AM, Magnus Hagander [EMAIL PROTECTED] wrote: Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17° 29' 34.37N, 78° 30' 59.76E - Hyderabad 18° 32' 57.25N, 73° 56' 25.42E - Pune 37° 47' 19.72N, 122° 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device
Re: [HACKERS] Problem with site doc search
Oleg Bartunov wrote: On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Which way would you recommend? You're the tsearch master ;-) //Magnus ---(end of broadcast)--- TIP 9: In versions below 8.0, the planner will ignore your desire to choose an index scan if your joining column's datatypes do not match
[HACKERS] Problem with site doc search
Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Thanks and best regards, -- [EMAIL PROTECTED] [EMAIL PROTECTED] gmail | hotmail | indiatimes | yahoo }.com EnterpriseDB http://www.enterprisedb.com 17° 29' 34.37N, 78° 30' 59.76E - Hyderabad 18° 32' 57.25N, 73° 56' 25.42E - Pune 37° 47' 19.72N, 122° 24' 1.69 W - San Francisco * http://gurjeet.frihost.net Mail sent from my BlackLaptop device
Re: [HACKERS] Problem with site doc search
On Sat, 2 Feb 2008, Gurjeet Singh wrote: Hi All, I just noticed a minor bug in our search results. Searching for is_insteadbool in 8.3 docs returns the following page: http://www.postgresql.org/docs/8.3/static/catalog-pg-rewrite.html is_instead is a column, and bool is the datatype, both mentioned in different columns. I know it is based on postgres' own full text search, but am not sure about the method how docs are read by the search engine. It seems that the problem lies in the the way the doc was read and fed to the index builder, because the initial search results show these two words combined. Also, is it possible to teach our search engine to *not* treat _ (underscore) as a word separator? This would be great help and would result much better results. Sure, there are many ways to do this. Magnus ? Thanks and best regards, Regards, Oleg _ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: [EMAIL PROTECTED], http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 ---(end of broadcast)--- TIP 6: explain analyze is your friend