[HACKERS] Re: [GENERAL] Text search parser's treatment of URLs and emails

2010-10-12 Thread Bruce Momjian
[ sent to hackers where it belongs ] Thom Brown wrote: Hi, I noticed that if I run this: SELECT alias, description, token FROM ts_debug('http://www.postgresql.org:2345/directory/page.html?version=9.1build=alpha1#summary'); I get: alias | description |

Re: [HACKERS] Re: [GENERAL] Text search parser's treatment of URLs and emails

2010-10-12 Thread Tom Lane
Bruce Momjian br...@momjian.us writes: [ sent to hackers where it belongs ] Thom Brown wrote: It could be me being picky, but I don't regard parameters or page fragments as part of the URL path. Wow, that is a tough one. One the one hand, it seems nice to be able to split stuff out more,

Re: [HACKERS] Re: [GENERAL] Text search parser's treatment of URLs and emails

2010-10-12 Thread Sushant Sinha
On Tue, 2010-10-12 at 19:31 -0400, Tom Lane wrote: This seems much of a piece with the existing proposal to allow individual words of a URL to be reported separately: https://commitfest.postgresql.org/action/patch_view?id=378 As I said in that thread, this could be done in a