two document fields one named positive one called negative
you query have to look somehow like this:
positive: (keyword1 keywordN) AND NOT negative:(keyword1 keywordN)
Am 23.10.2005 um 20:50 schrieb Sam Lee:
Yes, I thought of that. But since the ads have
negative keywords, it's very possible for the webpage
to match the ads but not the other way around because
of the negative keywords. So the system cannot be
sure that the ads match the webpage until it uses ads'
keyword and negative keywords to rematch the webpage.
This is a lot of resource for having each ad to match
the webpage again..
webpage www.mysite.com --match--> ad1.....ad101
Then I match each ad with the webpage.
But due to negative keywords:
ad1....ad100 --NOT match--> www.mysite.com
ad101 --match--> www.mysite.com
# of queries = 102
If there is a way to match content with boolean
keywords, then # of query is 1 only. Huge difference!
Any idea how to accomplish this?
--- Stefan Groschupf <[EMAIL PROTECTED]> wrote:
Index the keywords of your ads with lucene.
Extract all words from your page (ajax), remove stop
words, build a
query from the page words by connect the words with
OR and you will
find the best matching ad.
You may need to limit the words per page or set the
maximum clauses
to a much higher number.
HTH
Stefan
Am 23.10.2005 um 18:39 schrieb Sam Lee:
ok, I am implementing a google
adsense/adwords-like
system. For examples, the website has keywords
"nike
red shoe", so it can match text ad with keywords
"nike
shoe -blue". Of course, I can always use the text
ad
keywords to match the website's keywords. But it
will
take too much resource to have all ads to rematch
the
new websites whenever new websites joins the ad
network. So I need a way for the new websites to
"reverse match" the text ads.
So if new website has "nike red shoes" as
keywords,
then it should match all text ads with "nike shoes
-blue". The only difference is that it is doing
it in
reverse.
many thanks.
--- Erik Hatcher <[EMAIL PROTECTED]>
wrote:
Sam - I'm not quite sure I follow you, but let's
see
if this fits...
you want to have a document and see if a query
matches it? Please
elaborate more on what you're after. Maybe what
you're looking for
is the contrib/memory and the MemoryIndex within
that Subversion area.
Erik
On 22 Oct 2005, at 18:54, Sam Lee wrote:
Hi,
Normally, lucene or Nutch can match query
"nike
shoe
-blue" with "red nike shoe".
But what about matching "red nike shoe" with
query
"nike shoe -blue"? It is the other way around.
Can I
do it with a combinations of API?
Many thanks.
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam
protection around
http://mail.yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail:
[EMAIL PROTECTED]
For additional commands, e-mail:
[EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail:
[EMAIL PROTECTED]
For additional commands, e-mail:
[EMAIL PROTECTED]
__________________________________
Yahoo! Mail - PC Magazine Editors' Choice 2005
http://mail.yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail:
[EMAIL PROTECTED]
For additional commands, e-mail:
[EMAIL PROTECTED]
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net
__________________________________
Yahoo! Mail - PC Magazine Editors' Choice 2005
http://mail.yahoo.com
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net