Let me qualify this - ad banner rotation is dealt with - I'm looking for
something that will use our Nutch engine to serve up relevant links from
people who pay for that privilege. We do not want to serve up ad's from
someone else's system i.e. the big G or Y, but use our own Nutch search
results to serve up relevant paying links that we have sold and
maintain. In a simple relational SQL world we would add a flag and
another table with the links and scores and look that up and pass back
when needed. Problem with that is that we lose the whole multi word
scoring capability in Nutch i.e. pizza beer Chicago, should serve up a
Chicago pizza ad first and beer ads further down, just like our search
results have relevancy (not a great example but you get the idea).
Re-writing a scoring engine to do that in SQL seems like a waste when
Nutch already does it just fine.
So in a nutshell - we need to do what the big G and Y and other do when
serving up key word based sponsor links. My thought - automate the
build of a dummy page with the key words bought that would be indexed
and served up just like regular crawled and indexed pages, using the
scoring to rank them in terms of relevancy and placement - I have not
seen any snippets of code to do simple insert/update/delete operations
on a Nutch segment or index however....
This is the idea gathering phase - think like a school/college search
engine with local paying advertisers - we want to serve those links up
to the searchers to help offset the cost of the service and serve up or
flag links that rank first because of payment followed by normal search
link results....
rp
Sean Dean wrote:
I might be totally off base with what your asking to do, but take a look at
this open source project: http://phpadsnew.com/two/.
Its basically an advertising engine, built on PHP. Integration within any application is a breeze, and it supports external advertising such as Google Ads.
Sean
----- Original Message ----
From: RP <[EMAIL PROTECTED]>
To: [email protected]
Sent: Tuesday, December 19, 2006 10:52:56 AM
Subject: How best to add "sponsored link" support..??
Hi all,
I've been tasked with looking into this and am not a coder - that said,
Nutch is doing great and the bean counters have asked me to look into
adding sponsored link results and I'm wondering how best to add this.
It would be nice to utilize the Nutch engine to come up with the pages
versus just doing a lookup on words and results in a flat file but the
key word data could change daily (hourly) and would need to be able to
be hand entered (or automated) as people sign up (re-index is not really
an option). I'm not sure this would fly within the main Nutch segments
and index, but I could see maybe a separate index or possibly adding a
flag to the existing data but I've not seen any easy to use tools to
change/update/insert records into what is already there (yes Luke on the
index but that does not touch the segment data, right?). I don't want
to change existing searched data and I don't see an issue with having
duplicate results (sponsored up top and existing entry down below
somewhere) but it would be more elegant to not have that occur. I also
see issues in a simple flat file look up as a multiple word search is
best handled inside Nutch to "score" the results versus having to do
something similar in the sponsored results. I can see the need to
control the summary text displayed and also pass thru any codes in the
URL which are currently being stripped during the main crawl/index
cycle. I also see issues with seriously customizing the internals as
they would have to be maintained as Nutch itself is updated....
If anyone has looked at this and has at least some ideas on how best to
do this let me know. I need to come up with a preliminary estimate
before I can engage and pay the coders to make this happen so if there
are any easy or "best practices" ways on doing this any help/pointers
would be appreciated....