I have this work completed in my workspace and I am almost ready to check it in, so I'd like to ask for comments/opinions/objections one more time.

Proposal is here ...

http://rollerweblogger.org/wiki/Wiki.jsp?page=Proposal_IndependentHitCounts

-- Allen


Allen Gilliland wrote:
Anyone have any other comments/opinions about doing independent hit count tracking?

-- Allen


Allen Gilliland wrote:


Elias Torres wrote:

Allen Gilliland wrote:

Elias Torres wrote:
I'm very much for the motivation behind this effort but I think we need to think this about it a little bit more. At first I was imagining that *both* referrers and hits would be tracked externally, but this proposal only solves half my problem. I'd support this if we did both at the same
time but we left this for 3.2 unless it was absolutely guaranteed you
could finish it off by the end of the week.
Asynchronous processing of referrers was implemented a while ago. It is
controlled by these 3 settings in the roller config file ...

referrers.asyncProcessing.enabled=false
referrers.queue.numWorkers=3
referrers.queue.sleepTime=10

So you would have to enable it, and then you can choose how many threads
run to process the incoming queue and how long each thread should sleep
(in seconds) between runs.  Each "run" processes any referrers in the
queue until the queue is empty.

I'll warn you though, AFAIK it works fine and we used it on
blogs.sun.com for a short while with no real problems, except that the
workers really couldn't keep up with the queue.  I think I had the
workers up to around 25 before I decided that it wasn't really worth it
anymore and that what I really wanted was to disable the referrer
processing all together.

So, if this proposal goes through then you would be able to control both
hit counting and referrer tracking independently of each other.  i.e.
enable/disable either or both of them, as well as control a bit how they
work.


good. In a large installation I don't want to do neither referral or hit
count tracking. However, I'm not sure if we can today disable referral
and do it externally because don't our queries assume that the table
contains raw data as opposed to summarized data? I would think we should
rethink our built-in referral think to assume massaged data as in the
tagging case for performance and then we can externalize it. Also, not
sure if your proposal for *just* dailyhits is enough. Maybe it is for
the one sidebar that uses this info today, but I'd think we would like
to keep track of a week's or months number of hits. Am I thinking the
wrong way?

I totally agree. I've been wanting to do something better with the referrer tracking for a while, but there were more important things to worry about first.

I would agree that having the summarized data makes more sense for performance, but I think that part of the way the referrers are used is like a comment which is gathered via linkback extraction. If it was only used for tracking hits from external sources then I don't see the benefit at all, any stats package in the world would give you better metrics for something like that. So I'm not sure that you could really do summarized work for the referrers stuff, it's currently setup to expect a single row per referrer :/

I also agree that just doing daily hits doesn't make a whole ton of sense and I had tried to think of a few ways to add some more interesting counts in there, but opted for speed in getting this done rather than adding new features. We could certainly add weekly and monthly counts in there, but they would be restricted the same way that daily is now, eventually it has to be cleared because it's not meant for archiving. I had also considered the idea of trying to maintain an average # of hits for a weblog so that you could somehow identify if that weblog had gotten extra hits on a given day and could then be identified as a hot weblog that day, even though relatively speaking it may not get as hit of a hit count as many other blogs in the system. i.e. basing some of the hot blogs off of % increase in traffic for a day, rather than purely off of raw hit counts.

In any case, it's an open proposal, so we can design it to do whatever we think would be useful.

-- Allen



BTW, is the task locking work completed? I have a couple of tasks I want
to develop and I'm dependent on that feature. Please let me know when
it's ready to be tested. Thanks.
Yes, it should be working.  I still want to do some more testing on it,
but I'd be glad to get some help with that :) You should be able to add
new tasks simply by extending the new RollerTask class and then setting
the right config elements in the config file.

-- Allen


-Elias

Allen Gilliland wrote:
I have a new proposal for separating out the hit count tracking from the
existing referrer tracking system so that they can each function
independently.  This would hopefully happen for 3.1.

http://rollerweblogger.org/wiki/Wiki.jsp?page=Proposal_IndependentHitCounts


Please take a look at the proposal soon and response with any questions,
comments, objections, etc.  I plan to begin working on the
implementation very soon and would likely try and commit before the end
of the week.

-- Allen

Reply via email to