On Sat, Apr 6, 2013 at 5:19 PM, janI <j...@apache.org> wrote:

> On 6 April 2013 22:21, Rob Weir <robw...@apache.org> wrote:
>
> > On Sat, Apr 6, 2013 at 12:37 PM, janI <j...@apache.org> wrote:
> >
> > > On 6 April 2013 13:37, Rob Weir <robw...@apache.org> wrote:
> > >
> > > > On Sat, Apr 6, 2013 at 5:51 AM, janI <j...@apache.org> wrote:
> > > >
> > > > > Hi.
> > > > >
> > > > > Now that our wiki (mwiki) runs smoothly and performs quite well, it
> > is
> > > > > slowly time to take a look at our other services.
> > > > >
> > > > > In order to determine what should/can be done, I need some figures.
> > > > >
> > > > > Can anybody help me with traffic figures for
> > > > >    wiki.o.o, forum.o.o (total for the vm not per language) and
> > www.o.o
> > > > >
> > > > > Does end-users download directly from www.o.o or from a secondary
> > site
> > > ?
> > > > >
> > > > > I need the following numbers (if possible):
> > > > >    - daily number of clicks (average)
> > > > >    - peak number of clicks within a short period like 1 hour
> > > > >
> > > > >
> > > >
> > > > From Google Analytics we have:
> > > >
> > > > For wiki:  around 33K page views/day
> > > >
> > > > For forums, I have no idea.  They are using a Google Analytics
> account
> > (
> > > > UA-1087265-8) that is not connected with the rest of the website.  If
> > > that
> > > > could be updated to match the rest of the website we'd have typical
> > > numbers
> > > > in a week.
> > > >
> > >
> > > Thanks for your answers, but I am a bit scared you (as a PMC) tell me
> > > "they" use an account unknown to us. Who has the forum account ?
> > >
> > > I have the technical power to change it (sudo access) but if a PMC tell
> > we
> > > do no know who owns it, it would not be right just to change it. Who
> but
> > > the "owner" can authorize a change ?
> > >
> > > I trust this is a misunderstanding somewhere, it cannot be that we run
> a
> > > service, that we do not control, in the sense of change management.
> > >
> > >
> > I assume this is just a legacy account from Sun days.  The same GA code
> was
> > used for other OOo-associated web services as well.
> >
> > I raised this issue a year ago on this mailing list:
> >
> > http://markmail.org/thread/mtknhgrwjkajj2ux
> >
> > I'm assuming this is just a "lack of maintenance issue" and we can add it
> > to the list of other things that need attention.
> >
>
> Ok, I just wanted to be sure, when I come around to it, I will find the
> header page in forum and change. Sadly enough the setup of the forum vm is
> not "a piece of cake", it seems to have gone through quite a lot of changes
> (but it works and that is important).
>
>
> >
> > > For the website as a whole (all subdomains tracked) it is around
> 730K/day
> > > > for a peak day.  If you subtract out the forum traffic and the
> > downloads
> > > > (which serviced by SourceForge) then the static website is probably
> > more
> > > > like 600K/day visits.
> > > >
> > >
> > > Why do I need to subtract forum, I thought they ran as a separate dns ?
> > do
> > > we not count on basis of our template (that loads the GA account).
> > >
> > >
> >
> > Sorry, I meant "subtract out the wiki" traffic.  GA reports the total
> > traffic for all tracked domains, and then allows filtering by URL
> pattern.
> > So total traffic minus the /wiki/ traffic minus the download traffic
> gives
> > us the static website traffic.
> >
>
> ok, got it.
>
> >
> >
> > >
> > > >
> > > > So that indirectly answers your other question -- the big download
> > files
> > > > are from the SourceForge distribution network.  But we do have
> smaller
> > > > downloads, of documentation PDF's.  These are mainly from the wiki.
> > > >
> > > > In terms of peaks, our biggest day usually Wednesday, lowest on
> Sunday.
> > > > The variation there around 200K/visits.
> > > >
> > >
> > > Super just what I need.
> > >
> > >
> > > >
> > > > Hourly peak is around 15K visits for all tracked subdomains together.
> > > >
> > >
> > > Do you know the division between mwiki, forum and www ?
> > >
> > >
> > I can't seem to filter on the GA screen that gives hourly detail, so I
> only
> > see the count cross-subdomain.
> >
> >
> > >
> > > > Note:  for all of these I'm talking about "pages", where a single
> page
> > > > could include several HTTP calls, for the HTML, the Javascript,
> images,
> > > > etc.  Since anyone with Javascript not enabled would not register,
> the
> > > > actual numbers are probably a little bit higher than this.
> > > >
> > >
> > > Good point, that is very important for a a traffic server, where e.g.
> > > javascripts would be cached.
> > >
> > >
> > We might be able to get the httpd logs from Infra, in which case we could
> > do more traditional traffic stats
>
> I have the httpd logs for forum and mwiki, but I miss a tool to make proper
> statistic. I have asked in the httpd channel, but no luck.
>
>
I use a commercial tool called WebLog Expert.  I'm happy to run a report on
the logs if they are accessible.

-Rob



>
> > .
> >
> >
> >
> > > I am trying to convince Infra, that we should have a common
> trafficserver
> > > for wiki and forum (and maybe www). Currently we use eresbus as proxy,
> > and
> > > all packets are forwarded to the ooo-wiki2-vm where that trafficserver
> > > filters cached pages. Forum goes direct to httpd, so every packet
> causes
> > > mysql activity. If we had a trafficserver in eresbus, we would for
> wiki2
> > > save loads of network traffic and for forum network traffic as well as
> > > mysql activity.
> > >
> > >
> > > >
> > > > In any case, one approach would be to enable GA for the Forums, track
> > > that
> > > > for a week or two and then compare that to the Wiki numbers.  If the
> > > stacks
> > > > are similar, then we could think of the traffic ratio between the two
> > and
> > > > tune based on that.
> > > >
> > >
> > > I agree, but not being PMC, I do not feel I am in the position to
> "just"
> > > make that change. Who "owns" the forum, they should authorize such a
> > > change. I can make the change (provided you tell give me the GA line to
> > > insert) when I am sure I am not doing a change that someone else depend
> > on.
> > >
> > >
> > I've certainly raised the issue on the list previously and there were no
> > objections.   If you have access let me know offline and I can give you
> the
> > codes.
> >
> > Note that this root page is not in the Apache CMS.  It doesn't look
> > phpBB-related so there might be a static page out there on that host that
> > would need to be updated manually:  http://forum.openoffice.org/
> >
>
> thx for the info.
>
> rgds
> Jan I.
>
> >
> >
> > > >
> > > >
> > > > > I would like to tune at least forum before we release 4.0 to
> prepare
> > > for
> > > > a
> > > > > higher load. Tuning of www depend on a discussion with infra,
> because
> > > it
> > > > is
> > > > > a shared service.
> > > > >
> > > > >
> > > > If it is like the release of 3.4.0 and 3.4.1 we'll get a few hot
> pages:
> > > >
> > > > 1) The announcement blog post.  With both 3.4.0 and 3.4.1 releases
> > these
> > > > were heavily linked to by the press.  Roller went down in the 3.4.1
> > case.
> > > > I don't know if that was due to the traffic or not.  But we might
> want
> > to
> > > > plan on having the announcement post be a static webpage just to be
> > sure.
> > > >
> > >
> > > The blog post is on the general apache server and not our www, is that
> > > correct ?
> > >
> > >
> > Correct.
> >
> >
> > > >
> > > > 2) The homepage and http://www.openoffice.org/download page will be
> > hot.
> > > > You can see from the download chart the boost we get when a new
> release
> > > > occurs:  http://www.openoffice.org/stats/downloads.html  (AOO 3.4.0
> > was
> > > > May
> > > > 8th 2012 and AOO 3.4.1 was Aug 23rd).
> > > >
> > >
> > > That is a real obvious case for trafficserver, and until it is in
> place I
> > > can ask to have the httpd use memcached (which it might be already).
> > > Currently the www is on a general apache server, so I need strong
> > arguments
> > > to make changes (high load is a strong argument)
> > >
> > >
> > > >
> > > >
> > > >
> > > >
> > > > > Translate-vm also needs tunings, but that is in progress, and infra
> > has
> > > > > been very kind and prepared an extra vm, so we/I install/tune
> without
> > > > > affecting the active system.
> > > > >
> > > > > thx in advance for figures.
> > > > > rgds
> > > > > jan I.
> > > >
> > >
> > > THANKS a lot for your answer, which help me a lot doing calc.. Sorry
> for
> > > being frank about the forum, but it really make me nervous that we
> have a
> > > service where part it unknown.
> > >
> > > rgds
> > > Jan I
> > >
> > > > >
> > > >
> > >
> >
>

Reply via email to