Okay, here's a lua filter and accompanying TOML config that should work for you:

https://gist.github.com/rafrombrc/3858a7885e766fbefa61

Some notes:

* This should *replace* all of the 'nginx_access_stat' and 'nginx_access_stats' related sections in your original config.

* Depending on how large your stat set grows, you may bump up against some sandbox output limits. You can increase these by raising memory_limit, intruction_limit, or output_limit settings in the SandboxFilter config (see http://hekad.readthedocs.org/en/v0.9.1/config/common_sandbox_parameter.html#config-common-sandbox-parameters).

* I set up a LogOutput to capture any heka.sandbox-terminated messages. This way if the filter crashes you'll have some console output explaining why. The DashboardOutput shows this automatically, but I know you're running w/o a dashboard until we get #1422 resolved.

* It's occurred to me while writing this that the bulk of the logic in the 'timer_event' function should be added to a Lua module that we can ship with Heka, so in the future generating all of the various counter and timer values from the input stats data would just be a matter of calling some functions. That would make it trivial to develop future similar filters.

* Hopefully this is all clear enough, and a solid enough start, that you can further customize and tailor it as your needs change.

Try this out and let me know how it goes... hopefully you'll see much better throughput.

-r


On 04/01/2015 02:46 AM, Alex Muller wrote:
(In reply to https://mail.mozilla.org/pipermail/heka/2015-March/000388.html)

Hi Rob,

Thanks for your email a couple of weeks ago with some suggestions for
alleviating issues with Heka that we've seen.

 > The easy solution is to remove the plugin_chansize change,
 > and to bump your global pool_size setting up to about 130.

We've had this running for about a week now. The updated output from
sending a SIGUSR1 to the hekad process looks like this:

https://gist.github.com/alexmuller/73f824184f92adcc56d6

Unfortunately our machines which see most of our traffic (our cache nodes)
still can't keep up.

 > This would probably be about 30 minutes of work, if you're interested in
 > trying it out I'd be happy to whip something up for you to try out.

If you're willing to give this a go we'd be very happy to run it and see
what happens.

Cheers,

Alex



_______________________________________________
Heka mailing list
[email protected]
https://mail.mozilla.org/listinfo/heka

_______________________________________________
Heka mailing list
[email protected]
https://mail.mozilla.org/listinfo/heka

Reply via email to