Re: [rsyslog] how can a parser insert data into a message

david Thu, 10 Feb 2011 23:42:08 -0800

On Fri, 11 Feb 2011, Rainer Gerhards wrote:

-----Original Message-----
From: [email protected] [mailto:rsyslog-
[email protected]] On Behalf Of [email protected]
On Fri, 11 Feb 2011, Rainer Gerhards wrote:
Have a look at ./runtime/parser.c, function SanitizeMsg. It builds anew buffer and uses MsgSetRawMsg to set the new buffer. MsgSetRawMsghandles the "dirty" internals of message object buffer manipulation.
Note that it may be quicker to manipulate the buffer pointersyourself. But then you must be very careful. MsgSetRawMsg shouldprovide the necessary hints. The thing to keep on your mind is that upto a certain message length, a buffer is used from the msg objectitself (thus saving one malloc/free call) whereas for larger sizemessages, memory is allocated. You need to keep that straight duringmanipulation.
I'll look at it and see how hard it is to separate these two cases.
thanks
for the pointer here.
Just let me add that I did find it of questionable value to try avoid the
malloc here. At least in the sanitization problem, this would have resulted
in very complex code. And while saving memory writes and calls to the malloc
subsystem is useful, I thought that it would not have brought much benefit in
that case. Depending on what you intend to do (well-defined insert at late
point) things may be different, though.


My initial thought is something along the following

1. find out how much space is available in whatever buffer the message isin (potentially 0 if the buffer is exactly the right size)

document what needs to happen to adjust how much of the buffer is used(I've already figured out some of this with the existing parser modules)

2. if there is not enough space, document what the process is to allocatea new buffer and make the system use it.

at this point it should be fairly straightforward to write a routine to dosomething along the lines of 'make sure I have enough space in the bufferto add X characters' and have it either return immediatly if there'senough space or allocate the larger buffer if needed and return afterdoing that.

there will be some things that will need to be documented as side effects(pointers into the existing message may be invalid at that point,including values in the msg structure)

this could be mis-used (running this routine for every control characterfound could result in many malloc/free pairs for example), and so exampleswill need to be given of doing a 2-pass routine, pass 1 to figure out whatyou want to do, and then make sure there's enough space and do pass 2 tomodify the buffer as needed.

Using this for sanitizing would still be slightly less efficient than theapproach you probably use now (allocate a new buffer, copy things into itas you go to construct a new message, then set the message into thestructure), but probably not by more than two copies of the text. As aresult, it may be that the result will be enough cleaner to be worth thecost. I'm thinking that the new routine would be to copy the text from theold buffer to the new one, then copy everything after your first insert tothe end of the buffer. after that you are copying data from late in thebuffer to earlier in the buffer, which may even be faster than copyingsmall amounts of data from one buffer to another as it may result inbetter cache behavior.

in fact, this pattern is probably common enough to make it a routineitself


something like

int InsertIntoRawMsg(int offset, int count)

inserts at least count spaces into the message at position offset fromthe beginning of the message, returns the number of spaces actuallyinserted (may be more than the number requested)

or would it be better to return the number of extra characters availablein the buffer after the end of the string?

I figure error checking on the return is not needed because if it can'tallocate the space we need to bail out (with whatever rsyslog does when itruns out of memory, probably aborting the message entirely)


David Lang

Rainer

As a side-note, it would probably be useful if you could take somebullet points on how to modify things, so that others can find thatinformation in the case they want to do that themselves. Could go tothe wiki or I could include it in the doc set. Just a suggestion,though...


I'll see what I can do.

David Lang

Rainer

-----Original Message-----
From: [email protected] [mailto:rsyslog-
[email protected]] On Behalf Of [email protected]
Sent: Friday, February 11, 2011 5:38 AM
To: rsyslog-users
Subject: [rsyslog] how can a parser insert data into a message

the various parser modules that I've submitted are all removing data
from
the log message or overwriting the data in place.

But I've now run across a situation where I need to insert

information

into the message. I know that this can be done because the

sanitizing

call
does exactly this. I am assuming that this is doing something like
allocating a new string and copying the data into the new string.

the concern is how to do this in a way that will survive the exit

from

the
module, not confuse any of the many pointers or sizes that are
involved,
and make sure everything is properly freed afterwords.

should I just search for the sanitizing routine and copy what it

does

(and
can you point me at it?), or do you want me to wait until you have

time

to
write something up on this?

David Lang
_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

_______________________________________________
rsyslog mailing list
http://lists.adiscon.net/mailman/listinfo/rsyslog
http://www.rsyslog.com

Re: [rsyslog] how can a parser insert data into a message

Reply via email to