Re: [Syslog] Syslog RFC feedback

Chris Lonvick Fri, 19 Dec 2008 08:35:23 -0800

Hi Raffy,

On Thu, 18 Dec 2008, Raffael Marty wrote:

I am really late to the game here. I know. I just wanted to share somefeedback that I had about the current state of the syslog document:

You are late. In fact, the document has been approved to become an RFC sono changes are allowed in that document.

I see that you note some areas of the specification that you disagreewith. This is expected as the IETF process (which is completed in thiscase) is a series of proposals, discussions, and compromises. It is notexpected that everyone will be entirely happy with the results, butoverall we feel that we are pushing the technology forward and areproducing a better product. In fact, everyone can tell that we know thatwe havn't produced a perfect product since we have version numbers. ;-)


Some comments inline.

- Let me say this first: I really like some of the changes that have beenincorporated.

:-)

- Syslog message facility: Why do we still keep this? The only reason that Isee people using the facility is to filter messages. There are better ways todo that. Some of the pre-assigned groups are fairly arbitrary and not evenreally implemented in most OSs. UUCP subsystem? Who is still using that? Iguess the reason for keeping it is backwards compatibility? If possible, Iwould really like this to be gone.- Priority calculation. The whole priority field is funky. The priority doesnot really have any meaning. The order does not imply importance. Why havingthis at all?

The PRIs are being used today (although I agree that most labels aredefunct) and the WG felt that having some backwards compatability wouldincrease adoption of the spec.

- Timestamps: What's the reason for having the "T" in the timestamp? Havinglooked at hundreds of different log formats, I have never seen anything likethat. Why doing this?


http://www.ietf.org/rfc/rfc3339.txt?number=3339

- Hostname: I am not comfortable with the whole hostname spec. I like thatthere is an ordering and people are supposed to use FQDN, but there are manyquestions about this. To start with, in a lot of UNIX configurations,/etc/hosts contains an entry like:
127.0.0.1   localhost.localdomain  localhost
The second column is the FQDN (technically). Is that one that can be used?Can you make it clear that this is not what should be used? Same for127.0.0.1 or the loopback address in general. How does a machine know whetheran IP address is static or dynamic? How does a logging application know? Idon't think you will ever know that. Did you mean a private versus a publicaddress? That might be interesting. Furthermore, it should specify whichinterface's IP address to use. The one the message is sent out on?

Domain names are defined in RFCs 1034 and 1035. "localhost.localdomain"is not an FQDN.

- Under the section of PROCID: The text is imprecise. This number is not theprocess ID of the syslog process, it's the ID of the writing process. Thethird paragraph talks about detecting restarted applications and somehowmixes in the syslog process. ("might be assigned the same process ID as theprevious syslog process".) This is not clear at all and very very confusing- MSGID: Make clear that this ID is local to the application. It's not aglobal ID at all.
The biggest issue I have around the SD-ID field:

- I like that the user can extend these.
- Why is this structure so complicated? Why not going with a simple set ofkey-value pairs? This whole structure thing is so complicated. Parsing it,you need to keep state! You need to remember the SD-ID for each SD-PARAM. Whyintroducing this? Just stick with simple key-value pairs. That makes parsingeasier. Much easier. And it makes the events easier to produce as well.- By keeping an explicit message field (the unstructured part), you encouragepeople to still log in that way. I recommend using an explicit field (orparameter) that can be used to include human readable text. Instead of this:
<165>1 2003-08-24T05:14:15.000003-07:00 192.0.2.1 myproc 8710 - - %% It'stime to make the do-nuts.
use:
<165>1 2003-08-24T05:14:15.000003-07:00 192.0.2.1 myproc 8710 - message="%%It's time to make the do-nuts."
or really:
1 2003-08-24 05:14:15.000003-07:00 host=192.0.2.1 process=myproc procid=8710message="%% It's time to make the do-nuts."

Those could be considered globally unique attribute-value pairs as therewould be no uncertainty about their context in those situations. Inspecific contexts, we decided that people will reuse the attribute labels.As an (unlikely) example, let's use the attribute label of "color". if itwere globally unique, then it would have to be globally defined andaccepted. We get around that by specifying the context. To use yourway, we would have:

{syslog header stuff} color="yellow" I don't know what this really means
or, as we've defined it:
{syslog header stuff} [box_identificat...@9 color="teal"] Cuz it's cisco

We figured that the alternative would be to make each attribute labelglobally unique such as:

{syslog header stuff} ciscos_box_identification_color="teal"
which is not really workable.

- I definitely like the consideration of some of the special fields(structured data IDs). However, they should be used as simple keys (orparameters) that have special meaning.- Parameter - origin: What does it mean to have multiple origins IPs? Is thata syslog forwarding chain? The document does not say anything about that.Also, we already have the host field in the beginning of the syslog messages.What's the relationship to that? Or is origin something completely different?

We had several discussions about device virtualization. Consider that ahosting machine will have its own IP address and hostname and that thevirtual machines will have their own names and addresses. The hostingdevice will now be able to send a message describing itself or one of itsvirtual machines.

- Parameters - I would really like to see some use-cases for all of the IDs.Especially the sequenceId. I am assuming this is something that the syslogdaemon assigns, not the logging application. Right? I think that needs to beclearer. For the sequenceId, what happens for forwarded messages? Are theseIDs local? Are they forwarded along with a message? Also, how does thelogging application know about the timeQuality? Or if that something that thesyslog daemon assigns, how does it know?

We've discussed writing a "use case" document. It's not going to happenunder our current charter but that's good input for rechartering.

(Note to group: We won't discuss rechartering the WG until AFTER we getsyslog-sign approved by our AD.)

- I would really like to see the parameters to go away and have a generickey-value extension. In addition, IANA should have a set of allowed/definedkeys. The parameters should be part of those. Each key has a special meaning(semantics). There should be a whole lot of them: src_ip, user_name, etc.Each producer should be free to add additional keys, realizing that not allconsumers would understand their semantics. However, the consumers couldstill read them.

See above. The IANA will update the parameters based upon RFCs. I thinkthat having those defined for specific uses is a really good idea and Ithink that discussion will be welcomed when we get around to talking aboutrechartering.


That's it for now.

Thanks a lot

-raffy


Best regards,
Chris
_______________________________________________
Syslog mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/syslog

Re: [Syslog] Syslog RFC feedback

Reply via email to