Hi Raffy,

On Thu, 18 Dec 2008, Raffael Marty wrote:

I am really late to the game here. I know. I just wanted to share some feedback that I had about the current state of the syslog document:

You are late. In fact, the document has been approved to become an RFC so no changes are allowed in that document.

I see that you note some areas of the specification that you disagree with. This is expected as the IETF process (which is completed in this case) is a series of proposals, discussions, and compromises. It is not expected that everyone will be entirely happy with the results, but overall we feel that we are pushing the technology forward and are producing a better product. In fact, everyone can tell that we know that we havn't produced a perfect product since we have version numbers. ;-)

Some comments inline.


- Let me say this first: I really like some of the changes that have been incorporated.

:-)

- Syslog message facility: Why do we still keep this? The only reason that I see people using the facility is to filter messages. There are better ways to do that. Some of the pre-assigned groups are fairly arbitrary and not even really implemented in most OSs. UUCP subsystem? Who is still using that? I guess the reason for keeping it is backwards compatibility? If possible, I would really like this to be gone. - Priority calculation. The whole priority field is funky. The priority does not really have any meaning. The order does not imply importance. Why having this at all?

The PRIs are being used today (although I agree that most labels are defunct) and the WG felt that having some backwards compatability would increase adoption of the spec.


- Timestamps: What's the reason for having the "T" in the timestamp? Having looked at hundreds of different log formats, I have never seen anything like that. Why doing this?

http://www.ietf.org/rfc/rfc3339.txt?number=3339

- Hostname: I am not comfortable with the whole hostname spec. I like that there is an ordering and people are supposed to use FQDN, but there are many questions about this. To start with, in a lot of UNIX configurations, /etc/hosts contains an entry like:
127.0.0.1   localhost.localdomain  localhost
The second column is the FQDN (technically). Is that one that can be used? Can you make it clear that this is not what should be used? Same for 127.0.0.1 or the loopback address in general. How does a machine know whether an IP address is static or dynamic? How does a logging application know? I don't think you will ever know that. Did you mean a private versus a public address? That might be interesting. Furthermore, it should specify which interface's IP address to use. The one the message is sent out on?

Domain names are defined in RFCs 1034 and 1035. "localhost.localdomain" is not an FQDN.

- Under the section of PROCID: The text is imprecise. This number is not the process ID of the syslog process, it's the ID of the writing process. The third paragraph talks about detecting restarted applications and somehow mixes in the syslog process. ("might be assigned the same process ID as the previous syslog process".) This is not clear at all and very very confusing - MSGID: Make clear that this ID is local to the application. It's not a global ID at all.

The biggest issue I have around the SD-ID field:

- I like that the user can extend these.
- Why is this structure so complicated? Why not going with a simple set of key-value pairs? This whole structure thing is so complicated. Parsing it, you need to keep state! You need to remember the SD-ID for each SD-PARAM. Why introducing this? Just stick with simple key-value pairs. That makes parsing easier. Much easier. And it makes the events easier to produce as well. - By keeping an explicit message field (the unstructured part), you encourage people to still log in that way. I recommend using an explicit field (or parameter) that can be used to include human readable text. Instead of this:

<165>1 2003-08-24T05:14:15.000003-07:00 192.0.2.1 myproc 8710 - - %% It's time to make the do-nuts.

use:

<165>1 2003-08-24T05:14:15.000003-07:00 192.0.2.1 myproc 8710 - message="%% It's time to make the do-nuts."

or really:

1 2003-08-24 05:14:15.000003-07:00 host=192.0.2.1 process=myproc procid=8710 message="%% It's time to make the do-nuts."

Those could be considered globally unique attribute-value pairs as there would be no uncertainty about their context in those situations. In specific contexts, we decided that people will reuse the attribute labels. As an (unlikely) example, let's use the attribute label of "color". if it were globally unique, then it would have to be globally defined and accepted. We get around that by specifying the context. To use your way, we would have:
{syslog header stuff} color="yellow" I don't know what this really means
or, as we've defined it:
{syslog header stuff} [box_identificat...@9 color="teal"] Cuz it's cisco
We figured that the alternative would be to make each attribute label globally unique such as:
{syslog header stuff} ciscos_box_identification_color="teal"
which is not really workable.



- I definitely like the consideration of some of the special fields (structured data IDs). However, they should be used as simple keys (or parameters) that have special meaning. - Parameter - origin: What does it mean to have multiple origins IPs? Is that a syslog forwarding chain? The document does not say anything about that. Also, we already have the host field in the beginning of the syslog messages. What's the relationship to that? Or is origin something completely different?

We had several discussions about device virtualization. Consider that a hosting machine will have its own IP address and hostname and that the virtual machines will have their own names and addresses. The hosting device will now be able to send a message describing itself or one of its virtual machines.


- Parameters - I would really like to see some use-cases for all of the IDs. Especially the sequenceId. I am assuming this is something that the syslog daemon assigns, not the logging application. Right? I think that needs to be clearer. For the sequenceId, what happens for forwarded messages? Are these IDs local? Are they forwarded along with a message? Also, how does the logging application know about the timeQuality? Or if that something that the syslog daemon assigns, how does it know?

We've discussed writing a "use case" document. It's not going to happen under our current charter but that's good input for rechartering.

(Note to group: We won't discuss rechartering the WG until AFTER we get syslog-sign approved by our AD.)

- I would really like to see the parameters to go away and have a generic key-value extension. In addition, IANA should have a set of allowed/defined keys. The parameters should be part of those. Each key has a special meaning (semantics). There should be a whole lot of them: src_ip, user_name, etc. Each producer should be free to add additional keys, realizing that not all consumers would understand their semantics. However, the consumers could still read them.

See above. The IANA will update the parameters based upon RFCs. I think that having those defined for specific uses is a really good idea and I think that discussion will be welcomed when we get around to talking about rechartering.


That's it for now.

Thanks a lot

-raffy

Best regards,
Chris
_______________________________________________
Syslog mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/syslog

Reply via email to