Change `Write([]byte)` to log full message to `message` field

[This is in part a continuation of a discussion started in PR #9 where we were also worried about message length.]

The `message` field (sent as `short_message` in GELF) and the `full_message` field are stored in the exact same way in Elasticsearch, so presumably they support the same sizes. I've actually tested `message` only up to 80 KB (via a null-delimited "RAW/Plaintext" interface) and 2500 lines or so, however. :-) It works fine, and the Graylog UI seems to do a reasonable job of dealing with it. (I have seen [a complaint](https://github.com/Graylog2/graylog2-server/issues/873) that long fields don't work, but that appears to be because they changed the default configuration to do term analysis on the `message` field, which isn't done by default.)

Having looked into this a bit, I find:

1. `full_message` is [optional to send in GELF](http://docs.graylog.org/en/2.2/pages/gelf.html#gelf-payload-specification), unlike `message`.

2. `full_message` is not parsed for things like sources and timestamps.

3. On its syslog inputs, Graylog has a `store_full_message: true` option which stores the raw syslog protocol message (usually [RFC 5424](https://tools.ietf.org/html/rfc5424) in the `full_message` field. This seems useful for debugging and handling situations where bad messages come in that the parser can't handle. We turned this on when we set up Graylog because it seemed like a good idea at the time, and it still doesn't strike me as a bad thing to do.

4. Graylog search functionality seems to be oriented mainly towards `message` and [it seems to take a bit of extra work](https://stackoverflow.com/questions/12130870/searching-in-graylog2-full-message) to search `full_message` instead.

From all this, I get the sense that `full_message` is considered something that can be safely ignored by clients retrieving and processing messages from Graylog. Given that, I think it's a bad idea, when given a message via the `io.Writer` interface, not to put the entire message into the `message` field.

As background, let me tell you about my use of Graylog at work. We have dozens of servers generating log messages into the Systemd Journal that we forward to a central Graylog installation so we can easily query across servers, do centralized analysis (such as with [OSSEC LIDS](http://www.ossec.net/files/auscert-2007-dcid.pdf) log analysis), keep historical records, and so on.

In order to keep things as simple as possible, I'm aiming to get all our non-journal logging also going into the journal, including logs from programs that write directly to `/var/log` such as `auditd`, output of jobs run by `cron`, and so on. (For example, I'm intending to replace `/usr/sbin/sendmail` on those servers with a program that eats the message and logs it to the journal.)

One of the questions I had to ask myself is whether I should be using the `full_message` field, and what with various programs partially duplicating information across `message` and `full_message`, that seems like a bad idea. How would the OSSEC log scanner, for example, know that for some log entries `full_message` is all the information in `message` and a few other fields in unparsed form, yet for other messages `message` could be an incomplete log message and `full_message` contains the complete message but without other log entry information such as the timestamp? That way lies a lot of pain, especially if I end up having to assemble new versions of log entries for log-scanning software that expects things to be re-assembled in a syslog-ish format with the timestamp, source etc. all in one big string.

So for these reasons, I'm suggesting we change the `io.Writer` API, `Write([]byte)`,  of `go-gelf` to record the entire bytestring it's handed in the `message` field and not send a `full_message` field. I think it's reasonable for it to continue adding things like the extra fields for file and line number of the caller.

Note that if anybody does want to use the `full_message` field for something, or any other fields for that matter, we have the `Write(*Message)` interface for that which allows (or should allow) full control over the GELF message being sent, within the limitations of the protocol. (We should not allow, e.g., a GELF message to be sent without a `message` field since that field is required by the protocol.)

If we're agreed on this, I'm happy to submit a PR for changes to the code and update the documentation appropriately, including this rationale in an appropriate place (probably the commit message).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change `Write([]byte)` to log full message to `message` field #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Change Write([]byte) to log full message to message field #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Change `Write([]byte)` to log full message to `message` field #11