Obscuring data in Apache Logs

I am looking for the best way to obscure sensitive data being written to our Apache access logs. There is some personal data that can be posted in querystrings.

One option is to use piped logging to run the access log messages through a regular expression, to blot out the sensitive data. The upside is having a lot of control over what gets logged or not. The downsides are it adds complexity, and some load to the webserver boxes.

Here is Apache's info on piped logs:

[httpd.apache.org...]

The regex might look something like this:

s/(sensitive_param\=)([^;]+)/$1#*$!/g;

Another option is to eschew logging the querystrings entirely, by changing the Apache Logformat. This might obscure some useful information, but avoids adding load to the server. Another potential upside is it obscures other data that users might not prefer to be recorded in the logs.

A good discussion of this method is here:

[webmasterworld.com...]

bird suggested:
\"%{REQUEST_METHOD}e %{SCRIPT_NAME}e %{SERVER_PROTOCOL}e\"

A third option considerd was conditional logging. Although it would be possible to accomplish the goal this way, it became clear it was not the appropriate tool for the problem. It is for skipping entire log entries lines.

[httpd.apache.org...]

Any feedback is welcome.

Obscuring data in Apache Logs

Piped logging, logformats, conditional logging

timster

jdMorgan

timster

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week