Skip to main content
Version: 14.3.0

Configuration

To setup the architecture described previously, you need to configure the following components:

Sidecar

The Audit Trail sidecar is available in the marketplace and can be attached to any microservice in your project.

The only requirement for the sidecar to work is to share a volume with the main application container, so that logs are written to a shared log file, which is read by the sidecar to filter the audit logs based on their log level, enrich them with additional metadata and sends them to the Kafka topic.

To configure your sidecar and starts collecting the audit logs, you must:

  1. Attach the sidecar to the microservice (see the official documentation)
  2. Add a shared volume
  3. Configure the sidecar environmental variables

Shared volume

Once you have attached a sidecar to your microservice, you have two options:

Customize with Kustomize

Open the GitLab repository containing your Console project configuration and perform the following steps for each project environment you want to make the changes:

  1. Create a new YAML file with an arbitrary name - e.g. patch.yaml - under overlays/<environment>

  2. Open the overlays/<environment>/kustomization.yaml file and add a new entry in the patches section, specifying the name of your microservice under target/name and the name of the file created at the previous step under path:

apiVersion: kustomize.config.k8s.io/v1beta1
kind: Kustomization
patches:
- target:
kind: Deployment
name: <service-name>
path: patch.yaml
  1. In the patch.yaml file, add a new shared volume
- op: add
path: /spec/template/spec/volumes/-
value:
name: logs
emptyDir: {}
  1. In the patch.yaml file, attach to each container the shared volume ensuring the name property matches the name you chose for the shared volume defined previously.
# Let's assume the first container listed under `spec/template/spec/containers` in the deployment file is the main application container
- op: add
path: "/spec/template/spec/containers/0/volumeMounts/-"
value:
mountPath: "/logs/"
name: "logs"

# Let's assume the second container listed under `spec/template/spec/containers` in the deployment file is the sidecar container
- op: add
path: "/spec/template/spec/containers/1/volumeMounts/-"
value:
mountPath: "/logs/"
name: "logs"
  1. In the patch.yaml file, override the service Dockerfile command to send logs both to standard output and to the file specified in the sidecar LOG_FILE_PATH environment variable.
# Let's assume the first container listed under `spec/template/spec/containers` in the deployment file is the main application container
- op: add
path: "/spec/template/spec/containers/0/command"
value: ["/bin/sh", "-c"]
args: ["node ./dist/index.js | tee /logs/application.log"]
  1. From the Console, set the LOG_FILE_PATH environment variable of the sidecar with the absolute path of the log file, in the example /logs/application.log.

Customize with Console Raw Manifest

After enabling the Raw Manifest configuration on your service, you should be able to edit the deployment file, called configuration/<service-name>.deployment.yml.

  1. Under spec/template/spec/volumes, add a new shared volume.
apiVersion: apps/v1
kind: Deployment
spec:
template:
spec:
volumes:
- name: logs
emptyDir: {}
  1. Attach to each container listed under spec/template/spec/containers the shared volume, ensuring the name property matches the name you chose for the shared volume defined previously.
apiVersion: apps/v1
kind: Deployment
spec:
template:
spec:
containers:
- name: your-service
image: ...
volumeMounts:
- name: logs
mountPath: /logs/
- name: audit-trail
image: nexus.mia-platform.eu/mia-care/plugins/audit-trail:1.0.0
volumeMounts:
- name: logs
mountPath: /logs/
volumes:
- name: logs
emptyDir: {}
  1. Set the LOG_FILE_PATH environment variable to the absolute path of a file located inside the mount patch (/logs/ in our example); you can choose any name we want for the file, like application.log.
apiVersion: apps/v1
kind: Deployment
spec:
template:
spec:
containers:
- name: your-service
image: ...
volumeMounts:
- name: logs
mountPath: /logs/
- name: audit-trail
image: nexus.mia-platform.eu/mia-care/plugins/audit-trail:1.0.0
volumeMounts:
- name: logs
mountPath: /logs/
env:
- name: LOG_FILE_PATH
value: /logs/application.log
volumes:
- name: logs
emptyDir: {}
  1. Ensure your service sends logs both to standard output and to the file specified in the sidecar LOG_FILE_PATH environment variable. You can simply change the Dockerfile default command (CMD) to sends it output to both the standard output (as by default) and the log file using tee. If you cannot edit the Dockerfile directly, for example because you are using a marketplace plugin, you can simply override the Dockerfile default command in the microservice configuration args field.

Environment variables

To configure the sidecar you must set the following environment variables. The required environment variables are highlighted in bold. We strongly recommend using secrets to load sensitive information (like Kafka credentials and brokers address) into environment variables.

NameRequiredDefaultVersionDescription
AUDIT_TRAIL_LOG_LEVELNo1100*The log level associated to audit logs.
KAFKA_BROKERSYes-*A comma-separated list of Kafka brokers address.
KAFKA_CLIENT_IDYes-*The Kafka client ID, for example the identifier of the service the sidecar is attached to.
KAFKA_TOPIC-*Name of the Kafka topic to send audit logs to.
KAFKA_AUTH_MECHANISMPLAIN*Authentication mechanism, used only if KAFKA_USERNAME and KAFKA_PASSWORD are set.
KAFKA_USERNAME-*Username of the Kafka credentials.
KAFKA_PASSWORD-*Password of the Kafka credentials
KAFKA_CONNECTION_TIMEOUTNo1000*Time in milliseconds to wait for a successful connection.
KAFKA_AUTHENTICATION_TIMEOUTNo10000*Time in milliseconds to wait for a successful authentication.
LOG_FILE_PATHYes-*Absolute path of the log file inside the shared volume.
LOG_LEVELYes-*Logging level for the sidecar.
danger

When configuring your microservice, be careful to set a log level that is lower or equal than the AUDIT_TRAIL_LOG_LEVEL, otherwise the audit logs will be suppressed by the logger.

Kafka

To collect the logs you just need to create a dedicated topic in a Kafka cluster.

After creating a new topic, you should configure the retention period, taking into account the amount of audit logs generated by the systems. When choosing the retention period, you need to find a balance between the availability of logs for later processing, accounting for the unavailability of the downstream services (Kafka2Rest and the CRUD Service), and the amount of logs generated by your services, which can vary between environments depending on the level of traffic and the configured log level.

A log will appear in Kafka as a message with a payload looking like this:

{
"version": "1.0.0",
"timestamp": "2024-04-30T09:12:06.095Z",
"checksum": {
"algorithm": "sha512",
"value": "e474e95adfb31ef4cac7d992732a43d65e3313c852bd5e318dd84087b39ab62b19ff4c5590a6d5d5055ee5e3669c384c55eff0f36fe090205bd67d92d4aa9381"
},
"metadata": {
"level": 1100,
"msg": "Hello World Audit Log",
"event": "MiaCare/HelloWorld/v1",
"severity": "INFO"
},
"message": "Hello World Audit Log",
"rawLog": "{\"level\":1100,\"msg\":\"Hello World Audit Log\", ...}"
}

The audit logs are enriched with several fields before being sent from the sidecar to the Kafka topic:

  • version: the version of the audit logs reference schema;
  • timestamp: when the audit log was recorded ;
  • checksum: this checksum is generated automatically from the original log (available in the rawLog field);
  • metadata: this field contains all the log fields, including the ones passed as first argument to the logger;
  • message: this field contains the original log message (currently the message must be in the log msg field);
  • rawLog: the original log, as emitted by the application logger.

For more details, check the data model overview.

tip

If you need to record when the even occurred, you should pass it explicitly as field of the object passed as first argument to the logger, so it's recorded in the metadata and available later for querying.

Kafka2Rest

The Kafka2Rest service should authenticate with Kafka using SASL/SCRAM with Transport Layer Security (TLS) in combination with dedicated credentials, that have exclusive access to the topic.

You can also configure a validator processor, to performs additional filtering on the audit logs, and a body processor, to manipulate the audit log before sending it as payload of the POST to the CRUD Service.

CRUD Service

The CRUD Service should connect to MongoDB using dedicated credentials, which only allow insert or read operations, and must not be exposed.

You must create a CRUD collection with the custom fields described in the following table or you can easily import the fields from this JSON file

NameTypeRequiredNullable
versionStringNoNo
timestampDateNoNo
metadataRawObjectNoNo
checksumRawObjectNoNo
messageStringNoNo
rawLogStringNoNo

Performance tuning

After you have setup all the components of the architecture, you need to estimate the amount of logs generated by your microservices and appropriately scale your infrastructure.

Once you have established you overall performance requirements, in terms of expected incoming traffic and amount of generated logs by each service, you should carry a load test to assign the proper resources to each component, starting with the sidecar, which has to read all the logs of the main application container, filter the audit logs and forward them to Kafka.

We recommend starting with vertical scaling, trying to assign more CPU resources to process incoming logs faster, and eventually resolve to scale horizontally with multiple replicas of the service. According to our internal benchmarks, a sidecar with the following resources should be able process around 1000 logs/second:

ServiceVersionCPU (req/lim)RAM (req/lim)
CRUD Service5.4.2150/300100/150
Kafka2Rest1.1.1150/30050/100
Sidecar1.0.0150/30050/100

Then you can focus on scaling appropriately the Kafka2Rest and CRUD Service services.

Starting with Kafka2Rest, you can use a conservative configuration, since logs forwarded to the Kafka topic can be processed asynchronously. If you opt for multiple replicas, for optimal performance you should configure the topic to have the same number of partitions, so each Kafka2Rest replica can parallelize work processing logs from different partitions.

Finally, you can look at the CRUD Service and scale it to "match" the amount of requests generated by Kafka2Rest. We expect that, in most cases, the default configuration will work fine. For additional information and guidance, take a look at the CRUD Service performance documentation.