Confluent’s Secret Protection feature encrypts secrets in configuration files. Instead of storing passwords or other sensitive data as cleartext, Secret Protection encrypts it within the configuration file itself.
You may either step through this tutorial manually, or run the automated demo.
Before you start:
-
Download Confluent Platform 5.3 or higher
-
Get the new Confluent CLI (v0.128.0 or above)
In the most common use case, you would want to encrypt passwords. We have a Security tutorial that provides an example of how to enable security features on Confluent Platform, but that takes extra steps to generate the keys and certificates and add the TLS configurations. Therefore, instead of encrypting a password, we will encrypt a basic configuration parameter, but the steps are exactly the same.
First choose your master encryption key passphrase, a phrase that is much longer than a typical password and is easily remembered as a string of words, and enter this passphrase into a file to be passed into the CLI, to avoid logging history showing the passphrase. Then choose the location of where the secrets file will reside on your local host (not where the Confluent Platform services run). The secrets file will contain encrypted secrets for the master encryption key, data encryption key, and configuration parameters, along with their metadata such as which cipher was used for encryption. Now you are ready to generate the master encryption key:
# passphrase: /path/to/passphrase.txt
# local-secrets-file: /path/to/secrets.txt
$ confluent secret master-key generate --local-secrets-file /path/to/secrets.txt --passphrase @/path/to/passphrase.txt
Save the master key. It cannot be retrieved later.
+------------+----------------------------------------------+
| Master Key | Nf1IL2bmqRdEz2DO//gX2C+4PjF5j8hGXYSu9Na9bao= |
+------------+----------------------------------------------+
As the output indicates, the master encryption key cannot be retrieved later so save it somewhere. Export this key into the environment on the local host, as well as every host that will have a configuration file with secret protection:
$ export CONFLUENT_SECURITY_MASTER_KEY=Nf1IL2bmqRdEz2DO//gX2C+4PjF5j8hGXYSu9Na9bao=
To protect this environment variable in a production host, you can set the master encryption key at the process level instead of the global machine level. For example, you could set it in the systemd overrides for executed processes, restricting the environment directives file to root-only access.
Let’s use a configuration parameter available in a configuration file example that ships with Confluent Platform.
We will encrypt the parameter config.storage.topic
in $CONFLUENT_HOME/etc/schema-registry/connect-avro-distributed.properties
.
First make a backup of this file, because the CLI currently does in-place modification on the original file. Then choose the exact path for where the secrets file will reside on the remote hosts where the Confluent Platform services run. Now you are ready to encrypt this field:
# Value before encryption
$ grep "config\.storage\.topic" connect-avro-distributed.properties
config.storage.topic=connect-configs
# Encrypt it
# remote-secrets-file: /path/to/secrets-remote.txt
confluent secret file encrypt --local-secrets-file /path/to/secrets.txt --remote-secrets-file /path/to/secrets-remote.txt --config-file connect-avro-distributed.properties --config config.storage.topic
# Value after encryption
$ grep "config\.storage\.topic" connect-avro-distributed.properties
config.storage.topic = ${securepass:/path/to/secrets-remote.txt:connect-avro-distributed.properties/config.storage.topic}
As you can see, the configuration parameter config.storage.topic
setting was changed from connect-configs
to ${securepass:/path/to/secrets-remote.txt:connect-avro-distributed.properties/config.storage.topic}
.
This is a tuple that directs the service to use to look up the encrypted value of the file/parameter pair connect-avro-distributed.properties/config.storage.topic
from the secrets file /path/to/secrets-remote.txt
.
View the contents of the local secrets file /path/to/secrets.txt
, which now contains the encrypted secret for this file/parameter pair along with the metadata, e.g. which cipher was used for encryption:
$ cat /path/to/secrets.txt
...
connect-avro-distributed.properties/config.storage.topic = ENC[AES/CBC/PKCS5Padding,data:CUpHh5lRDfIfqaL49V3iGw==,iv:vPBmPkctA+yYGVQuOFmQJw==,type:str]
You can also decrypt the value into a file:
$ confluent secret file decrypt --local-secrets-file /path/to/secrets.txt --config-file connect-avro-distributed.properties --output-file decrypted.txt
$ cat decrypted.txt
config.storage.topic = connect-configs
You may have a requirement to update secrets on a regular basis, to help them from getting stale.
The configuration parameter config.storage.topic
was originally set to connect-configs
.
If you need to change the value in the future, you can update it directly using the CLI.
In the CLI below, pass in a file /path/to/updated-config-and-value
that has written config.storage.topic=newTopicName
, to avoid logging history showing the new value.
$ confluent secret file update --local-secrets-file /path/to/secrets.txt --remote-secrets-file /path/to/secrets-remote.txt --config-file connect-avro-distributed.properties --config @/path/to/updated-config-and-value
The configuration file connect-avro-distributed.properties
does not change because it’s just a pointer to the secrets file.
However the secrets file has a new value for the encrypted value for this file/parameter pair:
$ cat /path/to/secrets.txt
...
connect-avro-distributed.properties/config.storage.topic = ENC[AES/CBC/PKCS5Padding,data:CblF3k1ieNkFJzlJ51qAAA==,iv:dnZwEAm1rpLyf48pvy/T6w==,type:str]
That’s cool, but does it work?
Try it out yourself.
Run Kafka and start the modified connect worker with the encrypted value of config.storage.topic=newTopicName
# Start ZooKeeper and a Kafka broker
$ confluent local start kafka
# Run the modified connect worker
$ connect-distributed connect-avro-distributed.properties > connect.stdout 2>&1 &
# List the topics
$ kafka-topics --bootstrap-server localhost:9092 --list
__confluent.support.metrics
__consumer_offsets
_confluent-metrics
connect-offsets
connect-statuses
newTopicName <<<<<<<
So far we have covered how to create the master encryption key and encrypt secrets in the configuration files. We recommend that you operationalize this workflow by augmenting your orchestration tooling to distribute everything that you need for secret protection to work to the destination hosts. These hosts may include Kafka brokers, Connect workers, Confluent Schema Registry instances, KSQL servers, Confluent Control Center, etc., any service using password encryption. The CLI is flexible to accommodate whatever secret distribution model you prefer: you can either do the secret generation and configuration modification on each destination host directly, or do it all a single host and then distribute the encrypted secrets to the destination hosts. There are four required tasks:
-
Export the master encryption key into the environment on every host that will have a configuration file with secret protection.
-
Distribute the secrets file: copy the secrets file
/path/to/secrets.txt
from the local host on which you have been working to/path/to/secrets-remote.txt
on the destination hosts. -
Propagate the necessary configuration file changes: update the configuration file on all hosts so that the configuration parameter now has the tuple for secrets.
-
Restart the services if they were already running.
You may also have a requirement to rotate the master encryption key or data encryption key on a regular basis. You can do either of these with the CLI, and the example below is for rotating just the data encryption key.
$ confluent secret file rotate --data-key --local-secrets-file /path/to/secrets.txt --passphrase @/path/to/passphrase.txt