Nautobot

nautobot-job-queues --
reconciling Nautobot JobQueue records and assigning Jobs to queues
mTLS Certificate Renewal --
how mTLS client certificates for site workers are renewed and
distributed across clusters

mTLS Certificate Resources

All mTLS certificates are managed by cert-manager in the nautobot
namespace. The CA hierarchy bootstraps from a self-signed issuer and
issues all leaf certificates for PostgreSQL, Redis, and application
pods automatically.

Resource	Kind	Key Fields	Purpose
`mtls-selfsigned`	Issuer	`selfSigned: {}`	Bootstrap issuer for the CA chain
`mtls-ca`	Certificate	`isCA: true`, CN=`understack-mtls-ca`, ECDSA P-256, 10yr	Root CA for all mTLS certs
`mtls-ca-issuer`	Issuer	`ca: mtls-ca-key-pair`	Signs all leaf certificates
`mtls-ca-cert`	Certificate	secret=`mtls-ca-cert`, label `cnpg.io/reload`	CA public cert for CNPG/Redis verification
`nautobot-cluster-server-tls`	Certificate	CN=`nautobot-cluster-rw.nautobot.svc`, 1yr	PostgreSQL server TLS
`nautobot-cluster-replication`	Certificate	CN=`streaming_replica`, usage=`client auth`, 1yr	CNPG streaming replication
`nautobot-redis-server-tls`	Certificate	CN=`nautobot-redis-master.nautobot.svc`, 1yr	Redis server TLS
`nautobot-mtls-client`	Certificate	CN=`app`, usage=`client auth`, 3yr	Client cert for nautobot/celery pods
`nautobot-mtls-client-<site>`	Certificate	CN=`app`, usage=`client auth`, 3yr	Per-site client cert issued on the global cluster and copied to the site cluster through the secrets provider

Client Certificate Names

There are three relevant Secret names:

Global cluster nautobot-mtls-client: generated by the global
Certificate and mounted by global Nautobot web and Celery pods.
Global cluster nautobot-mtls-client-<site>: generated by a per-site
Certificate. Workloads do not mount this Secret directly; its cert/key
are copied to the external secrets provider for that site.
Site cluster nautobot-mtls-client: generated by ExternalSecret from
the provider data for that site. This is the Secret mounted by
site-level worker pods at /etc/nautobot/mtls/.

The server certificates (nautobot-cluster-server-tls and
nautobot-redis-server-tls) include site-specific dnsNames that vary
per deployment (e.g. external hostnames for the database and Redis
endpoints). All other resources are identical across sites.

PostgreSQL mTLS

All PostgreSQL connections -- both from global Nautobot pods and
site-level workers -- use mutual TLS with client certificate
authentication. The CNPG cluster enforces this with a single pg_hba
rule:

hostssl all all 0.0.0.0/0 cert

This means every client must connect over TLS and present a valid
client certificate signed by the mTLS CA. The certificate CN is mapped
to the PostgreSQL user (app).

CNPG Certificate Configuration

The CNPG Cluster resource has four certificate fields. Understanding
what each one does is critical for troubleshooting TLS errors:

Field	Secret	What CNPG Does With It
`serverTLSSecret`	`nautobot-cluster-server-tls`	Mounted as the PostgreSQL server cert. Presented to clients during the TLS handshake.
`serverCASecret`	`mtls-ca-cert`	The `ca.crt` from this secret is sent to clients so they can verify the server cert (`sslrootcert` on the client side).
`clientCASecret`	`mtls-ca-cert`	The `ca.crt` from this secret populates PostgreSQL's `ssl_ca_file`. This is what PostgreSQL uses to verify client certificates during `pg_hba cert` auth.
`replicationTLSSecret`	`nautobot-cluster-replication`	Client cert (`CN=streaming_replica`) used for streaming replication between PostgreSQL instances.

Key points:

clientCASecret is the field that controls client cert verification.
Without it, CNPG auto-generates its own internal CA and uses that for
ssl_ca_file. External client certs signed by the mTLS CA will be
rejected with tlsv1 alert unknown ca.
serverCASecret does NOT populate ssl_ca_file. It only provides
the CA cert that clients use to verify the server. This is a common
source of confusion.
replicationTLSSecret must be provided when setting clientCASecret.
Without it, CNPG tries to generate its own replication cert and needs
ca.key in the clientCASecret secret. Since mtls-ca-cert only
has ca.crt (not the CA private key), CNPG fails with
missing ca.key secret data.
Both clientCASecret and serverCASecret can point to the same
secret (mtls-ca-cert) when the same CA signs both server and client
certificates.
The replicationTLSSecret and client Certificate resources both set
usages: [client auth] in their cert-manager spec. This maps to the
X.509 Extended Key Usage (EKU) id-kp-clientAuth (OID
1.3.6.1.5.5.7.3.2), marking the certificate as valid only for
authenticating a client to a server. PostgreSQL's pg_hba cert rule
requires connecting certificates to carry this EKU -- without it the
handshake is rejected. Server certificates (e.g.
nautobot-cluster-server-tls) omit usages so cert-manager applies
the default set which includes server auth.
The mtls-ca-cert Certificate resource must have the label
cnpg.io/reload: "true". This tells the CNPG operator to watch the
resulting Secret for changes and automatically reload PostgreSQL
instances when cert-manager renews the CA certificate. Without this
label, a CA renewal would require a manual pod restart for CNPG to
trust the new certificate. See the
CNPG Labels and Annotations docs
for details.

How nautobot_config.py Handles SSL

The shared deploy config's ($deploy/apps/nautobot-config/nautobot_config.py)
SSL logic is gated on the NAUTOBOT_DB_SSLMODE
environment variable:

Value	Behavior	Use Case
`verify-ca`	Sets `sslmode`, `sslcert`, `sslkey`, `sslrootcert` on the Django DB connection. Validates cert files exist at startup.	Global pods and site workers (production).
`verify-full`	Same as `verify-ca` but also verifies the server hostname matches the cert.	Stricter verification if needed.
`require`	Sets `sslmode=require` only. Encrypts the connection but does not present a client cert or verify the server CA.	Not suitable for `pg_hba cert` -- use `verify-ca` instead.
Unset or empty	No SSL options applied. Plain TCP connection.	Will be rejected by `hostssl ... cert` pg_hba rule.

All pods (global and site) must set NAUTOBOT_DB_SSLMODE=verify-ca in
their extraEnvVars and have the mTLS client cert volume mounted at
/etc/nautobot/mtls/.

NAUTOBOT_DB_SSLNEGOTIATION is optional. If set to direct, the shared
config passes sslnegotiation=direct to libpq. This requires
PostgreSQL/libpq 17+ and NAUTOBOT_DB_SSLMODE=require or stronger.

Verifying the Certificate Chain

To confirm the CNPG cluster is using the correct CA for client cert
verification:

# Check what CA PostgreSQL is using for ssl_ca_file
kubectl exec -n nautobot nautobot-cluster-1 -c postgres -- \
  openssl x509 -noout -subject -issuer \
  -in /controller/certificates/client-ca.crt
# Expected: subject=CN=understack-mtls-ca

# Check the client cert CN and issuer
kubectl get secret nautobot-mtls-client -n nautobot \
  -o jsonpath='{.data.tls\.crt}' | base64 -d | \
  openssl x509 -noout -subject -issuer
# Expected: subject=CN=app, issuer=CN=understack-mtls-ca

# Verify the client cert against the CA
kubectl get secret mtls-ca-cert -n nautobot \
  -o jsonpath='{.data.ca\.crt}' | base64 -d > /tmp/ca.crt
kubectl get secret nautobot-mtls-client -n nautobot \
  -o jsonpath='{.data.tls\.crt}' | base64 -d > /tmp/client.crt
openssl verify -CAfile /tmp/ca.crt /tmp/client.crt
# Expected: /tmp/client.crt: OK

Common Errors

Error	Cause	Fix
`tlsv1 alert unknown ca`	`clientCASecret` not set or points to wrong secret. CNPG uses its internal CA for `ssl_ca_file`.	Set `clientCASecret: mtls-ca-cert` and `replicationTLSSecret: nautobot-cluster-replication`.
`missing ca.key secret data`	`clientCASecret` set but `replicationTLSSecret` not provided. CNPG needs CA key to generate replication certs.	Add `replicationTLSSecret` with a cert-manager Certificate (`CN=streaming_replica`).
`connection requires a valid client certificate`	Client connected over TLS but did not present a cert.	Set `NAUTOBOT_DB_SSLMODE=verify-ca` on the pod.
`certificate authentication failed for user`	Client cert CN does not match the PostgreSQL user.	Ensure cert has `commonName: app`.
`x509: certificate signed by unknown authority` (CNPG status)	Old replication secret signed by CNPG's internal CA, not the mTLS CA.	Delete the old secret: `kubectl delete secret nautobot-cluster-replication -n nautobot`. cert-manager recreates it.
`no pg_hba.conf entry`	Client is not connecting over TLS, or the source IP / auth method does not match any rule.	Ensure `NAUTOBOT_DB_SSLMODE=verify-ca` is set. Check that the pg_hba rules cover the connection type.

Forcing CNPG to Reconcile

After changing certificate fields on the CNPG Cluster resource, the
operator may not immediately pick up the change. Force a reconcile:

kubectl annotate cluster nautobot-cluster -n nautobot \
  cnpg.io/reconcile=$(date +%s) --overwrite

Check the result:

kubectl get cluster nautobot-cluster -n nautobot \
  -o jsonpath='{.status.phase}{"\n"}{.status.phaseReason}{"\n"}'

If the phase is healthy, the change was applied. If it shows an error,
see the Common Errors table above.

Handling Stale CNPG-Managed Secrets

When adding replicationTLSSecret, CNPG may have already created a
secret with the same name (e.g. nautobot-cluster-replication) using
its internal CA. cert-manager will not overwrite a secret it did not
create. You must delete the old secret first:

kubectl delete secret nautobot-cluster-replication -n nautobot
# cert-manager recreates it within seconds, signed by mtls-ca-issuer

Verify the new secret:

kubectl get secret nautobot-cluster-replication -n nautobot
# Should show DATA=3 (tls.crt, tls.key, ca.crt)

kubectl get secret nautobot-cluster-replication -n nautobot \
  -o jsonpath='{.data.tls\.crt}' | base64 -d | \
  openssl x509 -noout -subject -issuer
# Expected: subject=CN=streaming_replica, issuer=CN=understack-mtls-ca

Then force a CNPG reconcile (see above).

Restarting CNPG Pods

If the CNPG pods have not picked up updated certificate secrets (e.g.
client-ca.crt still shows the old CA), use the cnpg kubectl plugin
to perform a rolling restart:

kubectl cnpg restart nautobot-cluster -n nautobot

This performs a rolling restart of all instances, handling replica/primary
ordering automatically and waiting for each pod to be ready before
proceeding.

If you only need pods to reload configuration (e.g. updated pg_hba
or PostgreSQL parameters) without a full restart:

kubectl cnpg reload nautobot-cluster -n nautobot

pg_hba Behavior

pg_hba rules are evaluated top-to-bottom. PostgreSQL stops at the first
rule matching the connection type and source IP. If authentication fails
on that rule, the connection is rejected -- it does NOT fall through to
the next rule. This means two rules with the same
hostssl all all 0.0.0.0/0 prefix makes the second unreachable. Use
CIDR scoping if you need different auth methods for different source
networks.

Rollback to Password Auth

To revert global pods to password-based auth while keeping cert auth
for site workers:

Add back the host rule for local pods:

postgresql:
  pg_hba:
    - host all all 10.0.0.0/8 scram-sha-256
    - hostssl all all 0.0.0.0/0 cert

Remove NAUTOBOT_DB_SSLMODE from global pod extraEnvVars (keep
it on site workers).
Optionally remove clientCASecret and replicationTLSSecret from
the CNPG spec to let CNPG manage its own replication CA again.

Configuration Architecture

All config changes go in the deploy repo

The public nautobot_config.py at $understack/components/nautobot/nautobot_config.py
is intentionally kept as simple and generic as possible for open-source consumers.
It does not contain mTLS, plugin loading, UNDERSTACK_PARTITION,
UNDERSTACK_SITE, or any extra plugins mechanism. All deployment-specific
Nautobot configuration changes MUST be made in the shared deploy config at
$deploy/apps/nautobot-config/nautobot_config.py.
Do not modify the public config for private deployment needs.

Nautobot requires a nautobot_config.py file that defines Django
settings, plugin loading, database options, and authentication
backends. For private deployments, this file lives at
$deploy/apps/nautobot-config/nautobot_config.py and is injected into
pods using the Helm chart's fileParameters feature. The public repo
provides a minimal default at components/nautobot/nautobot_config.py
for non-private deployments.

How fileParameters Works

Both the nautobot and nautobot-worker ArgoCD Applications use a
multi-source setup. The Helm chart source includes a configurable
fileParameters entry:

helm:
  fileParameters:
    - name: nautobot.config
      path: <global.nautobot.nautobot_config or site.nautobot_worker.nautobot_config>

By default, both values point at
$understack/components/nautobot/nautobot_config.py. Private
deployments can override them to a deploy-repo file. The current
site and global deployments can set:

global:
  nautobot:
    nautobot_config: '$deploy/apps/nautobot-config/nautobot_config.py'

site:
  nautobot_worker:
    nautobot_config: '$deploy/apps/nautobot-config/nautobot_config.py'

ArgoCD reads the selected file content from either the understack or
deploy repo and passes it as the nautobot.config Helm value. The
Nautobot Helm chart then creates a ConfigMap from that content and
mounts it into pods at /opt/nautobot/nautobot_config.py. The
NAUTOBOT_CONFIG environment variable (set in the deploy repo values)
tells Nautobot to load its configuration from that path.

This approach means:

The config file is version-controlled in the selected ArgoCD source
($understack or $deploy)
Changes to the config trigger ArgoCD syncs and pod restarts
automatically (the Helm chart checksums the ConfigMap)
Global Nautobot and site-level workers can share the same private
config file when they need identical mTLS, plugin, and hardening
behavior

Why Not Use the Baked-In Config?

Container images may include their own nautobot_config.py at build
time (e.g. at /opt/nautobot_config/nautobot_config.py). While this
works for simple deployments, it has limitations:

Config changes require rebuilding and redeploying the container image
Different deployments (global vs site workers) may need different
settings (e.g. mTLS, plugin sets) but share the same image
Private deployment-specific settings (plugin credentials, SSO config)
get baked into the image

The Helm fileParameters approach decouples the config from the image.
The image provides the runtime (Nautobot + installed plugins), while
the git-managed config and deploy-repo environment variables control
behavior. This separation allows:

The same container image to be used across global and site deployments
with different configurations
mTLS, SSL, and other connection settings to be conditional on
environment variables rather than hardcoded
Private plugin configuration to be injected via environment variables
in the deploy repo without modifying the public config file

Config Layering

The effective configuration is built from multiple layers:

Nautobot defaults -- from nautobot.core.settings import *
provides all default Django and Nautobot settings
Selected config file -- either the public
$understack/components/nautobot/nautobot_config.py default or a
private deploy config such as
$deploy/apps/nautobot-config/nautobot_config.py. A
deployment-specific config can contain settings such as PostgreSQL
mTLS, Redis mTLS, SSO, production hardening, UNDERSTACK_PARTITION,
UNDERSTACK_SITE, plugin configuration, and logging.
Helm chart env vars -- the base components/nautobot/values.yaml
sets database, Redis, and other connection parameters as environment
variables that the config reads via os.getenv()
Deploy repo values -- site-specific overrides (hostnames, image
tags, credentials) that Helm merges on top of the base values

Important: Helm List Replacement

Helm merges scalar and map values from multiple value files, but
replaces lists entirely. If the base components/nautobot/values.yaml
defines:

nautobot:
  extraVolumes:
    - name: nautobot-sso
      secret:
        secretName: nautobot-sso

And the deploy repo values set:

nautobot:
  extraVolumes:
    - name: mtls-certs
      secret:
        secretName: nautobot-mtls-client

The result is only mtls-certs -- the nautobot-sso volume is
gone. The deploy values must re-include any base volumes they need to
preserve.

Plugin Loading

Plugin changes go in the deploy repo config

For private deployments, all plugin configuration is managed in the
shared deploy config at $deploy/apps/nautobot-config/nautobot_config.py.
The public config does not have any plugin loading mechanism -- it only
has a static PLUGINS_CONFIG entry for vni_custom_model. Do not add
plugins or plugin config to the public config.

Deployment-specific plugin credentials and integration settings can be
injected through the nautobot-custom-env secret, which is referenced by
both the global Nautobot values and the site nautobot-worker component
via extraEnvVarsSecret. Keep secret names and environment variable
names generic in public docs; document provider-specific mappings in the
deploy repo that owns those secrets.

Nautobot Django shell

You can access the Nautobot Django shell by connecting to the pod and running the
nautobot-server shell command.

# find one of the nautobot app pods
kubectl get pod -l app.kubernetes.io/component=nautobot-default
NAME                                READY   STATUS    RESTARTS   AGE
nautobot-default-598bddbc79-kbr72   1/1     Running   0          2d4h
nautobot-default-598bddbc79-lnjj6   1/1     Running   0          2d4h

# use the nautobot-server shell
kubectl exec -it nautobot-default-598bddbc79-kbr72 -- nautobot-server shell

Nautobot GraphQL Queries

Query for all servers in a specific rack

This queries devices with the role server located in rack rack-123
and includes the iDRAC/iLO BMC IP address.

query {
  devices(role: "server", rack: "rack-123") {
    id
    name
    interfaces(name: ["iDRAC", "iLO"]) {
      ip_addresses {
        host
      }
    }
  }
}

Output example:

rack-123-devices-output.json

{
  "data": {
    "devices": [
      {
        "id": "4933fb3d-aa7c-4569-ae25-0af879a11291",
        "name": "server-1",
        "interfaces": [
          {
            "ip_addresses": [
              {
                "host": "10.0.0.1"
              }
            ]
          }
        ]
      },
      {
        "id": "f6be9302-96b0-47e9-ad63-6056a5e9a8f5",
        "name": "server-2",
        "interfaces": [
          {
            "ip_addresses": [
              {
                "host": "10.0.0.2"
              }
            ]
          }
        ]
      }
    ]
  }
}

Some jq to help parse the output:

cat rack-123-devices-output.json | jq -r '.data.devices[] | "\(.id) \(.interfaces[0]["ip_addresses"][0]["host"])"'

Output:

4933fb3d-aa7c-4569-ae25-0af879a11291 10.0.0.1
f6be9302-96b0-47e9-ad63-6056a5e9a8f5 10.0.0.2