Chapter 05 · How to buy
Infrastructure choice

DNS hosting — and the 30-zone management problem.

DNS is the substrate of email authentication. SPF, DKIM, DMARC, MTA-STS, BIMI — every protocol in the authentication layer exists as records in zones served by some DNS provider. The choice constrains the authentication record set, the management workflow, the floor on propagation latency, and at the margin the receiver-side reputation signal. At 30 zones, it also determines whether the operator can actually change a record before the next deployment window closes.

The premise

A sending estate of 30 domains is, in operational terms, 30 DNS zones. Each zone publishes SPF, DMARC, one or more DKIM selectors, MTA-STS, TLS-RPT, and MX records pointing inbound mail at the mailbox provider. A typical sending zone holds 12 to 18 records before any vendor tracking domains, link-shortener CNAMEs, or BIMI assertions — on the order of 400 to 600 records across a 30-zone estate.

DNS is specified by RFC 1035 and is, by design, a globally distributed cache hierarchy. The authoritative nameservers for a zone are operated by whichever DNS provider the registrant delegates control to. Every choice downstream — record format, propagation behavior, API surface, DNSSEC chain-of-trust, observability — is a property of the provider, not of the protocol. The DNS provider decision therefore precedes the authentication-record decisions. A provider without DNSSEC cannot root a chain-of-trust. A provider whose API rate-limits aggressively cannot complete a coordinated 600-record update inside the deployment window. A provider that propagates over a 24-hour TTL cannot support a same-day rollback.

The DNS provider landscape

Three categories meaningfully differ for a cold-sending estate: registrar-bundled DNS, major-cloud DNS, and dedicated DNS services. The categories overlap at the edges, but operational properties cluster tightly enough that category is the correct level at which to decide. Registrar-bundled DNS is the default — included with the registration, pre-delegated at purchase, managed in the same control panel as renewal billing. Major-cloud DNS is offered by the large infrastructure platforms, typically used when the rest of production runs on the same cloud. Dedicated DNS providers are companies whose primary product is authoritative DNS — larger anycast networks, richer APIs, a wider feature set (DNSSEC, ALIAS at the apex, geographic routing, health-checked failover), priced as a metered utility.

Registrar-bundled DNS

Approximately 90% of new domain registrations remain on registrar-bundled DNS for the lifetime of the domain. The case for staying rests on a single property: there is no migration to perform. Nameservers are already delegated, zones already exist, the operator can begin populating records the moment the domain is purchased.

The case against it is the operational floor. Registrar-bundled DNS UIs are built for the single-domain consumer use case — form-per-record, one zone at a time, no bulk import, APIs that either do not exist or throttle aggressively enough to make scripted updates impractical. Adding the authentication record set to a single new zone is a 15-minute click-through; across 30 zones, seven and a half hours of repetitive UI work with the corresponding error rate. Propagation is adequate but uneven — default TTL of one hour or longer, smaller and less geographically distributed authoritative fleet, propagation under load rarely an engineering priority. For a zone that changes once per quarter, none of this matters. For a same-day rollback, it does.

Major-cloud DNS

Major-cloud DNS is the natural choice when the operator already runs production infrastructure on the same cloud. DNS is exposed through the same SDK, IAM model, logging stream, and infrastructure-as-code primitives as the rest of the stack — a 30-zone estate is a single Terraform module managed alongside compute. Pricing is per-zone-per-month plus per-query, low single digits per zone-month; for a 30-zone estate at moderate volume, monthly cost is $20 to $50.

The drawback is that DNS is now one of many services in a cloud account that may have a wider blast radius than the sending estate itself. A misconfigured IAM policy on the account becomes a misconfigured policy on the DNS layer of every sending domain. Diversification is harder to maintain when DNS shares a control plane with the rest of production.

Dedicated DNS

Dedicated DNS providers are the category most operators of a 30+ zone estate eventually adopt. The feature differential against registrar-bundled DNS is consistent: faster propagation on larger anycast networks, sub-minute record-change visibility, DDoS mitigation at the authoritative tier, native DNSSEC, per-record query observability, and an API that does not throttle on coordinated multi-zone updates. Cost is a small monthly fee per zone scaling with query volume — $30 to $80 per month for a 30-zone estate, plus the one-time migration. The migration is mechanical (export zone file, import to new provider, update delegation) but gated on TTL: the maximum TTL of any record being migrated sets the minimum overlap window.

The operational case for dedicated DNS at 30 zones is not the per-feature comparison. It is the API. A correctly configured dedicated provider allows the operator to express the entire estate as code, version-control zone definitions, and apply coordinated changes in a single deployment. That is the precondition for the management workflow below.

The DNS-provider-as-reputation-signal question

A recurring concern is whether the choice of DNS provider impacts deliverability at major mailbox providers. The empirical signal is that it mostly does not. Receiving mail servers perform a DNS lookup to verify SPF, DKIM, and DMARC; the lookup resolves through the global hierarchy and the receiver typically has no visibility into which provider serves the authoritative response.

Two edge cases. First, propagation latency: if the receiver caches a stale answer because the operator’s provider was slow to propagate a corrective update, the stale policy applies for the cached TTL. Second, provider reputation: a small number of free or low-cost providers have been historically associated with abusive infrastructure, and a handful of receivers consult provider-level reputation as a weak signal. Commercial-grade providers across all three categories are not on these lists. Optimize for operational properties; treat reputation as a non-factor at any commercial provider.

The 30-zone management problem

UI-only management of a 30-zone estate is, in production, untenable. A single coordinated DKIM-selector rotation is 30 record additions and 30 deletions, in defined order, with verification between steps — 90 minutes of uninterrupted attention at 90 seconds per record. Performed mid-deployment, under time pressure, switching between 30 nearly-identical tabs, the error rate is observably non-zero. The threshold at which UI-only management becomes the dominant source of operational risk is, in our observation, approximately 8 to 12 zones.

The required capabilities at the 30-zone tier are programmatic record management, an audit log of who changed what when, a dry-run mode previewing proposed changes, and a rollback path that does not depend on the operator remembering the prior value. Any provider category can support this — but only when the operator has actually adopted the API workflow. Registrar-bundled DNS with a click-through UI is, at 30 zones, the modal operator failure.

DNSSEC support and the chain-of-trust complexity

DNSSEC, specified by RFC 4033 through RFC 4035, cryptographically signs DNS records so that a resolver can verify the response was produced by the authoritative nameserver and has not been tampered with in transit. For email authentication, DNSSEC is the substrate beneath any future protocol asserting authenticity of the SPF, DKIM, or DMARC record itself — including DANE for SMTP and certain hardening profiles of MTA-STS. Major-cloud and dedicated DNS providers near-universally support DNSSEC; registrar-bundled DNS is uneven, with a non-trivial fraction supporting it through multi-step configuration or not at all.

The operational complexity is the chain-of-trust handoff between the DNS provider (which signs the zone) and the registrar (which publishes the DS record at the parent). A misaligned handoff — a DS record pointing to a key the provider has since rotated — produces a SERVFAIL response at every DNSSEC-validating resolver, which means the domain is unreachable. Rotation has to be coordinated across two providers with a defined overlap window, and the operator must monitor for DS-record drift on a recurring basis.

Propagation behavior across providers

DNS propagation is not, technically, a property of the authoritative provider — it is a property of the caching resolvers downstream. The authoritative provider publishes a record with a TTL; resolvers cache for up to that TTL; queries return the cached value until it expires. The provider-controlled lever is the default TTL on newly created records and the speed at which the authoritative fleet propagates a change across its own nameservers. Dedicated DNS propagates fleet-internal changes in seconds; registrar-bundled DNS is typically tens of seconds to minutes. The cache-invalidation pattern is dominated by the TTL, not by fleet propagation.

The operational implication for record-set updates during deployment: lower the TTL at least one TTL-period before the window, apply the change inside the window, observe the new value propagating, then optionally raise the TTL back. A typical pre-deployment TTL is 300 seconds; a typical steady-state TTL is 3600 seconds. The operator who omits the pre-deployment reduction discovers, during rollback, that the prior record is cached at a meaningful fraction of resolvers for the full original TTL.

The anycast-network propagation differential

Dedicated DNS providers compete in part on the size and geographic distribution of their anycast network. For inbound queries that resolve SPF, DKIM, and DMARC, the differential is on the order of milliseconds and is operationally invisible at typical mail-volume scale. It becomes visible at two margins: a coordinated DDoS against the nameservers (the larger anycast network absorbs the load across more POPs) and a same-day rollback (the larger network propagates the corrective change marginally faster). For most sending estates, neither matters. For an estate large enough to be a DDoS target, both do.

The single-point-of-failure risk and the diversification pattern

30 sending zones on one provider is a single point of failure. A multi-hour authoritative outage takes down resolution for every sending domain simultaneously — SPF, DKIM, and DMARC fail to resolve, inbound mail returns temporary failures, outbound mail fails alignment. The diversification pattern is to split the estate across two providers. The simplest variant hosts the corporate domain on the provider already in use for corporate infrastructure and the sending domains on a separate dedicated provider. The more rigorous variant delegates each sending zone to a primary and a secondary simultaneously via zone-transfer or API-replication. The cost is doubled DNS spend and the lift of keeping both in sync. For most 30-zone estates, the corporate-vs-sending split captures the majority of the benefit at a fraction of the cost.

DoH, DoT, and receiver-side considerations

DoH (DNS over HTTPS, RFC 8484) and DoT (DNS over TLS) are the encrypted transport protocols for DNS queries. They affect resolvers — the clients performing the lookup — rather than the authoritative provider. Receiving mail servers may resolve authentication records over DoH or DoT for privacy and integrity. For the operator of a sending estate, the relevant property is that the authoritative provider correctly serves responses to upstream recursive resolvers regardless of the receiver-resolver transport. All commercial-grade providers satisfy this. DoH and DoT are not a provider-selection criterion at the authoritative tier.

DNS-zone-level monitoring

A 30-zone estate requires three categories of monitoring, none of which most provider control panels offer by default. Record-change alerting: any modification to any record in any zone produces an alert, regardless of whether it came from the operator’s own API workflow or from a credential-compromised attacker. DNSSEC validation alerting: continuous validation of the chain-of-trust from the parent zone, with an alert on any validation failure. Propagation monitoring: periodic queries against a geographically distributed set of public resolvers to verify the expected record value is served everywhere it should be. The standard implementation is a small external monitoring service polling each zone’s critical records on a five-minute cadence — the authentication record set (SPF, DMARC, MTA-STS), the MX records, and the DNSSEC DS record at the parent. A typical 30-zone configuration tracks approximately 200 records.

Common operator failures observed in production

  • Registrar-bundled DNS at 30+ zones with no API. The operator scales past the UI-management threshold without migrating. The first coordinated rotation produces at least one zone with a mistyped record and a partial-week deliverability incident before the error is identified.
  • No DNSSEC anywhere. The estate is configured without DNSSEC across all 30 zones. The hardening for MTA-STS and any future DANE deployment is unavailable, and the operator has no protocol-level defense against authoritative-response tampering at downstream resolvers.
  • No record-change alerting. A misconfigured CI pipeline, a compromised API credential, or an erroneous manual edit modifies a critical record — typically the SPF or DMARC record at the corporate domain — and the change is detected days or weeks later when a deliverability metric finally reflects it.
  • TTL too long during deployment. The operator changes a DKIM selector or rotates an SPF include without lowering the TTL beforehand. A same-day rollback is not actually possible.
  • No diversification. The entire 30-zone estate, plus the corporate domain, plus production infrastructure DNS, all run on the same provider. A multi-hour outage simultaneously suspends every revenue-impacting flow the operator owns.
  • DNSSEC DS-record drift. The DNS provider rotates the zone-signing key on schedule, the registrar’s DS record is not updated to match, and the domain begins returning SERVFAIL at every DNSSEC-validating resolver. The failure is silent at non-validating resolvers, which delays detection.

Pre-deployment checklist

  • DNS provider selected, with the API or infrastructure-as-code workflow validated against a non-production zone first
  • Nameserver delegation at the registrar pointing to the chosen provider, propagated to the parent zone
  • DNSSEC enabled at the provider, DS record published at the registrar, chain-of-trust validated against a public DNSSEC validator
  • Default TTL set to support the rollback target — typically 300 to 600 seconds for sending zones
  • API credentials scoped to minimum necessary permissions, stored in secret-management infrastructure rather than a shell history file
  • Zone-level monitoring against the authentication record set, with alerts routed to a channel a human reads within one business day
  • Corporate domain DNS on a provider separate from sending-domain DNS, or a documented justification for accepting the single-provider risk
  • A documented runbook for a coordinated DKIM rotation, tested end-to-end against a staging zone before any production rotation

Where this fits

DNS hosting is the infrastructure layer beneath authentication. The decisions above — provider category, API posture, DNSSEC, monitoring, diversification — determine whether the records described in the cold email infrastructure reference can be created, rotated, and verified at the cadence the protocol demands.

For a 30-zone sending estate, the modal correct configuration is: corporate domain on the provider already in use for corporate infrastructure; sending domains on a dedicated DNS provider with API-first management, DNSSEC enabled, and zone-level monitoring; all 30 zones expressed as code, version-controlled, with a documented rollback path. Total operational cost is approximately $30 to $80 per month, plus the one-time migration. Every subsequent change to the authentication record set becomes a coordinated, auditable, reversible operation rather than 30 sequential click-throughs.

The next chapter moves up the stack to the mailbox-provider decision: per-mailbox cost, the SMTP-layer feature set, the sending-volume ceiling per account, and the deliverability reputation differential at the major receivers.

Skip the setup

Allston Labs operates the full sending estate as a service.

We provision domains, configure the entire authentication record set, run warmup, and monitor reputation across providers. The stack lives under your entity. The engineer on call lives in your Slack.