Chapter 05 · Build the list
Cohort design

Segmentation architecture — the per-segment reply-rate lift.

A single message variant sent across a heterogeneous prospect list converges, in steady state, to the reply rate of the lowest-yield cohort on the list. Segmentation breaks the list into addressable cohorts and assigns each cohort a variant calibrated to its reply-rate ceiling. The compounded per-cohort lift is the difference between an outbound motion that produces pipeline and one that produces unsubscribes.

Why a single variant collapses

A list of 5,000 prospects assembled under a single firmographic filter contains prospects in five or six distinct buyer states: never heard of the category, recognizes the problem but has not searched, actively evaluating vendors, previously chose a competitor, previously chose nothing. A single message variant frames a single buyer state. The message that lands for the actively-evaluating prospect reads as presumptuous to the unaware prospect; the message that educates the unaware prospect reads as condescending to the evaluating one.

The empirical consequence is a list-wide reply rate that sits at the floor of the constituent cohorts rather than the average. A campaign with a 2.1% reply rate across 5,000 prospects, broken into five cohorts and re-messaged with cohort-specific variants, typically produces per-cohort reply rates in the range of 1.2% to 8%. The aggregate after re-cohorting commonly lands between 3.5% and 5%, a 1.7-2.5x lift without changing volume, sender, or offer.

The dimensions that segmentation operates on

Six dimensions are independently load-bearing for outbound. They are not orthogonal — a prospect carries a value on each — but the empirical reply-rate differential is large enough on each dimension that ignoring any one of them is a measurable cost.

ICP attribute

The firmographic and technographic attributes from the upstream chapters: industry vertical, headcount band, revenue band, tech stack, business model. The reply-rate differential across ICP cohorts on a single campaign is typically 2-4x — a vertical-specific opener with vertical-specific reference customers outperforms a generic opener by a factor that exceeds the rounding error on any other lever.

Intent signal

External indicators that the prospect is in an active buying motion: a job posting that implies the pain, a tech-stack change that implies a transition window, a funding event that implies budget, a leadership change that implies a re-evaluation. Intent-flagged cohorts typically reply at 3-5x the no-signal baseline on the same campaign. Deep treatment is Chapter 06.

Stage of awareness

Five operationally distinct states: unaware, problem-aware, solution-aware, vendor-aware, and evaluating. The message frame inverts across the stages: the unaware prospect needs a problem-naming opener; the evaluating prospect needs a differentiation-claim opener. A single variant cannot serve both ends of this range.

Relationship temperature

The strength and recency of any prior touch: cold, warm-network (mutual connection or shared institution), prior-conversation (a prior thread that did not convert), and lost-deal (the prospect previously evaluated and chose not to buy). The reply-rate differential along this dimension is the largest of the six.

Role and seniority

Functional role and seniority band: practitioner, line-manager, director, VP, C-level. The reply-rate differential is moderate (1.5-2x across adjacent bands) but the message-frame differential is large — the practitioner replies to a tactical-pain opener, the VP to a strategic-outcome opener, the C-level to a board-level-priority opener.

Geography

Time zone, language, regulatory regime. The cleanest operational impact is on send time — a campaign sent at 9am sender-local time achieves a 2-3x open-rate differential across the time-zone bands of the recipient list — but the regulatory-regime split also matters for message constraints.

The empirical per-segment differential

A well-instrumented multi-cohort campaign, measured at the per-segment reply rate, typically produces a 3-7x ratio between the highest-yield and lowest-yield cohort on the same underlying offer. The shape of this differential is consistent across categories:

  • Top cohort: warm-network plus high-intent signal plus high-seniority match. Reply rate typically 8-15% on a well-written variant.
  • Median cohort: cold plus ICP-matched plus role-matched. Reply rate typically 2-4%.
  • Bottom cohort: cold plus broad-ICP plus role-mismatched or stage-mismatched. Reply rate typically 0.5-1.5%.

The operational implication: the bottom cohort consumes the same per-prospect contact spend as the top cohort and returns 5-10x less pipeline per dollar. Suppressing the bottom cohort entirely is, in most production motions, the first lever the operator pulls after measurement reveals the differential. The top cohort, conversely, justifies a 3-4x increase in per-prospect research depth.

The cohort-size threshold

A cohort is operationally useful only if it is large enough to (a) absorb a per-cohort message variant without compressing into a single message per prospect, and (b) produce a reply-count signal stable enough to A/B test future variants within it. The empirical minimum is 50-100 prospects per cohort for a 4-touch sequence on a 4-week campaign window.

Below 50, the cohort produces 0-3 replies in the campaign window — a sample size at which the difference between a 4% and 8% reply rate is indistinguishable from noise. Above 100, the reply count produces a stable per-cohort signal within two campaign cycles. The operator should presume any cohort below the threshold is unmeasurable and treat its variant as a hypothesis, not a tuned message.

The over-segmentation failure mode

The opposing failure is the operator who splits a 5,000-prospect list into 47 cohorts, each requiring a per-cohort variant, cadence, and reply analysis. The operational lift required to write, test, and maintain 47 variants exceeds, in steady state, the conversion-rate lift those variants produce relative to a 6-cohort architecture covering the same dimensions.

The diminishing-returns curve becomes visible at approximately 8-12 cohorts for a list under 10,000 prospects. Beyond that, each additional cohort produces a sub-50-prospect partition the operator cannot measure, and the per-variant authoring cost increases linearly while the per-variant pipeline lift becomes invisible. The correct response to a list that appears to demand 47 cohorts is to identify the two or three dimensions that account for the largest empirical differential and segment on those.

Stage-of-awareness segmentation

The stage-of-awareness dimension deserves treatment in isolation because the message-frame differential across stages is larger than on any other dimension and the failure mode is the most common.

StageOpener frameBody frameCTA frame
UnawareProblem-naming, third-party observationWhat this looks like in practiceResource share, no meeting ask
Problem-awarePain-quantificationWhat others in the cohort have triedSoft meeting offer, optional
Solution-awareCategory-positioningWhat distinguishes a serious solutionMeeting offer, with framing
Vendor-awareDifferentiation claimSpecific contrast with named alternativesDirect meeting ask
EvaluatingDecision-support frameReference customer, proof artifactDemo or trial ask, time-bound

The aggregate reply-rate differential when these five framings are correctly assigned versus a single solution-aware framing across the full list is typically 2-3x, with the largest absolute gain coming from the unaware and problem-aware cohorts — populations the single-variant operator typically writes off as low intent and that are, in practice, mis-framed.

Relationship-temperature segmentation

The temperature dimension is the dimension on which the per-cohort reply-rate differential is largest, and the cohort the operator is most likely to under-index on.

Warm-network

The warm-network cohort — prospects with a mutual connection, a shared institution, or a documented prior touch from a colleague — is in practice the highest-yield segment of any outbound motion. Reply rates of 15-30% are typical, against a cold-list baseline of 2-4%. The 5-10x differential is the central economic argument for spending the per-prospect research time required to identify warm paths into 200-500 accounts rather than running a 5,000-prospect cold list at the same total sender capacity.

A 200-prospect warm cohort at 20% reply rate produces 40 conversations; a 5,000-prospect cold cohort at 2.5% reply rate produces 125 conversations. The cold list looks larger in pipeline terms only until the per-conversation conversion-to-meeting and meeting-to-opportunity rates are factored in, at which point the warm cohort typically wins on opportunity count as well.

Prior-conversation and lost-deal

The prior-conversation cohort — prospects who replied to a sequence in a prior quarter but did not convert — is the second-highest-yield cohort in most motions. Re-engagement reply rates of 8-15% are typical. The frame acknowledges the prior thread, references a concrete update since the prior touch, and proposes a lower-friction next step than the original ask.

The lost-deal cohort — prospects who evaluated the offer in a prior cycle and chose either a competitor or nothing — is the cohort most operators ignore and the cohort with the most durable re-engagement signal. The empirical pattern is that 15-25% of lost deals re-evaluate within 12-18 months, typically driven by a triggering event (the competing vendor failed, the internal champion changed roles, the budget cycle reopened). A lost-deal cohort messaged on a 90-day cadence with a check-in frame produces reply rates in the 10-20% range and conversion-to-meeting rates 2-3x the cold baseline.

Intent-signal segmentation

Intent-signal segmentation — the use of external buying-intent indicators to cohort the list — is treated in depth in Chapter 06. The relevant observation here is that intent signals function operationally as a cohort partition, not as a sequence trigger: the intent-flagged subset of the list is messaged as its own cohort, with its own variant, its own cadence, and its own measurement track, rather than being interleaved into the general send queue.

Per-segment send cadence

Cadence is a per-cohort decision. High-confidence segments — warm-network, prior-conversation, intent-flagged, evaluating-stage — tolerate and benefit from faster cadences: a 4-touch sequence over 14 days produces a higher reply count than the same sequence over 28 days for these cohorts, because the implicit signal density justifies the higher sender presence.

Low-confidence segments — cold, ICP-broad, unaware-stage — produce the inverse result. A 14-day cadence on these cohorts compresses the touches into a window the prospect reads as pressure, and the reply rate falls below the same sequence stretched to 42 days. The per-cohort cadence is set as a function of expected reply rate and re-tuned on the per-cohort unsubscribe rate. An unsubscribe rate above 1% per touch on a cold cohort signals the cadence is too compressed.

Per-segment volume allocation

The operational pattern for distributing weekly send volume across cohorts is to allocate volume proportional to expected pipeline per send — reply rate times reply-to-meeting rate times meeting-to-opportunity rate times opportunity ACV — rather than to cohort size.

A concrete allocation pattern for a sender with 2,000 weekly sends of capacity, three cohorts, and measured per-cohort yields:

CohortReply rateMeeting rateACVAllocation
Warm-network20%40%$80k15% (300 sends)
Intent-flagged8%30%$60k35% (700 sends)
Cold ICP-matched3%20%$50k50% (1,000 sends)

The warm-network cohort gets the smallest share of volume because it is supply-constrained; the cold cohort gets the largest share because it is supply-abundant. The allocation is re-tuned monthly off the actual per-cohort funnel data; allocations set at campaign launch and never revisited are almost always over-indexed on cold and under-indexed on intent.

The segment-attribution problem

When replies come in, the operator faces a three-way attribution problem: cohort effect (the prospect was in the right state to reply to anything), copy effect (the variant resonated), or channel effect (the touch landed at the right channel and time). The three are confounded by default, and the standard response — reading the per-cohort aggregate reply rate as cohort-effect — overstates the cohort contribution.

The instrumented approach is to A/B test copy within cohort. A two-variant test inside the warm-network cohort isolates the copy effect; a two-cohort comparison holding copy constant isolates the cohort effect. Most operators do neither and conflate the two. The instrumentation pattern is described in Chapter 08.

Where this connects

Segmentation depends upstream on the ICP definition (Chapters 01-03) and the prospect-graph construction (Chapter 04), and feeds downstream into intent (Chapter 06), enrichment (Chapter 07), and operational list management (Chapter 08). Intent signals are the primary cohort partition on the buyer-state dimension; enrichment determines the data quality on the ICP and role-seniority dimensions; list management keeps the cohort assignments fresh as prospects change roles and prior conversations age out.

Common operator failures

  • Under-segmentation. The operator sends a single variant to the full list and reports the aggregate reply rate. This is the dominant failure mode and accounts for most outbound motions producing reply rates below 2%.
  • Over-segmentation. The operator splits the list into 30+ cohorts, cannot maintain the per-cohort variants, and ends up sending a generic message to most cohorts with cohort-specific subject lines. The operational complexity rises with no measurable reply-rate gain.
  • Cohort assignment without per-cohort messaging. The operator builds the cohort taxonomy in the CRM but sends the same message variant across cohorts. The data exists; the operator never closes the loop into the campaign layer. The measurement infrastructure produces post-hoc per-cohort reply rates that the operator reads as cohort-effects, when in practice they are noise on a constant variant.
  • Per-cohort messaging without per-cohort measurement. The operator writes five variants and sends them to five cohorts but never measures per-cohort reply rate, opt-out rate, or meeting rate. The variants are tuned on the operator's intuition rather than on the data, and the per-cohort allocation is set once and never re-tuned.
  • Treating the warm cohort as a side project. The operator concentrates effort on the cold cohort and treats warm-network as ad-hoc outreach outside the sequence tool. The warm cohort produces a disproportionate share of pipeline and is left unmeasured, so its per-touch yield cannot be referenced in the volume-allocation decision.
  • Ignoring the lost-deal cohort. The operator suppresses lost deals from the active list and never re-engages them. A cohort with a 15-25% re-evaluation rate over 12-18 months is left to a competitor by default.
  • Static cohort assignments. The operator assigns cohort at list import and never re-runs the assignment. Prospects move stages, intent signals decay, and the taxonomy decays into noise over a quarter.

Pre-deployment checklist

  • The list is partitioned into between three and eight cohorts, each above the 50-prospect floor
  • The dimensions selected for segmentation are documented and justified by either prior data or stated hypothesis
  • Each cohort has a distinct message variant, not a single template with token substitution
  • The per-cohort send cadence is set as a function of cohort confidence and is distinct across cohorts
  • The weekly send volume is allocated across cohorts on expected-pipeline-per-send, not on cohort size
  • The measurement track records reply rate, meeting rate, and opt-out rate per cohort, not at the aggregate level only
  • An A/B test is running within at least one cohort to isolate copy effect from cohort effect
  • The cohort assignment is scheduled for re-run on a defined cadence — weekly for intent-flagged, monthly for ICP-attribute, quarterly for relationship-temperature
  • The warm-network and lost-deal cohorts are explicitly named, sized, and assigned variants — not left to ad-hoc handling outside the sequence tool

Where segmentation fits

Segmentation is the layer at which upstream list-construction first translates into addressable campaign architecture. The prospect graph (Chapter 04) is the substrate; segmentation is the partition; intent, enrichment, and list management are the maintenance disciplines that keep the partition aligned with reality. Without segmentation, the downstream campaign layer operates on a single undifferentiated list and converges to the lowest-common-denominator reply rate. With it, each cohort is tuned to its own reply-rate ceiling, and the compounding per-cohort lift becomes the most reliable conversion-rate lever in the outbound stack.

Skip the setup

Allston Labs operates the full sending estate as a service.

We provision domains, configure the entire authentication record set, run warmup, and monitor reputation across providers. The stack lives under your entity. The engineer on call lives in your Slack.