Chapter 03 · Classification and routing

Operational discipline

Triage workflow — the four-hour window and the per-category SLA.

The four-hour discipline is the single highest-leverage rule in reply handling. Teams that hold the line convert positive outbound replies into booked meetings at 15%+. Teams that miss it convert the same replies at under 5%. Same campaign, same prospects — the only difference is whether a human touched the reply before intent decayed. For pure inbound, the bar is even steeper: 5 minutes, with an 80% vs 20% conversion gap.

TL;DR

Inbound leads: 5-minute response → 80% convert to meetings. 30+ minutes → 20%. A 4x swing on one variable.
Outbound positive replies: 4-hour window during business hours → 15% reply-to-meeting. 8+ hours → under 5%.
Be a scientist, not a lawyer — classify what the reply actually says, not what you want it to mean. Skip this and your dashboard reads green while the funnel rots.
Per-category SLAs are not optional: positive 1h, objection 4h, soft-pass 24h, auto-responder 24h, negative 72h.
The metric that matters is p95, not average. p95 inside SLA = real discipline. p50 inside SLA with a long tail = your highest-value reply is sitting for 38 hours.

The premise — speed is a math problem, not a posture

Most teams treat reply speed as a vibe. Disciplined teams treat it as a decay curve. A positive reply at hour zero is not the same asset twenty-four hours later — it's a fundamentally different opportunity with a lower terminal value. Reply triage is perishable inventory, and most outbound teams have no operational concept of perishability.

The pattern: a positive reply touched within the first hour converts to a held meeting at 60%+. Same reply at hour 24 converts at ~25%. At hour 72, ~10%, and below that the curve flattens to background noise. The prospect hasn't changed. Their intent hasn't vanished. What's decayed is the recency — your original message is no longer in working memory and the responder has to rebuild context cold.

For inbound leads — anyone who took an action that says “I want to talk now” — the curve is even steeper. The 5-minute response window converts at roughly 80%; wait longer and you drop to about 20%. That's the cleanest single benchmark in B2B sales operations.

The four-hour window isn't a target. It's the inflection point where the outbound-reply curve starts to fall hard. Hold first-touch on positive outbound replies inside four hours, every business day, and you've fixed the single most expensive gap in the stack.

The per-touch decay curve

The numbers are worth stating concretely because the operational decisions downstream — staffing, on-call rotation, automation investment — all calibrate against them.

Time to first touch	Positive-reply → meeting conversion	Operational state
Within 1 hour	~60%	Live triage, peak window
1–4 hours	~45%	Acceptable business-hours posture
4–24 hours	~25%	Overnight rollover, recoverable
24–72 hours	~10%	Weekend gap, mostly lost
72+ hours	<5%	Pipeline-equivalent to no reply

The fall from 60% to 25% across the first day is the single largest unit-economic lever in the entire reply-handling stack. A team that moves its median first-touch from hour 18 to hour 3 has, in a single operational change, roughly doubled the meeting throughput of the same campaign volume — with no incremental cost upstream, no new domains, no new sequences, no new copy.

Scientist not lawyer — the classification call drives the SLA

Before triage even starts, the reply has been classified (Chapter 1). The discipline at this step is to read what the reply actually says — pull-language vs polite-language, named-objection vs vague-deferral — and route on the empirical pattern, not on what you want the reply to be. The 4-hour SLA only earns its conversion lift if the reply you're racing to handle is actually positive intent. Mis-classify a soft-pass as positive and you've spent your scarcest triage budget on a contact that converts at 6% no matter how fast you respond.

The four-hour window — what it actually means

The four-hour SLA is business-hours-aware, time-zone-aware, and on-call-rotation-backed. Reading it as a flat “reply within four hours of receipt” is the most common misreading — a reply landing at 2am local should not trigger a clock that expires at 6am.

The operationally correct definition: a reply received during the prospect's business hours has a four-hour-of-prospect-business-hours clock attached. A reply received outside business hours has a clock that begins at the next business-hours-window open. The relevant time zone is the prospect's, not the responder's — a Pacific-time responder touching an East Coast reply has a clock that closes three hours earlier than they instinctively expect.

The team that ignores this produces a dashboard reading "average response time 3.8 hours" while their East Coast prospects experience 7 hours. The dashboard is not wrong. The dashboard is measuring the wrong thing.

Per-category triage urgency

Treating all replies as equally urgent is the second most common failure mode. The five reply categories (Chapter 1) carry materially different per-category SLAs because the per-category decay curves are materially different.

Category	SLA	Rationale
Positive intent	Within 1 hour	Steepest decay; meeting-booking window
Objection with signal	Within 4 hours	Active engagement, response library applies
Soft-pass	Within 24 hours	Nurture assignment, no live decay
Auto-responder	Within 24 hours	Date-update on CRM, requeue downstream
Negative	Within 72 hours	Suppression-list update only

The auto-responder SLA is not zero because the work is small: parse the return-by date, update the contact, requeue the sequence. The negative SLA is not zero because the work — suppression-list update — has no per-prospect urgency, only an aggregate compliance requirement before the next sequence send.

The positive SLA, in contrast, is everything. Every operational design downstream — the routing channel, the on-call rotation, the AE handoff — is a mechanism for protecting the one-hour-on-positive-replies number.

The on-call rotation pattern

Human triage requires human availability. This is the operational reality that operators with otherwise sophisticated outbound stacks routinely fail to plan for. The team-size-to-rotation pattern, in the configurations we have observed:

1–2 person team. No rotation; one person is always on. The SLA holds during their working hours and silently breaks outside them. The acceptable posture is to disable sends in time zones the team cannot cover, not to pretend the coverage exists.
3–5 person team. Daily rotation; one person owns the inbox each day, with a backup. The relevant discipline is that the on-call has no other deep-work assignments — interrupt-driven work and creative work do not share a person without one degrading.
6+ person team. Weekly rotation with split-shift coverage for the relevant time zones. A published schedule with named backups and an explicit escalation path for time-zone hand-offs.

The error mode is the team that announces a four-hour SLA but has no named on-call. The SLA holds opportunistically and the variance across days is enormous — median of 2 hours and a 95th-percentile of 38 hours. The 95th percentile is the number that matters; the high-intent reply that produced the 38-hour wait was a $40k opportunity.

The AE-vs-SDR triage split

The routing architecture (Chapter 2) determines which seat receives each category. The triage discipline reinforces it.

Positive replies → AE. The meeting-booking handoff is direct and immediate. Routing through an SDR for qualification first adds, in practice, four to eighteen hours of latency and a measurable drop in meeting-booking rate. The exception is a high-volume inside-sales motion where the SDR has full meeting-booking authority.
Objections → SDR. The per-objection response library (Chapter 4) lives with the SDR. An AE responding to objections produces both worse responses and a downstream pattern in which the AE's working hours are consumed by reply-triage rather than discovery and close.
Soft-pass → SDR. Nurture assignment (Chapter 6) is SDR work. Tag the contact, set the next touch date, return to the queue.

Multi-channel triage

A reply does not always arrive on the channel that sent the original message. The mature motion produces replies across email, LinkedIn, and inbound phone callbacks, and the triage discipline has to span all three.

Email replies. The default channel. The four-hour SLA applies directly.
LinkedIn replies. The timeline is shorter, not longer — message activity is itself a high-recency signal, and a reply unanswered for 24 hours competes with active feed-scrolling attention. Treat LinkedIn positive replies with a two-hour SLA, not four.
Phone callbacks. A returned voicemail or inbound call is the strongest intent signal in the reply stack. The SLA is the next available calendar opening of the recipient, not a clock-based number. The failure mode is the AE returning a callback four hours later when the prospect has already been pitched by a competitor.

Response-time SLA enforcement

A per-category SLA without a metric and a dashboard is a slogan, not a discipline. The minimum-viable dashboard captures: time-to-first-touch per reply, broken out by category, displayed as a distribution rather than an average. The 95th-percentile is the operationally relevant number; the average is misleading.

The review cadence is weekly. The team meeting that opens with the prior week's triage distribution — number of replies per category, percentage within SLA, identification of the specific replies that breached, the named cause of each breach — is the meeting that holds the discipline. The team meeting that opens with a green "average response time" number is the meeting that performs discipline without producing it.

Per-segment triage prioritization

When two positive replies land at the same minute, which one gets touched first matters. The prioritization rule we observe operating cleanly in production:

Named accounts — the high-ACV, strategically tagged target list — go first, every time, with no exception.
ICP-fit prospects — the prospects matching the documented ideal-customer profile but not on the named list — go second.
Low-tier prospects — replies from outside the ICP, often from sequences run as broader experiments — go last and may legitimately fall outside the four-hour SLA without operational concern.

The operator who treats all positive replies as identical produces, by random ordering, a dashboard in which the named-account 95th-percentile and the low-tier 95th-percentile look similar. The named-account 95th-percentile is the only one that affects revenue at scale; the low-tier number is operational vanity.

The lunch-and-weekend coverage problem

The four-hour SLA breaks predictably at the lunch hour, on Fridays after 4pm, and across the entire weekend. These are the gaps where the dashboards lie and the revenue leaks.

The lunch problem resolves with rotation discipline: the on-call does not take lunch unattended. The Friday-afternoon problem resolves with an explicit hand-off through the prospect's business day, not the responder's. The weekend problem is harder. A positive reply landing at 4pm Friday Eastern, without weekend coverage, has its clock running for 64 hours before a human touches it — per the decay curve, roughly a tenth of the asset it was when it landed.

The mature options: (a) weekend on-call with a reduced SLA — eight hours rather than four — explicitly published; (b) automated acknowledgement that establishes the conversation and buys time; (c) pausing Friday-afternoon sends in time zones the team cannot cover over the weekend. The unmature option, which we see most frequently, is to do none of these and accept the weekend gap as inevitable while continuing to send into it.

The volume-overload pattern

Reply volume scales with campaign volume and occasionally spikes above the team's steady-state triage capacity. The operational requirement is a load-shedding policy decided in advance, not improvised under load.

The order of preservation, when capacity is exceeded: named-account positive replies first, ICP-fit positive replies second, named-account objections third, ICP-fit objections fourth. Soft-pass replies, auto-responders, and negative replies are shed under load — they accumulate in a backlog queue and are processed in batch when capacity returns. The acceptable backlog age for these categories is 72 hours; beyond that, the negative-reply backlog risks compliance exposure on the next sequence send.

A team operating at 80%-of-capacity steady-state has the headroom to absorb spikes. A team operating at 110% has a four-hour-SLA dashboard that reads green on the average and breaches catastrophically at the 95th-percentile every time campaign volume increases. The capacity calculation is upstream of the triage discipline.

The AI-augmented triage pattern

An LLM in the triage loop, configured to read inbound replies and draft response candidates for human approval, is the highest-leverage operational change we have observed in the last twelve months. Every inbound reply is auto-classified and a response is drafted; the on-call reviews, edits, and sends. Median time-to-first-touch falls from 90 minutes to 15. The four-hour SLA holds at the 99th-percentile, not just the median.

The cost is non-trivial — token spend per reply runs in the low cents, response-quality QA requires a weekly review of sampled drafts, and the team has to develop the discipline of editing rather than rubber-stamping. The naive deployment, in which the LLM both drafts and sends without review, produces a measurable conversion penalty — the drafts are competent but generic, and the prospect can tell. The augmentation pattern, where the human approves every send, captures most of the speed benefit without the quality penalty.

The time-to-first-touch metric

The metric definition matters. Time-to-first-touch is measured from the timestamp of the inbound reply receipt to the timestamp of the outbound human-authored response, broken out by reply category and by named-vs-ICP-vs-low-tier segment. The aggregate distribution displayed on the dashboard:

Median (p50) — operational health
p75 — the visible variance
p95 — the breach indicator
p99 — the named-account incident detector

A team holding p95 inside the per-category SLA is operating with discipline. A team holding only p50 inside the SLA is operating on average and breaching specifically. The dashboard that surfaces only p50 is the dashboard that produces complacent quarterly reviews.

Common operator failures observed in production

No on-call coverage. The SLA is announced; no one is named; the four-hour window holds opportunistically. Most common in 3-to-5-person teams.
No per-category SLA. All replies share a single clock; positives drag through the same queue as auto-responders, producing wasted urgency on low-value categories and missed urgency on high-value ones.
No metric tracking. The team believes it operates inside the SLA; the dashboard does not exist; the belief is unfalsifiable.
Treating all replies as equal urgency. The SDR responding to a negative reply within five minutes while a positive waits four hours. The decay curve says the trade-off is roughly twenty-to-one against the SDR.
No escalation path for after-hours. The reply landing at 7pm Friday Pacific has no defined home — not the AE's problem until Monday, not the SDR's, not the on-call's because the rotation ended at 6pm. The reply rots.
Time-zone confusion. The responder's clock, not the prospect's, drives the dashboard. The team believes it operates at 3-hour median; East Coast prospects experience 7.
Capacity-blind growth. Campaign volume scales 2x, triage capacity scales 1x, the SLA degrades silently until the quarterly pipeline review surfaces the conversion drop.

Pre-deployment triage checklist

Per-category SLAs documented (positive 1h, objection 4h, soft-pass 24h, auto-responder 24h, negative 72h)
On-call rotation published with named owners and named backups
Time-zone-aware SLA-clock definition documented and implemented in the dashboard
AE-vs-SDR category split agreed and reflected in the routing rules (Chapter 2)
Multi-channel coverage — email, LinkedIn, phone — assigned to named owners with channel-specific SLAs
Time-to-first-touch dashboard live, surfacing p50/p75/p95/p99 by category and segment
Weekly triage review on the team calendar with a standing agenda
Weekend and after-hours policy documented — coverage rotation, reduced SLA, or explicit acceptance of the gap
Load-shedding policy documented for capacity-overload conditions
Named-account vs ICP-fit vs low-tier segmentation reflected in the dashboard breakdown
If using AI-augmented triage: human-approval gate enforced, weekly draft-quality QA scheduled

Where this fits

Triage is the operational layer that converts classification (Chapter 1) and routing (Chapter 2) into a held discipline. A correctly classified reply, routed to the correct channel, that no human touches inside the SLA, is operationally equivalent to a misclassified reply routed to nowhere — the conversion outcome is the same. The next two chapters cover what the human does once they touch the reply: objection handling (Chapter 4), which converts active resistance into a conversation, and meeting booking (Chapter 5), which converts a positive reply into a held calendar slot.

The four-hour discipline is the single number the operator should defend with the most institutional resistance. Every other reply-handling investment compounds against it. Without it, every other investment is lower-leverage than fixing it would have been.

Related chapters

Reply Routing Architecture — Slack vs CRM — the infrastructure that enables the four-hour SLA.
How to Classify Cold Email Replies — the per-category routing rules the SLA varies on.
How to Handle Cold Email Objections — what to do once a reply is triaged into the objection category.
Meeting Booking Mechanics — Reply to Calendar to Show — what happens to positive replies after triage.

Was this guide useful?

Skip the setup

Allston Labs operates the full sending estate as a service.

We provision domains, configure the entire authentication record set, run warmup, and monitor reputation across providers. The stack lives under your entity. The engineer on call lives in your Slack.

See the service →Book a call →