Use when designing a Service Cloud solution end-to-end: channel strategy (phone, email, chat, messaging, social), routing model (queue-based vs skills-based Omni-Channel), knowledge strategy, entitlement and SLA enforcement, Einstein Bot / Agentforce deflection, and integration points. Triggers: service cloud architecture, case routing design, omni-channel strategy, contact center design, channel strategy, knowledge deflection, service console architecture. NOT for individual feature configuration (use admin/case-management), NOT for Einstein Bot conversation design (use agentforce/einstein-bot-architecture), NOT for telephony CTI implementation details.
PranavNagrecha10 スター2026/04/06
職業
カテゴリ
アーキテクチャパターン
スキル内容
Use this skill when designing an end-to-end Service Cloud solution. It activates when a practitioner needs to make architectural decisions about channel strategy, routing model, knowledge deflection, SLA enforcement, and integration points across a contact center or customer service operation. It produces a structured architecture covering the full case lifecycle from creation through routing, assignment, resolution, and closure.
Before Starting
Gather this context before working on anything in this domain:
What channels does the business require today and in the next 12 months? Phone, email, web-to-case, chat, messaging (in-app, web, WhatsApp, SMS), social, self-service portal. Each channel has distinct routing, licensing, and integration implications.
What is the most common wrong assumption? That Omni-Channel is just a queue — it is a unified routing engine with two distinct models (queue-based and skills-based) that behave very differently at scale. Choosing the wrong model creates routing bottlenecks that are expensive to fix post-go-live.
What limits are in play? Omni-Channel has a maximum of 200 routing configurations per org. Skills-based routing supports up to 100 skills per routing configuration. Service Console performance degrades with more than 10-12 open tabs. Messaging sessions have concurrency limits per agent tied to Omni-Channel capacity. Knowledge article search is bound by SOSL query limits (2,000 results per query).
関連 Skill
Core Concepts
Omni-Channel Routing Engine
Omni-Channel is the unified routing engine for Service Cloud. It replaces legacy queue assignment rules for real-time work items (chat, messaging, phone) and can optionally route cases. Two routing models exist:
Queue-based routing: Work items land in queues. Omni-Channel pushes items from queues to agents based on capacity and queue priority. Simple to configure. Best when agent skill overlap is high and work is relatively homogeneous.
Skills-based routing: Work items carry required skill attributes. Omni-Channel matches items to agents whose skill profile satisfies the requirements. Supports skill weighting and minimum skill levels. Best when agent specialization matters — language, product line, certification level.
Queue-based routing is configured declaratively. Skills-based routing requires the Skills-Based Routing managed package and additional setup for skill definitions, service resources, and routing configurations. Both models respect agent capacity, which is expressed as a numeric value per channel (e.g., 1 for phone, 3 for chat, 5 for messaging).
Service Console as the Agent Workspace
The Service Console is a Lightning App optimized for high-throughput agent work. It uses a tabbed workspace with split views, utility bar components (Omni-Channel widget, History, Macros, Knowledge sidebar), and subtab navigation. Architectural decisions that affect Console performance:
Component count per page: More than 10 components on a single Console page causes measurable load time increase. Lazy-load components that are not needed on initial render.
Synchronous Apex on page load: Any Apex called synchronously during Console tab open blocks the UI thread. Use Lightning Data Service or wire services instead where possible.
Utility bar density: Each utility bar component consumes memory for the lifetime of the Console session. Limit to 6-8 utilities.
Knowledge as a Deflection Engine
Salesforce Knowledge serves two architectural purposes: agent-assist (suggesting articles during case work) and self-service deflection (exposing articles in Experience Cloud sites, Einstein Bots, and Agentforce). Architectural decisions:
Data categories control article visibility by audience (internal agents, partners, customers) and topic taxonomy. Design the category hierarchy before authoring content — restructuring later requires re-classifying every article.
Article types (record types on the Knowledge object) separate content by format: FAQ, troubleshooting guide, product documentation. Each type can have a distinct page layout.
Search optimization matters: Knowledge search uses SOSL under the hood. Articles need keyword-rich titles, summaries, and structured content to surface correctly. Einstein Search Answers (AI-powered) requires Lightning Knowledge and a minimum article corpus.
Entitlements and SLA Enforcement
Entitlements define what support a customer is entitled to (response time, resolution time, available channels). Milestones within entitlement processes enforce SLAs by tracking time-based deadlines and triggering escalation actions when milestones approach violation or are violated. Architectural considerations:
Entitlement processes support business-hours-aware milestone tracking.
Milestone actions fire at configurable percentage thresholds (e.g., warn at 50%, escalate at 75%, violate at 100%).
Entitlements can be auto-applied via lookup to the Account, Asset, or Contact on case creation.
Common Patterns
Pattern: Tiered Omni-Channel Routing with Overflow
When to use: Contact centers with specialized teams (Tier 1 general, Tier 2 product-specific, Tier 3 engineering) where work should route to the best-fit agent but must overflow to a generalist pool if wait time exceeds a threshold.
How it works:
Enable skills-based routing. Define skills for product line, language, and tier level.
Create routing configurations with primary skill requirements (e.g., Product=Billing, Tier=2).
Set the routing configuration timeout (e.g., 120 seconds). On timeout, the work item re-routes with relaxed skill requirements — dropping the Tier skill so any available agent can accept.
Use Omni-Channel Flow to implement the timeout and re-routing logic.
Why not the alternative: Queue-based routing with multiple queues per tier creates a proliferation of queues (Products x Tiers x Languages). Agent membership becomes unmanageable at 50+ queues, and priority ordering across queues is fragile.
Pattern: Messaging-First Channel Strategy
When to use: Organizations moving from synchronous chat (legacy Live Agent) to asynchronous messaging to improve agent utilization and customer experience.
How it works:
Deploy Messaging for In-App and Web (replaces legacy Live Agent for new implementations). This uses the Messaging channel type in Omni-Channel.
Configure Embedded Service deployment for web and mobile surfaces.
Set agent capacity: messaging is asynchronous, so agents can handle 4-6 concurrent sessions vs 2-3 for synchronous chat.
Implement Einstein Bot or Agentforce as the first responder — gather case category, check authentication, attempt knowledge deflection. Escalate to a human agent only when the bot cannot resolve.
Enable Messaging session persistence so customers can return to existing conversations without re-authenticating.
Why not the alternative: Synchronous chat (legacy Live Agent) requires an agent to be continuously engaged. If the customer goes idle for 5 minutes, the session times out. Messaging allows hours or days between responses, which dramatically improves agent utilization for intermittent interactions.
Pattern: Knowledge-Centered Service (KCS) Architecture
When to use: Organizations where knowledge creation should be embedded in the case resolution workflow, not treated as a separate authoring activity.
How it works:
Configure Knowledge in the Service Console with the Knowledge sidebar component.
Build a screen Flow or Quick Action that lets agents create or update articles directly from a case — pre-populating fields from case data.
Implement an article lifecycle: Draft (created during case work) -> Review (flagged for SME review) -> Published. Use approval processes or Flow orchestration.
Expose published articles through Experience Cloud, Einstein Bot, and Agentforce to drive self-service deflection.
Track deflection metrics: article views that did not result in a case creation (measured via custom event tracking in the Experience Cloud site).
Decision Guidance
Situation
Recommended Approach
Reason
Fewer than 50 agents, homogeneous skills
Queue-based Omni-Channel routing
Simpler to configure and maintain; skills-based overhead is unjustified
50+ agents with distinct specializations
Skills-based Omni-Channel routing
Skill matching improves first-contact resolution and reduces transfers
New chat/messaging deployment
Messaging for In-App and Web
Replaces legacy Live Agent; supports asynchronous conversations and bot integration
Existing Live Agent deployment
Evaluate migration to Messaging
Salesforce is deprecating legacy Live Agent; plan migration before forced end-of-life
High case volume, immature knowledge base
Invest in KCS before adding channels
Adding more channels without deflection just increases volume; knowledge is the multiplier
Telephony integration needed
Amazon Connect (Service Cloud Voice) or Open CTI partner
Service Cloud Voice is the native option; Open CTI for existing PBX investments
SLA enforcement required
Entitlement processes with milestones
Only native mechanism for business-hours-aware, multi-stage SLA tracking
Recommended Workflow
Step-by-step instructions for an AI agent or practitioner designing a Service Cloud architecture:
Gather requirements — Document the required channels, expected case volume, agent count, SLA requirements, and existing systems. Confirm the Salesforce edition (Enterprise+ required for Omni-Channel; Service Cloud Voice requires additional licensing).
Design the channel strategy — Map each customer touchpoint to a Salesforce channel type (Email-to-Case, Web-to-Case, Messaging for In-App/Web, Service Cloud Voice, Social Customer Service). Document which channels are synchronous vs asynchronous and the expected volume split.
Select the routing model — Choose queue-based or skills-based Omni-Channel routing based on agent count and specialization requirements. Define capacity model (units per channel per agent). Design overflow and escalation paths.
Design the case lifecycle — Map the full case journey: creation (auto or manual), categorization, routing, assignment, work in progress, resolution, closure. Define case record types, page layouts per support tier, and automation (assignment rules, escalation rules, entitlement processes).
Architect knowledge and deflection — Define the knowledge taxonomy (data categories), article types, and authoring workflow. Plan bot/Agentforce deflection for the top case categories by volume.
Design integrations — Document telephony integration (Service Cloud Voice vs Open CTI), external system integrations (ERP for order lookup, billing systems), and data synchronization requirements. Use Named Credentials for all external callouts.
Validate and document — Review the architecture against the Review Checklist below. Produce the solution architecture document, channel strategy matrix, and capacity model. Walk through the architecture with stakeholders before build begins.
Review Checklist
Run through these before marking work in this area complete:
Channel strategy covers all required customer touchpoints with documented rationale
Routing model (queue-based or skills-based) is selected with capacity model showing agent concurrency per channel
Case lifecycle is documented end-to-end: creation, routing, assignment, resolution, closure
Entitlement processes with milestones are designed for all SLA requirements
Knowledge taxonomy (data categories) and article types are defined
Deflection strategy is documented with target deflection rate for top case categories
Integration architecture covers telephony, external systems, and data flows
Service Console layout is optimized: fewer than 10 components per page, utility bar limited to 6-8 items
Security model reviewed: case sharing rules, Knowledge article visibility by audience, agent profile permissions
Licensing confirmed: Service Cloud User, Omni-Channel, Messaging, Service Cloud Voice as needed
Salesforce-Specific Gotchas
Non-obvious platform behaviors that cause real production problems:
Omni-Channel capacity is per-presence-status, not global — An agent's capacity is consumed only while they are in an Omni-Channel presence status. If an agent switches to "Offline" and back to "Available," any previously assigned work items no longer count toward their capacity. This can cause double-assignment if agents toggle status to game the system.
Messaging sessions persist but consume capacity until closed — Unlike chat, messaging sessions do not auto-expire on idle. An agent with 5 open messaging sessions that went quiet yesterday still has 5 capacity units consumed today. You must architect a session-close policy (auto-close after N hours of inactivity via Flow or scheduled job).
Skills-based routing requires the managed package and is not reversible in-place — Once you enable skills-based routing and agents have active sessions, you cannot switch back to queue-based without re-creating routing configurations. Test thoroughly in a sandbox before enabling in production.
Knowledge article search returns max 2,000 results via SOSL — If you have 10,000+ articles, poorly structured search queries will hit the SOSL ceiling and return incomplete results. This is invisible to the user — they just do not see the right article. Mitigate with data categories, keyword optimization, and promoted search terms.
Email-to-Case threading breaks on modified subject lines — Email-to-Case uses a thread ID token in the email subject to match replies to existing cases. If a customer or external system modifies the subject line, a new case is created instead of threading to the existing one. This causes duplicate cases that are hard to detect at scale.
Output Artifacts
Artifact
Description
Solution architecture document
End-to-end Service Cloud design covering channels, routing, case lifecycle, knowledge, integrations, and security
Channel strategy matrix
Table mapping each customer touchpoint to a Salesforce channel type with rationale, volume estimate, and licensing requirement
Routing and capacity model
Omni-Channel routing design (queue or skills-based) with agent capacity units per channel and overflow rules
Knowledge taxonomy
Data category hierarchy and article record type definitions
Integration architecture
Diagram and documentation of telephony, external system, and data integration points
Related Skills
omni-channel-capacity-model — Use for detailed capacity modeling and Omni-Channel tuning after the high-level routing decision is made
multi-channel-service-architecture — Use when the focus is specifically on cross-channel orchestration and channel switching
einstein-bot-architecture — Use when designing the conversational bot/Agentforce deflection layer in detail
knowledge-vs-external-cms — Use when deciding whether to use Salesforce Knowledge or an external CMS for article content
security-architecture-review — Use to deep-dive on the security model (sharing, FLS, Shield) for the Service Cloud implementation
integration-framework-design — Use when building the reusable integration layer for telephony and external system callouts