ClawDesk Architecture Quiz

Test your understanding of ClawDesk's architecture, crate responsibilities, and design decisions. Click on each answer to reveal the explanation.

How to Use

Work through each question, form your answer, then click "Click to reveal answer" to check. Aim for 15/20 to confirm solid understanding.

Section 1: Crate Responsibilities

Question 1: Which crate defines `InboundMessage`?

A message arrives from Telegram. Which crate contains the enum definition for InboundMessage?

Click to reveal answer

clawdesk-channel

clawdesk-channel defines the Channel trait and all associated types including InboundMessage, OutboundMessage, ChannelId, and ChannelMeta. Don't confuse it with clawdesk-channels (plural), which contains the implementations for specific platforms like Telegram and Discord.

📖 Channel Traits Architecture

Question 2: Where does normalization happen?

The normalize() function converts InboundMessage into NormalizedMessage. Which crate owns this function?

Click to reveal answer

clawdesk-domain

Normalization is a domain concern — it maps platform-specific representations into the application's canonical form. The domain crate sits between channel (Layer 3) and the agent pipeline (Layer 4) in the dependency DAG.

📖 Message Flow Tutorial

Question 3: What does `clawdesk-sochdb` provide?

True or false: clawdesk-sochdb is responsible for all of ClawDesk's persistent storage.

Click to reveal answer

False — but it's close.

clawdesk-sochdb is the SochDB integration crate — it provides the ACID-compliant embedded vector database used for conversation history, knowledge retrieval, and context assembly. However, clawdesk-storage defines the abstract storage traits, and clawdesk-memory provides the higher-level memory service that uses SochDB.

The dependency chain: clawdesk-memory → clawdesk-sochdb → clawdesk-storage → clawdesk-types

📖 Storage Layer Architecture · clawdesk-sochdb Reference

Question 4: Event bus crate

Which crate would you modify to add a new system-wide event (e.g., a "provider health changed" event)?

Click to reveal answer

clawdesk-domain

Domain events are defined in clawdesk-domain because they represent business-level occurrences that multiple crates may need to react to. The domain crate is the natural boundary — it's high enough to be meaningful but low enough in the dependency graph that most crates can depend on it.

📖 Crate Dependency Graph

Question 5: Where is the Tauri IPC bridge?

You want to add a new IPC command that the desktop frontend can call. Which crate do you modify?

Click to reveal answer

clawdesk-tauri

The Tauri crate sits at Layer 8 (the top of the DAG) alongside clawdesk-cli. It defines IPC commands using Tauri's #[tauri::command] macro, which the frontend can invoke. These commands delegate to the ACP (Agent Communication Protocol) layer below.

📖 Tauri IPC Commands API

Section 2: Traits & Types

Question 6: Channel trait methods

List the 5 methods defined by the Layer 0 Channel trait.

Click to reveal answer

id() — Returns the ChannelId
meta() — Returns ChannelMeta (name, description, capabilities)
start(processor) — Start the channel (bind listeners, connect to APIs)
send(message) — Send an OutboundMessage through the channel
stop() — Gracefully shut down the channel

These are the only required methods. Layer 1 traits (Threaded, Streaming, Reactions, GroupManagement, Directory, Pairing) are opt-in.

📖 Build a Channel Tutorial

Question 7: Provider trait

What is the method signature of the Provider trait's primary method?

Click to reveal answer

async fn send(&self, req: ProviderRequest) -> Result<ProviderResponse, ProviderError>

The trait is intentionally minimal — one method for sending requests. The complexity lives in error mapping for the fallback FSM. The trait also defines id(), supported_models(), and health().

📖 Build a Provider Tutorial

Question 8: InboundMessage variant count

How many variants does the InboundMessage enum have, and can you name at least 8?

Click to reveal answer

13 variants:

Telegram
Discord
Slack
WhatsApp
Signal
IMessage
Matrix
WebChat
Email
Sms
Cli
Irc
Custom

Each variant wraps a platform-specific struct (e.g., TelegramInbound) that carries only that platform's fields — no Option<T> waste.

📖 Type Algebra Deep Dive

Question 9: Sum type vs product type

Why is InboundMessage a sum type (enum) rather than a product type (struct with Options)? Give the information-theoretic argument.

Click to reveal answer

A struct with 60 Option<T> fields creates $2^{60} \approx 10^{18}$ possible states, but only 13 are valid. That's 56.3 bits of wasted entropy — states the type allows but that never occur.

The enum has exactly 13 variants, requiring only $\lceil\log_2(13)\rceil = 4$ bits. Every representable state is a valid state.

$$\frac{|\text{Valid}|}{|\text{Total}_\text{struct}|} = \frac{13}{2^{60}} \approx 10^{-17}$$

Additionally, the enum gives exhaustive matching — the compiler ensures every variant is handled. With the struct approach, you must manually check which fields are populated, with no compiler help.

📖 Type Algebra Deep Dive

Section 3: Agent Pipeline

Question 10: Pipeline stage ordering

Put these 6 pipeline stages in the correct order: ContextGuard, FailoverDecide, HistorySanitize, AuthResolve, Execute, ToolSplit

Click to reveal answer

AuthResolve — Resolve user identity and permissions
HistorySanitize — Fetch and clean conversation history
ContextGuard — Enforce token limits and content policy
ToolSplit — Select skills via knapsack under token budget
Execute — Send request to LLM provider
FailoverDecide — Evaluate response, trigger fallback if needed

The order is deliberate: authentication must come first (reject unauthorized early), history before context guard (need history to compute token budget), tool selection before execution (tools are part of the request), and failover after execution (needs the result to decide).

📖 Message Flow Tutorial

Question 11: What triggers ToolSplit?

What determines which skills are selected in the ToolSplit stage?

Click to reveal answer

Three factors:

Context matching — Does the skill's pattern match the incoming message?
Token budget — Does the skill's token_cost (including dependencies) fit in the remaining budget?
Priority weight — Among fitting skills, higher priority/cost ratio wins

The selector uses a greedy knapsack approximation after topological sort of dependencies:

$$\max \sum_{i} w_i \cdot x_i \quad \text{s.t.} \quad \sum_{i} c_i \cdot x_i \leq B$$

Solved in $O(k \log k)$ where $k$ is the number of candidate skills.

📖 Build a Skill Tutorial

Section 4: Fallback FSM

Question 12: FSM states

Name all 7 states of the fallback finite state machine.

Click to reveal answer

Idle — No active request
Selecting — Choosing next provider candidate
Attempting — LLM request in-flight
Succeeded — Valid response received (terminal)
Retrying — Waiting before retry of same provider
Exhausted — All candidates tried, none succeeded (terminal)
Aborted — Fatal error or cancellation (terminal)

Three states are terminal: Succeeded, Exhausted, Aborted. The FSM always terminates in one of these.

📖 Fallback FSM Deep Dive

Question 13: Termination bound

What is the maximum number of transitions the fallback FSM can make with $n$ provider candidates and 1 retry per provider?

Click to reveal answer

$$T_{\max} = 2n + 1$$

More generally, with $r$ retries per provider: $T_{\max} = n \cdot (r + 1) + 1$

The proof relies on the progress function: the candidate queue is finite and strictly decreasing. Each provider is attempted at most $r + 1$ times before being removed.

📖 Fallback FSM Deep Dive — Termination Proof

Question 14: Error classification

A provider returns HTTP 429 (Too Many Requests). How does the FSM classify this error, and what transition occurs?

Click to reveal answer

Error: ProviderError::RateLimit(retry_after)
Classification: ErrorClass::Transient
FSM Event: TransientFailure
Transition: Attempting → Retrying

The FSM waits for the retry_after duration, then transitions Retrying → Attempting to retry the same provider. If the retry also fails, it transitions Retrying → Selecting to try the next candidate.

Compare with HTTP 401 (Auth Error), which is Fatal and causes Attempting → Aborted immediately.

📖 Build a Provider — Error Mapping

Section 5: Concurrency

Question 15: CancellationToken hierarchy

What happens to child tasks when a parent CancellationToken is cancelled?

Click to reveal answer

All child tokens are immediately cancelled. Cancellation propagates automatically from parent to child via child_token():

let parent = CancellationToken::new();
let child = parent.child_token();
let grandchild = child.child_token();

parent.cancel();
// child.is_cancelled() == true
// grandchild.is_cancelled() == true

This is zero-cost propagation — no event listeners, no polling, no manual wiring. Compare to Node.js AbortSignal which requires explicit addEventListener('abort', ...) at each level.

📖 Structured Concurrency Deep Dive

Question 16: JoinSet vs tokio::spawn

Why does ClawDesk use JoinSet for parallel tool execution instead of bare tokio::spawn?

Click to reveal answer

JoinSet provides structured ownership over spawned tasks:

Feature	`tokio::spawn`	`JoinSet`
Task ownership	Fire-and-forget	Owned by the set
Wait for all	Manual tracking	`join_next()` loop
Cancel all	Per-task tracking	`abort_all()` or drop
Result collection	Manual channels	Built-in
Task count	Unknown	`.len()`

With tokio::spawn, tasks are detached — if you forget to track the JoinHandle, the task runs forever (resource leak). With JoinSet, dropping the set aborts all tasks.

📖 Structured Concurrency Deep Dive

Question 17: spawn_blocking

When should you use tokio::task::spawn_blocking instead of regular async execution?

Click to reveal answer

Use spawn_blocking for CPU-bound work that would block a Tokio worker thread for more than ~1ms:

Regex matching on large text (security scanning)
Token estimation on large documents
Cryptographic operations (HMAC verification)
JSON serialization of large payloads (>1MB)
File I/O (synchronous syscalls)

Never use it for network I/O (use async reqwest/hyper instead) or database queries (use async drivers).

spawn_blocking moves the work to a separate blocking threadpool so that Tokio's async worker threads remain free to handle concurrent I/O.

📖 Structured Concurrency Deep Dive

Section 6: Architecture Concepts

Question 18: Security cascade order

What are the 4 layers of the security cascade, and in what order do they execute?

Click to reveal answer

Allowlist — Is the user on the allowlist? (cheapest check first)
Content Scanning — Regex → AST → Semantic analysis (progressively more expensive)
ACL — Does the user have permission for this action?
Audit — Log the access to the audit trail

The cascade is deliberately ordered by cost: ~95% of messages are cleared or blocked by the allowlist check alone. Only messages that pass all 4 layers are processed.

📖 Security Model Architecture

Question 19: TOON format

What does TOON stand for, and what token savings does it achieve?

Click to reveal answer

TOON = Token-Optimized Output Notation

It achieves 58–67% fewer tokens compared to standard JSON formatting for context sections.

Example:

// JSON: ~85 tokens
{"conversation_history":[{"role":"user","content":"Hello",...}]}

// TOON: ~32 tokens
[H]
U|10:30|Hello

TOON uses single-character role prefixes, pipe delimiters, section headers, and omits redundant structure. The LLM can still parse it because the format is documented in the system prompt.

📖 Context Assembly Deep Dive

Question 20: Crate DAG layering

In the crate dependency graph, what is the fundamental rule that prevents circular dependencies?

Click to reveal answer

Strict layering: A crate at layer $L_a$ may only depend on crates at layers $L_b < L_a$.

$$\text{Layer}(A) > \text{Layer}(B) \Rightarrow A \text{ may depend on } B$$ $$\text{Layer}(A) \leq \text{Layer}(B) \Rightarrow A \text{ must NOT depend on } B$$

Layer	Crates
0	`types`
1	`storage`
2	`domain`
3	`channel`, `sochdb`
4	`channels`, `agents`, `skills`, `memory`, `security`
5	`providers`
6	`gateway`
7	`acp`
8	`cli`, `tauri`

This is enforced by Cargo's dependency resolver — circular dependencies are a compile error in Rust.

📖 Crate Dependency Graph

Score Guide

Score	Level	Recommendation
18–20	Expert	Ready to contribute core crates
15–17	Proficient	Ready for features and bug fixes
10–14	Intermediate	Review the Deep Dives
5–9	Beginner	Start with the Tutorials
0–4	New	Read the Learning Path intro and start from the beginning

Want more?

Check the Architecture Explorer for interactive diagrams that reinforce these concepts visually.

Section 1: Crate Responsibilities​

Question 1: Which crate defines InboundMessage?​

Question 2: Where does normalization happen?​

Question 3: What does clawdesk-sochdb provide?​

Question 4: Event bus crate​

Question 5: Where is the Tauri IPC bridge?​

Section 2: Traits & Types​

Question 6: Channel trait methods​

Question 7: Provider trait​

Question 8: InboundMessage variant count​

Question 9: Sum type vs product type​

Section 3: Agent Pipeline​

Question 10: Pipeline stage ordering​

Question 11: What triggers ToolSplit?​

Section 4: Fallback FSM​

Question 12: FSM states​

Question 13: Termination bound​

Question 14: Error classification​

Section 5: Concurrency​

Question 15: CancellationToken hierarchy​

Question 16: JoinSet vs tokio::spawn​

Question 17: spawn_blocking​

Section 6: Architecture Concepts​

Question 18: Security cascade order​

Question 19: TOON format​

Question 20: Crate DAG layering​

Score Guide​

Section 1: Crate Responsibilities

Question 1: Which crate defines `InboundMessage`?

Question 2: Where does normalization happen?

Question 3: What does `clawdesk-sochdb` provide?

Question 4: Event bus crate

Question 5: Where is the Tauri IPC bridge?

Section 2: Traits & Types

Question 6: Channel trait methods

Question 7: Provider trait

Question 8: InboundMessage variant count

Question 9: Sum type vs product type

Section 3: Agent Pipeline

Question 10: Pipeline stage ordering

Question 11: What triggers ToolSplit?

Section 4: Fallback FSM

Question 12: FSM states

Question 13: Termination bound

Question 14: Error classification

Section 5: Concurrency

Question 15: CancellationToken hierarchy

Question 16: JoinSet vs tokio::spawn

Question 17: spawn_blocking

Section 6: Architecture Concepts

Question 18: Security cascade order

Question 19: TOON format

Question 20: Crate DAG layering

Score Guide