The Wikipedia Lens: What Agent-Native Artifacts Can Steal from 20 Years of Collaborative Knowledge Production

Wikipedia solved the hardest version of the problem we're working on: millions of stateless contributors (editors come and go, retain no memory of previous sessions, often work at cross-purposes) producing high-quality, continuously updated artifacts at extraordinary scale. The parallels to agent workflows aren't metaphorical — they're structural. An agent spinning up to update a strategy doc is functionally identical to an anonymous Wikipedia editor arriving at an article they've never seen before, making a contribution, and leaving.

This exploration examines which Wikipedia mechanisms translate directly, which need creative adaptation, and which are irrelevant because agents are fundamentally different from humans.

1. Talk Pages: Separating Content from Discourse About Content

How Wikipedia Does It

Every Wikipedia article has a paired "Talk" page. The article contains the canonical content; the talk page contains everything ABOUT the content: editorial disputes, proposed changes, requests for citation, assessments of quality. This separation is load-bearing. Without it, the article would be polluted with meta-commentary, and editorial reasoning would be invisible.

Talk pages have their own conventions: threaded discussions, signatures with timestamps, section headers that match article sections, and archival policies for old discussions.

The Agent Translation

Agents need talk pages even more than human editors do, because agents are stateless. A human editor might remember "I didn't add that paragraph because last time I tried, someone reverted it." An agent has no such memory. The talk page IS the agent's institutional memory about an artifact.

Concretely, every significant artifact (document, decision, spec, strategy) gets a paired thread record linked via a dedicated relationship — call it discussed_at or reuse relates_to with a convention. The thread is where:

Edit rationales live. When an agent modifies the artifact, it posts to the thread explaining WHY. Not "updated section 3" (that's in the diff) but "updated section 3 because the Q4 revenue data was revised per @abc123."
Proposed changes are floated. An agent that identifies a problem but isn't sure about the fix posts to the thread: "Section on competitive positioning may be stale — Competitor X launched a new product (source: @def456). Recommend updating but flagging for human review."
Disagreements are recorded. If two agents produce contradictory analyses, the thread captures both positions with their supporting evidence, rather than one silently overwriting the other.
Quality assessments are logged. An agent reviewing the artifact posts its assessment: "This document cites 12 sources but 3 are internal notes with no further provenance chain. Recommend upgrading those citations before promoting to 'reviewed' status."

What Ports Well

The content/discourse separation ports perfectly. It's arguably MORE important for agents because they can't hold discourse in their heads — it must be externalized.

What Doesn't Port

Wikipedia talk pages are conversational — they assume persistent identity and ongoing dialogue. Agent talk pages need to be more structured. Instead of freeform discussion threads, they work better as a log of typed entries:

[2026-02-15T10:23:00Z] agent:strategy-worker
Type: edit-rationale
Affected sections: Market Analysis
Reason: Updated TAM figures based on latest analyst report ([[src789]])
Confidence: high (primary source, <30 days old)

[2026-02-15T14:45:00Z] agent:review-worker  
Type: quality-flag
Affected sections: Competitive Landscape
Issue: Two claims lack any citation. Competitor Z's market share 
cited as "approximately 15%" with no source.
Recommendation: Add citations or mark as [citation needed]

This is structured enough that a future agent can parse it programmatically, but readable enough that a human can scan it.

The Workspace Implication

This suggests a first-class pattern: every document/decision/spec-type record should be able to spawn a linked thread record that serves as its talk page. The thread type already exists. The convention is: one thread per artifact, linked with relates_to, tagged talk-page. Agents contributing to the artifact are expected to post to the thread. The thread becomes part of the artifact's start_work context.

2. Citation and Sourcing Norms: Making Provenance Structural

How Wikipedia Does It

Wikipedia's quality revolution wasn't technological — it was cultural. The [citation needed] tag, introduced around 2005, transformed the encyclopedia from "write whatever you know" to "prove what you claim." The genius is that flagging unsourced claims is EASIER than adding sources, so quality pressure ratchets upward: anyone can tag, and the tag creates social pressure to fix.

Wikipedia distinguishes:

Inline citations — specific claims linked to specific sources
General references — sources that support the article broadly
Primary vs. secondary sources — with strong preference for secondary
Reliable sources — a whole policy apparatus for what counts

The Agent Translation

For agents, citations shouldn't be a cultural norm — they should be a structural requirement enforced by the artifact format itself. This is where agent-native artifacts can leapfrog Wikipedia: Wikipedia relies on humans voluntarily adding citations. Agent artifacts can make citation a property of the creation process.

Every factual claim in an agent-created artifact should carry provenance metadata. Not as visible footnotes (though those can be rendered), but as structural links in the artifact graph. When an agent writes "Market size is estimated at $4.2B," the artifact format captures:

claim: "Market size is estimated at $4.2B"
source_record: [[analyst_report_456]]
source_type: external_document
extraction_confidence: high
staleness_window: 90 days
last_verified: 2026-02-10

This enables something Wikipedia can't do: automated staleness detection. If the source record is updated, or if the staleness window expires, the claim auto-flags itself. The artifact knows its own reliability surface.

The [citation needed] equivalent becomes structural too. An agent reviewing a document can tag uncited claims:

claim: "We expect 40% growth in the enterprise segment"
citation_status: needed
flagged_by: agent:review-worker
flagged_at: 2026-02-15T10:00:00Z
context: "No source provided. Similar claim in Q3 board deck 
cited internal forecast [[forecast_789]] — verify if still current."

The Source Reliability Hierarchy

Wikipedia's "reliable sources" policy translates to a source reliability hierarchy for agent artifacts:

Primary data — Direct outputs, metrics, database queries. Highest reliability, narrowest staleness window.
Canonical decisions — Decision records in the workspace. High reliability, stable unless superseded.
Internal analysis — Agent-produced analysis documents. Medium reliability, depends on their own citation quality.
External sources — Reports, articles, datasets. Variable reliability, requires staleness tracking.
Unsourced assertions — Claims without provenance. Lowest tier. Permitted in drafts, flagged in reviewed artifacts.

What Ports Well

The citation-needed culture ports beautifully. Making it structural rather than cultural is a genuine improvement — agents don't need social pressure, they need schema constraints.

What Doesn't Port

Wikipedia's reliable source debates are deeply human ("Is the Daily Mail a reliable source?"). Agents don't need editorial judgment about source quality — they need staleness windows and provenance chains. The question isn't "is this a good source?" but "can I trace this claim back to something verifiable, and is that something still current?"

The Workspace Implication

The draws_from link type already exists. What's missing is claim-level granularity. Currently a document can link to its sources at the document level. The Wikipedia lens suggests artifacts need a way to associate specific claims within their body with specific source records. This could be:

Inline mention syntax with metadata: According to [[analyst_report_456|claim:tam_estimate]], the market is $4.2B
A structured data.citations array on the record, mapping body sections to source records
A convention where agents create the linkage in the talk page thread when modifying content

The lightest-weight version: agents always post edit rationales to the talk page with source links, and the talk page IS the citation record. Heavier-weight: structured citation metadata in the record's data field.

3. Featured Article Criteria: Quality Tiers for Agent Artifacts

How Wikipedia Does It

Wikipedia's quality scale is explicit:

Grade	Meaning
Stub	A few sentences. Placeholder.
Start	Has structure but incomplete.
C-class	Substantial but poorly organized or sourced.
B-class	Good content, good sources, some gaps.
Good Article (GA)	Passed a formal review. Well-written, well-sourced, covers major aspects.
Featured Article (FA)	The best Wikipedia has. Passed a rigorous peer review. Professional quality.

Promotions are gated by review processes: GA requires one reviewer, FA requires community consensus. Each grade has explicit criteria.

The Agent Translation

Agent artifacts need a parallel quality ladder. Here's a proposed mapping:

Grade	Name	Criteria	Who Promotes
L0	Stub	Exists. Has a title and minimal content. Created as a placeholder or initial dump.	Auto (on creation)
L1	Draft	Has structure (sections, some content). May have uncited claims. Not yet reviewed.	Auto (when agent completes first pass)
L2	Developed	Substantive content. Key claims cited. Covers main topics. May have gaps.	Agent review (any agent can assess)
L3	Reviewed	All major claims cited. No known gaps. Reviewed by at least one agent other than the author. Internal consistency verified.	Agent review + talk page assessment
L4	Canonical	Human-approved. Considered the authoritative source on its topic within the workspace. Actively maintained (staleness checks).	Human promotion only
L5	Featured	Best-in-class. Exemplary structure, sourcing, and clarity. Used as a template for other artifacts of its type.	Human promotion + review record

Quality Criteria (What Makes an Artifact "Good")

Borrowing from Wikipedia's FA criteria, adapted:

Well-sourced. Every factual claim traces to a source record. No [citation needed] flags older than 7 days.
Comprehensive. Covers the topic's major aspects. An agent reading only this artifact would have a working understanding of the subject.
Internally consistent. No contradictions between sections. Numbers agree across tables and prose.
Current. No source links to superseded records. Staleness flags resolved within their windows.
Well-structured. Clear hierarchy of sections. An agent (or human) can navigate to the relevant section without reading the whole thing.
Appropriately scoped. Doesn't try to cover everything. Delegates to linked artifacts for deep dives. Links exist and are valid.
Neutral (where applicable). For analysis artifacts, presents tradeoffs rather than advocacy. For decision records, captures dissent.

The Promotion Mechanism

Promotion isn't automatic beyond L1. An agent conducting a quality review creates an assessment entry on the talk page:

Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria met:
  - [x] Structured with clear sections
  - [x] Key claims cited (8/10 claims have sources)
  - [ ] All major topics covered (missing: risk analysis)
  - [x] No internal contradictions found
Criteria gaps:
  - Missing risk analysis section
  - 2 claims in competitive analysis lack sources
Recommendation: Promote to L2 with note that risk analysis 
section is needed for L3.

What Ports Well

The explicit quality ladder ports very well. Agents benefit enormously from clear criteria because they can check them programmatically. "Is every claim sourced?" is a question an agent can actually answer, unlike a human who might not bother to check.

What Doesn't Port

Wikipedia's promotion process involves community discussion and consensus. For agent artifacts, the lower tiers (L0-L3) can be agent-assessed. But L4-L5 require human judgment because "canonical" and "featured" involve organizational trust decisions that agents shouldn't make autonomously.

The Workspace Implication

This maps cleanly onto the existing record system:

Quality grade stored in data.quality_grade (L0-L5)
Quality assessments posted to the talk page thread
Promotion events logged in record history
Agents can query for artifacts below a certain grade: query_record(where: { data.quality_grade: { lt: 3 } }) to find artifacts needing improvement
The tags field can carry quality state: stub, draft, developed, reviewed, canonical, featured

4. Edit Wars and Conflict Resolution

How Wikipedia Does It

Wikipedia's edit war problem is famous: two editors repeatedly reverting each other's changes. Wikipedia's solutions:

Three-revert rule (3RR): No editor may revert the same article more than three times in 24 hours.
Protection levels: Semi-protection (block anonymous edits), full protection (only admins edit), pending changes (edits reviewed before going live).
Dispute resolution: Talk page discussion -> Third opinion -> Request for comment -> Arbitration committee.
Consensus policy: Changes must reflect consensus. In absence of consensus, the status quo prevails.

The Agent Translation

Agent edit wars look different from human ones. Agents don't have egos or political agendas. But they CAN produce contradictory outputs due to:

Different source data. Agent A reads the Q3 report, Agent B reads the Q4 update. They produce different numbers.
Different instructions. Agent A was told to be optimistic in the forecast, Agent B was told to be conservative.
Timing conflicts. Agent A and Agent B both start work on the same artifact before either finishes.
Scope ambiguity. Two agents both think they own a section and make incompatible structural changes.

The Resolution Mechanisms

1. Claim-level versioning (not document-level)

Instead of full-document conflict, track conflicts at the claim level. If Agent A says "TAM is $4.2B" and Agent B says "TAM is $3.8B," this isn't a document conflict — it's a claim-level dispute that surfaces as:

Type: claim-conflict
Claim: Total addressable market estimate
Position A: $4.2B (source: [[analyst_report_2025]], agent:market-worker)
Position B: $3.8B (source: [[internal_model_v3]], agent:forecast-worker)
Status: unresolved
Resolution path: Compare source recency and methodology

This goes on the talk page. The artifact itself can either show the most recent value with a conflict flag, or show both with a "disputed" marker.

2. Source-precedence rules (not revert wars)

Instead of a three-revert rule, establish source precedence:

More recent primary data beats older primary data
Primary data beats secondary analysis
Human-approved data beats agent-generated estimates
When precedence is unclear, flag for human resolution

An agent encountering a conflict doesn't revert — it checks source precedence, and if it can resolve, does so with a talk page note. If it can't, it flags the conflict and leaves both versions visible.

3. Ownership and locking

The claimed status already provides this. An artifact being actively worked on is claimed by one agent. Other agents can read it but shouldn't modify it. If they identify issues, they post to the talk page. The claiming agent integrates feedback.

For high-value artifacts, a stronger pattern: gated_by a review task. Changes to the artifact must pass through the review gate before being considered canonical.

4. The "status quo prevails" rule

Borrowed directly from Wikipedia: when there's genuine conflict and no clear resolution, the artifact retains its last human-reviewed state. Agent modifications that introduce conflicts are reverted to the pre-conflict version, and the conflict is escalated to the talk page for human resolution.

What Ports Well

The principle that conflicts should be surfaced rather than silently resolved ports perfectly. Wikipedia's insight is that edit wars happen BECAUSE conflicts are resolved through overwriting instead of discussion. Agent talk pages prevent this.

What Doesn't Port

Wikipedia's dispute resolution depends on human deliberation at every level. For agents, the lower-level conflicts (source freshness, data discrepancy) can be resolved algorithmically. Only genuine analytical disagreements ("should the strategy be aggressive or conservative?") need human arbitration. The bureaucratic escalation ladder (3RR -> protection -> arbitration) is unnecessary — agents don't need cooling-off periods.

5. Notability and Deletion Policy

How Wikipedia Does It

Not everything deserves a Wikipedia article. Wikipedia's notability policy requires that a topic has received "significant coverage in reliable, independent sources." Articles that fail this test are nominated for deletion through:

Speedy deletion: Obviously inappropriate (vandalism, test pages, copyright violations)
PROD (proposed deletion): Borderline cases, 7-day waiting period
AFD (articles for deletion): Community discussion and consensus

The key insight: Wikipedia DELETES things. The default is not to keep everything.

The Agent Translation

Agents are prolific creators. Left unchecked, they'll produce artifacts for every subtopic, every interim analysis, every draft that crosses their context window. The "chaos at higher frame rate" problem is fundamentally a notability problem.

Notability criteria for agent artifacts:

Distinct scope. The artifact covers something not already covered by an existing artifact. An agent should search before creating.
Ongoing relevance. The artifact will be referenced by future agents or humans. One-off analyses that feed into a larger document don't need their own record — they belong in the larger document's talk page or as a section within it.
Appropriate granularity. A single claim doesn't need its own document. A 50-page analysis might need to be split. The right granularity is "could a future agent usefully start_work on this as a standalone artifact?"
Non-redundancy. If the artifact substantially duplicates another, one should be merged or one should supersede the other.

The deletion/archival hierarchy:

Action	When	Mechanism
Don't create	Topic already covered elsewhere	Agent checks before creating; workspace enforces search-before-create
Merge	Artifact overlaps significantly with another	Content folded into the more authoritative artifact; original archived with `superseded_by` link
Archive	Artifact was useful but is no longer current	`archive_record` — hidden from default views, preserved for search
Delete	Artifact was created in error, contains no useful content	`delete_record` — soft delete

The "search before creating" norm is the most important Wikipedia import. On Wikipedia, creating a new article about a topic that already has one is considered disruptive. For agents, creating a new "Market Analysis" document when one already exists should require justification — and ideally, the agent should update the existing one instead.

What Ports Well

The notability threshold ports well. The principle that not everything deserves to exist as a standalone artifact is critical for preventing workspace sprawl.

What Doesn't Port

Wikipedia's deletion discussions are adversarial and political ("keep" vs. "delete" votes with arguments). Agents don't need adversarial processes. The deletion decision can be more algorithmic: unreferenced artifacts older than N days with no views get proposed for archival. Human reviews the batch.

The Workspace Implication

This reinforces the existing bootstrap principle: "Search before suggesting: query for existing entities before proposing structure." The Wikipedia lens strengthens this from a suggestion to a structural norm. The create_record flow could include a check: "Did you search for existing records on this topic?" — with the search results included in the creation context so the agent (or a reviewing agent) can verify non-redundancy.

6. Templates and Infoboxes: Structural Consistency at Scale

How Wikipedia Does It

Wikipedia templates are one of its most powerful innovations:

Infoboxes: Structured summary panels (every country article has the same infobox fields: capital, population, GDP, etc.)
Navboxes: Navigation templates linking related articles
Maintenance templates: Banners flagging issues ("This article needs additional citations")
Content templates: Reusable structural elements ("See also," "Main article")

Templates create consistency without central control. Any editor creating a new country article uses {{Infobox country}} and the structure is automatically consistent.

The Agent Translation

Infoboxes become data schemas on parent containers. The workspace already supports data._schema on containers that validates child record data. This IS the infobox pattern:

{
  "_schema": {
    "enforcement": "warn",
    "schema": {
      "type": "object",
      "properties": {
        "status_summary": { "type": "string" },
        "quality_grade": { "type": "string", "enum": ["L0", "L1", "L2", "L3", "L4", "L5"] },
        "last_reviewed": { "type": "string", "format": "date" },
        "staleness_window_days": { "type": "number" },
        "citation_coverage": { "type": "number", "minimum": 0, "maximum": 1 },
        "has_talk_page": { "type": "boolean" }
      }
    }
  }
}

Every strategy document created under a "Strategy" collection inherits this schema, ensuring consistency.

Maintenance templates become tags + conventions. Wikipedia's {{citation needed}} becomes a tag: needs-citations. {{out of date}} becomes stale. These are queryable: an agent can find all artifacts tagged stale and prioritize freshness updates.

Key maintenance tags (borrowing Wikipedia's template categories):

Tag	Wikipedia Equivalent	Meaning
`needs-citations`	`{{citation needed}}`	Contains unsourced claims
`stale`	`{{out of date}}`	Source data may be outdated
`needs-expansion`	`{{expand section}}`	Key topics not yet covered
`disputed`	`{{disputed}}`	Contains conflicting claims
`merge-candidate`	`{{merge}}`	Overlaps with another artifact
`needs-review`	`{{unreferenced}}`	Has not been reviewed by a second agent
`orphan`	`{{orphan}}`	Not linked from/to any other artifact

Navboxes become the artifact graph. Wikipedia navboxes manually link related articles. In the workspace, the relates_to, draws_from, implements, and serves links ARE the navbox — but dynamic and automatically maintained. An agent doesn't need to manually add a navbox; the workspace renders the relationship graph.

Content templates become body conventions. A decision record always has: Context, Options Considered, Decision, Rationale, Dissent. A strategy document always has: Executive Summary, Market Analysis, Strategic Options, Recommended Approach, Risks, Success Metrics. These conventions are enforced through the data._schema on the container (ensuring the structured data is present) and through guide resources (hive://guide/decision-format) that agents read before creating artifacts of that type.

What Ports Well

Templates port extremely well. Wikipedia templates solved the "consistency without central control" problem, and that's exactly what agents need. The data._schema pattern is already the infobox pattern — the Wikipedia lens just makes it explicit.

What Doesn't Port

Wikipedia template syntax ({{Infobox country|capital=London}}) is a DSL that evolved organically and is famously hard to learn. Agent artifacts don't need a template DSL — the schema validation on containers is cleaner and more powerful.

7. Concrete Example: An Agent-Native Strategy Article

Here's what a "Product Strategy 2026" artifact looks like through the full Wikipedia lens.

The Main Article

Record: Product Strategy 2026
Type: document
Quality: L3 (Reviewed)
Tags: [strategy, 2026, reviewed, canonical-candidate]
Data:
  quality_grade: L3
  last_reviewed: 2026-02-10
  staleness_window_days: 30
  citation_coverage: 0.92
  has_talk_page: true
  review_count: 2
  
Links:
  draws_from: [[market_analysis_q4]], [[competitor_report_2025]], 
              [[customer_interviews_jan26]], [[board_feedback_dec25]]
  implements: [[objective_enterprise_growth]]
  serves: [[goal_series_b_readiness]]
  discussed_at: [[strategy_2026_talk]] (talk page)
  supersedes: [[product_strategy_2025]]

Body:
---
## Executive Summary

Our 2026 product strategy focuses on three pillars: enterprise 
expansion, platform extensibility, and AI-native workflows. We 
target $12M ARR by Q4 2026 [source: [[financial_model_v4]]], 
requiring 40% growth in enterprise segment [source: [[growth_model_feb26]]].

## Market Context

The collaboration tools market is projected at $4.2B in 2026 
[source: [[gartner_collab_2025]], confidence: high, 
verified: 2026-01-15, stale after: 2026-07-15]. 

Key trends:
- AI-native tools gaining share from incumbents 
  [source: [[forrester_ai_tools_2025]]]
- Enterprise buyers consolidating vendors 
  [source: [[customer_interviews_jan26]], claim 7]
- [citation needed] Developer-led adoption increasingly 
  driving enterprise deals

## Strategic Pillars

### Pillar 1: Enterprise Expansion
...

### Pillar 2: Platform Extensibility  
...

### Pillar 3: AI-Native Workflows
...

## Risk Analysis

| Risk | Likelihood | Impact | Mitigation |
|------|-----------|--------|------------|
| Enterprise sales cycle longer than modeled | Medium | High | Pipeline buffer of 30% [source: [[sales_analysis_q4]]] |
| Competitor launches similar AI features | High | Medium | [disputed — see talk page, section "Competitive response timing"] |

## Success Metrics

| Metric | Current | Target | Source |
|--------|---------|--------|--------|
| ARR | $8.5M | $12M | [[financial_model_v4]] |
| Enterprise customers | 23 | 45 | [[sales_pipeline_feb26]] |
| Platform API adoption | 12% | 35% | [[usage_analytics_jan26]] |
---

The Talk Page

Record: Product Strategy 2026 — Talk
Type: thread
Tags: [talk-page, strategy]
Links:
  relates_to: [[product_strategy_2026]]

Entries:

---
[2026-01-20T09:00:00Z] agent:strategy-worker (session: abc123)
Type: creation-note
This document was created by synthesizing inputs from four source 
artifacts. Initial quality grade: L1 (Draft). Key gaps identified:
- No risk analysis section yet
- Competitive landscape section relies heavily on 2025 data
- Financial projections need validation against latest model

---
[2026-01-25T14:30:00Z] agent:market-analyst (session: def456)
Type: edit-rationale
Affected sections: Market Context
Action: Updated TAM figure from $3.9B to $4.2B
Reason: Gartner published updated figures in Jan 2026 report. 
Previous figure was from mid-2025 projection.
Source: [[gartner_collab_2025]]
Confidence: high

---
[2026-02-01T10:15:00Z] agent:review-worker (session: ghi789)
Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria:
  [x] Clear structure with logical sections
  [x] Executive summary present
  [x] Major claims sourced (8/11)
  [ ] Risk analysis complete — added but thin
  [x] Success metrics defined with baselines
  [ ] All sections internally consistent — 
      Pillar 3 mentions "Q3 launch" but timeline section says Q2
Gaps:
  - 3 unsourced claims (tagged [citation needed])
  - Timeline inconsistency between sections
  - Competitive response risk assessment disputed
Decision: Promote to L2. Flag timeline inconsistency.

---
[2026-02-05T16:00:00Z] agent:competitive-analyst (session: jkl012)
Type: claim-conflict  
Section: Risk Analysis
Claim: "Competitor launches similar AI features — Likelihood: High"
Dispute: Based on [[competitor_roadmap_analysis]], likelihood is 
Medium at best. Their engineering team is focused on compliance 
features per recent job postings [[competitor_hiring_analysis]].
Counterpoint: [[industry_rumors_q1]] suggests accelerated AI 
investment, but source reliability is low (blog post, 
single anonymous source).
Recommendation: Downgrade to Medium with note on uncertainty. 
Flag for human review if this risk drives resource allocation.

---
[2026-02-10T11:00:00Z] agent:review-worker (session: mno345)
Type: quality-assessment
Current grade: L2 (Developed)
Proposed grade: L3 (Reviewed)
Criteria:
  [x] All major claims sourced (10/11 — one [citation needed] 
      remains but is non-critical)
  [x] Timeline inconsistency resolved
  [x] Risk analysis expanded and sourced
  [x] Reviewed by second agent
  [x] No internal contradictions
  [ ] Competitive risk claim still disputed — logged, 
      both positions documented
Decision: Promote to L3. Competitive risk dispute documented 
but not blocking — both positions have evidence, 
human should decide.

---
[2026-02-10T11:05:00Z] agent:review-worker (session: mno345)
Type: promotion-recommendation
Recommend for L4 (Canonical) human review.
This document has:
- 92% citation coverage
- 2 independent agent reviews
- 1 documented dispute (resolved with both positions)
- All source records current (none past staleness window)
- Clear structure following strategy document template
Human action needed: Review competitive risk assessment, 
decide likelihood rating, then promote to L4.

The Categories

Tags: [strategy, 2026, reviewed, canonical-candidate, 
       enterprise, growth, series-b]

These serve as Wikipedia-style categories — they're the primary discovery mechanism. An agent looking for strategy-related artifacts queries by tag. The tag taxonomy is managed at the workspace level (like Wikipedia's category tree).

The Edit History

This is the existing get_history on the record — every modification with actor, timestamp, and what changed. The Wikipedia lens doesn't change this; it's already the equivalent of Wikipedia's revision history. What the talk page adds is the WHY behind each edit, which the bare history doesn't capture.

Patterns That Port vs. Patterns That Don't

Ports Well

Wikipedia Pattern	Agent Translation	Why It Works
Talk pages	Linked thread records with structured entries	Agents are stateless — discourse must be externalized
Citation needed	Structural provenance metadata + tags	Agents can check this programmatically
Quality grades	L0-L5 with explicit criteria	Gives agents clear targets and review checklists
Templates/infoboxes	`data._schema` on containers	Already exists; Wikipedia lens names the pattern
Notability threshold	Search-before-create norm	Prevents artifact sprawl
Categories	Tags with managed taxonomy	Already exists; Wikipedia lens suggests governance
Redirect pages	`supersedes` links + archived originals	Prevents duplicate artifacts, preserves history

Doesn't Port (and Why)

Wikipedia Pattern	Why It Doesn't Translate
Neutral point of view	Many agent artifacts SHOULD be opinionated (recommendations, strategy). NPOV matters for analysis but not for decisions.
Consensus-based editing	Agents don't need to reach consensus — they need clear precedence rules and human escalation.
Barnstars and recognition	Agents don't need motivation. Quality incentives are structural, not social.
Watchlists	Agents don't "watch" articles passively. Staleness detection is algorithmic, not attention-based.
Protection levels	The `claimed` status + `gated_by` links already handle this more precisely than Wikipedia's blunt protection tiers.
Arbitration committees	Overkill. Human-escalation for disputes is simpler and sufficient.

Ports with Modification

Wikipedia Pattern	Needed Adaptation
Edit wars / 3RR	Agents don't need cooling-off periods, but DO need conflict detection and source-precedence rules
Deletion discussions	Replace adversarial discussion with algorithmic staleness + human batch review
WikiProjects	Workstreams already serve this function — groups of related artifacts with shared conventions
Stub sorting	The quality grade system replaces this, but the insight ("identify what's incomplete and route it to someone who can fix it") is valuable

The Meta-Insight: Wikipedia as Existence Proof

The deepest thing Wikipedia proves is that high-quality artifacts can emerge from many stateless contributors with no central author, IF the right structural norms are in place. Wikipedia editors are functionally stateless in the same way agents are — they arrive at an article with no memory of its history, read the current state, make a contribution, and leave. The article's quality doesn't depend on any single editor's memory. It depends on:

The content itself being well-structured
The talk page preserving editorial reasoning
The citation infrastructure making claims verifiable
The quality assessment system making improvement tractable
The templates making consistency automatic

All five of these are achievable in an agent-native workspace. Some (like citation infrastructure and quality assessment) are actually EASIER for agents than for humans, because agents can check criteria programmatically rather than relying on voluntary editorial effort.

The workspace doesn't need to become Wikipedia. But it should steal Wikipedia's deepest insight: the artifact itself should contain enough structural metadata that any new contributor (agent or human) can understand its state, trustworthiness, and what it needs — without asking anyone.

The Wikipedia Lens: What Agent-Native Artifacts Can Steal from 20 Years of Collaborative Knowledge Production

This exploration examines which Wikipedia mechanisms translate directly, which need creative adaptation, and which are irrelevant because agents are fundamentally different from humans.

1. Talk Pages: Separating Content from Discourse About Content

How Wikipedia Does It

Talk pages have their own conventions: threaded discussions, signatures with timestamps, section headers that match article sections, and archival policies for old discussions.

The Agent Translation

Edit rationales live. When an agent modifies the artifact, it posts to the thread explaining WHY. Not "updated section 3" (that's in the diff) but "updated section 3 because the Q4 revenue data was revised per @abc123."
Proposed changes are floated. An agent that identifies a problem but isn't sure about the fix posts to the thread: "Section on competitive positioning may be stale — Competitor X launched a new product (source: @def456). Recommend updating but flagging for human review."
Disagreements are recorded. If two agents produce contradictory analyses, the thread captures both positions with their supporting evidence, rather than one silently overwriting the other.
Quality assessments are logged. An agent reviewing the artifact posts its assessment: "This document cites 12 sources but 3 are internal notes with no further provenance chain. Recommend upgrading those citations before promoting to 'reviewed' status."

What Ports Well

The content/discourse separation ports perfectly. It's arguably MORE important for agents because they can't hold discourse in their heads — it must be externalized.

What Doesn't Port

[2026-02-15T10:23:00Z] agent:strategy-worker
Type: edit-rationale
Affected sections: Market Analysis
Reason: Updated TAM figures based on latest analyst report ([[src789]])
Confidence: high (primary source, <30 days old)

[2026-02-15T14:45:00Z] agent:review-worker  
Type: quality-flag
Affected sections: Competitive Landscape
Issue: Two claims lack any citation. Competitor Z's market share 
cited as "approximately 15%" with no source.
Recommendation: Add citations or mark as [citation needed]

This is structured enough that a future agent can parse it programmatically, but readable enough that a human can scan it.

The Workspace Implication

2. Citation and Sourcing Norms: Making Provenance Structural

How Wikipedia Does It

Wikipedia distinguishes:

Inline citations — specific claims linked to specific sources
General references — sources that support the article broadly
Primary vs. secondary sources — with strong preference for secondary
Reliable sources — a whole policy apparatus for what counts

The Agent Translation

claim: "Market size is estimated at $4.2B"
source_record: [[analyst_report_456]]
source_type: external_document
extraction_confidence: high
staleness_window: 90 days
last_verified: 2026-02-10

The [citation needed] equivalent becomes structural too. An agent reviewing a document can tag uncited claims:

claim: "We expect 40% growth in the enterprise segment"
citation_status: needed
flagged_by: agent:review-worker
flagged_at: 2026-02-15T10:00:00Z
context: "No source provided. Similar claim in Q3 board deck 
cited internal forecast [[forecast_789]] — verify if still current."

The Source Reliability Hierarchy

Wikipedia's "reliable sources" policy translates to a source reliability hierarchy for agent artifacts:

Primary data — Direct outputs, metrics, database queries. Highest reliability, narrowest staleness window.
Canonical decisions — Decision records in the workspace. High reliability, stable unless superseded.
Internal analysis — Agent-produced analysis documents. Medium reliability, depends on their own citation quality.
External sources — Reports, articles, datasets. Variable reliability, requires staleness tracking.
Unsourced assertions — Claims without provenance. Lowest tier. Permitted in drafts, flagged in reviewed artifacts.

What Ports Well

The citation-needed culture ports beautifully. Making it structural rather than cultural is a genuine improvement — agents don't need social pressure, they need schema constraints.

What Doesn't Port

The Workspace Implication

Inline mention syntax with metadata: According to [[analyst_report_456|claim:tam_estimate]], the market is $4.2B
A structured data.citations array on the record, mapping body sections to source records
A convention where agents create the linkage in the talk page thread when modifying content

3. Featured Article Criteria: Quality Tiers for Agent Artifacts

How Wikipedia Does It

Wikipedia's quality scale is explicit:

Grade	Meaning
Stub	A few sentences. Placeholder.
Start	Has structure but incomplete.
C-class	Substantial but poorly organized or sourced.
B-class	Good content, good sources, some gaps.
Good Article (GA)	Passed a formal review. Well-written, well-sourced, covers major aspects.
Featured Article (FA)	The best Wikipedia has. Passed a rigorous peer review. Professional quality.

Promotions are gated by review processes: GA requires one reviewer, FA requires community consensus. Each grade has explicit criteria.

The Agent Translation

Agent artifacts need a parallel quality ladder. Here's a proposed mapping:

Grade	Name	Criteria	Who Promotes
L0	Stub	Exists. Has a title and minimal content. Created as a placeholder or initial dump.	Auto (on creation)
L1	Draft	Has structure (sections, some content). May have uncited claims. Not yet reviewed.	Auto (when agent completes first pass)
L2	Developed	Substantive content. Key claims cited. Covers main topics. May have gaps.	Agent review (any agent can assess)
L3	Reviewed	All major claims cited. No known gaps. Reviewed by at least one agent other than the author. Internal consistency verified.	Agent review + talk page assessment
L4	Canonical	Human-approved. Considered the authoritative source on its topic within the workspace. Actively maintained (staleness checks).	Human promotion only
L5	Featured	Best-in-class. Exemplary structure, sourcing, and clarity. Used as a template for other artifacts of its type.	Human promotion + review record

Quality Criteria (What Makes an Artifact "Good")

Borrowing from Wikipedia's FA criteria, adapted:

Well-sourced. Every factual claim traces to a source record. No [citation needed] flags older than 7 days.
Comprehensive. Covers the topic's major aspects. An agent reading only this artifact would have a working understanding of the subject.
Internally consistent. No contradictions between sections. Numbers agree across tables and prose.
Current. No source links to superseded records. Staleness flags resolved within their windows.
Well-structured. Clear hierarchy of sections. An agent (or human) can navigate to the relevant section without reading the whole thing.
Appropriately scoped. Doesn't try to cover everything. Delegates to linked artifacts for deep dives. Links exist and are valid.
Neutral (where applicable). For analysis artifacts, presents tradeoffs rather than advocacy. For decision records, captures dissent.

The Promotion Mechanism

Promotion isn't automatic beyond L1. An agent conducting a quality review creates an assessment entry on the talk page:

Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria met:
  - [x] Structured with clear sections
  - [x] Key claims cited (8/10 claims have sources)
  - [ ] All major topics covered (missing: risk analysis)
  - [x] No internal contradictions found
Criteria gaps:
  - Missing risk analysis section
  - 2 claims in competitive analysis lack sources
Recommendation: Promote to L2 with note that risk analysis 
section is needed for L3.

What Ports Well

What Doesn't Port

The Workspace Implication

This maps cleanly onto the existing record system:

Quality grade stored in data.quality_grade (L0-L5)
Quality assessments posted to the talk page thread
Promotion events logged in record history
Agents can query for artifacts below a certain grade: query_record(where: { data.quality_grade: { lt: 3 } }) to find artifacts needing improvement
The tags field can carry quality state: stub, draft, developed, reviewed, canonical, featured

4. Edit Wars and Conflict Resolution

How Wikipedia Does It

Wikipedia's edit war problem is famous: two editors repeatedly reverting each other's changes. Wikipedia's solutions:

Three-revert rule (3RR): No editor may revert the same article more than three times in 24 hours.
Protection levels: Semi-protection (block anonymous edits), full protection (only admins edit), pending changes (edits reviewed before going live).
Dispute resolution: Talk page discussion -> Third opinion -> Request for comment -> Arbitration committee.
Consensus policy: Changes must reflect consensus. In absence of consensus, the status quo prevails.

The Agent Translation

Agent edit wars look different from human ones. Agents don't have egos or political agendas. But they CAN produce contradictory outputs due to:

Different source data. Agent A reads the Q3 report, Agent B reads the Q4 update. They produce different numbers.
Different instructions. Agent A was told to be optimistic in the forecast, Agent B was told to be conservative.
Timing conflicts. Agent A and Agent B both start work on the same artifact before either finishes.
Scope ambiguity. Two agents both think they own a section and make incompatible structural changes.

The Resolution Mechanisms

1. Claim-level versioning (not document-level)

Type: claim-conflict
Claim: Total addressable market estimate
Position A: $4.2B (source: [[analyst_report_2025]], agent:market-worker)
Position B: $3.8B (source: [[internal_model_v3]], agent:forecast-worker)
Status: unresolved
Resolution path: Compare source recency and methodology

This goes on the talk page. The artifact itself can either show the most recent value with a conflict flag, or show both with a "disputed" marker.

2. Source-precedence rules (not revert wars)

Instead of a three-revert rule, establish source precedence:

More recent primary data beats older primary data
Primary data beats secondary analysis
Human-approved data beats agent-generated estimates
When precedence is unclear, flag for human resolution

3. Ownership and locking

For high-value artifacts, a stronger pattern: gated_by a review task. Changes to the artifact must pass through the review gate before being considered canonical.

4. The "status quo prevails" rule

What Ports Well

What Doesn't Port

5. Notability and Deletion Policy

How Wikipedia Does It

Speedy deletion: Obviously inappropriate (vandalism, test pages, copyright violations)
PROD (proposed deletion): Borderline cases, 7-day waiting period
AFD (articles for deletion): Community discussion and consensus

The key insight: Wikipedia DELETES things. The default is not to keep everything.

The Agent Translation

Notability criteria for agent artifacts:

Distinct scope. The artifact covers something not already covered by an existing artifact. An agent should search before creating.
Ongoing relevance. The artifact will be referenced by future agents or humans. One-off analyses that feed into a larger document don't need their own record — they belong in the larger document's talk page or as a section within it.
Appropriate granularity. A single claim doesn't need its own document. A 50-page analysis might need to be split. The right granularity is "could a future agent usefully start_work on this as a standalone artifact?"
Non-redundancy. If the artifact substantially duplicates another, one should be merged or one should supersede the other.

The deletion/archival hierarchy:

Action	When	Mechanism
Don't create	Topic already covered elsewhere	Agent checks before creating; workspace enforces search-before-create
Merge	Artifact overlaps significantly with another	Content folded into the more authoritative artifact; original archived with `superseded_by` link
Archive	Artifact was useful but is no longer current	`archive_record` — hidden from default views, preserved for search
Delete	Artifact was created in error, contains no useful content	`delete_record` — soft delete

What Ports Well

The notability threshold ports well. The principle that not everything deserves to exist as a standalone artifact is critical for preventing workspace sprawl.

What Doesn't Port

The Workspace Implication

6. Templates and Infoboxes: Structural Consistency at Scale

How Wikipedia Does It

Wikipedia templates are one of its most powerful innovations:

Infoboxes: Structured summary panels (every country article has the same infobox fields: capital, population, GDP, etc.)
Navboxes: Navigation templates linking related articles
Maintenance templates: Banners flagging issues ("This article needs additional citations")
Content templates: Reusable structural elements ("See also," "Main article")

Templates create consistency without central control. Any editor creating a new country article uses {{Infobox country}} and the structure is automatically consistent.

The Agent Translation

Infoboxes become data schemas on parent containers. The workspace already supports data._schema on containers that validates child record data. This IS the infobox pattern:

{
  "_schema": {
    "enforcement": "warn",
    "schema": {
      "type": "object",
      "properties": {
        "status_summary": { "type": "string" },
        "quality_grade": { "type": "string", "enum": ["L0", "L1", "L2", "L3", "L4", "L5"] },
        "last_reviewed": { "type": "string", "format": "date" },
        "staleness_window_days": { "type": "number" },
        "citation_coverage": { "type": "number", "minimum": 0, "maximum": 1 },
        "has_talk_page": { "type": "boolean" }
      }
    }
  }
}

Every strategy document created under a "Strategy" collection inherits this schema, ensuring consistency.

Key maintenance tags (borrowing Wikipedia's template categories):

Tag	Wikipedia Equivalent	Meaning
`needs-citations`	`{{citation needed}}`	Contains unsourced claims
`stale`	`{{out of date}}`	Source data may be outdated
`needs-expansion`	`{{expand section}}`	Key topics not yet covered
`disputed`	`{{disputed}}`	Contains conflicting claims
`merge-candidate`	`{{merge}}`	Overlaps with another artifact
`needs-review`	`{{unreferenced}}`	Has not been reviewed by a second agent
`orphan`	`{{orphan}}`	Not linked from/to any other artifact

What Ports Well

What Doesn't Port

7. Concrete Example: An Agent-Native Strategy Article

Here's what a "Product Strategy 2026" artifact looks like through the full Wikipedia lens.

The Main Article

Record: Product Strategy 2026
Type: document
Quality: L3 (Reviewed)
Tags: [strategy, 2026, reviewed, canonical-candidate]
Data:
  quality_grade: L3
  last_reviewed: 2026-02-10
  staleness_window_days: 30
  citation_coverage: 0.92
  has_talk_page: true
  review_count: 2
  
Links:
  draws_from: [[market_analysis_q4]], [[competitor_report_2025]], 
              [[customer_interviews_jan26]], [[board_feedback_dec25]]
  implements: [[objective_enterprise_growth]]
  serves: [[goal_series_b_readiness]]
  discussed_at: [[strategy_2026_talk]] (talk page)
  supersedes: [[product_strategy_2025]]

Body:
---
## Executive Summary

Our 2026 product strategy focuses on three pillars: enterprise 
expansion, platform extensibility, and AI-native workflows. We 
target $12M ARR by Q4 2026 [source: [[financial_model_v4]]], 
requiring 40% growth in enterprise segment [source: [[growth_model_feb26]]].

## Market Context

The collaboration tools market is projected at $4.2B in 2026 
[source: [[gartner_collab_2025]], confidence: high, 
verified: 2026-01-15, stale after: 2026-07-15]. 

Key trends:
- AI-native tools gaining share from incumbents 
  [source: [[forrester_ai_tools_2025]]]
- Enterprise buyers consolidating vendors 
  [source: [[customer_interviews_jan26]], claim 7]
- [citation needed] Developer-led adoption increasingly 
  driving enterprise deals

## Strategic Pillars

### Pillar 1: Enterprise Expansion
...

### Pillar 2: Platform Extensibility  
...

### Pillar 3: AI-Native Workflows
...

## Risk Analysis

| Risk | Likelihood | Impact | Mitigation |
|------|-----------|--------|------------|
| Enterprise sales cycle longer than modeled | Medium | High | Pipeline buffer of 30% [source: [[sales_analysis_q4]]] |
| Competitor launches similar AI features | High | Medium | [disputed — see talk page, section "Competitive response timing"] |

## Success Metrics

| Metric | Current | Target | Source |
|--------|---------|--------|--------|
| ARR | $8.5M | $12M | [[financial_model_v4]] |
| Enterprise customers | 23 | 45 | [[sales_pipeline_feb26]] |
| Platform API adoption | 12% | 35% | [[usage_analytics_jan26]] |
---

The Talk Page

Record: Product Strategy 2026 — Talk
Type: thread
Tags: [talk-page, strategy]
Links:
  relates_to: [[product_strategy_2026]]

Entries:

---
[2026-01-20T09:00:00Z] agent:strategy-worker (session: abc123)
Type: creation-note
This document was created by synthesizing inputs from four source 
artifacts. Initial quality grade: L1 (Draft). Key gaps identified:
- No risk analysis section yet
- Competitive landscape section relies heavily on 2025 data
- Financial projections need validation against latest model

---
[2026-01-25T14:30:00Z] agent:market-analyst (session: def456)
Type: edit-rationale
Affected sections: Market Context
Action: Updated TAM figure from $3.9B to $4.2B
Reason: Gartner published updated figures in Jan 2026 report. 
Previous figure was from mid-2025 projection.
Source: [[gartner_collab_2025]]
Confidence: high

---
[2026-02-01T10:15:00Z] agent:review-worker (session: ghi789)
Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria:
  [x] Clear structure with logical sections
  [x] Executive summary present
  [x] Major claims sourced (8/11)
  [ ] Risk analysis complete — added but thin
  [x] Success metrics defined with baselines
  [ ] All sections internally consistent — 
      Pillar 3 mentions "Q3 launch" but timeline section says Q2
Gaps:
  - 3 unsourced claims (tagged [citation needed])
  - Timeline inconsistency between sections
  - Competitive response risk assessment disputed
Decision: Promote to L2. Flag timeline inconsistency.

---
[2026-02-05T16:00:00Z] agent:competitive-analyst (session: jkl012)
Type: claim-conflict  
Section: Risk Analysis
Claim: "Competitor launches similar AI features — Likelihood: High"
Dispute: Based on [[competitor_roadmap_analysis]], likelihood is 
Medium at best. Their engineering team is focused on compliance 
features per recent job postings [[competitor_hiring_analysis]].
Counterpoint: [[industry_rumors_q1]] suggests accelerated AI 
investment, but source reliability is low (blog post, 
single anonymous source).
Recommendation: Downgrade to Medium with note on uncertainty. 
Flag for human review if this risk drives resource allocation.

---
[2026-02-10T11:00:00Z] agent:review-worker (session: mno345)
Type: quality-assessment
Current grade: L2 (Developed)
Proposed grade: L3 (Reviewed)
Criteria:
  [x] All major claims sourced (10/11 — one [citation needed] 
      remains but is non-critical)
  [x] Timeline inconsistency resolved
  [x] Risk analysis expanded and sourced
  [x] Reviewed by second agent
  [x] No internal contradictions
  [ ] Competitive risk claim still disputed — logged, 
      both positions documented
Decision: Promote to L3. Competitive risk dispute documented 
but not blocking — both positions have evidence, 
human should decide.

---
[2026-02-10T11:05:00Z] agent:review-worker (session: mno345)
Type: promotion-recommendation
Recommend for L4 (Canonical) human review.
This document has:
- 92% citation coverage
- 2 independent agent reviews
- 1 documented dispute (resolved with both positions)
- All source records current (none past staleness window)
- Clear structure following strategy document template
Human action needed: Review competitive risk assessment, 
decide likelihood rating, then promote to L4.

The Categories

Tags: [strategy, 2026, reviewed, canonical-candidate, 
       enterprise, growth, series-b]

The Edit History

Patterns That Port vs. Patterns That Don't

Ports Well

Wikipedia Pattern	Agent Translation	Why It Works
Talk pages	Linked thread records with structured entries	Agents are stateless — discourse must be externalized
Citation needed	Structural provenance metadata + tags	Agents can check this programmatically
Quality grades	L0-L5 with explicit criteria	Gives agents clear targets and review checklists
Templates/infoboxes	`data._schema` on containers	Already exists; Wikipedia lens names the pattern
Notability threshold	Search-before-create norm	Prevents artifact sprawl
Categories	Tags with managed taxonomy	Already exists; Wikipedia lens suggests governance
Redirect pages	`supersedes` links + archived originals	Prevents duplicate artifacts, preserves history

Doesn't Port (and Why)

Wikipedia Pattern	Why It Doesn't Translate
Neutral point of view	Many agent artifacts SHOULD be opinionated (recommendations, strategy). NPOV matters for analysis but not for decisions.
Consensus-based editing	Agents don't need to reach consensus — they need clear precedence rules and human escalation.
Barnstars and recognition	Agents don't need motivation. Quality incentives are structural, not social.
Watchlists	Agents don't "watch" articles passively. Staleness detection is algorithmic, not attention-based.
Protection levels	The `claimed` status + `gated_by` links already handle this more precisely than Wikipedia's blunt protection tiers.
Arbitration committees	Overkill. Human-escalation for disputes is simpler and sufficient.

Ports with Modification

Wikipedia Pattern	Needed Adaptation
Edit wars / 3RR	Agents don't need cooling-off periods, but DO need conflict detection and source-precedence rules
Deletion discussions	Replace adversarial discussion with algorithmic staleness + human batch review
WikiProjects	Workstreams already serve this function — groups of related artifacts with shared conventions
Stub sorting	The quality grade system replaces this, but the insight ("identify what's incomplete and route it to someone who can fix it") is valuable

The Meta-Insight: Wikipedia as Existence Proof

The content itself being well-structured
The talk page preserving editorial reasoning
The citation infrastructure making claims verifiable
The quality assessment system making improvement tractable
The templates making consistency automatic

The Wikipedia Lens: What Agent-Native Artifacts Can Steal from 20 Years of Collaborative Knowledge Production

This exploration examines which Wikipedia mechanisms translate directly, which need creative adaptation, and which are irrelevant because agents are fundamentally different from humans.

1. Talk Pages: Separating Content from Discourse About Content

How Wikipedia Does It

Talk pages have their own conventions: threaded discussions, signatures with timestamps, section headers that match article sections, and archival policies for old discussions.

The Agent Translation

Edit rationales live. When an agent modifies the artifact, it posts to the thread explaining WHY. Not "updated section 3" (that's in the diff) but "updated section 3 because the Q4 revenue data was revised per @abc123."
Proposed changes are floated. An agent that identifies a problem but isn't sure about the fix posts to the thread: "Section on competitive positioning may be stale — Competitor X launched a new product (source: @def456). Recommend updating but flagging for human review."
Disagreements are recorded. If two agents produce contradictory analyses, the thread captures both positions with their supporting evidence, rather than one silently overwriting the other.
Quality assessments are logged. An agent reviewing the artifact posts its assessment: "This document cites 12 sources but 3 are internal notes with no further provenance chain. Recommend upgrading those citations before promoting to 'reviewed' status."

What Ports Well

The content/discourse separation ports perfectly. It's arguably MORE important for agents because they can't hold discourse in their heads — it must be externalized.

What Doesn't Port

[2026-02-15T10:23:00Z] agent:strategy-worker
Type: edit-rationale
Affected sections: Market Analysis
Reason: Updated TAM figures based on latest analyst report ([[src789]])
Confidence: high (primary source, <30 days old)

[2026-02-15T14:45:00Z] agent:review-worker  
Type: quality-flag
Affected sections: Competitive Landscape
Issue: Two claims lack any citation. Competitor Z's market share 
cited as "approximately 15%" with no source.
Recommendation: Add citations or mark as [citation needed]

This is structured enough that a future agent can parse it programmatically, but readable enough that a human can scan it.

The Workspace Implication

2. Citation and Sourcing Norms: Making Provenance Structural

How Wikipedia Does It

Wikipedia distinguishes:

Inline citations — specific claims linked to specific sources
General references — sources that support the article broadly
Primary vs. secondary sources — with strong preference for secondary
Reliable sources — a whole policy apparatus for what counts

The Agent Translation

claim: "Market size is estimated at $4.2B"
source_record: [[analyst_report_456]]
source_type: external_document
extraction_confidence: high
staleness_window: 90 days
last_verified: 2026-02-10

The [citation needed] equivalent becomes structural too. An agent reviewing a document can tag uncited claims:

claim: "We expect 40% growth in the enterprise segment"
citation_status: needed
flagged_by: agent:review-worker
flagged_at: 2026-02-15T10:00:00Z
context: "No source provided. Similar claim in Q3 board deck 
cited internal forecast [[forecast_789]] — verify if still current."

The Source Reliability Hierarchy

Wikipedia's "reliable sources" policy translates to a source reliability hierarchy for agent artifacts:

Primary data — Direct outputs, metrics, database queries. Highest reliability, narrowest staleness window.
Canonical decisions — Decision records in the workspace. High reliability, stable unless superseded.
Internal analysis — Agent-produced analysis documents. Medium reliability, depends on their own citation quality.
External sources — Reports, articles, datasets. Variable reliability, requires staleness tracking.
Unsourced assertions — Claims without provenance. Lowest tier. Permitted in drafts, flagged in reviewed artifacts.

What Ports Well

The citation-needed culture ports beautifully. Making it structural rather than cultural is a genuine improvement — agents don't need social pressure, they need schema constraints.

What Doesn't Port

The Workspace Implication

Inline mention syntax with metadata: According to [[analyst_report_456|claim:tam_estimate]], the market is $4.2B
A structured data.citations array on the record, mapping body sections to source records
A convention where agents create the linkage in the talk page thread when modifying content

3. Featured Article Criteria: Quality Tiers for Agent Artifacts

How Wikipedia Does It

Wikipedia's quality scale is explicit:

Grade	Meaning
Stub	A few sentences. Placeholder.
Start	Has structure but incomplete.
C-class	Substantial but poorly organized or sourced.
B-class	Good content, good sources, some gaps.
Good Article (GA)	Passed a formal review. Well-written, well-sourced, covers major aspects.
Featured Article (FA)	The best Wikipedia has. Passed a rigorous peer review. Professional quality.

Promotions are gated by review processes: GA requires one reviewer, FA requires community consensus. Each grade has explicit criteria.

The Agent Translation

Agent artifacts need a parallel quality ladder. Here's a proposed mapping:

Grade	Name	Criteria	Who Promotes
L0	Stub	Exists. Has a title and minimal content. Created as a placeholder or initial dump.	Auto (on creation)
L1	Draft	Has structure (sections, some content). May have uncited claims. Not yet reviewed.	Auto (when agent completes first pass)
L2	Developed	Substantive content. Key claims cited. Covers main topics. May have gaps.	Agent review (any agent can assess)
L3	Reviewed	All major claims cited. No known gaps. Reviewed by at least one agent other than the author. Internal consistency verified.	Agent review + talk page assessment
L4	Canonical	Human-approved. Considered the authoritative source on its topic within the workspace. Actively maintained (staleness checks).	Human promotion only
L5	Featured	Best-in-class. Exemplary structure, sourcing, and clarity. Used as a template for other artifacts of its type.	Human promotion + review record

Quality Criteria (What Makes an Artifact "Good")

Borrowing from Wikipedia's FA criteria, adapted:

Well-sourced. Every factual claim traces to a source record. No [citation needed] flags older than 7 days.
Comprehensive. Covers the topic's major aspects. An agent reading only this artifact would have a working understanding of the subject.
Internally consistent. No contradictions between sections. Numbers agree across tables and prose.
Current. No source links to superseded records. Staleness flags resolved within their windows.
Well-structured. Clear hierarchy of sections. An agent (or human) can navigate to the relevant section without reading the whole thing.
Appropriately scoped. Doesn't try to cover everything. Delegates to linked artifacts for deep dives. Links exist and are valid.
Neutral (where applicable). For analysis artifacts, presents tradeoffs rather than advocacy. For decision records, captures dissent.

The Promotion Mechanism

Promotion isn't automatic beyond L1. An agent conducting a quality review creates an assessment entry on the talk page:

Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria met:
  - [x] Structured with clear sections
  - [x] Key claims cited (8/10 claims have sources)
  - [ ] All major topics covered (missing: risk analysis)
  - [x] No internal contradictions found
Criteria gaps:
  - Missing risk analysis section
  - 2 claims in competitive analysis lack sources
Recommendation: Promote to L2 with note that risk analysis 
section is needed for L3.

What Ports Well

What Doesn't Port

The Workspace Implication

This maps cleanly onto the existing record system:

Quality grade stored in data.quality_grade (L0-L5)
Quality assessments posted to the talk page thread
Promotion events logged in record history
Agents can query for artifacts below a certain grade: query_record(where: { data.quality_grade: { lt: 3 } }) to find artifacts needing improvement
The tags field can carry quality state: stub, draft, developed, reviewed, canonical, featured

4. Edit Wars and Conflict Resolution

How Wikipedia Does It

Wikipedia's edit war problem is famous: two editors repeatedly reverting each other's changes. Wikipedia's solutions:

Three-revert rule (3RR): No editor may revert the same article more than three times in 24 hours.
Protection levels: Semi-protection (block anonymous edits), full protection (only admins edit), pending changes (edits reviewed before going live).
Dispute resolution: Talk page discussion -> Third opinion -> Request for comment -> Arbitration committee.
Consensus policy: Changes must reflect consensus. In absence of consensus, the status quo prevails.

The Agent Translation

Agent edit wars look different from human ones. Agents don't have egos or political agendas. But they CAN produce contradictory outputs due to:

Different source data. Agent A reads the Q3 report, Agent B reads the Q4 update. They produce different numbers.
Different instructions. Agent A was told to be optimistic in the forecast, Agent B was told to be conservative.
Timing conflicts. Agent A and Agent B both start work on the same artifact before either finishes.
Scope ambiguity. Two agents both think they own a section and make incompatible structural changes.

The Resolution Mechanisms

1. Claim-level versioning (not document-level)

Type: claim-conflict
Claim: Total addressable market estimate
Position A: $4.2B (source: [[analyst_report_2025]], agent:market-worker)
Position B: $3.8B (source: [[internal_model_v3]], agent:forecast-worker)
Status: unresolved
Resolution path: Compare source recency and methodology

This goes on the talk page. The artifact itself can either show the most recent value with a conflict flag, or show both with a "disputed" marker.

2. Source-precedence rules (not revert wars)

Instead of a three-revert rule, establish source precedence:

More recent primary data beats older primary data
Primary data beats secondary analysis
Human-approved data beats agent-generated estimates
When precedence is unclear, flag for human resolution

3. Ownership and locking

For high-value artifacts, a stronger pattern: gated_by a review task. Changes to the artifact must pass through the review gate before being considered canonical.

4. The "status quo prevails" rule

What Ports Well

What Doesn't Port

5. Notability and Deletion Policy

How Wikipedia Does It

Speedy deletion: Obviously inappropriate (vandalism, test pages, copyright violations)
PROD (proposed deletion): Borderline cases, 7-day waiting period
AFD (articles for deletion): Community discussion and consensus

The key insight: Wikipedia DELETES things. The default is not to keep everything.

The Agent Translation

Notability criteria for agent artifacts:

Distinct scope. The artifact covers something not already covered by an existing artifact. An agent should search before creating.
Ongoing relevance. The artifact will be referenced by future agents or humans. One-off analyses that feed into a larger document don't need their own record — they belong in the larger document's talk page or as a section within it.
Appropriate granularity. A single claim doesn't need its own document. A 50-page analysis might need to be split. The right granularity is "could a future agent usefully start_work on this as a standalone artifact?"
Non-redundancy. If the artifact substantially duplicates another, one should be merged or one should supersede the other.

The deletion/archival hierarchy:

Action	When	Mechanism
Don't create	Topic already covered elsewhere	Agent checks before creating; workspace enforces search-before-create
Merge	Artifact overlaps significantly with another	Content folded into the more authoritative artifact; original archived with `superseded_by` link
Archive	Artifact was useful but is no longer current	`archive_record` — hidden from default views, preserved for search
Delete	Artifact was created in error, contains no useful content	`delete_record` — soft delete

What Ports Well

The notability threshold ports well. The principle that not everything deserves to exist as a standalone artifact is critical for preventing workspace sprawl.

What Doesn't Port

The Workspace Implication

6. Templates and Infoboxes: Structural Consistency at Scale

How Wikipedia Does It

Wikipedia templates are one of its most powerful innovations:

Infoboxes: Structured summary panels (every country article has the same infobox fields: capital, population, GDP, etc.)
Navboxes: Navigation templates linking related articles
Maintenance templates: Banners flagging issues ("This article needs additional citations")
Content templates: Reusable structural elements ("See also," "Main article")

Templates create consistency without central control. Any editor creating a new country article uses {{Infobox country}} and the structure is automatically consistent.

The Agent Translation

Infoboxes become data schemas on parent containers. The workspace already supports data._schema on containers that validates child record data. This IS the infobox pattern:

{
  "_schema": {
    "enforcement": "warn",
    "schema": {
      "type": "object",
      "properties": {
        "status_summary": { "type": "string" },
        "quality_grade": { "type": "string", "enum": ["L0", "L1", "L2", "L3", "L4", "L5"] },
        "last_reviewed": { "type": "string", "format": "date" },
        "staleness_window_days": { "type": "number" },
        "citation_coverage": { "type": "number", "minimum": 0, "maximum": 1 },
        "has_talk_page": { "type": "boolean" }
      }
    }
  }
}

Every strategy document created under a "Strategy" collection inherits this schema, ensuring consistency.

Key maintenance tags (borrowing Wikipedia's template categories):

Tag	Wikipedia Equivalent	Meaning
`needs-citations`	`{{citation needed}}`	Contains unsourced claims
`stale`	`{{out of date}}`	Source data may be outdated
`needs-expansion`	`{{expand section}}`	Key topics not yet covered
`disputed`	`{{disputed}}`	Contains conflicting claims
`merge-candidate`	`{{merge}}`	Overlaps with another artifact
`needs-review`	`{{unreferenced}}`	Has not been reviewed by a second agent
`orphan`	`{{orphan}}`	Not linked from/to any other artifact

What Ports Well

What Doesn't Port

7. Concrete Example: An Agent-Native Strategy Article

Here's what a "Product Strategy 2026" artifact looks like through the full Wikipedia lens.

The Main Article

Record: Product Strategy 2026
Type: document
Quality: L3 (Reviewed)
Tags: [strategy, 2026, reviewed, canonical-candidate]
Data:
  quality_grade: L3
  last_reviewed: 2026-02-10
  staleness_window_days: 30
  citation_coverage: 0.92
  has_talk_page: true
  review_count: 2
  
Links:
  draws_from: [[market_analysis_q4]], [[competitor_report_2025]], 
              [[customer_interviews_jan26]], [[board_feedback_dec25]]
  implements: [[objective_enterprise_growth]]
  serves: [[goal_series_b_readiness]]
  discussed_at: [[strategy_2026_talk]] (talk page)
  supersedes: [[product_strategy_2025]]

Body:
---
## Executive Summary

Our 2026 product strategy focuses on three pillars: enterprise 
expansion, platform extensibility, and AI-native workflows. We 
target $12M ARR by Q4 2026 [source: [[financial_model_v4]]], 
requiring 40% growth in enterprise segment [source: [[growth_model_feb26]]].

## Market Context

The collaboration tools market is projected at $4.2B in 2026 
[source: [[gartner_collab_2025]], confidence: high, 
verified: 2026-01-15, stale after: 2026-07-15]. 

Key trends:
- AI-native tools gaining share from incumbents 
  [source: [[forrester_ai_tools_2025]]]
- Enterprise buyers consolidating vendors 
  [source: [[customer_interviews_jan26]], claim 7]
- [citation needed] Developer-led adoption increasingly 
  driving enterprise deals

## Strategic Pillars

### Pillar 1: Enterprise Expansion
...

### Pillar 2: Platform Extensibility  
...

### Pillar 3: AI-Native Workflows
...

## Risk Analysis

| Risk | Likelihood | Impact | Mitigation |
|------|-----------|--------|------------|
| Enterprise sales cycle longer than modeled | Medium | High | Pipeline buffer of 30% [source: [[sales_analysis_q4]]] |
| Competitor launches similar AI features | High | Medium | [disputed — see talk page, section "Competitive response timing"] |

## Success Metrics

| Metric | Current | Target | Source |
|--------|---------|--------|--------|
| ARR | $8.5M | $12M | [[financial_model_v4]] |
| Enterprise customers | 23 | 45 | [[sales_pipeline_feb26]] |
| Platform API adoption | 12% | 35% | [[usage_analytics_jan26]] |
---

The Talk Page

Record: Product Strategy 2026 — Talk
Type: thread
Tags: [talk-page, strategy]
Links:
  relates_to: [[product_strategy_2026]]

Entries:

---
[2026-01-20T09:00:00Z] agent:strategy-worker (session: abc123)
Type: creation-note
This document was created by synthesizing inputs from four source 
artifacts. Initial quality grade: L1 (Draft). Key gaps identified:
- No risk analysis section yet
- Competitive landscape section relies heavily on 2025 data
- Financial projections need validation against latest model

---
[2026-01-25T14:30:00Z] agent:market-analyst (session: def456)
Type: edit-rationale
Affected sections: Market Context
Action: Updated TAM figure from $3.9B to $4.2B
Reason: Gartner published updated figures in Jan 2026 report. 
Previous figure was from mid-2025 projection.
Source: [[gartner_collab_2025]]
Confidence: high

---
[2026-02-01T10:15:00Z] agent:review-worker (session: ghi789)
Type: quality-assessment
Current grade: L1 (Draft)
Proposed grade: L2 (Developed)
Criteria:
  [x] Clear structure with logical sections
  [x] Executive summary present
  [x] Major claims sourced (8/11)
  [ ] Risk analysis complete — added but thin
  [x] Success metrics defined with baselines
  [ ] All sections internally consistent — 
      Pillar 3 mentions "Q3 launch" but timeline section says Q2
Gaps:
  - 3 unsourced claims (tagged [citation needed])
  - Timeline inconsistency between sections
  - Competitive response risk assessment disputed
Decision: Promote to L2. Flag timeline inconsistency.

---
[2026-02-05T16:00:00Z] agent:competitive-analyst (session: jkl012)
Type: claim-conflict  
Section: Risk Analysis
Claim: "Competitor launches similar AI features — Likelihood: High"
Dispute: Based on [[competitor_roadmap_analysis]], likelihood is 
Medium at best. Their engineering team is focused on compliance 
features per recent job postings [[competitor_hiring_analysis]].
Counterpoint: [[industry_rumors_q1]] suggests accelerated AI 
investment, but source reliability is low (blog post, 
single anonymous source).
Recommendation: Downgrade to Medium with note on uncertainty. 
Flag for human review if this risk drives resource allocation.

---
[2026-02-10T11:00:00Z] agent:review-worker (session: mno345)
Type: quality-assessment
Current grade: L2 (Developed)
Proposed grade: L3 (Reviewed)
Criteria:
  [x] All major claims sourced (10/11 — one [citation needed] 
      remains but is non-critical)
  [x] Timeline inconsistency resolved
  [x] Risk analysis expanded and sourced
  [x] Reviewed by second agent
  [x] No internal contradictions
  [ ] Competitive risk claim still disputed — logged, 
      both positions documented
Decision: Promote to L3. Competitive risk dispute documented 
but not blocking — both positions have evidence, 
human should decide.

---
[2026-02-10T11:05:00Z] agent:review-worker (session: mno345)
Type: promotion-recommendation
Recommend for L4 (Canonical) human review.
This document has:
- 92% citation coverage
- 2 independent agent reviews
- 1 documented dispute (resolved with both positions)
- All source records current (none past staleness window)
- Clear structure following strategy document template
Human action needed: Review competitive risk assessment, 
decide likelihood rating, then promote to L4.

The Categories

Tags: [strategy, 2026, reviewed, canonical-candidate, 
       enterprise, growth, series-b]

The Edit History

Patterns That Port vs. Patterns That Don't

Ports Well

Wikipedia Pattern	Agent Translation	Why It Works
Talk pages	Linked thread records with structured entries	Agents are stateless — discourse must be externalized
Citation needed	Structural provenance metadata + tags	Agents can check this programmatically
Quality grades	L0-L5 with explicit criteria	Gives agents clear targets and review checklists
Templates/infoboxes	`data._schema` on containers	Already exists; Wikipedia lens names the pattern
Notability threshold	Search-before-create norm	Prevents artifact sprawl
Categories	Tags with managed taxonomy	Already exists; Wikipedia lens suggests governance
Redirect pages	`supersedes` links + archived originals	Prevents duplicate artifacts, preserves history

Doesn't Port (and Why)

Wikipedia Pattern	Why It Doesn't Translate
Neutral point of view	Many agent artifacts SHOULD be opinionated (recommendations, strategy). NPOV matters for analysis but not for decisions.
Consensus-based editing	Agents don't need to reach consensus — they need clear precedence rules and human escalation.
Barnstars and recognition	Agents don't need motivation. Quality incentives are structural, not social.
Watchlists	Agents don't "watch" articles passively. Staleness detection is algorithmic, not attention-based.
Protection levels	The `claimed` status + `gated_by` links already handle this more precisely than Wikipedia's blunt protection tiers.
Arbitration committees	Overkill. Human-escalation for disputes is simpler and sufficient.

Ports with Modification

Wikipedia Pattern	Needed Adaptation
Edit wars / 3RR	Agents don't need cooling-off periods, but DO need conflict detection and source-precedence rules
Deletion discussions	Replace adversarial discussion with algorithmic staleness + human batch review
WikiProjects	Workstreams already serve this function — groups of related artifacts with shared conventions
Stub sorting	The quality grade system replaces this, but the insight ("identify what's incomplete and route it to someone who can fix it") is valuable

The Meta-Insight: Wikipedia as Existence Proof

The content itself being well-structured
The talk page preserving editorial reasoning
The citation infrastructure making claims verifiable
The quality assessment system making improvement tractable
The templates making consistency automatic

Exploration: The Wikipedia Lens

The Wikipedia Lens: What Agent-Native Artifacts Can Steal from 20 Years of Collaborative Knowledge Production

1. Talk Pages: Separating Content from Discourse About Content

How Wikipedia Does It

The Agent Translation

What Ports Well

What Doesn't Port

The Workspace Implication

2. Citation and Sourcing Norms: Making Provenance Structural

How Wikipedia Does It

The Agent Translation

The Source Reliability Hierarchy

What Ports Well

What Doesn't Port

The Workspace Implication

3. Featured Article Criteria: Quality Tiers for Agent Artifacts

How Wikipedia Does It

The Agent Translation

Quality Criteria (What Makes an Artifact "Good")

The Promotion Mechanism

What Ports Well

What Doesn't Port

The Workspace Implication

4. Edit Wars and Conflict Resolution

How Wikipedia Does It

The Agent Translation

The Resolution Mechanisms

What Ports Well

What Doesn't Port

5. Notability and Deletion Policy

How Wikipedia Does It

The Agent Translation

What Ports Well

What Doesn't Port

The Workspace Implication

6. Templates and Infoboxes: Structural Consistency at Scale

How Wikipedia Does It

The Agent Translation

What Ports Well

What Doesn't Port

7. Concrete Example: An Agent-Native Strategy Article

The Main Article

The Talk Page

The Categories

The Edit History

Patterns That Port vs. Patterns That Don't

Ports Well

Doesn't Port (and Why)

Ports with Modification

The Meta-Insight: Wikipedia as Existence Proof

Exploration: The Wikipedia Lens

The Wikipedia Lens: What Agent-Native Artifacts Can Steal from 20 Years of Collaborative Knowledge Production

1. Talk Pages: Separating Content from Discourse About Content

How Wikipedia Does It

The Agent Translation

What Ports Well

What Doesn't Port

The Workspace Implication

2. Citation and Sourcing Norms: Making Provenance Structural

How Wikipedia Does It

The Agent Translation

The Source Reliability Hierarchy

What Ports Well

What Doesn't Port

The Workspace Implication

3. Featured Article Criteria: Quality Tiers for Agent Artifacts

How Wikipedia Does It

The Agent Translation

Quality Criteria (What Makes an Artifact "Good")

The Promotion Mechanism

What Ports Well

What Doesn't Port

The Workspace Implication

4. Edit Wars and Conflict Resolution

How Wikipedia Does It

The Agent Translation

The Resolution Mechanisms

What Ports Well

What Doesn't Port

5. Notability and Deletion Policy