Pipeline Phases

The paper pipeline runs 10 sequential phases. Verification and review phases can loop back for revision. The entire cycle takes 30 minutes to 4 hours depending on model speed and revision depth.

Agent Action Timeline

Literature Scout Novelty Engines Methodology Data Engineer Writer Verifier Reviewers Formatter

Phase 0: Voice Calibration

Input: 2-3 paragraphs of the author’s published writing Output: Voice style profile (sentence length, vocabulary, paragraph patterns, punctuation)

The profile is loaded by every Writer agent. Writers match the author’s voice at the sentence level, not just the topic level.

Phase 1: Literature Review

5 parallel scouts hit different sources:

Scout	Source	Max Results
1	arXiv OAI-PMH	200
2	Semantic Scholar	100
3	CrossRef	50
4	OpenAlex	50
5	Field-specific (NCBI, etc.)	varies

Results are deduplicated by title similarity. The orchestrator builds a master literature graph from all discovered papers.

Phase 1.5: Novelty Generation

All 6 novelty engines run in parallel, generating 50+ hypotheses scored by novelty × tractability. Top 3 become the paper’s contribution.

Engine	Angle	Output
Contrarian	Invert field claims	10 counter-hypotheses
Cross-Pollinator	Import from distant fields	Top 5 analogies
Assumption Excavator	Find unstated assumptions	5 testable assumptions
Counterfactual Generator	Rewrite field history	5 counterfactual histories
Paradox Sifter	Cross-reference limitations	Paradoxes + elephants
Heretic	50 wild guesses from title alone	50 hypotheses + haunting idea

Phase 2: Hypothesis Selection

Top hypotheses are presented to the user (if connected) or selected automatically. Each includes: primary claim, evidence base, gap filled, proposed approach.

Phase 3: Methodology Design

Recommends correct statistical tests, performs power analysis, designs experimental protocol, flags confounds. Outputs a reproducible analysis plan.

Phase 4: Data Engineering

Writes Python analysis code, generates publication-ready figures (SciencePlots, 300 DPI, colorblind-safe), computes all statistics with correct reporting.

Phase 5: Parallel Writing

5 Writer subagents write simultaneously:

Writer	Section
Writer 1	Abstract
Writer 2	Introduction
Writer 3	Methods
Writer 4	Results
Writer 5	Related Work + Discussion

All 41 Humanizer patterns are hard constraints during generation, not post-hoc fixes.

Phase 6: Verification

3 parallel verification modules:

Citation Verifier — Every citation checked against Semantic Scholar + CrossRef. Must be found in 2+ sources.
Statistical Auditor — Validates p-values, effect sizes, sample sizes, test selection. Flags p-hacking and multiple comparison errors.
AI-Pattern Detector — Scans for all 41 Humanizer patterns. Density must be < 2/1000 words.

All 3 must pass. If any fails, the section goes back to the Writer with annotations.

Phase 7: Adversarial Review

10 reviewer personas (Theorist, Empiricist, Pragmatist, Skeptic, Historian, Methodologist, Ethicist, Competitor, Student, Dreamer) review independently. All 10 must recommend acceptance.

Phase 8: Revision

Writers receive annotated critiques and revise. The loop continues until all 10 reviewers accept or the orchestrator determines a critique has been adequately addressed.

Phase 9: Style Audit

Complete paper scanned for all 41 patterns:

Pattern density < 1 per 2000 words
Em dash count: ZERO
Voice must match author profile

PASS → Proceed to formatting. FAIL → Back to Writer with line-level annotations.

Phase 10: Formatting → Submission

Formatter loads venue-specific LaTeX template, generates BibTeX from verified citations, embeds figures, compiles PDF. Outputs: paper.tex, references.bib, paper.pdf.

Supported venue templates: NeurIPS, ICML, ICLR, ACL, Nature/Springer, arXiv.

Architecture Novelty Engines