Agentic harness · Codex CLI / Claude Code

Write the review paper.
Verify every citation.

A highly customizable agentic harness for arXiv-ready ML/AI review papers and beyond. It drives agentic AI through a gated LaTeX workflow where every BibTeX citation is checked against a live source before it lands.

100%
cites verified
A Survey of State-of-the-Art
Generative Image Models
Generated with the arXiv Review Paper Harness
IEEEtran · review article · preprint

Abstract. This review surveys diffusion, autoregressive, and masked generative models for high-fidelity image synthesis, with a unified taxonomy and evaluation protocol.

I. Introduction
prior work [12] established the
II. Model Families
Fig. 1. Taxonomy of model families.
100%
citations verified against a live source
2
prompts to a complete first draft
0
hallucinated or fabricated references
4
portable skills in the bundle
01 / The Workflow

A gated pipeline, not a single prompt.

Discovery, a human approval gate, an issue-by-issue writing loop, verification, and compilation. The agent cannot write a paragraph into the paper until you approve the plan.

01

Research snapshot

Topic becomes keywords, then 10 to 20 key papers. No prose yet.

02

Scaffold

An IEEEtran two-column project, compiled early to catch errors.

03 · gate

Approval

You approve the plan and outline. Nothing is written until you do.human in the loop

04

Issues CSV

The execution contract. One tracked row per section and task.

05

Write & verify

Each section researched and drafted, every citation checked against a live source.

06

Rhythm refinement

Sentence and paragraph length varied for readability, citations preserved.

07

QA & compile

Overfull boxes fixed. Zero undefined citations before delivery.

08 · output

Ready as a preprint

main.pdf and a verified ref.bib.compiles clean

02 / Why It Is Different

Discipline the model does not have on its own.

Guardrails that forbid the two things agents are worst at, writing before a plan exists and citing sources they have not checked.

Hard quality gates

No prose before approval. Nothing marked done until its acceptance criteria are met.

Verified citations only

Every citation is checked against a live source before it enters ref.bib. No fabricated references.

Issue-driven execution

A CSV is the single source of truth. Scope grows by splitting issues, never by quietly guessing.

Deterministic scripts

Scaffolding, arXiv discovery, validation, and compile run the same way on every pass.

Runs where you work

One portable Agent Skill bundle, equally at home in Codex CLI and Claude Code.

Compiles or it is not done

Delivery requires a clean pdflatex and bibtex build with no warnings.

04 / Quickstart

Two prompts to a compiled paper.

01. Name a topic. The harness runs a literature pass, drafts a framework, proposes titles, and writes a plan for you to approve.

02. Hand it the decisions. Even a deliberately vague reply produces a complete, compiling LaTeX project.

Read the full quickstart
codex / claude code
prompt 1 ›write a review article for arxiv that is about SOTA generative image models
prompt 2 ›I will let you choose the best title and the topics and inclusion of material that you see the best fit
The second prompt was intentionally vague. The harness still delivered a complete, compiling paper with verified citations.
05 / Questions

Good to know.

How does it prevent hallucinated or invented citations?
Guardrails, built into the workflow. Every citation is verified against a live source before it enters ref.bib, and any claim without evidence becomes a TODO rather than a fabricated reference.
Can I use it on an existing LaTeX project?
Yes. Point it at your project and a citation-validation pass audits and repairs ref.bib without re-scaffolding anything.
Can it write original or experimental research papers, not just reviews?
Yes, with a little tailoring. Review and survey articles are its sweet spot out of the box, but nothing locks it there. Shape the plan and inputs to your goal, and the same gated workflow extends to original or experimental work.
Open source · MIT

Star it. Fork it.
Write your paper.

A highly customizable agentic harness for preprint-ready review papers and beyond.