Step 4.1: Quality Evaluation
Invoke report-reviewer agent:
```
Task: report-reviewer
Prompt: Evaluate this weekly report draft for quality and understandability.
[Include the draft report]
```
Step 4.2: Fact Checking (Hallucination Prevention) - CRITICAL
β οΈ HARD CONSTRAINT: Hallucinationμ λ¨ νλλ νμ©νμ§ μλλ€.
λͺ¨λ μ¬μ€μ μ£Όμ₯μ λ°λμ κ²μ¦ν΄μΌ νλ©°, κ²μ¦λμ§ μμ μ 보λ λ³΄κ³ μμ ν¬ν¨ν μ μλ€.
#### 4.2.1: Hallucination μ νλ³ κ²μ¦ λ°©λ²
| Hallucination μ ν | κ²μ¦ λ°©λ² | λꡬ |
|-------------------|----------|------|
| μ‘΄μ¬νμ§ μλ κΈ°λ₯ | ν΄λΉ μ½λ νμΌμμ κΈ°λ₯ ꡬν μ¬λΆ νμΈ | Read tool |
| μλͺ»λ μμΉ/νλΌλ―Έν° | config νμΌ, μ½λ, κ²°κ³Ό νμΌκ³Ό λμ‘° | Read tool |
| νμ λ
Όλ¬Έ μΈμ© | λ
Όλ¬Έ μ‘΄μ¬ μ¬λΆ λ° λ΄μ© κ²μ¦ | WebSearch |
| μλͺ»λ μ μ/μ°λ | μ€μ λ
Όλ¬Έ λ©νλ°μ΄ν° νμΈ | WebSearch |
| κ³Όμ₯λ μ±λ₯ μ£Όμ₯ | μ€ν κ²°κ³Ό νμΌκ³Ό μ§μ λμ‘° | Read tool |
| μ‘΄μ¬νμ§ μλ API/ν¨μ | 곡μ λ¬Έμ λλ μ½λλ² μ΄μ€ νμΈ | WebSearch, Read |
#### 4.2.2: κ²μ¦ νλ‘μΈμ€
```
λ³΄κ³ μμ λͺ¨λ λ¬Έμ₯ μν:
β
ββ [κΈ°μ μ μ£Όμ₯] β μ½λ νμΌμμ μ§μ νμΈ
β μ: "LTPO νμ΅λ₯ μ 0.03μ΄λ€"
β κ²μ¦: ltpo/config.yaml λλ ν΄λΉ .py νμΌ μ½κΈ°
β
ββ [μμΉ λ°μ΄ν°] β μλ³Έ λ°μ΄ν°μ λμ‘°
β μ: "μ€ν κ²°κ³Ό 87% μ νλ λ¬μ±"
β κ²μ¦: results/*.json νμΌμμ μ€μ κ° νμΈ
β
ββ [λ
Όλ¬Έ μΈμ©] β WebSearchλ‘ κ²μ¦
β μ: "Zhang et al. (2024)μ λ°λ₯΄λ©΄..."
β κ²μ¦: λ
Όλ¬Έ μ‘΄μ¬ μ¬λΆ, μ μ, μ°λ, λ΄μ© μΌμΉ νμΈ
β
ββ [ꡬν λ΄μ©] β Git diff λ° μ½λ νμΈ
β μ: "μλ‘μ΄ reward ν¨μλ₯Ό μΆκ°νλ€"
β κ²μ¦: git log, ν΄λΉ νμΌμμ ν¨μ μ‘΄μ¬ νμΈ
β
ββ [μ€μ κ°] β config νμΌ νμΈ
μ: "batch size 32λ‘ νμ΅"
κ²μ¦: configs/*.yaml νμΌμμ νμΈ
```
#### 4.2.3: Fact Base μ
λ°μ΄νΈ
κ²μ¦ κ²°κ³Όλ₯Ό Fact Baseμ κΈ°λ‘:
```
β Verified:
- "LTPO lr=0.03" (ltpo/memgen_ltpo.py:36)
- "Titans λ
Όλ¬Έ 2025λ
" (WebSearch: arXiv:2501.00663)
- "GPT νμ°μ¨ 4.6%" (results/gpt_corrected.json:bankruptcy_rate)
β HALLUCINATION DETECTED:
- "99% μ νλ λ¬μ±" β μ€μ : 87% (results/exp1.json)
- "Kim et al. (2024)" β WebSearch: ν΄λΉ λ
Όλ¬Έ μμ
- "μλ μ μ₯ κΈ°λ₯" β μ½λμ ν΄λΉ κΈ°λ₯ μμ
β οΈ Needs Verification:
- "MemGenμ 2024λ
λ°ν" β WebSearch νμ
```
#### 4.2.4: Hallucination λ°κ²¬ μ μ‘°μΉ
- μ¦μ ν΄λΉ λ΄μ© μμ λλ μμ
- μ¬λ°λ₯Έ μ λ³΄λ‘ λ체 (κ²μ¦λ μ¬μ€λ§ μ¬μ©)
- κ²μ¦ λΆκ°λ₯ν λ΄μ©μ λ³΄κ³ μμμ μ μΈ
- μ¬κ²μ¦ ν λ€μ iteration μ§ν
μ λ κΈμ§ μ¬ν:
- κ²μ¦ μμ΄ μμΉ μΈκΈ β
- μ½μ§ μμ λ
Όλ¬Έ μΈμ© β
- νμΈνμ§ μμ μ½λ κΈ°λ₯ μ€λͺ
β
- μΆμΈ‘μ± λ΄μ©μ μ¬μ€μ²λΌ μμ β
Step 4.3: Check Termination Conditions
Success Criteria (all must be met):
- Critical issues: 0
- Hallucinations: 0
- Overall score: >= 80
If criteria met: Save final report and complete
If criteria not met: Proceed to rewriting
Step 4.4: Rewriting Based on Feedback
Address each issue:
- Add missing definitions for undefined terms
- Add compare-contrast for unexplained new approaches
- Improve paragraph flow with connective words
- Fix any factual errors identified by fact checker
- Simplify complex sections for better understandability
Step 4.5: Loop Control
- Maximum iterations: 5
- If max iterations reached without meeting criteria:
- Save best version so far
- Report unresolved issues to user
- Request manual intervention
---