docs(flows): DoD obligatorio con user-facing surface + abrir issues 0100-0103 (taxonomia, frontmatter migration, dev_console, work dashboard)
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -68,6 +68,33 @@ Probar end-to-end el stack: navegator AutoExtract -> recipe -> dag_engine schedu
|
||||
- `dag_engine.dag_step_results`: step `extract` con `function_id='cdp_extract_recipe_py_pipelines'`.
|
||||
- `call_monitor.calls`: chain function call.
|
||||
|
||||
## Definition of Done
|
||||
|
||||
Ver `README.md` seccion DoD + user-facing.
|
||||
|
||||
### Generico
|
||||
|
||||
- [ ] **Repetibilidad**: corre 3 veces consecutivas via cron sin intervencion.
|
||||
- [ ] **Observabilidad**: `call_monitor.calls` registra `cdp_extract_recipe_py_pipelines` + `data_factory.runs` muestra `node_id=hn_top_stories`.
|
||||
- [ ] **Error-path**: si Chrome :9222 cae, el step falla con mensaje claro (no crash silencioso del DAG).
|
||||
- [ ] **Idempotencia**: dedup `dedup_duckdb_table_by_hash_py_pipelines` corre tras extract; mismo HTML 2x = 0 filas nuevas.
|
||||
- [ ] **Secrets**: N/A (HN publico).
|
||||
- [ ] **Docs**: `## Notas` con comandos para reproducir + onboarding.
|
||||
- [ ] **Registry-first**: extract sin codigo inline en el DAG.
|
||||
- [ ] **INDEX + status**: `status: done` + `INDEX.md` + movido a `completed/`.
|
||||
|
||||
### User-facing
|
||||
|
||||
- [ ] **User-facing**: usuario abre `data_factory.exe` → tab "All Runs" filtra `node_id=hn_top_stories` → ve >=30 filas con rank/title/url/points.
|
||||
- [ ] **User-facing repeat**: vuelve manana al mismo tab, ve runs frescos (cada 30 min) y tabla actualizada.
|
||||
- [ ] **User-facing onboarding**: parrafo en `## Notas`: "Para ver HN top: lanzar `data_factory.exe` → tab Extractors → `hn_top_stories`. DuckDB en `apps/data_factory/data/hn_top_stories.duckdb` tabla `hn_stories`."
|
||||
- [ ] **User-facing latencia**: cron `*/30 * * * *` → datos frescos en <31 min p95.
|
||||
|
||||
### Custom
|
||||
|
||||
- [ ] 7/7 campos cubiertos en TODOS los runs ultimas 24h (rank/title/url/points/author/age/comments).
|
||||
- [ ] Latencia extract <30s p95 (cdp_extract_recipe + render).
|
||||
|
||||
## Notas
|
||||
|
||||
(rellenas tras correr)
|
||||
|
||||
Reference in New Issue
Block a user