# sec-cyBERT Bun workspace monorepo. Three packages: - `packages/schemas/` — shared Zod schemas (`@sec-cybert/schemas`). Import directly by path: `from "@sec-cybert/schemas/label.ts"` - `ts/` — GenAI labeling pipeline (CLI scripts, Vercel AI SDK, OpenRouter) - `labelapp/` — Next.js human labeling webapp (Drizzle, Postgres, shadcn/ui, Playwright) ## Quick reference | What | Where | |------|-------| | Shared schemas (Zod) | `packages/schemas/src/` | | Labeling codebook (source of truth for all category/specificity definitions) | `docs/LABELING-CODEBOOK.md` | | Codebook ethos (reasoning behind every codebook decision) | `docs/CODEBOOK-ETHOS.md` | | Project narrative (decisions, roadblocks, lessons) | `docs/NARRATIVE.md` | | Project status & todo list | `docs/STATUS.md` | | Specificity improvement plan (pending threshold tuning) | `docs/SPECIFICITY-IMPROVEMENT-PLAN.md` | | Training docs (DAPT procedure, data quality audit, strategy notes) | `docs/training/` | | Data pipeline reference (tech guide, HTML cleaning, filing generators) | `docs/data-pipeline/` | | v1 archive (codebook, narrative, iteration logs, analyses) | `docs/archive/v1/` | | Planning archive (project overview, implementation plan, labelapp plan) | `docs/archive/planning/` | | Professor-provided reference materials | `docs/reference/` | | Labelapp-specific agent guide | `labelapp/AGENTS.md` | | Docker compose (Postgres) | `docker-compose.yaml` (root) | | DB credentials | `sec_cybert` / `sec_cybert` / `sec_cybert` on localhost:5432 | ## Root scripts All commands run from repo root via `bun run