9 Commits

Author SHA1 Message Date
Joey Eamigh
67beaede45
quantization + onnx sweeps
Phase 10.8: torchao/bnb quant sweep on iter1-independent. bf16 already
optimal; torchao int8-wo gives -19% VRAM at no F1 cost; all 4-bit
variants collapse (ModernBERT-large too quant-sensitive).

Phase 10.9: ONNX export + ORT eval. Legacy exporter only working path
(dynamo adds 56 Memcpy nodes); ORT fp32 -22% latency vs torch via
kernel fusion but bf16+flash-attn-2 still wins; fp16 broken on rotary;
dynamic int8 silently CPU-fallback + 0.5 F1 collapse.

Driver scripts wired to bun run py:quant / py:onnx; full reports at
results/eval/{quant,onnx}/REPORT.md.
2026-04-07 05:10:38 -04:00
Joey Eamigh
a5f06f2db7
infra: migrate from DVC to Git LFS with xxh3 change detection
Replace DVC pipeline with Git LFS on self-hosted Gitea. New scripts
use per-file xxh3 hashing for change detection and parallel zstd-19
compression. Supports separate data/checkpoint push modes.
2026-04-05 16:21:14 -04:00
Joey Eamigh
6f4d6c57a4
labelapp updates v2 2026-04-05 00:55:53 -04:00
Joey Eamigh
e2c7a21c99
human labeling done 2026-04-02 00:28:31 -04:00
Joey Eamigh
8190950f1a
fix some bugs for tapt 2026-03-30 20:44:10 -04:00
Joey Eamigh
c0273c9e2e
adding dvc backend so data can be cleanly pulled 2026-03-30 16:53:35 -04:00
Joey Eamigh
ca4bc288c9
labelapp timing fixes and migration 2026-03-29 16:37:51 -04:00
Joey Eamigh
11cb91564b
fix docker scripts 2026-03-29 01:17:00 -04:00
Joey Eamigh
3260a9c5d9
labelapp scaffold 2026-03-28 23:44:37 -04:00