Prune stale editor and Wayland surface area

Stop shipping code that implied Aman supported a two-pass editor, external API cleanup, or a Wayland scaffold when the runtime only exercises single-pass local cleanup on X11.\n\nCollapse aiprocess to the active single-pass Llama contract, delete desktop_wayland and the empty wayland extra, and make model_eval reject pass1_/pass2_ tuning keys while keeping pass1_ms/pass2_ms as report compatibility fields.\n\nRemove the unused pillow dependency, switch to SPDX-style license metadata, and clean setuptools build state before packaging so deleted modules do not leak into wheels. Update the methodology and repo guidance docs, and add focused tests for desktop adapter selection, stale param rejection, and portable wheel contents.\n\nValidate with uv lock, python3 -m unittest discover -s tests -p 'test_*.py', python3 -m py_compile src/*.py tests/*.py, and python3 -m build --wheel --sdist --no-isolation.
This commit is contained in:
Thales Maciel 2026-03-14 17:48:23 -03:00
parent dd2813340b
commit 94ead25737
12 changed files with 98 additions and 811 deletions

View file

@ -8,17 +8,14 @@ Find a local model + generation parameter set that significantly reduces latency
All model candidates must run with the same prompt framing:
- XML-tagged system contract for pass 1 (draft) and pass 2 (audit)
- A single cleanup system prompt shared across all local model candidates
- XML-tagged user messages (`<request>`, `<language>`, `<transcript>`, `<dictionary>`, output contract tags)
- Strict JSON output contracts:
- pass 1: `{"candidate_text":"...","decision_spans":[...]}`
- pass 2: `{"cleaned_text":"..."}`
- Strict JSON output contract: `{"cleaned_text":"..."}`
Pipeline:
1. Draft pass: produce candidate cleaned text + ambiguity decisions
2. Audit pass: validate ambiguous corrections conservatively and emit final text
3. Optional heuristic alignment eval: run deterministic alignment against
1. Single local cleanup pass emits final text JSON
2. Optional heuristic alignment eval: run deterministic alignment against
timed-word fixtures (`heuristics_dataset.jsonl`)
## Scoring
@ -37,6 +34,13 @@ Per-run latency metrics:
- `pass1_ms`, `pass2_ms`, `total_ms`
Compatibility note:
- The runtime editor is single-pass today.
- Reports keep `pass1_ms` and `pass2_ms` for schema stability.
- In current runs, `pass1_ms` should remain `0.0` and `pass2_ms` carries the
full editor latency.
Hybrid score:
`0.40*parse_valid + 0.20*exact_match + 0.30*similarity + 0.10*contract_compliance`