No description

Find a file

Thales Maciel ed950cb7c4 Polish onboarding flow docs and retry acceptance tests		2026-02-26 17:58:16 -03:00
src	Add onboarding wizard framework and startup hook	2026-02-26 17:57:32 -03:00
systemd	Improve startup diagnostics and systemd robustness	2026-02-26 16:31:34 -03:00
tests	Polish onboarding flow docs and retry acceptance tests	2026-02-26 17:58:16 -03:00
.gitignore	Update project files	2026-02-10 11:01:36 -03:00
AGENTS.md	Rename project from lel to aman	2026-02-25 11:11:10 -03:00
config.example.json	Document usability workflow and add acceptance tests	2026-02-26 17:44:20 -03:00
Makefile	Document usability workflow and add acceptance tests	2026-02-26 17:44:20 -03:00
pyproject.toml	Rename project from lel to aman	2026-02-25 11:11:10 -03:00
README.md	Polish onboarding flow docs and retry acceptance tests	2026-02-26 17:58:16 -03:00
uv.lock	Rename project from lel to aman	2026-02-25 11:11:10 -03:00

README.md

aman

Local amanuensis

Python X11 STT daemon that records audio, runs Whisper, applies local AI cleanup, and injects text.

Requirements

X11
sounddevice (PortAudio)
faster-whisper
llama-cpp-python
Tray icon deps: gtk3, libayatana-appindicator3
Python deps (core): numpy, pillow, faster-whisper, llama-cpp-python, sounddevice
X11 extras: PyGObject, python-xlib

System packages (example names): portaudio/libportaudio2.

Ubuntu/Debian

sudo apt install -y portaudio19-dev libportaudio2 python3-gi gir1.2-gtk-3.0 libayatana-appindicator3-1

Arch Linux

sudo pacman -S --needed portaudio gtk3 libayatana-appindicator

Fedora

sudo dnf install -y portaudio portaudio-devel gtk3 libayatana-appindicator-gtk3

openSUSE

sudo zypper install -y portaudio portaudio-devel gtk3 libayatana-appindicator3-1

Python Daemon

Install Python deps:

X11 (supported):

uv sync --extra x11

Quickstart

uv run python3 src/aman.py run

On first launch, Aman opens a graphical setup wizard automatically.
The wizard asks for:

microphone input
hotkey
output backend
writing profile

Config

Create ~/.config/aman/config.json (or let aman create it automatically on first start if missing):

{
  "daemon": { "hotkey": "Cmd+m" },
  "recording": { "input": "0" },
  "stt": { "model": "base", "device": "cpu" },
  "injection": {
    "backend": "clipboard",
    "remove_transcription_from_clipboard": false
  },
  "ux": {
    "profile": "default",
    "show_notifications": true
  },
  "advanced": {
    "strict_startup": true
  },
  "vocabulary": {
    "replacements": [
      { "from": "Martha", "to": "Marta" },
      { "from": "docker", "to": "Docker" }
    ],
    "terms": ["Systemd", "Kubernetes"]
  }
}

Recording input can be a device index (preferred) or a substring of the device name. If recording.input is explicitly set and cannot be resolved, startup fails instead of falling back to a default device.

Config validation is strict: unknown fields are rejected with a startup error. Validation errors include the exact field and an example fix snippet.

Profile options:

ux.profile=default: baseline cleanup behavior.
ux.profile=fast: lower-latency AI generation settings.
ux.profile=polished: same cleanup depth as default.
advanced.strict_startup=true: keep fail-fast startup validation behavior.

Hotkey notes:

Use one key plus optional modifiers (for example Cmd+m, Super+m, Ctrl+space).
Super and Cmd are equivalent aliases for the same modifier.

AI cleanup is always enabled and uses the locked local Llama-3.2-3B GGUF model downloaded to ~/.cache/aman/models/ during daemon initialization. Model downloads use a network timeout and SHA256 verification before activation. Cached models are checksum-verified on startup; mismatches trigger a forced redownload.

Use -v/--verbose to enable DEBUG logs, including recognized/processed transcript text and llama.cpp logs (llama:: prefix). Without -v, logs are INFO level.

Vocabulary correction:

vocabulary.replacements is deterministic correction (from -> to).
vocabulary.terms is a preferred spelling list used as hinting context.
Wildcards are intentionally rejected (*, ?, [, ], {, }) to avoid ambiguous rules.
Rules are deduplicated case-insensitively; conflicting replacements are rejected.

STT hinting:

Vocabulary is passed to Whisper as hotwords/initial_prompt only when those arguments are supported by the installed faster-whisper runtime.

systemd user service

mkdir -p ~/.local/share/aman/src/assets
cp src/*.py ~/.local/share/aman/src/
cp src/assets/*.png ~/.local/share/aman/src/assets/
cp systemd/aman.service ~/.config/systemd/user/aman.service
systemctl --user daemon-reload
systemctl --user enable --now aman

Service notes:

The user unit launches uv via /usr/bin/env; ensure uv is available in your user PATH (for example ~/.local/bin).
Inspect failures with systemctl --user status aman and journalctl --user -u aman -f.

Usage

Press the hotkey once to start recording.
Press it again to stop and run STT.
Press Esc while recording to cancel without processing.
Esc is only captured during active recording.
Recording start is aborted if the cancel listener cannot be armed.
Transcript contents are logged only when -v/--verbose is used.
Tray menu includes: Setup Aman..., Pause/Resume Aman, Reload Config, Run Diagnostics, Open Config Path, and Quit.
If setup is not completed, Aman enters a Setup Required tray mode and does not capture audio.

Wayland note:

Running under Wayland currently exits with a message explaining that it is not supported yet.

Injection backends:

clipboard: copy to clipboard and inject via Ctrl+Shift+V (GTK clipboard + XTest)
injection: type the text with simulated keypresses (XTest)
injection.remove_transcription_from_clipboard: when true and backend is clipboard, restores/clears the clipboard after paste so the transcript is not kept there

AI processing:

Local llama.cpp model only (no remote provider configuration).

Control:

make run
make doctor
make check

CLI (internal/support fallback):

uv run python3 src/aman.py run --config ~/.config/aman/config.json
uv run python3 src/aman.py doctor --config ~/.config/aman/config.json --json
uv run python3 src/aman.py init --config ~/.config/aman/config.json --force