Every Ultron session is persisted. Transcripts are saved turn-by-turn as JSONL. Sessions can be resumed exactly where they left off.Documentation Index
Fetch the complete documentation index at: https://docs.51ultron.com/llms.txt
Use this file to discover all available pages before exploring further.
Context injection at session start
Before you type your first message, Ultron loads context in a specific order:Session persistence
- Transcripts saved as JSONL — one JSON object per message turn
- Includes: message content, tool calls, tool results, token counts, timestamps
- Resume at any time — context is fully reconstructed from the transcript
- Cost tracking accumulated per session and per-turn
Automatic compression
As conversations grow, the compression engine manages context automatically. You never need to manually summarize or restart a session:- MicroCompact fires first — removes stale tool outputs from old turns (no API call)
- Session Memory Compact — uses stored memories as compression baseline (no API call)
- API Digest — full LLM summary via fast model (last resort)
Cost tracking
| What’s tracked | Where to see it |
|---|---|
| Input tokens per turn | Settings → Usage |
| Output tokens per turn | Settings → Usage |
| Model used per call | Settings → Usage |
| Total session cost | Settings → Usage |