mirror of
https://github.com/kennethnym/freya
synced 2026-06-19 08:01:17 +01:00
Set reasoning effort to none in the LLM client to reduce latency and token usage. Fall back to the reasoning field when content is absent in the response. Co-authored-by: Ona <no-reply@ona.com>