Note: OpenAI API key is required for text-to-speech and speech-to-text functionality. Groq, Gemini, and FishAudio are optional alternatives for text generation and audio processing.
LLM Configuration
Live2D Configuration
Models Information
OpenAI: GPT-4 (gpt-4o-mini) - Versatile for general use
Groq: LLaMA 3.1 70B - Recommended for speed and large conversations
Gemini: Gemini 1.5 Flash - Recommended for screen sharing
Custom: OpenAI-compatible API with custom base URL
FishAudio: Advanced text-to-speech capabilities
Note: When screen sharing is enabled, the last captured frame is sent to the selected model for analysis.