Andere Modelle (Image, Audio etc.)

Andere Modelle: Audio, OCR und Bilder

Auf dieser Seite stehen die nicht-LLM-Modelle in einer kompakten Form. Auch hier gilt: Im Wiki zeigen wir keine Basis-URLs oder internen Service-Hosts, sondern nur den Modell-Parameter, das Zielmodell, den Host-Typ und die relevanten Limits.

Speech-to-Text

Modellparameter	Zielmodell	myLab THL \| GWDG	API	Hinweise
`whisper-3-large`	`whisper-3-large`	THL	`audio/transcriptions`, `audio/translations`	Direktname
`whisper-1`	`whisper-3-large`	THL	`audio/transcriptions`, `audio/translations`	Alias auf unser Whisper-Backend

Text-to-Speech

Modellparameter	Zielmodell	myLab THL \| GWDG	API	Hinweise
`xtts-v2`	`xtts-v2qwen3-tts`	THL	`audio/speech`	Direktname; unterstützt die Stimmen `alloy`, `echo`, `fable`, `onyx`, `nova`, `shimmer`
`tts-1-hd`	`xtts-v2qwen3-tts`	THL	`audio/speech`	Alias für openAI default

fishaudio-s2-pro fishaudio-s2-pro ~~THL~~ audio/speech ~~Alternative TTS-Variante, derzeit nur default voice~~

OCR

Modellparameter	Zielmodell	myLab THL \| GWDG	API	Hinweise
`ocr-extract`	`chandra-ocr-2`	THL	`chat/completions`	Nutzerfreundlicher OCR-Alias
`chandra-ocr-2`	`chandra-ocr-2`	THL	`chat/completions`	32.768 Kontext

Bildmodelle

Modellparameter	Zielmodell	myLab THL \| GWDG	API-Familien	Hinweise
`image-gen-hd`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/variations`, `images/data`	`health_timeout=180`; `request_timeout=900` für Generierung, sonst `300`
`image-gen-fast`	`flux.2-klein`	THL	`images/generations`, `images/edits`, `images/data`	`health_timeout=180`; `request_timeout=300`
`flux.1-dev`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/variations`, `images/data`	Alias für den Dev-Workflow
`flux.1-kontext`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/data`	Kontext-/Edit-Alias
`flux.2-dev`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/variations`, `images/data`	Direktname
`flux.2-klein`	`flux.2-klein`	THL	`images/generations`, `images/edits`, `images/data`	Direktname
`dall-e-3`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/data`	Alias für die THL-Bildpipeline
`gpt-image-1`	`flux.2-dev`	THL	`images/generations`, `images/edits`, `images/data`	Alias für die THL-Bildpipeline

Qualitätseinstellungen für `image-gen-hd` und `flux.1-dev`

Quality Parameter	Guidance Scale	Num Inference Steps
`standard`	3.5	25
`standard+`	5.5	25
`standard++`	7.0	25
`bfl`	3.5	50
`hd`	5.5	50
`xhd`	7.0	50

Qualitätseinstellungen für `image-gen-fast` und `flux.1-kontext`

Quality Parameter	Guidance Scale	Num Inference Steps
`schnell-hd`	5.5	6
`schnell-standard`	3.5	3
`standard`	2.5	25
`standard+`	4.5	25
`standard++`	7.0	25
`bfl`	3.5	50
`hd`	5.5	50
`xhd`	7.0	50

Hinweise

Nicht-LLM-Modelle gelten weiterhin als experimenteller als die Chat-Modelle; bitte Verfügbarkeit vor produktiver Nutzung prüfen.