Commit Graph

100 Commits

Author SHA1 Message Date
Copilot 78628f7dbb Integrate Cloudflare Workers AI text-to-image into SD extension (#5434)
* feat: integrate Cloudflare Workers AI for text-to-image generation in SD extension

Agent-Logs-Url: https://github.com/SillyTavern/SillyTavern/sessions/efc79e4d-2119-4cdb-8afb-f26e318a38ef

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* fix: address review - use oai_settings for account ID, sort dropdown alphabetically, remove Account ID input, move debug log

Agent-Logs-Url: https://github.com/SillyTavern/SillyTavern/sessions/bf0dda38-df40-44f4-8a63-0c952b48905d

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* Clean-up diffs

* feat: add refresh models button to Workers AI section

Agent-Logs-Url: https://github.com/SillyTavern/SillyTavern/sessions/ab6b5e7a-84d2-44d1-9f6e-3d330de04ef1

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* fix: revert unrelated package-lock.json changes

Agent-Logs-Url: https://github.com/SillyTavern/SillyTavern/sessions/ab6b5e7a-84d2-44d1-9f6e-3d330de04ef1

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* Fix models loading

* refactor: update model refresh button ID and add class to select elements

* Send formData to BFL models

* fix: adjust use FormData condition

* fix: validate Workers AI account ID before proceeding with image model loading

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>
2026-04-15 22:00:08 +03:00
YKDZ 966ab8336b Merge pull request #5171 from YKDZ/release
i18n: Comprehensive update and refactor for Simplified Chinese (zh-CN)
2026-02-16 21:00:26 +02:00
Copilot 23ba3e5bb2 Add rename buttons for ComfyUI workflows and style presets (#5124)
* Initial plan

* Add rename buttons for ComfyUI workflows and style presets

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* Address review feedback: pencil icon, filename validation, 204 status, error prefix, frontend duplicate check, re-render styles select

Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>

* fix: ensure existing workflow check only considers HTMLOptionElement instances

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>
2026-02-09 23:04:34 +02:00
Kristy Aurelia d6fafdd983 Add Minimal Prompt Processing option (#5095)
* Implement Minimal Prompt Processing

* Fix and rename cmd argument.

* Whitespace

* Handle processing selection with a enum

* Update processing argument description

* Fix typo in minimal prompt processing label title

* Add missing semi

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2026-02-05 01:38:55 +02:00
Jay4242 12452491cb Stable diffusion.cpp server support (#5074)
* feat: Add stable-diffusion.cpp server endpoints

* feat: Add stable-diffusion.cpp server settings

* feat: Add stable-diffusion.cpp server

* Add debug log for sdcpp payload

* feat: add loadSdcppSamplers and loadSdcppSchedulers for sdcpp backend

* feat: add sdcpp to sampler and scheduler source lists

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2026-01-29 00:24:45 +02:00
Cohee 5a7875ba28 Update Pollinations API (#5060)
* Upgrade Pollinations API
Done: text, caption
To do: TTS, image
Fixes #5020

* Update Pollinations TTS to new API

* Update Pollinations API for images
2026-01-26 20:31:13 +02:00
Mooki bce1372ed4 feat(sd): Add Z.AI GLM-Image model support (#5012)
* feat(sd): Add Z.AI GLM-Image model support

Add the new GLM-Image model to the Z.AI image generation source:
- Add 'glm-image' to the model dropdown with friendly display name
- Handle GLM-Image's requirement for dimensions in multiples of 32
  (vs CogView's multiples of 16)
- Show quality dropdown for GLM-Image (supports standard/hd)

The GLM-Image model uses the same API endpoint as CogView
but has different dimension constraints.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* feat(sd): Enhance Z.AI GLM-Image support

- Use regex for GLM-Image model detection (futureproofing)
- Skip 2^21 pixel limit for GLM-Image (CogView-specific)
- Add Z.AI recommended resolutions (1280x1280, 1568x1056, etc.)
- Add "Use Coding API" toggle for GLM Coding Plan users
- Add better error logging for image fetch debugging

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* refactor(sd): Address PR review feedback for Z.AI GLM-Image

- Remove custom zai_coding_api setting, use existing oai_settings.zai_endpoint
- Always use Common API for image generation (avoids rate limits)
- Keep ZAI_ENDPOINT import for consistency with other extensions

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* fix: always use Common API for Z.AI image generation

Removes conditional endpoint selection since we decided to always use
Common API for image generation (Coding API has stricter rate limits).

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* It's not only cogview anymore

* Remove unused param from request payload

* Remove redundant debug logs

* Loosen the check on image quality data attribute

* Bring back coding API notice

---------

Co-authored-by: mschienbein <mschienbein@users.noreply.github.com>
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2026-01-14 23:58:53 +02:00
Cohee ad860f4447 Z.AI: Add video generation models 2026-01-10 00:59:33 +02:00
Cohee 7320aa948d Audio inlining for OpenAI and Custom-compatible (#4964)
* Audio inlining for OpenAI and Custom-compatible

* Add context sizes

* chatgpt-image-latest

* Add quality control for gpt-image
2026-01-06 13:27:13 +02:00
9nbf7c4q6b-lgtm 3098c4c5bf Comfyui serverless runpod image generation (#4891)
* Add comfyrunpod generation endpoints.

Get a bunch of settings added.

Probably get runpod api secret stored properly.

sending to runpod.

Correct workflow wrapping for runpod.

* Reuse comfyui workflow setting block for runpod.

* Update API key url to point to runpod settings

* Remove debug print.

* Combine normal and runpod comfyui generation functions

* Remove unused server endpoints for runpod.

* Remove debug prints from server.

* Remove redundant switch case.

* Check for url to see if image generation is possible for comfy and comfy runpod.

* lint

* Correct/update doc comments.

* Make runpod serverless a sub-type under comfyui source in the UI.

* Set comfy_type in stable-diffusion defaultSettings

* Allow loading comfy workflows even if url isnt set yet.

* Change default runpod url to template example

* Rename runpod api key to comfy_runpod

* Handle default switches separately from standard now that default setting is ensured.

* Update runpod api key name on server.

* Move example runpod url to placeholder.

* lint

* Remove incorrect comment.

* Hide toasts remote models toasts

* React to key updates

* Add type annotation for data and improve workflow handling in generate endpoint

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-12-24 23:37:57 +02:00
Cohee 6fccccca6d OpenRouter: add as image generation source 2025-12-18 21:17:01 +02:00
Cohee bb3ea08b5a Z.AI: Add image generation and web search (#4895)
* Z.AI: Add image generation

* Z.AI: Add web search endpoint

* Add image URL validation

* Combine validation conditions
2025-12-16 00:39:17 +02:00
Ben 55a07d445d Chutes integration (#4844)
* Chutes integration

* Fix eslint

* Fix key saving

* Fix logo coloration

* Fix tool checks

* Unhide image inlining controls

* Fix order of options

* Fix type use in TTS extension script

* Add Chutes as a vector storage source

* Change log levels to debug

* Fix streamed reasoning parsing

* Skip remote models update

* TTS: Fix API key highlight

* Sort image models A-Z

* TTS: Fixes

* Remove unused SD endpoint

* Skip setting context size if models list is not yet loaded

* remove chutes quota / balance

* Fix: streamed tool calling

* Hide reasoning effort control

* Add image request debug log

* Fix: scroll down on media load in extensions

* Unhide some samplers

* Bring back reasoning effort

* This code will never execute

* Reformat else if cases

* Add stop strings to request

* Remove conditional from reasoning_effort body param

* Preserve original pricing fields

* Unhide logit bias setting

* Pass repetition penalty and logit bias to backend

* Swap llama tokenizer for llama3

* Pass min_p, remove supported_sampling_parameters checks

* Enable logprobs

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-12-01 00:17:49 +02:00
Cohee b99ccc3f0c OpenAI: Add Sora 2 API (#4748)
* OpenAI: Add Sora 2 API

* Add duration control

* Support client generation abort

* Reduce poll log amount

* Simplify selector

* Simplify model-specific control handling
2025-11-16 17:49:21 +02:00
Cohee ad72fa806e Google: Add Veo models (#4742)
* Google: Add Veo models

* Combine retry into a singe loop

* Improve error logging and reporting

* Add client abort handling for video generation endpoint
2025-11-06 19:58:01 +02:00
Ngo Dinh Gia Bao f0ceba43e9 feat: [Electron Hub] Support Vector Storage, Better searching for image engine (#4540)
* feat: [Electron Hub] Add Vector Storage, Better searching for image model list

* feat: [Electron Hub] Add quality parameter for Image Engine

* fixed ESLint

* Update public/scripts/extensions/vectors/index.js

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

* small tweaks

* Use default getOpenAIVector

* Refactor and clean-up code

* Move endpoint filtering logic to backend

---------

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-10-21 23:38:12 +03:00
Yokayo 1e05eac7df Update ru-ru locale (#4522)
* Work on tl

* eslint fix

* Update data-i18n attribute for expand view button

* Add data-i18n to button

---------

Co-authored-by: Yokayo <nonameutubecomm@gmail.com>
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-09-14 20:57:33 +03:00
Ngo Dinh Gia Bao 8687bb99f3 Add Electron Hub as Chat Completions Provider (#4458)
* fixed merge conflicts

* Supported max tokens + fixed wrong image model mapping

* fixed merge conflicts

* fixed merge conflicts

* updated the logic

* updated the logic

* replaced hard coded reasoning_effort mode list with a dynamic function

* replaced hard coded reasoning_effort model list with a dynamic function

* Fix eslint

* Adjust reasoning effort logic

* Code clean-up

* Add logo

* Add inline image quality

* Fix multimodal models list

* Fix seed not passed

* Add "detail" error parser

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-09-04 21:25:31 +03:00
Yokayo e9e14001e3 Update ru-ru locale (#4449)
* Work on tl

* Small fix

* Fix formatting

---------

Co-authored-by: Yokayo <nonameutubecomm@gmail.com>
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2025-08-26 20:37:55 +03:00
sirius422 4d9f229383 Add Variety Boost option for NovelAI image generation. (#4417)
Co-authored-by: sirius422 <sirius42@gfw.moe>
2025-08-22 20:27:17 +03:00
Cohee 2d28f5cdaf Google: Add Imagen image generation 2025-06-20 20:44:21 +03:00
Dmitry fece612f09 Merge pull request #4135 from D1m7asis/release
feat: Added AI/ML API Provider Support
2025-06-13 20:59:18 +03:00
Cohee ed2e6fff6e OpenAI: add gpt-image-1 model 2025-05-23 20:33:27 +03:00
Cohee 0c4c86ef06 Add xAI for image generation extension 2025-04-11 20:32:06 +03:00
equal-l2 2a31f6af2d Remove Block Entropy references
Block Entropy shut down their service at the end of 2024.
2025-03-28 00:47:30 +09:00
Kristan Schlikow b033b98532 Address issues raised in PR 2025-02-13 21:09:13 +01:00
Kristan Schlikow 6e0ed8552f Add support for FAL.AI as image gen provider 2025-02-13 19:34:34 +01:00
Cohee f83dccda39 Mark 'Extras' option as deprecated in built-in extensions 2025-02-08 21:56:54 +02:00
Cohee 4a4e02450c locales: update API key hint from Horde KoboldAI to AI Horde 2024-12-26 21:54:24 +02:00
Cohee e6107ad447 Add NAI Diffusion V4 2024-12-22 15:39:41 +02:00
Cohee ba7e34c195 Image Generation: Use wrench symbol for function tool 2024-12-21 18:47:46 +02:00
Cohee e83182c03b Image Generation: Add tool message visibility toggle 2024-12-19 19:33:29 +02:00
Cohee 67869364a5 ImageGen: add BFL API for image generation 2024-11-19 00:30:28 +02:00
ceruleandeep 5992117904 Add GGUF models and denoise parameter for ComfyUI 2024-11-18 17:55:30 +11:00
Cohee 77be125a99 NanoGPT: Add to image generation extension 2024-11-03 20:37:30 +02:00
Cohee ae784ddb1a Add detailer checkbox for SDNext 2024-10-17 22:47:09 +03:00
Cohee a3693ccf6d Use deepMerge to combine payloads 2024-10-17 22:45:33 +03:00
error-usernotavailable 301a31b8b2 Update settings.html
add checkbox in stable diffusion extension tab for ADetailer
2024-10-16 22:28:09 -04:00
Cohee a4256ccf57 ImageGen: rename Enable function tool => Use function tool 2024-10-06 21:19:06 +03:00
Cohee c853547b11 Add a function tool for image generation 2024-10-04 13:04:19 +00:00
Cohee 9af4d62cdf Extend /sd command 2024-09-30 21:51:24 +03:00
Cohee ae53527c23 #2888 Hide non-functioning checkboxes from Comfy 2024-09-21 16:36:20 +03:00
Cohee 87f2dd9ae5 Remove pollinations refine toggle 2024-09-10 10:54:41 +00:00
Cohee 2f5e40691c Implement more smart textarea autosizes 2024-09-05 20:55:06 +03:00
Cohee 3f026aea15 SD: Add prompt expansion warning. Disable quantization 2024-08-23 19:57:25 +03:00
Cohee a9700c0ff0 Small fixes 2024-08-20 10:53:37 +00:00
Alex Yancey ea0508b910 Update settings.html 2024-08-20 01:33:14 -07:00
Alex Yancey 135ba2336e Hugging Face inference API for image generation 2024-08-19 23:02:15 -07:00
Edward Kim 88ada0f67d fixed ordering in image generation source, and max token length for llm models 2024-08-10 21:55:52 -04:00
Edward Kim b9857eb315 Added block entropy endpoints for chat and image 2024-08-08 00:08:03 -04:00