SillyTavern

Author	SHA1	Message	Date
TanJeeSchuan	e5d4ff5fae	Enhance Vectorize All process with error handling, retries and minor improvements (#5479 ) * fix (vectors): Fixed Vectorize All progress report and ETA issues * fix (vectors): Added strip reasoning block function for extras/WebLLM summaries * feat(vectors): Retry failed summaries with configurable attempts * feat(vectors): Skip summarization for short messages * feat(vectors): Skip failed messages during Vectorize All instead of aborting all Prevents the "Vectorize All" process from stopping on single-message errors. Failed items are now skipped and reported at the end of the session rather than aborting the entire sync. Summarization: Implements per-message retries; failures use the original text as a fallback or mark for skipping. Vector Insertion: Differentiates fatal configuration errors (abort) from transient batch failures (skip and notify). * Resolved: 'account_id_missing' is missing * Resolved: Refactored out summarizeSkipOnFailure() functionality into summarize() via options parameter * Fix eslint and type checks * feat(vectors): add types to maps and sets, improve summarize function options --------- Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2026-04-20 02:33:12 +03:00
Tony Gies	a9c377c3c8	feat: add Workers AI text embeddings and multimodal captioning (#5414 ) * feat: add Workers AI text embeddings and multimodal captioning Extends the Cloudflare Workers AI integration to the vectors and caption extensions. Embeddings: adds workers_ai source to the vectors extension using the OpenAI-compatible /v1/embeddings endpoint, with dynamic model listing from the Cloudflare model search API. Captioning: adds workers_ai as a multimodal caption API with dynamic vision model discovery via the multimodal-models endpoint. * Add logo svg * Refactor caption dropdown population * Fix order of sources * feat: add error handling for missing Workers AI account ID --------- Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2026-04-08 23:43:21 +03:00
Copilot	c918f4f36d	Add opt-in toggle to keep hidden messages in chat vector index (#5378 ) * Initial plan * Add opt-in toggle to keep hidden messages in chat vector index Agent-Logs-Url: https://github.com/SillyTavern/SillyTavern/sessions/eadc80a1-a417-40df-a374-76d7c4a46ce3 Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com> * Revert package-lock changes * Fix: Remove 'vectorized' class from chat messages before adding it --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>	2026-03-30 23:39:19 +03:00
Xiangzhe	2cb1861db6	feat: add SiliconFlow.cn chat completion and embedding support (#5316 ) * feat: add SiliconFlow.cn endpoint support and embedding vectors Chat completion: - Add endpoint selection dropdown (Global/.com vs China/.cn) to existing SiliconFlow provider, following the Z.AI endpoint pattern - Backend switches API URL based on selected endpoint - Add /api-url slash command support for endpoint switching Embeddings: - Add SiliconFlow as a vector/embedding source (OpenAI-compatible) - Support both .com and .cn endpoints via siliconflow_endpoint setting borrowed from the main connection panel (Vertex AI pattern) - Superset model list with platform attribution (.cn) markers - Models: Qwen3-Embedding (0.6B/4B/8B) + BGE/BCE models (.cn only) * Add filter by models type * Load embedding models from endpoint * Improve api-url command declaration * Support endpoint override in custom-request service --------- Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2026-03-22 00:52:03 +02:00
Maksim	c0a4682dcc	feat: Add gemini-embedding-2-preview in google vectorization models (#5274 ) Co-authored-by: Maksim Sharkov <msharkov@nobilis.team>	2026-03-13 19:58:06 +02:00
Copilot	5832cb8b07	Add NanoGPT embeddings support for Vector Storage (#5150 ) * Initial plan * Add NanoGPT embeddings support for Vector Storage Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com> * Fix models loading --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: Cohee1207 <18619528+Cohee1207@users.noreply.github.com>	2026-02-14 00:31:30 +02:00
Ben	55a07d445d	Chutes integration (#4844 ) * Chutes integration * Fix eslint * Fix key saving * Fix logo coloration * Fix tool checks * Unhide image inlining controls * Fix order of options * Fix type use in TTS extension script * Add Chutes as a vector storage source * Change log levels to debug * Fix streamed reasoning parsing * Skip remote models update * TTS: Fix API key highlight * Sort image models A-Z * TTS: Fixes * Remove unused SD endpoint * Skip setting context size if models list is not yet loaded * remove chutes quota / balance * Fix: streamed tool calling * Hide reasoning effort control * Add image request debug log * Fix: scroll down on media load in extensions * Unhide some samplers * Bring back reasoning effort * This code will never execute * Reformat else if cases * Add stop strings to request * Remove conditional from reasoning_effort body param * Preserve original pricing fields * Unhide logit bias setting * Pass repetition penalty and logit bias to backend * Swap llama tokenizer for llama3 * Pass min_p, remove supported_sampling_parameters checks * Enable logprobs --------- Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2025-12-01 00:17:49 +02:00
Cohee	845bd8c999	OpenRouter: Add as vector storage provider (#4736 ) * OpenRouter: Add as vector storage provider * Better array filtering	2025-11-03 21:53:17 +02:00
Ngo Dinh Gia Bao	f0ceba43e9	feat: [Electron Hub] Support Vector Storage, Better searching for image engine (#4540 ) * feat: [Electron Hub] Add Vector Storage, Better searching for image model list * feat: [Electron Hub] Add quality parameter for Image Engine * fixed ESLint * Update public/scripts/extensions/vectors/index.js Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * small tweaks * Use default getOpenAIVector * Refactor and clean-up code * Move endpoint filtering logic to backend --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2025-10-21 23:38:12 +03:00
Cohee	780633e05a	text-embedding-005	2025-10-08 21:24:55 +03:00
Cohee	82d9fa79e8	Add vectorization via Vertex AI (#4311 ) * Add vectorization via Vertex AI * Enable batching for Google vectors * Add batch embeddings for Vertex * Split embed methods for Vertex/AI Studio	2025-07-25 23:59:30 +03:00
Cohee	579ef1e86c	Refactor API key management in extensions	2025-06-26 21:24:51 +03:00
Cohee	a122109e0c	Add new model option 'embed-v4.0' to Cohere vectorization settings Closes #3951	2025-05-04 12:26:44 +03:00
pl752	07fb92b37d	Added vector secondary url placeholder example	2025-04-16 17:39:29 +05:00
pl752	f8bccb472f	Adjusted naming and validation	2025-04-16 17:34:58 +05:00
pl752	5cf3198da1	Added option to use secondary API URL in vector extension	2025-04-16 16:04:33 +05:00
Cohee	78bda9954d	Increase maximum injection depth and WI order (#3800 )	2025-04-13 21:31:57 +03:00
Cohee	70fe5b6e01	Add Gemini embedding model for vector storage	2025-04-01 21:42:26 +03:00
Cohee	80e821d12d	Add support for KoboldCpp embeddings in Vector Storage (#3795 ) * Add support for KoboldCpp embeddings in vector processing * Add validation for KoboldCpp embeddings to handle empty data * Improve toast handling	2025-04-01 21:21:29 +03:00
Cohee	1cb9287684	Vectors WebLLM (#3631 ) * Add WebLLM support for vectorization * Load models when WebLLM extension installed * Consistency updated * Move checkWebLlm to initEngine * Refactor vector request handling to use getAdditionalArgs * Add error handling for unsupported WebLLM extension * Add prefix to error causes	2025-03-09 00:51:44 +02:00
Cohee	f83dccda39	Mark 'Extras' option as deprecated in built-in extensions	2025-02-08 21:56:54 +02:00
Cohee	77bee453ab	Vectors: Add only custom boundary chunking	2025-01-03 20:44:25 +02:00
Cohee	df3d7a048e	Deprecate unscoped vectors	2024-09-24 21:51:10 +03:00
Cohee	9ef3385255	Implement collection scopes for vector models (#2846 ) * Implement collection scopes for vector models * Update makersuite-vectors.js to use Gemini API text-embedding-004 model * Add model scope for Google vectors * Fix purge log * Refactor header setting * Fix typo * Only display UI warning if scopes disabled * Remove i18n attribute --------- Co-authored-by: ceruleandeep <83318388+ceruleandeep@users.noreply.github.com>	2024-09-16 09:29:39 +03:00
Cohee	06e3d5f8de	Rename MakerSuite => AI Studio	2024-08-21 21:00:17 +03:00
Cohee	53c24174e7	Merge branch 'staging' into webgpu-summary	2024-08-13 20:54:46 +03:00
Cohee	d0b368f29c	Vector Storage: change NomicAI key input	2024-08-13 20:50:41 +03:00
Cohee	4888e3c2b0	Vector Storage: summarize with WebLLM extension	2024-08-13 19:34:11 +03:00
Cohee	b8830e34d3	Add ollama download shortcut to vector storage	2024-06-22 16:38:00 +03:00
Cohee	bba16f5263	Custom vector chunk boundary	2024-06-16 02:16:27 +03:00
steve green	3ede4aafbe	Update zh-cn.json (#2376 ) * Update zh-cn.json * html changes * `div`->`span` * fixes * fixes * more fixes * Revert last divs --------- Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>	2024-06-14 22:46:40 +03:00
Cohee	4e822eeebb	Add VLLM as vector source	2024-06-09 01:03:22 +03:00
Cohee	39721b6a8f	Add file chunks overlap control	2024-06-06 21:45:47 +03:00
Cohee	865c48bcc0	Cohee forgot the ABC	2024-05-28 22:57:07 +03:00
Cohee	2b3dfc5ae2	Add ollama and llamacpp as vector sources	2024-05-28 22:54:50 +03:00
Cohee	967a7980f5	Add vector retrieval score threshold	2024-05-23 17:28:43 +03:00
Cohee	4370db6bdc	Implement World Info activation using Vector Storage	2024-04-23 03:09:52 +03:00
Cohee	ca89be8930	Add experimental setting for file translation	2024-04-21 03:24:01 +03:00
Cohee	80de3fdd4c	Add buttons to process and purge file vectors for current chat	2024-04-19 00:16:23 +03:00
Cohee	25cb598694	Add Cohere as embedding source	2024-04-19 00:07:12 +03:00
Cohee	16785ae005	Merge branch 'staging' into neo-server	2024-04-18 22:57:27 +03:00
Cohee	5e18b1872b	Hide vector summarization unless chat vectors is enabled	2024-04-18 22:32:01 +03:00
Kristan Schlikow	abd8acd9fc	Add experimental flask icon with title	2024-04-17 20:35:51 +02:00
Kristan Schlikow	977c1d4ac4	Add missing hr to the settings html	2024-04-17 20:35:51 +02:00
Kristan Schlikow	21b5bb5a29	Rearrange settings	2024-04-17 20:35:51 +02:00
Kristan Schlikow	10692523f0	Improve settings, add settings for prompt, sent message summarize toggle	2024-04-17 20:35:50 +02:00
Kristan Schlikow	a022c9eccb	Set css class for summary dropdown in vectors	2024-04-17 20:35:50 +02:00
Kristan Schlikow	4cfd0c71cf	Add extras summarization API, fix missing substitution of placeholders	2024-04-17 20:35:49 +02:00
Kristan Schlikow	a1473dedd0	Make summarization toggleable	2024-04-17 20:35:49 +02:00
Cohee	9a1ea7f226	Implement Data Bank vectors querying	2024-04-17 02:09:22 +03:00

1 2

73 Commits