mirrors/zed

mirror of https://github.com/zed-industries/zed.git synced 2025-01-12 21:32:40 +00:00

Author	SHA1	Message	Date
Cole Miller	ee6f834028	Fuse LLM completion stream to avoid a panic (#21914 ) `LanguageModel::stream_completion_text` can poll the `stream_completion` stream (ultimately a `futures::Unfold`) after it's returned `Ready(None)`, which leads to a panic; avoid this by fusing the stream. Release Notes: - Fixed a panic when streaming language model completions	2024-12-12 11:39:35 -05:00
Marshall Bowers	937186da12	gpui: Don't export named `Context` from prelude (#21869 ) This PR updates the `gpui::prelude` to not export the `Context` trait named. This prevents some naming clashes in downstream consumers. Release Notes: - N/A	2024-12-11 13:21:40 -05:00
Marshall Bowers	f3140f54d8	assistant2: Wire up error messages (#21426 ) Some checks are pending CI / Check Postgres and Protobuf migrations, mergability (push) Waiting to run Details CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Build Remote Server (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details CI / Auto release preview (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Script / ShellCheck Scripts (push) Waiting to run Details This PR wires up the error messages for Assistant 2 so that they are shown to the user: <img width="1138" alt="Screenshot 2024-12-02 at 4 28 02 PM" src="https://github.com/user-attachments/assets/d8a5b9bd-0cef-4304-b561-b2edadbc70ef"> <img width="1138" alt="Screenshot 2024-12-02 at 4 29 09 PM" src="https://github.com/user-attachments/assets/0dd70841-0d5a-4de6-bebe-82c563246b65"> <img width="1138" alt="Screenshot 2024-12-02 at 4 32 49 PM" src="https://github.com/user-attachments/assets/a8838866-fad1-43a9-8935-490dc1936016"> @danilo-leal I kept the existing UX from Assistant 1, as I didn't see any errors in the design prototype, but we can revisit if another approach would work better. Release Notes: - N/A	2024-12-02 16:54:46 -05:00
Marshall Bowers	968ffaa3fd	assistant2: Restructure storage of tool uses and results (#21194 ) Some checks are pending CI / Check Postgres and Protobuf migrations, mergability (push) Waiting to run Details CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Build Remote Server (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details CI / Auto release preview (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Script / ShellCheck Scripts (push) Waiting to run Details This PR restructures the storage of the tool uses and results in `assistant2` so that they don't live on the individual messages. It also introduces a `LanguageModelToolUseId` newtype for better type safety. Release Notes: - N/A	2024-11-25 21:53:27 -05:00
Marshall Bowers	cbba44900d	Add `language_models` crate to house language model providers (#20945 ) This PR adds a new `language_models` crate to house the various language model providers. By extracting the provider definitions out of `language_model`, we're able to remove `language_model`'s dependency on `editor`, which improves incremental compilation when changing `editor`. Release Notes: - N/A	2024-11-20 18:49:34 -05:00
Marshall Bowers	e076f55d78	language_model: Remove dependency on `inline_completion_button` (#20930 ) This PR removes a dependency on the `inline_completion_button` crate from the `language_model` crate. We were taking on this dependency solely to call `initiate_sign_in`, which can easily be moved to the `copilot` crate. This allows `language_model` to move up in the crate dependency graph. Release Notes: - N/A	2024-11-20 16:19:20 -05:00
Siddharth M. Bhatia	97e9137cb7	Update references of Ollama Llama 3.1 to model Llama 3.2 (#20757 ) Release Notes: - N/A	2024-11-16 11:18:53 -05:00
Thorsten Ball	aee01f2c50	assistant: Remove `low_speed_timeout` (#20681 ) This removes the `low_speed_timeout` setting from all providers as a response to issue #19509. Reason being that the original `low_speed_timeout` was only as part of #9913 because users wanted to _get rid of timeouts_. They wanted to bump the default timeout from 5sec to a lot more. Then, in the meantime, the meaning of `low_speed_timeout` changed in #19055 and was changed to a normal `timeout`, which is a different thing and breaks slower LLMs that don't reply with a complete response in the configured timeout. So we figured: let's remove the whole thing and replace it with a default _connect_ timeout to make sure that we can connect to a server in 10s, but then give the server as long as it wants to complete its response. Closes #19509 Release Notes: - Removed the `low_speed_timeout` setting from LLM provider settings, since it was only used to _increase_ the timeout to give LLMs more time, but since we don't have any other use for it, we simply remove the setting to give LLMs as long as they need. --------- Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Peter Tripp <peter@zed.dev>	2024-11-15 07:37:31 +01:00
Danilo Leal	187356ab9b	assistant: Show only configured models in the model picker (#20392 ) Closes https://github.com/zed-industries/zed/issues/16568 This PR introduces some changes to how we display models in the model selector within the assistant panel. Basically, it comes down to this: - If you don't have any provider configured, you should see _all_ available models in the picker - But, once you've configured some, you should _only_ see models from them in the picker Visually, nothing's changed much aside from the added "Configured Models" label at the top to ensure the understanding that that's a list of, well, configured models only. 😬 <img width="700" alt="Screenshot 2024-11-07 at 23 42 41" src="https://github.com/user-attachments/assets/219ed386-2318-43a6-abea-1de0cda8dc53"> Release Notes: - Change model selector in the assistant panel to only show configured models	2024-11-08 10:08:59 -03:00
Jonathan Toledo	67be6ec3b5	copilot: Add support for new models (#19968 ) Closes #19963 This PR implements integration with the newly announced GitHub Copilot LLM models, including: - Claude 3.5 Sonnet - o1-mini - o1-preview Release Notes: - N/A --------- Co-authored-by: Bennet Bo Fenner <bennet@zed.dev>	2024-11-04 10:55:20 +01:00
Boris Cherny	b87c4a1e13	assistant: Add health telemetry (#19928 ) This PR adds a bit of telemetry for Anthropic models, in order to understand model health. With this logging, we can monitor and diagnose dips in performance, for example due to model rollouts. Release Notes: - N/A --------- Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2024-10-31 16:21:26 -07:00
Conrad Irwin	273cb1921f	Fix wrong UpdateWorktree chunk size being used in release mode (#19912 ) Release Notes: - Fixed slowness when collaborating Co-authored-by: Thorsten <thorsten@zed.dev>	2024-10-29 11:22:41 -06:00
Thorsten Ball	6686f66949	ollama: Ensure only single task fetches models (#19830 ) Before this change, we'd see a ton of requests from the Ollama provider trying to fetch models: ``` [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: https://api.zed.dev/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ [2024-10-28T15:00:52+01:00 DEBUG reqwest::connect] starting new connection: http://localhost:11434/ ``` Turns out we'd send a request on every change to settings. Now, with this change, we only send a single request. Release Notes: - N/A Co-authored-by: Bennet <bennet@zed.dev>	2024-10-28 15:40:50 +01:00
David Soria Parra	8a96ea25c4	context_servers: Support tools (#19548 ) This PR depends on #19547 This PR adds support for tools from context servers. Context servers are free to expose tools that Zed can pass to models. When called by the model, Zed forwards the request to context servers. This allows for some interesting techniques. Context servers can easily expose tools such as querying local databases, reading or writing local files, reading resources over authenticated APIs (e.g. kubernetes, asana, etc). This is currently experimental. Things to discuss * I want to still add a confirm dialog asking people if a server is allows to use the tool. Should do this or just use the tool and assume trustworthyness of context servers? * Can we add tool use behind a local setting flag? Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-10-28 10:37:58 -04:00
Peter Tripp	291af664e1	Switch to Anthropic -latest tags (#19615 ) - Closes: https://github.com/zed-industries/zed/issues/19609 Switches us to using `-latest` tags with Anthropic models instead of pinning to a specific date version. See: [Anthropic Model Docs](https://docs.anthropic.com/en/docs/about-claude/models) This is a no-op for: - Claude 3 Opus (`claude-3-opus-20240229`) - Claude 3 Sonnet (`claude-3-sonnet-20240229`) - Claude 3 Haiku (`claude-3-haiku-20240307`) For Claude 3.5 Sonnet this will update us from `claude-3-5-sonnet-20240620` to `claude-3-5-sonnet-20241022`. We will also pickup any subsequent model updates automatically when Anthropic updates the `latest` tag. This matches the behavior for OpenAI where use `gpt-4o` as the model_name and not `gpt-4o-2024-08-06`.	2024-10-23 15:13:52 -04:00
Marshall Bowers	2bcf9fc490	Add `client::zed_urls` module for constructing zed.dev URLs (#19391 ) This PR adds a new `zed_urls` module to the `client` crate. This module contains functions for constructing URLs to Zed properties, such as zed.dev. The URLs produced by this module will respect the server URL set via settings or the `ZED_SERVER_URL` environment variable. This allows them to correctly reflect the current environment (such as when testing Zed against a local collab/zed.dev). Release Notes: - N/A	2024-10-17 16:18:35 -04:00
Marshall Bowers	84b61c8b1a	assistant: Add support for displaying billing-related errors (#19082 ) This PR adds support to the assistant for display billing-related errors. Pulling this out of #19081 to make it easier to cherry-pick. Release Notes: - N/A Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Richard <richard@zed.dev>	2024-10-11 13:22:45 -04:00
Boris Cherny	01ad22683d	telemetry: Add `language_name` and `model_provider` (#18640 ) This PR adds a bit more metadata for assistant logging. Release Notes: - Assistant: Added `language_name` and `model_provider` fields to telemetry events. --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com> Co-authored-by: Max <max@zed.dev>	2024-10-04 14:37:27 -04:00
Richard Feldman	caaa9a00a9	Remove Qwen2 model (#18444 ) Removed deprecated Qwen2 7B Instruct model from zed.dev provider (staff only). Release Notes: - N/A	2024-09-27 13:30:25 -04:00
Conrad Irwin	e28496d4e2	Stop leaking isahc assumption (#18408 ) Users of our http_client crate knew they were interacting with isahc as they set its extensions on the request. This change adds our own equivalents for their APIs in preparation for changing the default http client. Release Notes: - N/A	2024-09-26 14:01:05 -06:00
Roy Williams	5905fbb9ac	Allow Anthropic custom models to override temperature (#18160 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Release Notes: - Allow Anthropic custom models to override "temperature" This also centralized the defaulting of "temperature" to be inside of each model's `into_x` call instead of being sprinkled around the code.	2024-09-20 14:59:12 -06:00
jvmncs	9f6ff29a54	Reuse OpenAI low_speed_timeout setting for zed.dev provider (#18144 ) Release Notes: - N/A	2024-09-20 12:57:35 -04:00
Antonio Scandurra	15b4130fa5	Introduce the ability to cycle between alternative inline assists (#18098 ) Release Notes: - Added a new `assistant.inline_alternatives` setting to configure additional models that will be used to perform inline assists in parallel. --------- Co-authored-by: Nathan <nathan@zed.dev> Co-authored-by: Roy <roy@anthropic.com> Co-authored-by: Adam <wolffiex@anthropic.com>	2024-09-19 17:50:00 -06:00
Peter Tripp	67f149a4bc	Ollama: Specify keep_alive via settings (#17906 )	2024-09-16 18:47:25 -04:00
Danilo Leal	29a5def12c	Refine assistant config UI (#17871 ) This PR does a little bit of a touch-up on the copywriting on the assistant config UI. I had friends reporting to me that some of the writing could be clearer, and hopefully, this goes into that direction! Release Notes: - N/A	2024-09-16 08:12:07 -03:00
Peter Tripp	d245f5e75c	OpenAI o1-preview and o1-mini support (#17796 ) Release Notes: - Added support for OpenAI o1-mini and o1-preview models. --------- Co-authored-by: Jason Mancuso <7891333+jvmncs@users.noreply.github.com> Co-authored-by: Bennet <bennet@zed.dev>	2024-09-13 16:23:55 -04:00
jvmncs	c71f052276	Add ability to use o1-preview and o1-mini as custom models (#17804 ) This is a barebones modification of the OpenAI provider code to accommodate non-streaming completions. This is specifically for the o1 models, which do not support streaming. Tested that this is working by running a `/workflow` with the following (arbitrarily chosen) settings: ```json { "language_models": { "openai": { "version": "1", "available_models": [ { "name": "o1-preview", "display_name": "o1-preview", "max_tokens": 128000, "max_completion_tokens": 30000 }, { "name": "o1-mini", "display_name": "o1-mini", "max_tokens": 128000, "max_completion_tokens": 20000 } ] } }, } ``` Release Notes: - Changed `low_speed_timeout_in_seconds` option to `600` for OpenAI provider to accommodate recent o1 model release. --------- Co-authored-by: Peter <peter@zed.dev> Co-authored-by: Bennet <bennet@zed.dev> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-13 15:42:15 -04:00
Richard Feldman	91ffa02e2c	/auto (#16696 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Add `/auto` behind a feature flag that's disabled for now, even for staff. We've decided on a different design for context inference, but there are parts of /auto that will be useful for that, so we want them in the code base even if they're unused for now. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-13 13:17:49 -04:00
Amin Ahmed Khan	ef5a7e1642	Fix OpenAI key URL (#17675 ) Update the create Open AI Key URL Release Notes: - Fixed a link in the Assistant panel to the OpenAI console.	2024-09-10 23:14:43 -04:00
maan2003	d6663fcb29	Pass temperature to Anthropic (#17509 ) Release Notes: - N/A --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-09-10 18:09:00 -04:00
Peter Tripp	bd1ff476b9	Revert tokenizer for custom OpenAI models (#17660 ) Fix for custom openai models tokenizer settings.	2024-09-10 15:38:27 -04:00
Peter Tripp	fb9d01b0d5	assistant: Add display_name for OpenAI and Gemini (#17508 )	2024-09-10 13:41:06 -04:00
Bennet Bo Fenner	a7ac37156c	assistant: Fix configuration page showing incorrect Anthropic API key label (#17650 ) Release Notes: - N/A	2024-09-10 11:23:50 -04:00
Piotr Osiewicz	e6c1c51b37	chore: Fix several style lints (#17488 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details It's not comprehensive enough to start linting on `style` group, but hey, it's a start. Release Notes: - N/A	2024-09-06 11:58:39 +02:00
Bennet Bo Fenner	f413ea90bf	assistant: Fix Google AI provider not respecting `low_speed_timeout_in_seconds` (#17423 ) Release Notes: - Fixed an issue when using Google Gemini models, where the setting `low_speed_timeout_in_seconds` was not respected	2024-09-05 18:16:30 +02:00
Marshall Bowers	497356b2ba	language_model: Add tool uses to message content (#17381 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details This PR updates the message content for an LLM request to allow it contain tool uses. We need to send the tool uses back to the model in order for it to recognize the subsequent tool results. Release Notes: - N/A	2024-09-04 19:29:11 -04:00
Marshall Bowers	965b23fffe	language_model: Remove unused `impl` for `MessageContent` (#17377 ) This PR removes an unused `impl` for the `MessageContent` type. Release Notes: - N/A	2024-09-04 18:51:35 -04:00
Marshall Bowers	30b2133336	language_model: Add tool results to message content (#17363 ) This PR updates the message content for an LLM request to allow it contain tool results. Release Notes: - N/A	2024-09-04 13:29:01 -04:00
Marshall Bowers	f38956943b	assistant: Propagate LLM stop reason upwards (#17358 ) This PR makes it so we propagate the `stop_reason` from Anthropic up to the Assistant so that we can take action based on it. The `extract_content_from_events` function was moved from `anthropic` to the `anthropic` module in `language_model` since it is more useful if it is able to name the `LanguageModelCompletionEvent` type, as otherwise we'd need an additional layer of plumbing. Release Notes: - N/A	2024-09-04 12:31:10 -04:00
Marshall Bowers	452272e5df	assistant: Stream tool uses as structured data (#17322 ) This PR adjusts the approach we use to encoding tool uses in the completion response to use a structured format rather than simply injecting it into the response stream as text. In #17170 we would encode the tool uses as XML and insert them as text. This would require then re-parsing the tool uses out of the buffer in order to use them. The approach taken in this PR is to make `stream_completion` return a stream of `LanguageModelCompletionEvent`s. Each of these events can be either text, or a tool use. A new `stream_completion_text` method has been added to `LanguageModel` for scenarios where we only care about textual content (currently, everywhere that isn't the Assistant context editor). Release Notes: - N/A	2024-09-03 15:04:51 -04:00
Marshall Bowers	68ea661711	assistant: Add foundation for receiving tool uses from Anthropic models (#17170 ) This PR updates the Assistant with support for receiving tool uses from Anthropic models and capturing them as text in the context editor. This is just laying the foundation for tool use. We don't yet fulfill the tool uses yet, or define any tools for the model to use. Here's an example of what it looks like using the example `get_weather` tool from the Anthropic docs: <img width="644" alt="Screenshot 2024-08-30 at 1 51 13 PM" src="https://github.com/user-attachments/assets/3614f953-0689-423c-8955-b146729ea638"> Release Notes: - N/A	2024-08-30 14:05:55 -04:00
Marshall Bowers	8901d926eb	anthropic: Use separate `Content` type in requests and responses (#17163 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details This PR splits the `Content` type for Anthropic into two new types: `RequestContent` and `ResponseContent`. As I was going through the Anthropic API docs it seems that there are different types of content that can be sent in requests vs what can be returned in responses. Using a separate type for each case tells the story a bit better and makes it easier to understand, IMO. Release Notes: - N/A	2024-08-30 11:46:03 -04:00
Peter Tripp	b62e63349b	Ollama max_tokens settings (#17025 ) - Support `available_models` for Ollama - Clamp default max tokens (context length) to 16384. - Add documentation for ollama context configuration.	2024-08-30 08:52:00 -04:00
Peter Tripp	d401ab1efc	Make links in assistant configuration clickable (#17011 )	2024-08-30 08:50:25 -04:00
Peter Tripp	0332eaf797	Remove reference to Copilot plugin (#16916 )	2024-08-26 16:43:22 -04:00
Jason Lee	938d93a64c	gpui: Add `truncate` and `text_ellipsis` to TextStyle (#14850 ) Release Notes: - N/A Ref issue #4996 ## Demo ``` cargo run -p gpui --example text_wrapper ``` https://github.com/user-attachments/assets/a7fcebf7-f287-4517-960d-76b12722a2d7 --------- Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-23 14:02:51 -04:00
Thorsten Ball	7647644602	zed ai: Show ToS form in Configuration View (#16736 ) Related #16618 Release Notes: - N/A	2024-08-23 11:17:21 +02:00
Marshall Bowers	93642c9c51	Pass through Anthropic cache configuration when using Zed provider (#16685 ) This PR makes it so the model's cache configuration gets passed through from the base model when using the Zed provider. Release Notes: - Fixed caching for Anthropic models when using the Zed provider.	2024-08-22 12:48:47 -04:00
邻二氮杂菲	f1778dd9de	Add max_output_tokens to OpenAI models and integrate into requests (#16381 ) ### Pull Request Title Introduce `max_output_tokens` Field for OpenAI Models https://platform.deepseek.com/api-docs/news/news0725/#4-8k-max_tokens-betarelease-longer-possibilities ### Description This commit introduces a new field `max_output_tokens` to the OpenAI models, which allows specifying the maximum number of tokens that can be generated in the output. This field is now integrated into the request handling across multiple crates, ensuring that the output token limit is respected during language model completions. Changes include: - Adding `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updating the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modifying the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensuring that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. This enhancement provides more control over the output length of OpenAI model responses, improving the flexibility and accuracy of language model interactions. ### Changes - Added `max_output_tokens` to the `Custom` variant of the `open_ai::Model` enum. - Updated the `into_open_ai` method in `LanguageModelRequest` to accept and use `max_output_tokens`. - Modified the `OpenAiLanguageModel` and `CloudLanguageModel` implementations to pass `max_output_tokens` when converting requests. - Ensured that the `max_output_tokens` field is correctly serialized and deserialized in relevant structures. ### Related Issue https://github.com/zed-industries/zed/pull/16358 ### Screenshots / Media N/A ### Checklist - [x] Code compiles correctly. - [x] All tests pass. - [ ] Documentation has been updated accordingly. - [ ] Additional tests have been added to cover new functionality. - [ ] Relevant documentation has been updated or added. ### Release Notes - Added `max_output_tokens` field to OpenAI models for controlling output token length.	2024-08-21 00:39:10 -04:00
Max Brunsfeld	b5bd8a5c5d	Add logic for closed beta LLM models (#16482 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-19 11:09:52 -07:00

1 2 3

104 commits