mirrors/zed

mirror of https://github.com/zed-industries/zed.git synced 2025-01-12 21:32:40 +00:00

Author	SHA1	Message	Date
Thorsten Ball	aee01f2c50	assistant: Remove `low_speed_timeout` (#20681 ) This removes the `low_speed_timeout` setting from all providers as a response to issue #19509. Reason being that the original `low_speed_timeout` was only as part of #9913 because users wanted to _get rid of timeouts_. They wanted to bump the default timeout from 5sec to a lot more. Then, in the meantime, the meaning of `low_speed_timeout` changed in #19055 and was changed to a normal `timeout`, which is a different thing and breaks slower LLMs that don't reply with a complete response in the configured timeout. So we figured: let's remove the whole thing and replace it with a default _connect_ timeout to make sure that we can connect to a server in 10s, but then give the server as long as it wants to complete its response. Closes #19509 Release Notes: - Removed the `low_speed_timeout` setting from LLM provider settings, since it was only used to _increase_ the timeout to give LLMs more time, but since we don't have any other use for it, we simply remove the setting to give LLMs as long as they need. --------- Co-authored-by: Antonio <antonio@zed.dev> Co-authored-by: Peter Tripp <peter@zed.dev>	2024-11-15 07:37:31 +01:00
David Soria Parra	a15f408f0c	anthropic: Remove stable headers (#20595 ) The tool and context length headers are now stable and no longer needed. Release Notes: - N/A	2024-11-13 15:04:37 -05:00
Peter Tripp	291af664e1	Switch to Anthropic -latest tags (#19615 ) - Closes: https://github.com/zed-industries/zed/issues/19609 Switches us to using `-latest` tags with Anthropic models instead of pinning to a specific date version. See: [Anthropic Model Docs](https://docs.anthropic.com/en/docs/about-claude/models) This is a no-op for: - Claude 3 Opus (`claude-3-opus-20240229`) - Claude 3 Sonnet (`claude-3-sonnet-20240229`) - Claude 3 Haiku (`claude-3-haiku-20240307`) For Claude 3.5 Sonnet this will update us from `claude-3-5-sonnet-20240620` to `claude-3-5-sonnet-20241022`. We will also pickup any subsequent model updates automatically when Anthropic updates the `latest` tag. This matches the behavior for OpenAI where use `gpt-4o` as the model_name and not `gpt-4o-2024-08-06`.	2024-10-23 15:13:52 -04:00
Mikayla Maki	22ac178f9d	Restore HTTP client transition, but use reqwest everywhere (#19055 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Release Notes: - N/A	2024-10-11 14:58:58 -07:00
Marshall Bowers	d55f025906	collab: Track cache writes/reads in LLM usage (#18834 ) This PR extends the LLM usage tracking to support tracking usage for cache writes and reads for Anthropic models. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Antonio <antonio@zed.dev>	2024-10-07 17:32:49 -04:00
Conrad Irwin	e28496d4e2	Stop leaking isahc assumption (#18408 ) Users of our http_client crate knew they were interacting with isahc as they set its extensions on the request. This change adds our own equivalents for their APIs in preparation for changing the default http client. Release Notes: - N/A	2024-09-26 14:01:05 -06:00
Roy Williams	5905fbb9ac	Allow Anthropic custom models to override temperature (#18160 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Release Notes: - Allow Anthropic custom models to override "temperature" This also centralized the defaulting of "temperature" to be inside of each model's `into_x` call instead of being sprinkled around the code.	2024-09-20 14:59:12 -06:00
Piotr Osiewicz	e6c1c51b37	chore: Fix several style lints (#17488 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details It's not comprehensive enough to start linting on `style` group, but hey, it's a start. Release Notes: - N/A	2024-09-06 11:58:39 +02:00
Marshall Bowers	30b2133336	language_model: Add tool results to message content (#17363 ) This PR updates the message content for an LLM request to allow it contain tool results. Release Notes: - N/A	2024-09-04 13:29:01 -04:00
Marshall Bowers	f38956943b	assistant: Propagate LLM stop reason upwards (#17358 ) This PR makes it so we propagate the `stop_reason` from Anthropic up to the Assistant so that we can take action based on it. The `extract_content_from_events` function was moved from `anthropic` to the `anthropic` module in `language_model` since it is more useful if it is able to name the `LanguageModelCompletionEvent` type, as otherwise we'd need an additional layer of plumbing. Release Notes: - N/A	2024-09-04 12:31:10 -04:00
Marshall Bowers	452272e5df	assistant: Stream tool uses as structured data (#17322 ) This PR adjusts the approach we use to encoding tool uses in the completion response to use a structured format rather than simply injecting it into the response stream as text. In #17170 we would encode the tool uses as XML and insert them as text. This would require then re-parsing the tool uses out of the buffer in order to use them. The approach taken in this PR is to make `stream_completion` return a stream of `LanguageModelCompletionEvent`s. Each of these events can be either text, or a tool use. A new `stream_completion_text` method has been added to `LanguageModel` for scenarios where we only care about textual content (currently, everywhere that isn't the Assistant context editor). Release Notes: - N/A	2024-09-03 15:04:51 -04:00
Marshall Bowers	68ea661711	assistant: Add foundation for receiving tool uses from Anthropic models (#17170 ) This PR updates the Assistant with support for receiving tool uses from Anthropic models and capturing them as text in the context editor. This is just laying the foundation for tool use. We don't yet fulfill the tool uses yet, or define any tools for the model to use. Here's an example of what it looks like using the example `get_weather` tool from the Anthropic docs: <img width="644" alt="Screenshot 2024-08-30 at 1 51 13 PM" src="https://github.com/user-attachments/assets/3614f953-0689-423c-8955-b146729ea638"> Release Notes: - N/A	2024-08-30 14:05:55 -04:00
Marshall Bowers	ea25d438d1	anthropic: Remove `cache_control` field from `ResponseContent` (#17165 ) This PR removes the `cache_control` field from the variants in `ResponseContent`. This field is used on requests to control the caching behavior, but is not needed on content in the response. Release Notes: - N/A	2024-08-30 12:22:47 -04:00
Marshall Bowers	8901d926eb	anthropic: Use separate `Content` type in requests and responses (#17163 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details This PR splits the `Content` type for Anthropic into two new types: `RequestContent` and `ResponseContent`. As I was going through the Anthropic API docs it seems that there are different types of content that can be sent in requests vs what can be returned in responses. Using a separate type for each case tells the story a bit better and makes it easier to understand, IMO. Release Notes: - N/A	2024-08-30 11:46:03 -04:00
Peter Tripp	4d6bb52d1f	Anthropic/OpenAI: Add country codes for territories (#17089 ) - Cloudflare provides ISO-3166-1 country code for protectorates. Expand our allowlist to include the territories of countries on the allowlist (US, UK, France, Australia, New Zealand). - Also include the country_code in the error message when we block. Co-authored-by: Marshall Bowers <elliott.codes@gmail.com>	2024-08-29 11:32:29 -04:00
Max Brunsfeld	1b1070e0f7	Add tracing needed for LLM rate limit dashboards (#16388 ) Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-16 17:52:31 -04:00
Nathan Sobo	907d76208d	Allow display name of custom Anthropic models to be customized (#16376 ) Also added some docs for our settings. Release Notes: - N/A	2024-08-16 14:02:37 -06:00
Roy Williams	b4f5f5024e	Support 8192 output tokens for Claude Sonnet 3.5 (#16358 ) Release Notes: - Added support for 8192 output tokens from Claude Sonnet 3.5 (https://x.com/alexalbert__/status/1812921642143900036)	2024-08-16 11:47:39 -04:00
Roy Williams	46fb917e02	Implement Anthropic prompt caching (#16274 ) Release Notes: - Adds support for Prompt Caching in Anthropic. For models that support it this can dramatically lower cost while improving performance.	2024-08-15 22:21:06 -05:00
Max Brunsfeld	4c390b82fb	Make LanguageModel::use_any_tool return a stream of chunks (#16262 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details This PR is a refactor to pave the way for allowing the user to view and edit workflow step resolutions. I've made tool calls work more like normal streaming completions for all providers. The `use_any_tool` method returns a stream of strings (which contain chunks of JSON). I've also done some minor cleanup of language model providers in general, removing the duplication around handling streaming responses. Release Notes: - N/A	2024-08-14 18:02:46 -07:00
Marshall Bowers	ebdb755fef	Surface upstream rate limits from Anthropic (#16118 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details This PR makes it so hitting upstream rate limits from Anthropic result in an HTTP 429 response instead of an HTTP 500. To do this we need to surface structured errors out of the `anthropic` crate. Release Notes: - N/A	2024-08-12 11:59:24 -04:00
Max Brunsfeld	33e120d964	Capture telemetry data on per-user monthly LLM spending (#16050 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details Release Notes: - N/A --------- Co-authored-by: Marshall <marshall@zed.dev>	2024-08-09 16:38:37 -07:00
Bennet Bo Fenner	514b79e461	collab: Always use newest anthropic model version (#15978 ) When Anthropic releases a new version of their models, Zed AI users should always get access to the new version even when using an old version of zed. Co-Authored-By: Thorsten <thorsten@zed.dev> Release Notes: - N/A Co-authored-by: Thorsten <thorsten@zed.dev>	2024-08-08 15:24:08 +02:00
Marshall Bowers	cf5f4dddf5	Authorize access to language model providers based on country (#15859 ) This PR updates the LLM service to authorize access to language model providers based on the requester's country. We detect the country using Cloudflare's [`CF-IPCountry`](https://developers.cloudflare.com/fundamentals/reference/http-request-headers/#cf-ipcountry) header. The country code is then checked against the list of supported countries for the given LLM provider. Countries that are not supported will receive an `HTTP 451: Unavailable For Legal Reasons` response. Release Notes: - N/A	2024-08-06 11:49:04 -04:00
Kirill Bulatov	9384f665bb	Properly extract errors from the Anthropic API (#15534 ) Before, we missed "successful" responses with the API errors, now they are properly shown in the assistant panel. ![image](https://github.com/user-attachments/assets/0c0936af-86c2-4def-9a58-25d5e0912b97) Release Notes: - N/A	2024-07-31 16:31:11 +03:00
Antonio Scandurra	99bc90a372	Allow customization of the model used for tool calling (#15479 ) We also eliminate the `completion` crate and moved its logic into `LanguageModelRegistry`. Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-30 16:18:53 +02:00
Antonio Scandurra	6e1f7c6e1d	Use tool calling instead of XML parsing to generate edit operations (#15385 ) Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-29 16:42:08 +02:00
Antonio Scandurra	d6bdaa8a91	Simplify LLM protocol (#15366 ) Some checks are pending CI / Check formatting and spelling (push) Waiting to run Details CI / (macOS) Run Clippy and tests (push) Waiting to run Details CI / (Linux) Run Clippy and tests (push) Waiting to run Details CI / (Windows) Run Clippy and tests (push) Waiting to run Details CI / Create a macOS bundle (push) Blocked by required conditions Details CI / Create a Linux bundle (push) Blocked by required conditions Details CI / Create arm64 Linux bundle (push) Blocked by required conditions Details Deploy Docs / Deploy Docs (push) Waiting to run Details Docs / Check formatting (push) Waiting to run Details In this pull request, we change the zed.dev protocol so that we pass the raw JSON for the specified provider directly to our server. This avoids the need to define a protobuf message that's a superset of all these formats. @bennetbo: We also changed the settings for available_models under zed.dev to be a flat format, because the nesting seemed too confusing. Can you help us upgrade the local provider configuration to be consistent with this? We do whatever we need to do when parsing the settings to make this simple for users, even if it's a bit more complex on our end. We want to use versioning to avoid breaking existing users, but need to keep making progress. ```json "zed.dev": { "available_models": [ { "provider": "anthropic", "name": "some-newly-released-model-we-havent-added", "max_tokens": 200000 } ] } ``` Release Notes: - N/A --------- Co-authored-by: Nathan <nathan@zed.dev>	2024-07-28 11:07:10 +02:00
Antonio Scandurra	70e895a8c7	Fix regression that caused Anthropic custom models to error (#15329 ) /cc: @bennetbo Release Notes: - N/A Co-authored-by: Nathan <nathan@zed.dev>	2024-07-27 14:45:18 +02:00
Mikayla Maki	855048041d	Update http crate name (#15041 ) Release Notes: - N/A	2024-07-23 15:01:05 -07:00
Bennet Bo Fenner	d0f52e90e6	assistant: Overhaul provider infrastructure (#14929 ) <img width="624" alt="image" src="https://github.com/user-attachments/assets/f492b0bd-14c3-49e2-b2ff-dc78e52b0815"> - [x] Correctly set custom model token count - [x] How to count tokens for Gemini models? - [x] Feature flag zed.dev provider - [x] Figure out how to configure custom models - [ ] Update docs Release Notes: - Added support for quickly switching between multiple language model providers in the assistant panel --------- Co-authored-by: Antonio <antonio@zed.dev>	2024-07-23 19:48:41 +02:00
Antonio Scandurra	0155435142	Allow using a custom model when using zed.dev (#14933 ) Release Notes: - N/A	2024-07-22 12:25:53 +02:00
Sean Billig	d3b3e072a7	Make Claude 3.5 the default Anthropic model (#13324 ) Release Notes: - N/A Co-authored-by: Antonio Scandurra <me@as-cii.com>	2024-06-21 18:47:38 +02:00
Antonio Scandurra	4072ad2858	Add support for Claude 3.5 Sonnet (#13371 ) Release Notes: - Added support for Claude 3.5 Sonnet.	2024-06-21 18:32:26 +02:00
Antonio Scandurra	6ff01b17ca	Improve model selection in the assistant (#12472 ) https://github.com/zed-industries/zed/assets/482957/3b017850-b7b6-457a-9b2f-324d5533442e Release Notes: - Improved the UX for selecting a model in the assistant panel. You can now switch model using just the keyboard by pressing `alt-m`. Also, when switching models via the UI, settings will now be updated automatically.	2024-05-30 12:36:07 +02:00
Antonio Scandurra	de09409f01	Sanitize messages before sending them to Anthropic (#11810 ) Release Notes: - N/A Co-authored-by: Nathan <nathan@zed.dev> Co-authored-by: David <davidsp@anthropic.com>	2024-05-14 17:47:33 +02:00
Antonio Scandurra	5944caaa90	Add support for interacting with Claude in the assistant panel (#11798 ) Release Notes: - Added support for interacting with Claude in the assistant panel. You can enable it by adding the following to your `settings.json`: ```json "assistant": { "version": "1", "provider": { "name": "anthropic" } } ```	2024-05-14 15:57:52 +02:00
Conrad Irwin	5515ba6043	Extract `http` from `util` (#11680 ) This avoids the CLI linking libssl etc... Release Notes: - N/A	2024-05-10 15:50:20 -06:00
Kyle Kelley	6563330239	Supermaven (#10788 ) Adds a supermaven provider for completions. There are various other refactors amidst this branch, primarily to make copilot no longer a dependency of project as well as show LSP Logs for global LSPs like copilot properly. This feature is not enabled by default. We're going to seek to refine it in the coming weeks. Release Notes: - N/A --------- Co-authored-by: Antonio Scandurra <me@as-cii.com> Co-authored-by: Nathan Sobo <nathan@zed.dev> Co-authored-by: Max <max@zed.dev> Co-authored-by: Max Brunsfeld <maxbrunsfeld@gmail.com>	2024-05-03 12:50:42 -07:00
Marshall Bowers	1fbc04104c	Move `lints` section to the top of `Cargo.toml`, to match the others	2024-04-18 15:53:48 -04:00
Kirill Bulatov	5602593089	Check license generation for every PR to avoid license-less crate additions (#10033 ) Also fix `anthropic` crate and make it AGPL-licensed, as it's used in the AGPL-licensed collab part only. Release Notes: - N/A	2024-04-01 12:16:16 +03:00
Nathan Sobo	9b673089db	Enable Claude 3 models to be used via the Zed server if "language-models" feature flag is enabled for user (#10015 ) Release Notes: - N/A	2024-03-31 15:57:57 -06:00

42 commits