mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2025-01-24 21:13:47 +00:00

Author	SHA1	Message	Date
dploch	3a0929b876	index_store: add more scope to write_index() This is more consistent with the other method and it makes some extension operations easier, by giving access to the OpStore and other relevant context for custom index extensions.	2024-05-10 09:22:09 -04:00
Martin von Zweigbergk	dbba2edc57	commit: add a helper for returning parent tree of `Commit` The pattern of getting the parent tree of a commit gets repeated a bit. Let's add a helper on `Commit`.	2024-05-07 19:35:03 -07:00
Martin von Zweigbergk	428e209304	cleanup: consistently use `BackendResult` We have the type alias so we should use it consistently.	2024-05-07 19:35:03 -07:00
Martin von Zweigbergk	2e44741e02	repo: move new commit ids into `RewriteType` enum `RewriteType::Rewritten` must have exactly one replacement. I think it's better to encode that in the type by attaching the value to the enum variant. I also renamed the type to just `Rewrite` since it now has attached data and `Type` sounds like a traditional data-free enum to me.	2024-05-06 12:58:40 -07:00
Martin von Zweigbergk	c6bb94de42	repo: make `RewriteType` private Looks like I forgot this in some recent refactoring. I don't really see any harm in making the type public later. I might want to make `rebase_descendants()` not clear `parent_mapping` and instead provide a way of accessing it afterwards (removing the need for the `_return_map()` flavors). We'll see if that ends up happening. For now it can be private anyway.	2024-05-06 12:58:40 -07:00
dploch	20af8c79ef	revset: support custom filter extensions	2024-05-06 10:42:01 -04:00
dploch	387ae9bce1	revset: support defining custom revset functions	2024-05-06 10:42:01 -04:00
dploch	cfa595199a	revset: make a bunch of parsing types public	2024-05-06 10:42:01 -04:00
dploch	4e0abf0631	revset: make RevsetParseContext opaque	2024-05-06 10:42:01 -04:00
Ilya Grigoriev	70b517ca64	conflicts.rs: label conflict number and sides next to conflict markers For example, ``` <<<<<<< Conflict 1 of 3 +++++++ Contents of side #1 left 3.1 left 3.2 left 3.3 %%%%%%% Changes from base to side #2 -line 3 +right 3.1 >>>>>>> ``` or ``` <<<<<<< Conflict 1 of 1 %%%%%%% Changes from base to side #1 -line 3 +right 3.1 +++++++ Contents of side #2 left 3.1 left 3.2 left 3.3 >>>>>>> ``` Currently, there is no way to disable these, this is TODO for a future PR. Other TODOs for future PRs: make these labels configurable. After that, we could support a `diff3/git`-like conflict format as well, in principle. Counting conflicts helps with knowing whether you fixed all the conflicts while you are in the editor. While labeling "side #1", etc, does not tell you the commit id or description as requested in #1176, I still think it's an improvement. Most importantly, I hope this will make `jj`'s conflict format less scary-looking for new users. I've used this for a bit, and I like it. Without the labels, I would see that the two conflicts have a different order of conflict markers, but I wouldn't be able to remember what that means. For longer diffs, it can be tricky for me to quickly tell that it's a diff as opposed to one of the sides. This also creates some hope of being able to navigate a conflict with more than 2 sides. Another not-so-secret goal for this is explained in https://github.com/martinvonz/jj/pull/3109#issuecomment-2014140627. The idea is a little weird, but I think it could be helpful, and I'd like to experiment with it.	2024-05-05 18:42:14 -07:00
Ilya Grigoriev	f43a810fe0	conflicts.rs: Teach `jj` to parse conflict markers that are followed by a label The format is 7 characters of the separator followed by a space and arbitrary text, followed by a newline. Separator followed by a newline is also allowed. E.g.: <<<<<<< Random text %%%%%%% Random text line 2 -line 3 +left line 4 +++++++ Random text right %%%%%%% Random text line 2 +forward line 3 line 4 >>>>>>> Random text This commit only allows reading such conflicts. I considered allowing longer separators (`<<<<<<<<<<<<<< Random text`), but we wouldn't currently write them, so let's be strict for now. 7 characters if they are followed by a space and arbitrary text	2024-05-05 18:42:14 -07:00
Martin von Zweigbergk	3dab92d2e9	cli: move `revsets.log` default to config file	2024-05-05 09:08:14 -07:00
Martin von Zweigbergk	f958957f9e	cli: drop support for old `ui.default-revset` config We replaced it by `revsets.log` in 0.8.0, which is long enough that users should have been able to switch.	2024-05-05 09:08:14 -07:00
Martin von Zweigbergk	0d1ff8a150	merged_tree: propagate errors from `TreeEntriesIterator` We shouldn't panic if we fail to read a tree from the backend.	2024-05-01 06:10:08 -07:00
Martin von Zweigbergk	dbf2a98903	rewrite: add `CommitRewriter::record_abandoned_commit()` We already have two uses for this function and I think we're soon going to have more. The function record the old commit as abandoned with the new parents, which is typically what you want. We could record it as abandoned with the old parents instead but then we'd have to do an extra iteration to find the parents when rebasing any children. It would also be confusing if `rewriter.set_parents(new_parents).record_abandoned_commit()` didn't respect the new parents.	2024-04-30 20:03:57 -07:00
dploch	586ab1f076	revset: add a SymbolResolverExtension trait to provide custom resolvers	2024-04-26 10:55:34 -04:00
dploch	bad9e9e3d7	revset: convert commit and change prefix resolvers into partial symbol resolvers	2024-04-26 10:55:34 -04:00
dploch	7bdf2b3945	revset: homogenize the logic of various symbol resolution steps into a common trait	2024-04-26 10:55:34 -04:00
dploch	cf78532bd8	revset: add two new error variants to support extensions	2024-04-26 10:55:34 -04:00
Yuya Nishihara	5b769c5c9e	fileset, revset, templater: add support for single-quoted raw string literals Since fileset/revset/template expressions are specified as command-line arguments, it's sometimes convenient to use single quotes instead of double quotes. Various scripting languages parse single-quoted strings in various ways, but I choose the TOML rule because it's simple and practically useful. TOML is our config language, so copying the TOML syntax would be less surprising than borrowing it from another language. https://github.com/toml-lang/toml/issues/188	2024-04-25 11:14:33 +09:00
Yuya Nishihara	a74bf89df5	revset: reuse parse_symbol_rule_as_literal() to parse string symbol For the same reason as the previous commit. Single-quoted string literal will be handled there.	2024-04-25 11:14:33 +09:00
Yuya Nishihara	37bd966357	fileset: extract inner function that parses string-like literal I'm going to add single-quoted string literal.	2024-04-25 11:14:33 +09:00
Yuya Nishihara	528ccb318e	fileset: fall back to bare pattern/string if no operator-like character found While I like strict parsing, it's not uncommon that we have to deal with file names containing spaces, and doubly-quoted strings such as '"Foo Bar"' look ugly. So, this patch adds an exception that accepts top-level bare strings. This parsing rule is specific to command arguments, and won't be enabled when loading fileset aliases.	2024-04-24 12:02:07 +09:00
Martin von Zweigbergk	9d7ed54f8e	git_backend: add a README to conflicted commits When you use e.g. `git switch` to check out a conflicted commit, you're going to end up with the `.jjconflicts-*` directories in your working copy. It's probably not obvious what those mean. This patch adds a README file to the root tree to try to explain to users what's going on and how to recover. The authoritative information about conflicts is stored in the `jj:trees` commit header. The contents of conflicted commits is only used for preventing GC. We can therefore add contents to the tree without much consequence.	2024-04-22 06:22:54 -07:00
Yuya Nishihara	9f4a7318c7	tests: compare git refs loaded from disk, not in-memory cache values This addresses the test instability. The underlying problem still exists, but it's unlikely to trigger user-facing issues because of that. A repo instance won't be reused after gc() call. Fixes #3537	2024-04-22 18:46:28 +09:00
Yuya Nishihara	527713a851	tests: fix potential mtime flakiness in git gc tests Apparently, these gc() invocations rely on that the previous "git gc" packed all refs so there are no loose refs to compare mtimes. If there were new (or remaining) loose refs, mtime comparison could fail. I also added +1sec to effectively turn off the keep_newer option, which isn't important in these tests.	2024-04-22 18:46:28 +09:00
Evan Mesterhazy	f9a3021a7a	Simplify calls to `CommitRewriter::replace_parents()` Now that it takes `IntoIterator` the caller doesn't need to clone the input `CommitIds`.	2024-04-21 23:31:17 -04:00
Evan Mesterhazy	2b0aa84c9d	CommitRewriter::rewrite_parents(): Take `IntoIterator` instead of `&[CommitId]` CommitIds are often manipulated by reference, so this makes the API more flexible for cases where the caller doesn't already have a Vec or array of owned CommitIds. In many cases `rewrite_parents()` does not even need to clone the input CommitIds. This refactor allows the clone to be avoided if it's unnecessary. There might be other APIs that would benefit from a similar change. In general, it seems like there are a lot of places where we're writing `&[commit_x.id().clone, commit_y.id().clone()]` and similiar. - [Rust API Guidelines](https://rust-lang.github.io/api-guidelines/flexibility.html#functions-minimize-assumptions-about-parameters-by-using-generics-c-generic)	2024-04-21 23:31:17 -04:00
Evan Mesterhazy	bbd9c7c7cb	Implement advance-branches for jj commit ## Feature Description If enabled in the user or repository settings, the local branches pointing to the parents of the revision targeted by `jj commit` will be advanced to the newly created commit. Support for `jj new` will be added in a future change. This behavior can be enabled by default for all branches by setting the following in the config.toml: ``` [experimental-advance-branches] enabled-branches = ["glob:"] ``` Specific branches can also be disabled: ``` [experimental-advance-branches] enabled-branches = ["glob:"] disabled-branches = ["main"] ``` Branches that match a disabled pattern will not be advanced, even if they also match an enabled pattern. This implements feature request #2338.	2024-04-20 10:26:04 -04:00
Martin von Zweigbergk	8bb92fa6fa	working_copy: allow `load_working_copy()` to return error It's reasonable for a `WorkingCopy` implementation to want to return an error. `LocalWorkingCopyFactory` doesn't because it loads all data lazily. The VFS-based one at Google wants to be able to return an error, however.	2024-04-19 15:22:37 -07:00
Austin Seipp	ddfdf5e357	cli: allow `snapshot.max-new-file-size` to be a raw u64 Previously, this command would work: jj --config-toml='snapshot.max-new-file-size="1"' st And is equivalent to this: jj --config-toml='snapshot.max-new-file-size="1B"' st But this would not work, despite looking like it should: jj --config-toml='snapshot.max-new-file-size=1' st This is extremely confusing for users. This config value is deserialized via serde; and while the `HumanByteSize` struct allegedly implemented Serde's `visit_u64` method, it was not called by the deserialize visitor. Strangely, adding an `visit_i64` method did work, but then requires handling of overflow, etc. This is likely because TOML integers are naturally specified in `i64`. Instead, just don't bother with any of that; implement a `TryFrom<String>` instance for `HumanByteSize` that uses `u64::from_str` to try parsing the string immediately; then fall back to `parse_human_byte_size` if that doesn't work. This not only fixes the behavior but, IMO, is much simpler to reason about; we get our `Deserialize` instance for free from the `TryFrom` instance. Finally, this adjusts the test for `max-new-file-size` to now use a raw integer literal, to ensure it doesn't regress. (There are already in-crate tests for parsing the human readable strings.) Signed-off-by: Austin Seipp <aseipp@pobox.com> Change-Id: I8dafa2358d039ad1c07e9a512c1d10fed5845738	2024-04-19 13:03:24 -05:00
Martin von Zweigbergk	d6b41c18c9	parallelize: rewrite using `transform_descendants()` `jj parallelize` was a good example of a command that can be simplified by the new API, so I decided to rewrite it as an example. The rewritten version is more flexible and doesn't actually need the restrictions from the old version (such as checking that the commits are connected). I still left the check for now to keep this patch somewhat small. A subsequent commit will remove the restrictions.	2024-04-18 21:06:52 -07:00
Martin von Zweigbergk	e682543570	repo: take owned commit IDs to `MutableRepo::new_parents()` We always call `.to_vec()` on the slice, so let's just have the caller pass in an owned vector instead.	2024-04-18 21:06:52 -07:00
Martin von Zweigbergk	87c65ee0f9	rewrite: make `CommitRewriter::replace_parents()` remove repeats	2024-04-18 21:06:52 -07:00
Martin von Zweigbergk	96f5ca47d4	repo: add method for tranforming descendants, use in `rebase_descendants()` There are several existing commands that would benefit from an API that makes it easier to rewrite a whole graph of commits while transforming them in some way. `jj squash` is one example. When squashing into an ancestor, that command currently rewrites the ancestor, then rebases descendants, and then rewrites the rewritten source commit. It would be better to rewrite the source commit (and any descendants) only once. Another example is the future `jj fix`. That command will want to rewrite a graph while updating the trees. There's currently no good API for that; you have to manually iterate over descendants and rewrite them. This patch adds a new `MutableRepo::transform_descendants()` method that takes a callback which gets a `CommitRewriter` passed to it. The callback can then decide to change the parents, the tree, etc. The callback is also free to leave the commit in place or to abandon it. I updated the regular `rebase_descendants()` to use the new function in order to exercise it. I hope we can replace all of the `rebase_descendant_*()` flavors later. I added a `replace_parent()` method that was a bit useful for the test case. It could easily be hard-coded in the test case instead, but I think the method will be useful for `jj git sync` and similar in the future.	2024-04-18 21:06:52 -07:00
Yuya Nishihara	18f94bbb8b	cli: suggest root:"<path>" if cwd-relative path is not in workspace Closes #3216	2024-04-19 09:35:47 +09:00
Martin von Zweigbergk	d38228d0c5	rewrite: move check for unchanged parents onto `CommitRewriter`	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	ad1ee2d1d2	rewrite: pass root commits into `find_descendants_to_rebase()` I'm going to add another caller that wants to rebase from given roots instead.	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	a5e6b1f997	rewrite: inline specialized `rebase_commit_with_options()` in `rebase()` `rebase_commit_with_options()` now does very little, and we don't want most of it in `rebase()`.	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	2859277941	rewrite: pass `CommitRewriter` into `rebase_commit_with_options()` `CommitRewriter` wraps 3 of the arguments, so I think it makes sense to pass it instead. More importantly, I hope to continue refactoring so many of the callers already have a `CommitRewriter`.	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	b2993f2b23	rewrite: add `rebase()` method to `CommitRewriter` The new `rebase()` method is meant to be called after deciding on the new parents (typically by leaving them unchanged). It returns a `CommitBuilder` for setting any additional values. There will probably be a `reparent()` method in the future.	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	b13cb8db26	rewrite: make `EmptyBehavior` implement `Copy`	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	402d94dbd7	rewrite: add a method for simplifying ancestors to `CommitRewriter`	2024-04-18 08:08:51 -07:00
Martin von Zweigbergk	dc6c7a98d6	rewrite: create a helper type for rewriting commits This patch adds a struct that's meant to help when rewriting commits. It contains the old commits and the new parents. I hope to move most of the logic from `rebase_commit_with_options()` onto it in coming patches. Then this type can be passed in a callback to make it easier to do custom rewriting of commits that is currently hard to do because `rebase_descendants()` does not give the caller any control over the process. The helper is similar to `CommmitBuilder`, but it is a bit different by also embedding information about the source commit, so I don't think the API would be as convenient if we just used `CommitBuilder` directly.	2024-04-18 08:08:51 -07:00
Ilya Grigoriev	9fa01e0246	lib `git.rs`: minor simplification, fixup to `62b14e1f` As suggested by @yuja in https://github.com/martinvonz/jj/pull/3516#discussion_r1568466814	2024-04-17 19:51:57 -07:00
Yuya Nishihara	4474577ceb	fileset: parse cwd/root-glob patterns Mercurial appears to resolve cwd-relative path first, so "glob:.c" could be parsed as "/.c" if cwd was literally "*". It wouldn't practically matter, but isn't correct. Instead, jj's parser first splits glob into literal part and pattern. That's mainly because we want to parse the user input texts into type-safe objects, and (RepoPathBuf, glob::Pattern) pairs are the simplest ones. The current parser can't handle patterns like "foo//.." (= "foo" ?), and errors out. I believe this restriction is acceptable. Unlike literal paths, the 'glob:' pattern anchors to the whole file path. I don't think "prefix"-matching glob is useful, and making it the default would be rather confusing.	2024-04-18 11:09:54 +09:00
Yuya Nishihara	147668cdf2	matchers: add matcher for glob patterns Patterns are specified as (dir, pattern) pairs because we need to handle parse errors prior to constructing a matcher, and it's convenient to split literal directory paths there.	2024-04-18 11:09:54 +09:00
Ilya Grigoriev	62b14e1fa2	lib `git.rs`: remove workaround for a now-fixed libgit2 bug https://github.com/libgit2/libgit2/issues/3178 is now fixed.	2024-04-17 12:00:37 -07:00
Martin von Zweigbergk	93baff0b8a	rewrite: pass just IDs of new parents into `rewrite::rebase*()` It's cheap to look up commits again from the cache in `Store` but it can be expensive to look up commits we didn't end up needing. This will make it easier to refactor further and be able to cheaply set preliminary parents for a rewritten commits and then let the caller update them.	2024-04-17 06:13:54 -07:00
Martin von Zweigbergk	057b7c8d0b	rewrite: take commit and new parents by value in `rebase_commit()` I'm going to add a helper struct to help with rewriting commits. I want to make that struct own the old commit and the new parents to simplify lifetimes. This patch prepares for that by passing the commits by value to `rebase_commit()`.	2024-04-17 06:13:54 -07:00
Martin von Zweigbergk	dca9c6f884	repo: propagate errors from `find_descendants_to_rebase()`	2024-04-17 06:13:54 -07:00
Martin von Zweigbergk	8ce099470b	cargo: explicitly indicate paths to publish Running `cargo publish` from a non-colocated repo (such as my usual repo) is currently quite scary because it uploads all non-hidden files, even if they're ignored by `.gitignore` (https://github.com/rust-lang/cargo/issues/2063). I noticed this a while ago and have always run the command from a fresh clone since then. To avoid the need for that, let's use the workaround mentioned on the bug, which is to explicitly list patterns we want to publish.	2024-04-15 20:37:00 -07:00
Martin von Zweigbergk	955c9bf27b	jj-lib-proc-macros: add missing LICENSE file We publish this crate on crates.io, so it should have a LICENSE file.	2024-04-15 20:37:00 -07:00
Yuya Nishihara	7bed5dd222	matchers: turn RepoPathTree into generic map-like type This prepares for adding glob matcher, which will be backed by RepoPathTree<Vec<glob::Pattern>>. FilesNodeKind/PrefixNodeKind are basically boolean types, but implemented as enums for better code readability.	2024-04-16 10:10:09 +09:00
Yuya Nishihara	10f5540b3b	matchers: rewrite RepoPathTree::to_visit_sets() to not depend on is_dir flag The is_dir flag will be removed soon. Since FilesMatcher doesn't set is_dir flag explicitly, is_dir is equivalent to !entries.is_empty(). OTOH, PrefixMatcher always sets is_dir, so all tree nodes are directories.	2024-04-16 10:10:09 +09:00
Yuya Nishihara	e0d5217450	matchers: inline RepoPathTree::get_visit_sets()	2024-04-16 10:10:09 +09:00
Yuya Nishihara	8e196d0025	matchers: simply derive Default for RepoPathTree Perhaps, I didn't do that because it's important to initialize is_dir/file to false. Since I'm going to extract a generic map-like API, and is_dir/file will be an enum, this won't be a problem.	2024-04-16 10:10:09 +09:00
Yuya Nishihara	f92e5b911f	matchers: inline RepoPathTree::add_file()	2024-04-16 10:10:09 +09:00
Yuya Nishihara	0153cc1bc7	matchers: remove tests that directly modify RepoPathTree I'm going to extract generic map from RepoPathTree, and .get_visit_sets() will be inlined into FilesMatcher/PrefixMatcher. These removed tests should be covered by the corresponding matcher tests.	2024-04-16 10:10:09 +09:00
Yuya Nishihara	9a83338079	matchers: don't allow dead_code	2024-04-16 10:10:09 +09:00
Martin von Zweigbergk	0bbebaf4f9	rewrite: move calculation of set to rebase to `MutableRepo` This lets us make `parent_mapping` private again.	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	53a0e23759	rewrite: move functions for updating refs to `MutableRepo` The functions now depend only on `MutableRepo`, so I think they belong on that type. This gets us closer to being able to make `parent_mapping` private again.	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	f716116249	rewrite: remove unnecessary assertions I think the recent refactorings (especially `9c382fd8c6`) make it pretty clear that `DescendantRebaser` will not attempt to rebase the same commit twice, so I think we can remove the assertions. This removes some of the places where `DescendantRebaser` reaches into `MutableRepo`'s internals.	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	656250d6d0	rewrite: pass UserSettings into `update_all_references()` With this change, `update_all_references()` only uses `self` to get to `mut_repo`. I'll move the function onto `MutableRepo` next.	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	750002594e	rewrite: inline and rewrite `ref_target_update()` I rewrote `old_target` and `new_target` to more accurately represent the change; the old target should be a normal (singleton) ref.	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	f696f5b727	rewrite: leverage `root_id()` helper on commit object	2024-04-15 07:09:12 -07:00
Martin von Zweigbergk	0525dc9d86	politics: delete references to Pijul The Pijul maintainer has opinions that I don't understand about how we mention Pijul (they consider the current mentions offensive as "bashing Pijul"). Let's just remove the references so we don't have to deal with it. I think the references to Darcs we already had in most of these places are sufficient.	2024-04-14 13:16:08 -07:00
Yuya Nishihara	aaa2025dfc	git: on fetch, pin visible untracked remote refs This implements the other workaround described in `57167cefda` "git: on import_refs(), don't abandon ancestors of newly fetched refs": > I think there are two ways to fix the problem: > a. pin non-tracking remote branches just like local refs > b. pin newly fetched refs in addition to local refs > This patch implements (b) because it's simpler and more obvious that the > fetched commits would never be abandoned immediately. The idea of (a) is that untracked remote branches are independent read-only refs, and read-only branches shouldn't be rewritten implicitly. Once the branch gets rewritten or abandoned by user, these remote refs will be hidden, and won't be pinned anymore. Since (a) effectively supersedes (b), this patch also removes the original workaround. Fixes #3495	2024-04-14 11:38:21 +09:00
dploch	57a5d7dd64	cli_util: support multiple extensions consistently If we ever implement some sort of ABI for dynamic extension loading, we'll need these underlying APIs to support multiple extensions, so we might as well do that first.	2024-04-12 14:07:33 -04:00
Yuya Nishihara	30984dae4a	cli: if enabled, parse path arguments as fileset expressions If this doesn't work out, maybe we can try one of these: a. fall back to bare file name if expression doesn't contain any operator-like characters (e.g. "f(x" is an error, but "f x" can be parsed as bare string) b. introduce command-line flag to opt in (e.g. -e FILESET) c. introduce pattern prefix to opt in (e.g. set:FILESET) Closes #3239, #2915, #2286	2024-04-12 11:36:40 +09:00
Ilya Grigoriev	8fa256ebac	New `jj debug watchman status` command This command checks not only whether Watchman works, but also whether it's enabled in the config. Also, the output is easier to understand than that of the other `jj debug watchman` commands. It would be nice if `jj debug watchman` called `jj debug watchman status`, but it's not trivial in `clap` to have a default subcommand.	2024-04-11 10:55:59 -07:00
Yuya Nishihara	33beb8d456	fileset: add recursive iterator over explicit paths The primary use case is to warn unmatched paths. I originally thought paths in negated expressions shouldn't be checked, but doing that seems rather inconsistent than useful. For example, "~x" in "jj split '~x'" should match at least one file to split to non-empty revisions.	2024-04-11 00:51:19 +09:00
Yuya Nishihara	57b423e3d7	fileset: relax identifier rule to accept more path-like strings Since fileset is primarily used in CLI, it's better to avoid inner quoting if possible. For example, ".." would have to be quoted in the original grammar derived from the revset. This patch also adds a stricter version of an identifier rule. If we add a symbol alias, it will follow the "strict_identifier" rule.	2024-04-09 20:42:09 +09:00
Yuya Nishihara	653173abad	fileset: implement name resolution stage, add all()/none() functions #3239	2024-04-09 20:42:09 +09:00
Yuya Nishihara	9c28fe954c	fileset: add grammar and implement parser (without name resolution) The fileset grammar is basically a stripped-down version of the revset grammar, with a few adjustments: * extract function call to "function" rule (like templater) * inline "symbol" rule (because "identifier" and "string" should be treated differently at the early parsing stage.) The parser will have a separate name resolution stage. This will help to do alias substitution properly. I'll probably rewrite the revset parser in the same way. It will also help if we want to embed fileset expression in file() revset.	2024-04-09 20:42:09 +09:00
Yuya Nishihara	521bcd81ab	dsl_util: deduplicate collect_similar() from revset and templater For convenience, sort and dedup are done by collect_similar().	2024-04-09 20:42:09 +09:00
Evan Mesterhazy	379849b4b8	Fix documentation for `RevsetExpression::ancestors_at` and `descendants_at` The current documentation is wrong. This is a follow up for https://github.com/martinvonz/jj/pull/3461#discussion_r1555011289	2024-04-08 08:29:14 -04:00
Yuya Nishihara	73508730aa	revset: rewrite identifier rule in common infix-op rule pattern I don't remember why I made it defined recursively, but it's basically the same as "primary ~ (infix_op ~ primary)*" rule.	2024-04-08 00:37:25 +09:00
Yuya Nishihara	7f1f73b0fa	revset: move whitespace rule to top The whitespace rule is a bit special, and it seemed weird that the rule is defined between literals and operator tokens.	2024-04-08 00:37:25 +09:00
Yuya Nishihara	c8f93c50fc	revset: remove redundant Result<..> from parse_symbol_rule_as_literal()	2024-04-08 00:37:25 +09:00
Yuya Nishihara	d442cd872f	revset: backport \-escapes parsing from templater	2024-04-08 00:37:25 +09:00
Yuya Nishihara	d1ae2d72c8	revset: rename Rule::literal_string to string_literal	2024-04-08 00:37:25 +09:00
Yuya Nishihara	274183fa66	dsl_util: extract helper that parses string literal with \-escapes The top-level assertion is removed since it's now obvious that the pair represents a Rule::string_literal.	2024-04-08 00:37:25 +09:00
Yuya Nishihara	8b32a8a916	revset: add support for file(kind:pattern) syntax There are no more callers of parse_function_argument_to_string(), so it's removed. This function was a thin wrapper of literal parser, and can be easily reintroduced if needed.	2024-04-07 19:43:29 +09:00
Yuya Nishihara	850887cf09	fileset: add basic pattern parsing functions Naming convention is described in FilePattern::from_str_kind(). It's based on Mercurial's pattern prefixes, but hopefully fixes some inconsistencies. https://github.com/martinvonz/jj/issues/2915#issuecomment-1956401114 #3239	2024-04-07 19:43:29 +09:00
Yuya Nishihara	3c1d485452	revset: extract function that handles kind:"value" pattern syntax I also removed comment about the error span. It's unclear whether the kind was invalid or the value had syntax error.	2024-04-07 19:43:29 +09:00
Yuya Nishihara	47150d2bb4	revset: migrate file() predicate to be based on FilesetExpression	2024-04-06 23:59:54 +09:00
Yuya Nishihara	3e029537c6	fileset: add basic AST-level object and matcher builder FilesetExpression is similar to RevsetExpression, but there are two major differences: - Union is represented as N-ary operator, - Expression node isn't Rc-ed. The former is because of the nature of the runtime Matcher objects. It's easier to construct a Matcher from flattened union expressions than from a binary tree. The latter choice comes from UnionAll(Vec<FilesetExpression>), which doesn't have to be Vec<Rc<FilesetExpression>>, and Rc<[FilesetExpression]> can't be constructed from [Rc<_>, ..]. Anyway, the internal representation may change as needed. Another design decision I made is Vec<Pattern(RepoPathBuf)> vs Pattern(Vec<RepoPathBuf>). I chose the former because it will be more closer to the parsed tree of the fileset language.	2024-04-06 23:59:54 +09:00
Yuya Nishihara	7acfab695a	matchers: impl custom Debug for RepoPathTree to get stable and concise output The default Debug output entries aren't sorted by name, which was inconvenient while writing snapshot tests.	2024-04-06 23:59:54 +09:00
Yuya Nishihara	c9b21a16be	matchers: require Matcher to be Debug This helps to write snapshot tests.	2024-04-06 23:59:54 +09:00
Yuya Nishihara	f3485c9efb	repo_path: make Debug formatting of RepoPathComponent less verbose Since RepoPath is formatted as a string, it should be okay for RepoPathComponent to do the same.	2024-04-06 23:59:54 +09:00
Yuya Nishihara	1134dc159e	repo_path: use write!() macro to implement Debug	2024-04-06 23:59:54 +09:00
Yuya Nishihara	0b833ea9c0	repo_path: qualify fmt::Error, use fmt::Result for short "Error" is super common type name, so I think better to not pollute the namespace with a very specific Error type.	2024-04-06 23:59:54 +09:00
Ilya Grigoriev	93cebcd0c0	protos: `cargo update prost prost-builder` and regenerate protobufs	2024-04-05 16:56:20 -07:00
Austin Seipp	4b45dde8c6	clippy: disable bogus lints for nightly clippy The nightly compiler has several clippy fix-its that, if applied, break the build. There are various bugs about this, but there isn't enough space in the margins to detail it all. Just ignore these on a per-function basis; about 70% of them are just multiple instances happening inside a single function. This makes `cargo clippy --workspace --all-targets` run clean, even with the nightly compiler. Signed-off-by: Austin Seipp <aseipp@pobox.com> Change-Id: Ic26a025d3c62b12fbf096171308b56e38f7d1bb9	2024-04-05 11:39:29 -05:00
Yuya Nishihara	a364310b56	matchers: add binary UnionMatcher This will be needed to concatenate patterns of different types (such as "prefix/dir" exact:"file/path".) The implementation is basically a copy of IntersectionMatcher, with some logical adjustments. In Mercurial, unionmatcher supports list of matchers as input, but I think binary version is good enough.	2024-04-05 10:26:01 +09:00
Yuya Nishihara	c4d7425de5	matchers: abstract matcher combinators over Matcher trait In order to implement a fileset, we'll need owned variants of these matchers. We can of course let callers move Box<dyn Matcher> into these adapters, but we might need to somehow clone Box<dyn Matcher>. So, I simply made adapters generic.	2024-04-05 10:26:01 +09:00
Yuya Nishihara	a7d5a9c99a	commit: actually remove boxing from CommitIteratorExt::ids() Also simplified lifetime bound a bit.	2024-04-05 00:16:42 +09:00
Evan Mesterhazy	d4a04779c0	Make check_rewritable take an iterator of &CommitId instead of &Commit This function doesn't actually need commits, it only needs their IDs. In some contexts we may only have commit IDs, so there's no need to require an iterator of Commits. This commit also adds a `CommitIteratorExt` that makes it easy to convert an iterator of `&Commit` to an iterator of `&CommitId`.	2024-04-04 09:31:17 -04:00
Yuya Nishihara	bb87fac1a4	revset: parse "all:" prefix rule by pest I had to use negative lookahead !":" because we still support a dummy ":" operator to provide a suggestion.	2024-04-03 08:59:42 +09:00
Yuya Nishihara	13dadadcdc	revset: add ParseState constructor	2024-04-03 08:59:42 +09:00
Christoph Koehler	7bde6ddc29	revset: add working_copies() function It includes the working copy commit of every workspace of the repo. Implements #3384	2024-04-01 19:36:53 -06:00
Martin von Zweigbergk	bbe906b426	repo: merge rewrite state into single `parent_mapping` with enum This simplifies the code and reduces the risk of inconsistencies in the data. Thanks to Yuya for the suggestion.	2024-03-30 09:35:45 -07:00
Yuya Nishihara	a6615bf36d	cli: render string pattern suggestion as a hint Templater doesn't have the one yet, but I think it belongs to the same category. For clap::Error, we could use clap's own mechanism to render suggestions as "tip: ...", but I feel "Hint: ..." looks better because our error/hint message is capitalized.	2024-03-30 23:53:17 +09:00
Yuya Nishihara	d759ba11f1	revset: don't stringify StringPatternParseError This helps to add hint at the CLI layer.	2024-03-30 23:53:17 +09:00
Yuya Nishihara	c4d48c5139	revset: add constructor for InvalidFunctionArguments error Inlined some of the make_error() closures instead. I'll make string pattern handler preserve the source error object.	2024-03-30 23:53:17 +09:00
Yuya Nishihara	b09732f4f8	revset, templater: split parse error constructor that sets source error object I'm going to add RevsetParseError constructor for InvalidFunctionArguments, with/without a source error, and I don't want to duplicate code for all combinations. The templater change is just for consistency. I couldn't find a good naming convention for the builder-like API, so it's called .with_source(mut self, _). Another option was .source_set(source). Apparently, it's not uncommon to name consuming constructor as with_<something>().	2024-03-30 23:53:17 +09:00
Yuya Nishihara	73b60903ce	tree: flatten TreeMergeError into BackendError	2024-03-30 22:40:05 +09:00
Yuya Nishihara	916014dc1e	tree: consolidate read error variants There isn't much difference between BackendError::ReadObject of file type and TreeMergeError::ReadError. They are both caused by the backend.	2024-03-30 22:40:05 +09:00
Martin von Zweigbergk	bfa43d16f9	rewrite: don't collect set of heads to add unnecessarily	2024-03-30 05:21:48 -07:00
Martin von Zweigbergk	c40949208b	rewrite: all rewritten commits are no longer heads Now that we no longer bother to keep the set of heads to add and remove updated while we rewrite descendants, we can simplify how we find the set of heads to remove - it's simply all commits that have been marked rewritten, divergent, or abandoned, i.e. the keys in `parent_mapping`.	2024-03-30 05:21:48 -07:00
Martin von Zweigbergk	bb1fef3258	rewrite: drop redundant unioning of old commits with abandoned commits We always add abandoned commits as key in `parent_mapping`.	2024-03-30 05:21:48 -07:00
Martin von Zweigbergk	db4b905bc9	repo: when setting rewritten or divergent, remove from abandoned I don't think we have any transactions that mark commit as abandoned and then later mark it as rewritten or divergent. But if we ever do, I think it should be considered just rewritten/divergent. So let's enforce that invariant by removing the old value from the set of abandoned commits.	2024-03-30 05:21:48 -07:00
Yuya Nishihara	f20004fffe	git_backend: classify "merge with root" as user error Perhaps, there will be more error types that hold BackendError internally, but this change is good enough to handle a merge error.	2024-03-30 11:14:25 +09:00
Yuya Nishihara	1e83faf4f8	tree: remove useless "Backend error" message from TreeMergeError I don't think it adds any contextual information. TreeMergeError is somewhat similar to BackendError.	2024-03-30 11:14:25 +09:00
Evan Mesterhazy	dd1def02e4	Move parse_string_pattern in cli to StringPattern::parse in lib This commit moves the parse_string_pattern helper function into the str_util module in jj lib and adds tests for it. I'd like to reuse this code in a function defined by `UserSettings`, which is part of the jj lib crate and cannot use functions from the cli crate.	2024-03-29 08:48:09 -04:00
Yuya Nishihara	916dc30828	revset: use common argument error instead of FsPathParseError It's not special compared to the other argument errors, and we can now track the error source separately.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	074e6e12bc	revset, templater: include short parse error description in summary line This makes the summary line more informative. Even though it just duplicates the message printed later, I think it's easier to follow. This patch also adjusts some RevsetParseError messages because it seemed redundant to repeat "revset function", "argument", etc.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	d17166628f	revset, templater: simplify parse error impls by using thiserror This patch moves all "source" errors to the source field to conform to thiserror API. It will probably help to keep ErrorKind enums comparable.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	2cd70bdf14	revset, templater: render parse error as usual error chain Because the CLI error handler now prints error sources in multi-line format, it doesn't make much sense to render Revset/TemplateParseError differently. This patch also fixes the source() of the SyntaxError kind. It should be self.pest_error.source() (= None), not self.pest_error.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	844d3d0ff0	revset, templater: allow any kind of error as parse error source I'm going to make TemplateParseError hold RevsetParseError as Box<dyn _>, but Box<dyn std::error::Error ..> doesn't implement Eq. I could remove Eq from ErrorKind enums, but it's handly if these enums remain as value types. This change will also simplify fmt::Display and error::Error impls.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	32efb4034d	revset: make span of parse error mandatory, remove Option<_> Since all callers of RevsetParseError have some reasonable span, we don't need a special case for WorkingCopyWithoutWorkspace error.	2024-03-28 10:53:06 +09:00
Yuya Nishihara	8ad0a703d4	repo_path: accept from_relative_path("."), make "".to_fs_path("") return "." It's common to normalize an empty directory path as ".". This change unblocks the use of from_relative_path() in edit_sparse(). There are a couple of callers who do to_fs_path(Path::new("")), but they all translate non-directory paths, which should never be empty.	2024-03-28 10:52:51 +09:00
Martin von Zweigbergk	9c382fd8c6	rewrite: exclude already rewritten commits from set to rebase We currently include the commits in `parent_mapping` and `abandoned` in the set of commits to visit when rebasing descendants. The reason was that we used to update branches and working copies when we visited these commits. Since we started updating refs after rebasing all commits, there's no need to even visit these commits.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	49ff818e97	rewrite: calculate `branches` later, remove it from state	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	718e54b01a	rewrite: calculate `heads_to_add` later, remove it from state Similar to the previous two commits.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	2ee1147145	rewrite: calculate `heads_to_remove` later, remove it from state Similar to the previous commit.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	b3dd038907	rewrite: calculate `new_commits` later, remove it from state We only use `new_commits` in `update_heads()`, so let's calculate it there. It should also be more correct in case other commits were created after we initialized `DescendantRebaser`.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	5e7a4a2028	rewrite: update heads outside `update_references()` Now that we only call `update_references()` in one place, there's no reason to have it also update `heads_to_add` and `heads_to_remove`. By moving it out of the function, we can consolidate the logic in one place.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	9511de486e	rewrite: extract a function for updating heads	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	0f7a86d725	rewrite: move `new_parents()` to `MutableRepo` The function only uses state from `MutableRepo`, so it should be implemented on that type.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	cfdb341c6b	rewrite: make rebase_commit_with_options() mark abandoned commit When `rebase_commit_with_options()` decides to abandons a commit, it records the new parents in the `MutableRepo`, but it's currently the caller's responsibility to remember to mark it as abandoned. Let's move that logic into the function to reduce the risk of future bugs.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	3ddf9f4329	repo: add parents of abandoned commit to parent_mapping By adding the abandoned commit's parents to `parent_mapping`, we can remove a bit more of the special handling of abandoned commitsin `DescendantRebaser`.	2024-03-26 09:50:50 -07:00
Martin von Zweigbergk	0481e67dfd	rewrite: drop now-unnecessary updating of `branches` map Since we update all branches at the end now, we never update them in several steps, so there are no intermediate locations we need to remember.	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	5e8d7f8c6f	rewrite: update references after rewriting all commits	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	e55ebd4fe6	rewrite: drop redundant update of parent_mapping after rebasing commit In the normal case when we don't abandon a commit because it became empty, then `CommitBuilder::write()` will have recorded the new commit as a rewrite of the old commit. We don't need to do that again in `rebase_one()`.	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	4406005dce	rewrite: make `DescendantRebaser` use state stored in `MutableRepo` A subset of the state in `DescendantRebaser` now matches exactly what `MutableRepo` already stores, so we can avoid copying that state and have `DescendantRebaser` use it directly instead. Having a single source of truth for the state will enable further simplifications and improvements.	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	ad16bec3a6	rewrite: move an assertion a little earlier I'm going to make `DescendantRebaser` share the state about rewritten commits with `MutableRepo` next. That means that the call to `rebase_commit_with_options()` will update that state, which would make this assertion fail. So let's move it a little earlier to avoid that.	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	a6857a7a8f	repo: rename `abandoned_commits` to `abandoned` This is just to match `DescendantRebaser`, to make the next commit a bit simpler. I think `MutableRepo` still has few enough fields that just `abandoned` is clear enough. Maybe we'll move the three rewrite-related fields into a new struct at some point.	2024-03-25 23:00:44 -07:00
Martin von Zweigbergk	6e3ceb4d1c	repo: store separate `divergent` field, pass into `DescendantRebaser` With this patch, `MutableRepo` has the same tracking of rewritten commits as `DescendantRebaser`, so we can simply pass that state into `DescendantRebaser` when we create it. The next step is to remove the state from `DescendantRebaser`.	2024-03-25 23:00:44 -07:00
Ilya Grigoriev	de0de4013d	hex_utils: fix typo found by clippy	2024-03-25 21:23:09 -07:00
Martin von Zweigbergk	890a8e282f	repo: update working copy to first divergent commit	2024-03-25 06:53:14 -07:00
Martin von Zweigbergk	d2043f069e	repo: delete `record_rewritten_commit()` I don't think we have any callers left that call `record_rewritten_commit()` multiple times within a transaction and expect it to result in divergence. I think we should consider it a bug to do that.	2024-03-25 06:53:14 -07:00
Martin von Zweigbergk	e55168fa3e	repo: make `record_rewritten_commit()` accept only one replacement id All callers now pass a single new commit and I would like to keep it that way.	2024-03-25 06:53:14 -07:00
Martin von Zweigbergk	af7ef4d04e	repo: add a method for explicitly recording divergent rewrite I plan to remove `record_rewritten_commit()` and instead make repeated rewrites replace the rewrite state.	2024-03-25 06:53:14 -07:00
Martin von Zweigbergk	b54ace4954	rewrite: mark divergent commits in `parent_mapping` too When rebasing descendants, we generally move branches, child commits, the working copy to the rewritten commit(s). However, we don't move the working copy to the new rewritten commit (s) if the old commit had been abandoned, and we don't move child commits if the rewriten was divergent. This patch aims to make it clearer that there's only one mapping from old to new parents, and that is in `parent_mapping`. It does so by merging the current `divergent` map into it, and makes the `divergent` just a set instead. When finding the new parents for a child, we leave the existing parent if it's in the set. My longer-term goal is to move `parent_mapping`, `abandoned`, and `divergent` into `MutableRepo` (maybe in a nested struct), so we can do some transformations on descendants as we rebase them. By having the state in a single place (not moving it from `MutableRepo` to `DescendantRebaser` as we currently do), I hope it will be easier to write a `MutableRepo::transform_descendants(callback)`, where the callback gets a `CommitBuilder` and can change parents of the commit, for example.	2024-03-25 06:53:14 -07:00
Martin von Zweigbergk	ba244423e8	rewrite: avoid an unnecessary clone	2024-03-25 06:53:14 -07:00
Yuya Nishihara	c311131ee2	log: encode elided node as None Since elided graph entry has no associated commits, it makes some sense to represent as None?	2024-03-24 10:32:15 +09:00
Benjamin Tan	3034dbba3f	git-push: Display messages from remote The implementation of sideband progress message printing is aligned with Git's implementation. See `43072b4ca1/sideband.c (L178)`. Closes #3236.	2024-03-23 20:17:04 +08:00
Ilya Grigoriev	02a04d0d37	test_conflicts and test_resolve_command: use `indoc!` to indent conflict markers in tests Apart from (IMO) looking nicer, this will also sidestep the potential problem that if the file contains actual jj conflict markers (`>>>>>>>` in the beginning of a line, for example), jj would currently have trouble materializing and subsequently parsing conflicts in the file if it actually became conflicted. I'll demo this bug in either this or a subsequent PR. It's the kind of bug that sounds serious in theory but might never cause a problem in practice. After this PR, only `docs/tutorial.md` has a conflict marker that's not indented. There's only one there, so hopefully it won't be too much of a pain to deal with. I also indented other strings in `test_conflicts.rs`. IMO, this looks nice and more consistent with the `insta::assert_snapshot` output. I didn't spend the time to do the same for `test_resolve_command`.	2024-03-22 23:27:25 -07:00
Anton Älgmyr	e2eb5bddf9	Make node symbols templatable in the graphs. Adds config options * templates.log_graph_node * templates.log_graph_node_elided * templates.op_log_graph_node	2024-03-21 17:41:31 +01:00
dploch	9380f9d529	rewrite: move handling of simplified ancestry into rebase_commit_with_options It seems incorrect that `simplify_ancestor_merge` is ignored when it's part of the helper's input.	2024-03-20 11:57:54 -04:00
Ilya Grigoriev	4fbe6aecc9	clippy: remove some unused code beta clippy/rustc compain about There are still some warnings from (seemingly) clippy bugs. Quoting myself from Discord: > PSA: the latest beta cargo clippy (from Rust 1.78) has some problems > that affect jj: https://github.com/rust-lang/rust-clippy/issues/12467 > and https://github.com/rust-lang/rust-clippy/issues/12377. You could > disable clippy::assigning_clones and clippy::empty_docs as a workaround. > VS Code can disable them in rust-analyzer, you can also use > https://github.com/ericseppanen/cargo-cranky (you can put Cranky.toml in > the per-user gitignore).	2024-03-19 18:33:29 -07:00
Martin von Zweigbergk	f865c1bc5d	index: print a milder "Reindexing..." message on version mismatch Closes #3323.	2024-03-18 13:50:14 -07:00
Yuya Nishihara	50363419fb	revset: substitute '~(::x)' to 'x..' Suppose we have an alias 'immutable()' = '::immutable_heads()', user can express (visible) mutable set as '~immutable()'. 'immutable_heads()..' can terminate early, but a generic difference 'all() & ~immutable()' can't.	2024-03-17 14:50:48 +09:00
Yuya Nishihara	9207314173	revset: add substitution rule for "::x & ~(::y-)" Suppose the generation value is usually small, it should be faster to do bounded range look up first 'y-', then walk ancestors with the unwanted set 'y-..x'.	2024-03-17 14:50:48 +09:00
Yuya Nishihara	39a460a077	revset: extract helper function that substitutes "::x & ~(::y)" I'm going to add a similar substitution rule for "~(::y)".	2024-03-17 14:50:48 +09:00
Yuya Nishihara	3f9ac78215	revset: update legacy range syntax in comment	2024-03-17 14:50:48 +09:00
Yuya Nishihara	a777cfe98e	index: remove topo_order() which is no longer used The same thing can be achieved by evaluating the input as a revset.	2024-03-17 11:44:41 +09:00
Martin von Zweigbergk	c55e08023e	workspace: don't lose sparsed-away paths when recovering workspace When an operation is missing and we recover the workspace, we create a new working-copy commit on top of the desired working-copy commit (per the available head operation). We then reset the working copy to an empty tree because it shouldn't really matter much which commit we reset to. However, when the workspace is sparse, it does matter, as the test case from the previous patch shows. This patch fixes it by replacing the `reset_to_empty()` method by a new `recover(&Commit)`, which effectively resets to the empty tree and then resets to the commit. That way, any subsequent snapshotting will result keep the paths from that tree for paths outside the sparse patterns.	2024-03-16 07:30:36 -07:00
Alexis (Poliorcetics) Bourget	93c707a469	lib: improve error message for invalid string pattern, suggesting to use one of the known one	2024-03-16 14:22:16 +01:00
Evan Mesterhazy	f30857190e	Add more test cases for Index::common_ancestors	2024-03-14 12:54:13 -04:00
Evan Mesterhazy	adaedd5556	Add documentation to lib/src/index.rs and lib/src/default_index/	2024-03-14 12:54:13 -04:00
Yuya Nishihara	5806dbfd32	revset_graph: detach CompositeIndex, reimplement as RevWalk For API consistency. It wouldn't practically matter unless we want to reuse .iter_graph() in lazy event-driven GUI context. I don't see significant performance difference: - jj-0: original impl with look-ahead IndexEntry<'_> buffer - jj-1: this patch With dense graph ``` % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/git --ignore-working-copy log -r.. -T ''" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/git --ignore-working-copy log -r.. -T '' Time (mean ± σ): 1.367 s ± 0.008 s [User: 1.261 s, System: 0.105 s] Range (min … max): 1.357 s … 1.380 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/git --ignore-working-copy log -r.. -T '' Time (mean ± σ): 1.344 s ± 0.017 s [User: 1.245 s, System: 0.099 s] Range (min … max): 1.313 s … 1.369 s 10 runs Relative speed comparison 1.02 ± 0.01 target/release-with-debug/jj-0 -R ~/mirrors/git --ignore-working-copy log -r.. -T '' 1.00 target/release-with-debug/jj-1 -R ~/mirrors/git --ignore-working-copy log -r.. -T '' ``` With sparse graph ``` % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/git --ignore-working-copy log -r'tags()' -T ''" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/git --ignore-working-copy log -r'tags()' -T '' Time (mean ± σ): 1.347 s ± 0.017 s [User: 1.216 s, System: 0.130 s] Range (min … max): 1.321 s … 1.379 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/git --ignore-working-copy log -r'tags()' -T '' Time (mean ± σ): 1.379 s ± 0.023 s [User: 1.238 s, System: 0.140 s] Range (min … max): 1.328 s … 1.403 s 10 runs Relative speed comparison 1.00 target/release-with-debug/jj-0 -R ~/mirrors/git --ignore-working-copy log -r'tags()' -T '' 1.02 ± 0.02 target/release-with-debug/jj-1 -R ~/mirrors/git --ignore-working-copy log -r'tags()' -T '' ```	2024-03-14 10:07:19 +09:00
Yuya Nishihara	3c8f22456b	revset_graph: remove lifetimed IndexEntry<'_> from look_ahead buffer Prepares for removing &CompositeIndex from the RevsetGraphIterator struct. The input iterator will also be changed to position-based. I've turned self.look_ahead.get().unwrap() into assertion, but it's not super important here. It's just for sanity that we've mapped missing edges properly. FWIW, we could say RevsetGraphIterator is an example of iterating and testing membership of the input revset (though the yielded entries are discarded.)	2024-03-14 10:07:19 +09:00
Yuya Nishihara	699707905c	index: reorganize revset_graph_iterator as private module of default_index The RevsetGraphIterator type is hidden so that the Iterator trait can be implemented differently.	2024-03-14 10:07:19 +09:00
Yuya Nishihara	17e46e0932	revset: extend lifetime of CommitId/ChangeId iterators For the same reason as the previous commit. Since self.inner.positions() basically clones the underlying evaluation tree, there is no reason to stick to &self lifetime. Perhaps, some of the CLI utility can be changed to not collect() the iterator. Migrating iter_graph() requires non-trivial changes, so it will be done separately.	2024-03-13 10:47:58 +09:00
Yuya Nishihara	3bf41d0c52	revset: extend lifetime of containing_fn() This allows callers to cache the returned function at 'index lifetime. It's important in templater. It also means the returned function could be 'static if the index were Arc<_> and we had a trait interface to achieve that. Option<Box<dyn ..>> is removed since RevWalk is fused.	2024-03-13 10:47:58 +09:00
Yuya Nishihara	027bd8f03a	revset: extend lifetime of internal evaluation nodes This makes the whole evaluation tree 'static, and we can freely move it without keeping the root RevsetImpl object alive. Perhaps, "Self: 'a" can be replaced with 'static, but let's leave it for now. It's not technically wrong to store lifetimed object in InternalRevset.	2024-03-13 10:47:58 +09:00
Yuya Nishihara	bc49b6b190	revset: make PurePredicateFn clonable Prepares for dropping &self lifetime from to_predicate_fn(). All predicate functions could be wrapped as Box::new(PurePredicateFn(Rc::new(f))) instead, but I don't think the .clone() cost matters.	2024-03-13 10:47:58 +09:00
dploch	6e8f1fb390	extensions_map: create a type-safe container for arbitrary objects	2024-03-12 16:52:49 -04:00
Yuya Nishihara	283907418a	revset: detach index from InternalRevset::positions() Perhaps, union/intersection/difference combinators can be moved to the rev_walk module, but let's think about that later.	2024-03-12 20:59:38 +09:00
Yuya Nishihara	78dbaba4dc	revset: remove entry-based API from InternalRevset Now all source/sink nodes produce/consume IndexPosition, so it doesn't make sense to keep InternalRevset::entries().	2024-03-12 20:59:38 +09:00
Yuya Nishihara	a733b0b052	revset: detach index from predicate fn, turn it into position-based This is the step towards removing &CompositeIndex references from the revset evaluation tree. The filter input is changed from &IndexEntry to IndexPosition to simplify the lifetime thingy. We might want to pass around CommitId or Commit object once it's loaded, but that can be implemented later. I don't see significant performance difference in revset benches.	2024-03-12 20:59:38 +09:00
Yuya Nishihara	97e69d1dcc	index: add filter RevWalk adapter FilterRevset will be built on top.	2024-03-12 20:59:38 +09:00
Yuya Nishihara	cfa067a0a9	index: add peekable RevWalk adapter This helps to migrate union/intersection/difference iterators to RevWalk.	2024-03-12 20:59:38 +09:00
Yuya Nishihara	8f0b9a0e4a	index: add RevWalk wrapper for eagerly evaluated set This serves the same role as templater::Literal. I'm going to add basic RevWalk adapters so that the revset evaluation tree can be constructed without capturing the index. EagerRevWalk will help to write tests for these adapters.	2024-03-12 20:59:38 +09:00
Yuya Nishihara	7d43a5c2c0	tests: alias index.as_composite() in revset combinator/accumulator tests	2024-03-12 20:59:38 +09:00
Aleksey Kuznetsov	6fd15dc7e5	graphlog: refactor out node symbols from GraphLog Now as default and elided node symbols come from the config, the next logical step is to use them directly bypassing GraphLog. Note that commands like `jj op log` and `jj obslog` do not use the elided node symbol at all.	2024-03-12 08:25:58 +05:00
Yuya Nishihara	9c1d5d155e	index: remove HRTB stuff by implementing RevWalkIndex for CompositeIndex	2024-03-11 17:24:10 +09:00
Yuya Nishihara	3d0952b316	index: implement AsCompositeIndex for CompositeIndex, not for &CompositeIndex Just a minor code cleanup. We still need Index for &CompositeIndex because the type is unsized, and unsized type cannot be converted to another dyn reference.	2024-03-11 17:24:10 +09:00
Yuya Nishihara	243675b793	index: turn CompositeIndex into transparent reference type This helps to eliminate higher-ranked trait bounds from RevWalkRevset and RevWalk combinators to be added. Since &CompositeIndex is now a real reference, it can be passed to functions as index: &T.	2024-03-11 17:24:10 +09:00
Yuya Nishihara	c8be8c3edd	index: add type alias for "dyn IndexSegment" to clarify it's 'static This helps to migrate CompositeIndex<'_> wrapper to &CompositeIndex. If the wrapped reference had a lifetimed field, it couldn't be represented as a trivial reference type.	2024-03-11 17:24:10 +09:00
Yuya Nishihara	64e0be2477	revset: consolidate early-return condition of PositionsAccumulator Since consume_to() checks the bottom position yielded from the source iterator, it makes sense to add the same check for the cached positions.	2024-03-11 17:24:01 +09:00
Martin von Zweigbergk	4d42604913	git_backend: write trees involved in conflict in git commit header We haven't used custom Git commit headers for two main reasons: 1. I don't want commits created by jj to be different from any other commits. I don't want Git projects to get annoyed by such commit and reject them. 2. I've been concerned that tools don't know how to handle such headers, perhaps even resulting in crashes. The first argument doesn't apply to commits with conflicts because such commits would never be accepted by a project whether or not they use custom commit headers. The second argument is less relevant for conflicted commits because most tools will be confused by such commits anyway. Storing conflict information in commit headers means that we can transfer them via the regular Git wire protocol. We already include the tree objects nested inside the root-level tree, so they will also be transferred. So, let's start by writing the information redundantly to the commit header and to the existing storage. That way we can roll it back if we realize there's a problem with using commit headers.	2024-03-10 20:51:05 -07:00
Aleksey Kuznetsov	cd3d75ebf6	revset: introduce more performant way to check if a commit is in a revset Initially we were thinking to have `Revset` return something like `CachedRevset`: ``` pub trait CachedRevset { fn iter(&self) -> Box<dyn Iterator<Item = Commit>>; fn contains(&self, &CommitId) -> bool; } ``` But we weren't sure what use case for `iter` would be, so we dropped the `iter` method. `CachedRevset` with single `contains` method needed a better name. We weren't able to come up with one, so we decided instead to have a method on `Revset` that returns a closure to check if a commit is in a revset.	2024-03-11 08:27:35 +05:00
Yuya Nishihara	8a406358af	index: migrate RevWalkRevset to be based off new RevWalk trait "for<'index> RevWalk<CompositeIndex<'index>, .." works as of now, but it won't be composed well. So I'll turn CompositeIndex<'_> into &CompositeIndex in the next batch, and remove "for<'index>".	2024-03-11 11:25:54 +09:00
Yuya Nishihara	4107cad80e	index: migrate RevWalkDescendants to new RevWalk trait Just for consistency. Descendants are always evaluated eagerly, so this change isn't strictly needed.	2024-03-11 11:25:54 +09:00
Yuya Nishihara	b6cbd8b90b	index: add trait and adaptor types to detach index from RevWalk* This eliminates lifetimed fields from RevWalk objects, and the RevWalk object will be embedded directly in RevWalkRevset. This patch adds two separate iterator adapters. They are identical at this point, but I'm going to add detach/reattach methods only to the borrowed version. I'm also planning to change CompositeIndex<'_> to &CompositeIndex to get around higher-ranked trait bound restrictions.	2024-03-11 11:25:54 +09:00
Yuya Nishihara	d780910bec	index: make RevWalk yield IndexPosition instead of IndexEntry This simplifies the RevWalkIndex API. It would probably add fractional msecs of overhead per next() call, but I don't see significant difference in revset benches.	2024-03-11 11:25:54 +09:00
Anton Älgmyr	099f06bf71	Add configuration options for node symbols in the graphs.	2024-03-09 21:16:58 +01:00
Yuya Nishihara	f51c5d7e57	index: consistently use IntoIterator in RevWalk builder API Since the return type is no longer "impl Iterator<..>", there isn't lifetime issue anymore.	2024-03-10 01:45:30 +09:00
Yuya Nishihara	2615fed5be	index: handle cut-off position of RevWalk by queue I'm going to make CompositeIndex<'_> detachable from the RevWalk, and "F: Fn(CompositeIndex) -> Box<dyn Iterator<..>>" of RevWalkRevset<F> will be replaced with "W: RevWalk<CompositeIndex>". This will simplify the code structure, but also means that we can no longer apply .take_while() here and convert it back to RevWalk. Fortunately, ancestors_until_roots() is the only function I need to reimplement.	2024-03-10 01:45:30 +09:00
Yuya Nishihara	34fbaaaad6	index: construct RevWalk queue after item type is settled It doesn't make sense to build BinaryHeap with intermediate type, and I'm going to reimplement take_until_roots() in a way that the queue drops uninteresting items.	2024-03-10 01:45:30 +09:00
Yuya Nishihara	8480ee9e05	index: migrate RevWalk constructors to builder API The current RevWalk constructors insert intermediate items to BinaryHeap and convert them as needed. This is redundant, and I'm going to add another parameter that should be applied to the queue first. That's why I decided to factor out a builder type. I considered adding a few set of factory functions that receive all parameters, but they looked messy because most of the parameters are of [IndexPosition] type. This patch also adds must_use to the builder and its return types, which are all iterator-like.	2024-03-10 01:45:30 +09:00
Yuya Nishihara	008adecf23	index: rename ancestors iterators from RevWalk* to RevWalkAncestors* I'm planning to add RevWalk trait, and this patch frees up the name. It seems also good for consistency as we have RevWalkDescendants*.	2024-03-10 01:45:30 +09:00
Yuya Nishihara	fa60026f25	repo_path: don't panic on invalid UTF-8 path component Although watchman client appears to fail at decoding non-UTF-8 path (somewhere in serde), jj shouldn't panic if watchman could deal with that. The outer error message "path not in the repo" would sounds odd, but I think that's okay because 1. it's unlikely that a user input is not UTF-8, and 2. it's technically correct that a non-UTF-8 path is not contained in the repo.	2024-03-09 11:01:43 +09:00
Yuya Nishihara	a224d0f172	repo_path: show more detailed error if filesystem path failed to parse This should address both use cases: 1. If from_relative_path() is directly called, the error says ".." shouldn't be included in the (normalized) relative path. 2. If parse_fs_path() is used, the error message contains paths relative to cwd. #3216	2024-03-09 11:01:43 +09:00
Yuya Nishihara	a76f716cd1	index: remove RevWalk newtypes that were necessary to hide impl types/traits Some of the RevWalk methods could be generalized, but I decided to not try that for now. I'll probably need to do more cleanup to (hopefully) remove 'index lifetime from these types.	2024-03-08 10:07:40 +09:00
Yuya Nishihara	8451453f3a	index: hide walk_revs() and related types They are now implementation details of the default index backend.	2024-03-08 10:07:40 +09:00

... 2 3 4 5 6 ...

3006 commits