mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2025-01-03 18:24:19 +00:00

Author	SHA1	Message	Date
Ilya Grigoriev	7cef879ef6	lib `repo.rs` & `rewrite.rs`: Move clearing of rewritten/abandoned commits This commit is a little out of place in this sequence, but it seems to make more sense for MutRepo to own these maps. @yuja [pointed out] that any tests written using `create_descendant_rebaser` now need to do this cleanup, but there are no longer any such tests after the previous commits and a follow-up commit removes `create_descendant_rebaser` entirely. [pointed out]: https://github.com/martinvonz/jj/pull/2737#discussion_r1435754370	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	4461d61254	test_rewrite: test branches of descendants of divergent commits A TODO left over from a previous PR	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	b2abba07e9	tests: (mostly) stop using soon-to-be-private DescendantRebaser-related APIs This removes uses of `DescendantRebaser::new` or `MutRepo::create_descendant_rebaser` from most tests. The exceptions are the tests having to do with abandoning empty commits on rebase, since adjusting those is a bit more elaborate (see follow-up commits).	2024-01-01 18:51:36 -08:00
Yuya Nishihara	3eafca65ea	op_walk: add support for op_id+ (children) operator A possible use case is when doing some archaeology around a certain operation. The current implementation is quadratic if + is repeated. Suppose op_id is usually close to the current op heads, I think it'll practically work better than building a reverse lookup table.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	ab299a6af5	op_walk: reimplement prefix lookup by using walk_ancestors() and HexPrefix Perhaps, OpStore should provide prefix resolution method, but let's think that later.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	c53748d732	op_walk: allow walk_ancestors() from more than one head operations	2024-01-02 10:30:08 +09:00
Yuya Nishihara	51691ea22c	tests: add lib tests for op id resolution, migrate some from cli CLI testing is slow and harder to set up crafted environment.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	dad890b960	operation: make parent_ids() return slice instead of Vec reference	2024-01-02 02:47:41 +09:00
Yuya Nishihara	c9b581589c	op_walk: simplify arguments passed to high-level "opset" query functions	2024-01-01 10:22:23 +09:00
Yuya Nishihara	26b5f38f45	op_walk: move "opset" query functions from jj_cli	2024-01-01 10:22:23 +09:00
Yuya Nishihara	e4460d5386	op_walk: add error types for fake "opset" expression This removes CommandError dependency from these resolution functions. We might want to refactor the error types again if we introduce a real "opset" evaluator. The error message for unresolved op heads now includes "@" instead of the whole expression.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	94fc32ab47	op_walk: extract walk_ancestors() to new module I'm going to extract fake "opset" resolution functions there, and I think walk_ancestors() belongs to the same category.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	6dd936f72f	op_heads: let caller decide resolve_op_heads() error type The resolver callback usually returns wider error type, which I don't think is a variant of OpHeadResolutionError. To help type inference, resolver's error type is E, not E1 where E: From<E1>.	2024-01-01 10:22:23 +09:00
Martin von Zweigbergk	90744fb770	working copy: read files ahead when updating If the commit backend has high latency, it can make a big difference to read files concurrently. This patch updates the working copy code to do that in the update code (when reading files from the backend to write to the working copy). Because our backend at Google reads files from a local daemon process that already does a lot of prefetching, this patch doesn't actually help us. I think it's still the right thing to do for backends that don't do the same kind of prefetching. It speeds up `jj sparse set --add` by >10x when I disable the prefetching in our daemon (our `Backend::concurrency()` is 100).	2023-12-29 13:37:13 -08:00
Yuya Nishihara	f9e9058b9b	index: show bad operation id if commit lookup failed during reindexing My jj repo contains such head commits, and "jj debug reindex" fails. To address this problem, we'll probably need to implement GC, and the user will discard operations before the first bad op id.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	43e016a7d1	index: add explicit reindexing method that can propagate error	2023-12-29 13:05:58 +09:00
Yuya Nishihara	ab1c8656a4	index: rename private index_at_operation methods, reorder arguments I'm going to add a public method that rebuilds index, and its return type will be different. I also added "build_" because "index" could be misinterpreted as noun. The method arguments are reordered to follow the public IndexStore interface.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	3abe6be384	index: propagate DefaultIndexStore::init/reinit() errors	2023-12-29 13:05:58 +09:00
Yuya Nishihara	955f6e356a	repo: add error propagation path to IndexStore initialization and loading The error types are shared with the commit store backend. We could add per-store error types, but it's unlikely that the caller needs to discriminate them.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	bb73cd491f	clenaup: don't use debug format to embed ObjectId in error message Also fixed typo, s/a/an/.	2023-12-29 13:05:58 +09:00
Martin von Zweigbergk	d06764eb7c	op heads: remove now-unused methods for adding/removing op heads	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	65a6aa61db	op heads: replace last use of remove_op_head() by update_op_heads()	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	76516bb46b	op heads: inline handle_ancestor_ops() This gets us closer to being able to use the new `update_op_heads()` function here (without calling it multiple times).	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	4221c7cf5c	op heads: remove handle_ancestor_ops() from trait I think the idea behind `handle_ancestor_ops()` was to let our backend at Google delegate the work to the server, which could then avoid walking ancestors. However, we're now thinking that we're going to make our server resolve divergent operations on its own instead, so the client will never see more than one op head, unless it manually creates the second op head itself (e.g. because the user ran two concurrent commands). In those cases it should be fine to do the walk. So let's simplify the trait by removing the function.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	f969f4b0b0	op heads: remove lifetime from OpHeadsStoreLock	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	c304777a35	op heads: remove promote_new_op() `OpHeadsStoreLock::promote_new_op()` doesn't add much over the new `update_op_heads()`, so let's switch to the latter.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	b8e45d196f	op heads: add a new trait method combining add and remove of op heads Consider how one would implment the current `OpHeadsStore` interface for a cloud-based backend. After `OpHeadsStore::add_op_head()` is called, the set of op heads temporarily contains two heads (typically) until `OpHeadsStore::remove_op_head()` is called. That's not invalid, but it's annoying to have to deal with that state more than necessary. Also, it's unnecessarily inefficient to send the addition and removal of op heads as separate RPCs. This patch therefore adds a `update_op_heads()` method that takes a list of old heads to remove and a single new head to add. Coming patches will start migrating to that method.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	8137975785	op heads: drop support for old location/format We move `.jj/repo/op_heads/` into `.jj/repo/op_heads/heads/` almost a year ago, in commits `90a66ec262` and `37ba17589d`. We said we would drop support for it in 0.9+. I think we said that before we started doing monthly releases, but I we're still past the goal of 6 months (which is what I think we were aiming for).	2023-12-28 09:17:42 -08:00
Yuya Nishihara	dde42b9c05	index: rename resolve_prefix() to resolve_commit_id_prefix() I'll probably add change id lookup methods to CompositeIndex. The Index trait won't gain resolve_change_id_prefix(), but I also renamed its resolve_prefix() for consistency.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0f2f566188	index: remove "segment_" prefix from IndexSegment methods Since Readonly/MutableIndexSegment no longer implement Index trait, there's no ambiguity between segment-local and index-global operations. Let's shorten the method names.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	c9b9e2864e	index: introduce newtype that represents segment-local position I'm thinking of changing some IndexSegment methods to return LocalPosition instead of global IndexPosition, and using u32 there would be a source of bugs.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	ee8d5e279a	index: make segment-level lookup return neighbor commit ids instead of positions Both readonly and mutable segments know the commit ids to return, and the caller only needs the ids. Since segment_commit_id(local_pos) scans the graph entries, doing that would increase the chance of cache miss.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0e7834feb9	index: inline segment_entry_by_pos() There's no reasonable way to abstract the IndexEntry construction.	2023-12-26 01:03:10 +09:00
Ilya Grigoriev	1fb9df252b	`split.rs`: stop using DescendantRebaser::new This requires creating a new public API as a substitute. I took the opportunity to also add some comments to the `MutRepo::record_rewritten_commit`/`record_abandoned_commit` functions. I imade the simplest possible addition to the API; it is not a very elegant one. Eventually, the entire `record_rewritten_commit` API should probably be refactored again. I also added some comments explaining what these functions do.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	6bfd09009f	`move.rs`: remove use of `MutRepo::create_descenant_rebaser`. After this, the internal function is only used in tests.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	cde8ea8985	Make CommitBuilder constructors private to the library crate The implementation of `CommitBuilder::write` is tightly bound to the MutRepo, so only MutRepo should construct CommitBuilder-s.	2023-12-24 19:25:16 -08:00
Yuya Nishihara	b954bab0ca	index: fix partial reindexing to not lose commits only reachable from one side Spotted while adding error propagation there. This wouldn't likely be a real problem because "jj debug reindex" removes all of the operation links. The "} else {" condition is removed because it doesn't make sense to exclude only the exact parent_op_id operation. This can be optimized to not walk ancestors of the parent_op_id operation, but I don't see a motivation to add tests covering such scenarios. It's pretty rare that an intermediate operation link is missing.	2023-12-24 23:31:16 +09:00
Yuya Nishihara	320d15412b	index: let caller of segment-level save-in() squash segments explicitly There are many unit tests that call mutable_segment.save_in(), but I don't think these callers expect that the segment file could be squashed depending on the size. Let's make it caller's responsibility. maybe_squash_with_ancestors() should be cheap if segment_num_commits() == 0, so it's okay to call it before checking the emptiness.	2023-12-24 00:22:47 +09:00
Yuya Nishihara	1d80bbb70a	index: leverage ancestor iterator to collect segments to be squashed I think "for" loop is easier to follow. Maybe it could be rewritten further to .find_map() loop, but that would be too clever. I also made ancestor_index_segments() pub(super) since it doesn't make sense to only provide ancestor_files_without_local().	2023-12-24 00:22:47 +09:00
Yuya Nishihara	55b4f69fb6	repo: propagate store error from add_heads()	2023-12-24 00:22:30 +09:00
Yuya Nishihara	0f6a7418f2	index: propagate store error from reindexing function If the error is permanent (because the repo predates the no-gc-ref fix for example), there's no easy way to recover. Still, panicking in this function seems wrong.	2023-12-24 00:22:30 +09:00
Yuya Nishihara	7a44e590dc	lock: remove byteorder dependency from tests, use fs helper functions This is the last use of Read/WriteBytesExt. The byteorder crate is great, but we don't need an abstraction of endianness. Let's simply use the std functions.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	9de6273e10	index, stacked_table: inline read_u32::<LittleEndian>() There aren't many callers of ReadBytesExt::read_u32().	2023-12-23 00:14:17 +09:00
Yuya Nishihara	21c22be96e	stacked_table: use u32::from_le_bytes() to reinterpret bytes as integer Apparently, I forgot to update this in `fb06e89649`.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	6f5096e266	index, stacked_table: use u32::try_from() instead of numeric cast These .unwrap()s wouldn't be compiled out, but I don't think they would have measurable impact. Let's use the safer method.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	9ec89bcf86	index, stacked_table: use u32::to_le_bytes() to reinterpret as bytes	2023-12-22 09:03:50 +09:00
Yuya Nishihara	392539fa29	index, stacked_table: simply extend Vec<u8> to not use .write_all() I'm going to remove use of .write_u32() there. It's not super important, but fewer .unwrap()s, the code looks slightly better.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	fb06e89649	index: use u32::from_le_bytes() to reinterpret bytes as integer It's less abstract than going through io::Read, so is probably easier for compiler to optimize out. I also feel it's a bit more readable.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	38ce914321	index: reindex on content-related I/O errors If read_exact() or read_u32() reached to EOF, the index file should be considered corrupted. File not found error is also treated as data corruption because an invalid file name could be read from the child segment file. It can't handle special file names like "..", though.	2023-12-21 08:05:30 +09:00
Yuya Nishihara	e98104d6f0	index: add file name to both io/corrupt errors, combine these variants Index file name also applies to io::Error. New error type reuses io::Error to represent data corruption. We could add an inner Corrupt\|Io enum instead, but we'll need to remap some io::Error variants (e.g. UnexpectedEof) to Corrupt anyway.	2023-12-21 08:05:30 +09:00
Yuya Nishihara	88f3085bb1	index: extract function that opens file and loads index segments	2023-12-21 08:05:30 +09:00
Yuya Nishihara	eccb9b7a44	index: propagate index load errors from DefaultIndexStore	2023-12-19 07:41:57 +09:00
Yuya Nishihara	dd8e686127	index: don't reload parent files after saving new segment file This should be cheaper, and more importantly, we no longer need to propagate ReadonlyIndexLoadError to the caller.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	fb07749291	index: split load function into header and local parts as well	2023-12-19 07:41:57 +09:00
Yuya Nishihara	616a8c7f54	index: split serialization function into header and local parts The idea is that we don't have to reload parent files as we already have the chain of the parent segments. The resulting readonly index will be constructed from the loaded parent segments + local entries blob.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	31b6e93c6e	index: move IndexLoadError to "readonly" module, rename accordingly I thought IndexLoadError and DefaultIndexStoreError would represent "load" and "store" failures respectively, but they aren't. Actually, DefaultIndexStoreError is the store-level error, and IndexLoadError should be wrapped in it.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	b5de16007e	index: add stub IndexReadError type This is needed to remove .unwrap()s from DefaultIndexStore.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	d49b079494	index: update file format comment about ReadonlyIndexSegment Also made it a doc comment. I think 4-byte alignment is a nice property, so added note about that.	2023-12-19 07:41:34 +09:00
Yuya Nishihara	8909647d86	index: pass base directory path by reference	2023-12-18 08:49:21 +09:00
Yuya Nishihara	b733d52557	index: split DefaultIndexStoreError::Io variant, extract save helper Since OpStoreError can also include io::Error, it doesn't make much sense to have Io variant at this level. Let's split it to context-specific errors, and extract helper method that maps io::Error.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	bf4a4e70b1	index: use DefaultMutableIndex wrapper when reconstructing missing index This allows us to extract helper method that writes index file and associates it with the operation.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	50164bb36f	index: have IndexWriteError carry opaque error type instead of string I'm going to remove some .unwrap()s from DefaultIndexStore, and the inner error type will be consolidated to DefaultIndexStoreError.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	87a8238bee	git: turn git.auto-local-branch off by default As far as I can see in the chat, there's no objection to changing the default, and git.auto-local-branch = false is generally preferred. docs/branches.md isn't updated as it would otherwise conflict with #2625. I think the "Remotes" section will need a non-trivial rewrite. #1136, #1862	2023-12-17 08:30:24 +09:00
Yuya Nishihara	6971ec239a	tests: set git_settings.auto_local_branch where it matters	2023-12-17 08:30:24 +09:00
Yuya Nishihara	ac99145a28	working_copy: drop open file instance from PersistError For the same reason as the file_util change.	2023-12-17 08:20:07 +09:00
Yuya Nishihara	c6df0ba4c3	file_util: don't try to overwrite existing content-addressed file on Windows The doc says persist() replaces the destination file as rename() would do on Unix. persist_noclobber() doesn't, and is probably more reliable on Windows. I don't know if persist() is completely atomic on Windows, but if it isn't, it might be the source of the "permission denied" error under highly contended situation. https://docs.rs/tempfile/latest/tempfile/struct.NamedTempFile.html#method.persist https://github.com/Stebalien/tempfile/blob/v3.8.0/src/file/imp/windows.rs#L77 We could use persist_noclobber() on all platforms, but it's more involved on Unix. https://github.com/Stebalien/tempfile/blob/v3.8.0/src/file/imp/unix.rs#L107	2023-12-17 08:20:07 +09:00
Yuya Nishihara	dd325c089c	file_util: drop open file instance from PersistError PersistError is basically a pair of io::Error and NamedTempFile instance. It's unlikely that we would want to propagate the open file instance to the CLI error handler, leaving the temporary file alive.	2023-12-17 08:20:07 +09:00
Yuya Nishihara	4d91e4c196	revset: simplify type constraints on combination iterators Just a minor cleanup to remove lifetime parameter from the types. I tried to reimplement them by using itertools, but I couldn't find a simple way to encode short-circuiting at the end of either left or right iterator.	2023-12-16 07:50:04 +09:00
Yuya Nishihara	6d59156858	revset: parameterize candidates set of FilterRevset as well	2023-12-16 07:50:04 +09:00
Yuya Nishihara	a36368bb88	revset: make revset combinators generic over set types, merge UnionPredicate UnionRevset and UnionPredicate are conceptually the same. Let's unify them.	2023-12-16 07:50:04 +09:00
Yuya Nishihara	af6047a655	lib: forbid unsafe_code at all	2023-12-15 16:10:28 +09:00
Yuya Nishihara	9990c41a90	repo: remove unsafe lifetime hack from change_id_index()	2023-12-15 16:10:28 +09:00
Yuya Nishihara	d9e8297059	index: add 'static version of evaluate_revset() to ReadonlyIndex We'll probably need a better abstraction, but a separate method is good enough to remove unsafe code from ReadonlyRepo. I'm not sure if this is feasible for the other backends, but I guess there would be less lifetimed variables than DefaultReadonlyIndex.	2023-12-15 16:10:28 +09:00
Yuya Nishihara	2ba50c76c7	revset: abstract evaluated RevsetImpl over owned/borrowed index types	2023-12-15 16:10:28 +09:00
Yuya Nishihara	72d9cd019b	index: extract as_composite() to trait method The revset engine will accept abstract AsCompositeIndex type, and the evaluated revset can be 'static if the index is behind Arc<T>.	2023-12-15 16:10:28 +09:00
Yuya Nishihara	8fdf9db6e0	revset: remove 'index lifetime from InternalRevset	2023-12-15 14:58:12 +09:00
Yuya Nishihara	c426d34c11	revset: pass in index to PurePredicateFn as an argument to make it 'static	2023-12-15 14:58:12 +09:00
Yuya Nishihara	71070e85d7	revset: add helper that coerces closure to PurePredicateFn Also renamed the boxed version to discriminate it from the cast helper.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	a9a7de4a5e	revset: store RevWalk factory function in RevWalkRevset The returned iterator is boxed by caller due to the limitation of the type system. There's a workaround, but it's super ugly. https://users.rust-lang.org/t/hrtb-on-multiple-generics/34255/3	2023-12-15 14:58:12 +09:00
Yuya Nishihara	575d3dc7bf	revset: store IndexPosition in EagerRevset to drop 'index lifetime This adds overhead to re-look up IndexEntry, but I don't think that would have significant impact on performance.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	261bf848a9	revset: pass in index to InternalRevset as an argument The idea is that InternalRevset will store a 'static boilerplate function that borrows an 'index passed by function argument. This way, we can abstract the index type over Arc<T> and &T without introducing too much generics.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	e332d39375	revset: extract inner method that constructs IndexEntry iterator	2023-12-15 14:58:12 +09:00
Yuya Nishihara	b8f60c4dd6	cargo: bump gix to 0.56.0 I don't know why the dependabot didn't catch this, but there are things to fix manually. EntryMode was changed to a u16 wrapper, and the enum was renamed to EntryKind. Other than that, I don't find anything breaking our codebase.	2023-12-15 14:17:02 +09:00
Yuya Nishihara	95a0cceb97	index: use loaded readonly data without splitting into vecs Since lookup data isn't typically small, .split_off() can take a few milliseconds to memcpy().	2023-12-14 08:43:50 +09:00
Yuya Nishihara	5121e1f4e9	index: move IndexSegment trait to "composite" module Perhaps, this is the most controversial part. It could be moved to new "segment" module (or something like "common"), but I think IndexSegment can be considered a trait that enables the CompositeIndex abstraction.	2023-12-14 08:43:40 +09:00
Yuya Nishihara	b89ae7c0b5	index: use IndexEntry::position() instead of direct field access	2023-12-14 08:43:40 +09:00
Yuya Nishihara	9fb0f00f2d	index: add IndexEntry constructor instead of pub(super)-ing fields	2023-12-14 08:43:40 +09:00
Yuya Nishihara	771f447d99	index: split IndexEntry and related types to "entry" module Added pub(super) or pub where needed. I won't implement accessor methods on IndexPositionByGeneration and IndexPosition as they are purely value types, and protecting the inner values wouldn't make sense.	2023-12-14 08:43:40 +09:00
Martin von Zweigbergk	60fae3114e	transaction: take description at end instead of start It seems better to have the caller pass the transaction description when we finish the transaction than when we start it. That way we have all the information we want to include more readily available.	2023-12-13 08:12:49 -08:00
Ilya Grigoriev	316ab8efb8	rewrite.rs: refactor `new_parents` to depend only on `parent_mapping` Previously, the function relied on both the `self.parent_mapping` and `self.rebased`. If `(A,B)` was in `parent_mapping` and `(B,C)` was in `rebased`, `new_parents` would map `A` to `C`. Now, `self.rebased` is ignored by `new_parents`. In the same situation, DescendantRebaser is changed so that both `(A,B)` and `(B,C)` are in `parent_mapping` before. `new_parents` now applies `parent_mapping` repeatedly, and will map `A` to `C` in this situation. ## Cons - The semantics are changed; `new_parents` now panics if `self.parent_mapping` contain cycles. AFAICT, such cycles never happen in `jj` anyway, except for one test that I had to fix. I think it's a sensible restriction to live with; if you do want to swap children of two commits, you can call `rebase_descendants` twice. ## Pros - I find the new logic much easier to reason about. I plan to extract it into a function, to be used in refactors for `jj rebase -r` and `jj new --after`. It will make it much easier to have a correct implementation of `jj rebase -r --after`, even when rebasing onto a descendant. - The de-duplication is no longer O(n^2). I tried to keep the common case fast. ## Alternatives - We could make `jj rebase` and `jj new` use a separate function with the algorithm shown here, without changing DescendantRebaser. I believe that the new algorithm makes DescendatRebaser easier to understand, though, and it feels more elegant to reduce code duplication. - The de-duplication optimization here is independent of other changes, and could be used on its own.	2023-12-12 19:35:51 -08:00
Yuya Nishihara	2abbb637e3	index: add wrapper functions to DefaultReadonlyIndex to remove pub(super) field	2023-12-13 08:09:48 +09:00
Yuya Nishihara	c0a12a7cbc	index: add methods that provides commit/change_id_length We could add Layout struct holding these parameters, but I don't think that's needed just for two parameters.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	3831ad423c	index: use as_composite().num_commits() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	30984b1505	index: use name() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	e5c8252fb4	index: use segment_parent_file() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	402e36bab7	index: split readonly index types to "readonly" module Added pub(super) where needed. There are a few pub(super) fields that look suspicious, which will be fixed by the subsequent patches.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	fbec16b49f	index: add wrapper functions to DefaultMutableIndex to remove pub(super) field into_segment() could be added instead of save_in(), but I decided to wrap save_in(). save_in() may squash ancestor files, so it could be considered an index-level operation.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	5aeeb5f723	index: split mutable index types to "mutable" module Added pub(super) where needed or makes sense.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	ab2742f2c9	index: split RevWalk types to "rev_walk" module Added pub(super) where needed.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	caa1b99c24	index: add CompositeIndex constructor instead of pub(super)-ing field This wouldn't matter, but seemed slightly better.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	679518fdf2	index: split CompositeIndex and stats types to "composite" module Added pub(super) where needed or makes sense.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	2423558e68	index: split DefaultIndexStore and Load/StoreError types to "store" module IndexLoadError isn't store-specific, but I think it's better to put I/O stuff in the store module.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	cdcd465c79	index: move default_index_store.rs to sub directory named default_index default_index_store.rs is relatively big, and it contains types and impls in arbitrary order. Let's split them into sub modules. After everything moved, mod.rs will only contain tests.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	f86b338681	revset: inline walk_ancestors()	2023-12-11 09:14:03 +09:00
Yuya Nishihara	cd0b24ef14	revset: inline walk_children() There's only one caller, and we have common code at the call site.	2023-12-11 09:14:03 +09:00
Yuya Nishihara	d28bd8fa0f	revset: inline collect_dag_range()	2023-12-11 09:14:03 +09:00
Yuya Nishihara	73fb922517	index: reimplement collect_dag_range() of revset engine as iterator I'm going to remove 'index lifetime from InternalRevset so Revset<'static> can be easily constructed from DefaultReadonlyIndex. As the first step, this series removes some lifetime complexity from EvaluationContext methods. We don't need an descendant iterator API, but it helps to add separate function to collect into HashSet<IndexPosition> instead of returning a pair of ordered vec and set.	2023-12-11 09:14:03 +09:00
Yuya Nishihara	cbbe38ba7b	index: rename MutableIndexImpl to MutableIndexSegment	2023-12-10 11:03:07 +09:00
Yuya Nishihara	c94e1de6d2	index: add DefaultMutableIndex wrapper, move Index impls to it The wrapper type isn't needed for the mutable layer, but this mirrors the readonly type structure. Test cases are also migrated to be using the index wrapper so long as we don't have to care for the nesting of the segment files.	2023-12-10 11:03:07 +09:00
Yuya Nishihara	ce312ae288	index: duplicate add_commit() to MutableIndexImpl	2023-12-10 11:03:07 +09:00
Yuya Nishihara	e0206a82f2	index: extract merge_in() function that works on segment types Prepares for splitting MutableIndexImpl into segment and index wrapper types.	2023-12-10 11:03:07 +09:00
Yuya Nishihara	a110ec6d95	cli: print failed git export reason for each ref Not all reasons are actionable, but we print hint in common cryptic cases.	2023-12-09 23:37:00 +09:00
Yuya Nishihara	990edcefc9	index: impl Index for DefaultReadonlyIndex instead of ReadonlyIndexSegment The idea is that the ReadonlyIndexSegment is a sub component of the index. The Index trait could be implemented for any Segment type, but we don't need a public interface to access sub segment as an index.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	1cbd2ddb4b	index: rename ReadonlyIndexImpl to ReadonlyIndexSegment I'm going to split the internal Segment types and the public Index types in order to clarify the layering concept. The public Index types will be wrappers like DefaultReadonlyIndex. Strictly speaking, ReadonlyIndexImpl is a segment + parent pointer pair, but I think calling it a segment is pretty okay. It could be called a ReadonlyIndexFile, but "File" can't apply to the mutable part.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	172043e968	index: make ReadonlyIndexImpl private There are no external callers.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	6c57ba7f21	index: rename ReadonlyIndexWrapper to DefaultReadonlyIndex This matches the store naming: impl IndexStore for DefaultIndexStore. I also added minimal doc comment and Debug.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	cee69d1665	tests: remove index downcast helpers called only by as_<type>_composite() I'm going to rename the impl types, and I don't want to think about the names of these downcast functions.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	5f6e28c8cf	git: migrate export_refs() to gix::Repository FailedToDelete/Set reasons are boxed because gix error types aren't small. They could be casted to std::error::Error if needed.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	2d76907048	git: unimplement PartialEq on FailedRefExportReason Gitoxide errors don't implement PartialEq. We could instead stringify the errors, but there aren't many callers who expect FailedRefExportReason to be comparable.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	9f8831e825	git: unimplement PartialEq on GitExportError Gitoxide errors don't implement PartialEq, and I don't think it makes sense to test equality of InternalGitError objects.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	a77eed648b	git: have export_refs() obtain git2::Repository instance from store	2023-12-09 15:18:19 +09:00
Yuya Nishihara	0f37027646	index: remove unneeded Any trait bound from MutableIndex We use .as_any() to downcast to the backend impl instead.	2023-12-08 23:30:35 +09:00
Yuya Nishihara	c197add39b	git_backend: do not try to resolve git_target path as working directory path The git_target path is normalized and managed by jj, so we don't need a fallback mechanism. Let's make it stricter.	2023-12-07 08:43:49 +09:00
Yuya Nishihara	77c811163f	tests: make sure to specify external git repository path including ".git"	2023-12-07 08:43:49 +09:00
Yuya Nishihara	25fcc3e403	workspace: consider .git symlink when generating relative git_target path Before, an absolute path would be saved in the git_target file if .git is a symlink. That's not wrong, but seemed a bit weird. Let's consolidate the behavior across .git file types.	2023-12-05 14:23:59 -08:00
Yuya Nishihara	787fa1340b	workspace: remove redundant cloning from init_external_git() Apparently, I forgot to update it in `1db033504c` "repo, workspace: remove 'static lifetime bound from initializer functions."	2023-12-05 14:23:59 -08:00
Yuya Nishihara	899c6375a0	git_backend: don't fully canonicalize .git symlink Apparently, libgit2 doesn't deduce "core.bare" config from the directory name, but gitoxide implements it correctly. So we shouldn't blindly canonicalize the Git repository path. Fortunately, the saved git_target path isn't a fully- canonicalized form (unless user explicitly sepcified "--git-repo ./.git"), so we don't need a hack to remap git_target back to the symlink path. is_colocated_git_workspace() is adjusted since the git_workdir is no longer resolved from the fully-canonicalized repo path, at least in our code. Still we have the ".git/.." fallback because test_init_git_colocated_symlink_gitlink() would otherwise fail. I haven't figured out why, and the test might be actually wrong compared to the git CLI behavior, but let's not change that for now. Fixes #2668	2023-12-05 14:23:59 -08:00
Martin von Zweigbergk	1cc271441f	gc: implement basic GC for Git backend This adds an initial `jj util gc` command, which simply calls `git gc` when using the Git backend. That should already be useful in non-colocated repos because it's not obvious how to GC (repack) such repos. In my own jj repo, it shrunk `.jj/repo/store/` from 2.4 GiB to 780 MiB, and `jj log --ignore-working-copy` was sped up from 157 ms to 86 ms. I haven't added any tests because the functionality depends on having `git` binary on the PATH, which we don't yet depend on anywhere else. I think we'll still be able to test much of the future parts of garbage collection without a `git` binary because the interesting parts are about manipulating the Git repo before calling `git gc` on it.	2023-12-03 07:40:12 -08:00
Yuya Nishihara	35f718f212	merged_tree: remove canceling terms prior to resolving file-level conflict I think this is a variant of the problem fixed by `7fda80fc22` "tree: simplify conflict before resolving at hunk level." We need to simplify() the conflict before and after extracting file ids because the source conflict values may contain trees to be cancelled out, and the file values may differ only in exec bits. Since the legacy tree passes a simplified conflict in to this function, I made the merged tree do the same. Fixes #2654	2023-12-03 07:44:58 +09:00
Yuya Nishihara	4ffbf40c82	merged_tree: do not propagate conflicting empty tree value to parent Otherwise an empty subtree would be added to the parent tree. If the stored tree contained an empty subtree, simplify() wouldn't work against new "absent" subtree representation. I don't know if there's a such code path, but I believe it's very rare to encounter the problem. #2654	2023-12-03 07:44:58 +09:00
Yuya Nishihara	1db033504c	repo, workspace: remove 'static lifetime bound from initializer functions	2023-12-03 07:44:41 +09:00
Yuya Nishihara	d747879aee	signing: pass SigningFn by reference write_commit() doesn't need ownership of the signing function.	2023-12-01 22:55:04 +09:00
Anton Bulakh	eb1c0ab4a2	sign: Implement a test signing backend and add a few basic tests	2023-11-30 23:36:56 +02:00
Anton Bulakh	d7229a3f90	sign: Define signing backend API and integrate it Finished everything except actual signing backend implementation(s) and the UI.	2023-11-30 23:36:56 +02:00
Yuya Nishihara	076b49b610	merged_tree: use merged_tree_entry_diff() in stream version	2023-12-01 00:05:06 +09:00
Yuya Nishihara	97a260b1bf	merged_tree: reimplement TreeEntryDiffIterator by using iterator adapter We don't need a named type anymore.	2023-12-01 00:05:06 +09:00
Yuya Nishihara	fd1c03d037	merged_tree: use sync get_tree() in TreeDiffIterator This basically backs out the change `1b9a3e27e0` "merged_tree: read before/after trees concurrently." As we decided to add a separate impl for async access, it doesn't make sense to read before/after pair in parallel. The async single_tree() is moved to TreeDiffStreamImpl. It will help remove the sync version when the performance problem is solved.	2023-12-01 00:05:06 +09:00
Yuya Nishihara	601be0d480	working_copy: narrow file_states recursively while visiting directories This saves another ~10ms. Without watchman: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 327.7 ms ± 24.9 ms [User: 1059.1 ms, System: 654.3 ms] Range (min … max): 296.0 ms … 385.4 ms 20 runs Benchmark 3: target/release-with-debug/jj-2 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 311.0 ms ± 24.8 ms [User: 960.0 ms, System: 643.1 ms] Range (min … max): 274.9 ms … 358.5 ms 20 runs ```	2023-11-30 12:09:31 +09:00
Yuya Nishihara	a935a4f70c	working_copy: use proto file states without rebuilding BTreeMap In snapshot(), changed_file_states are received in arbitrary order. For the other callers, entries are in diff_stream order, so we don't have to sort them. With watchman enabled, we can see the cost of sorting the sorted proto entries. I don't think this is significant, but we can mitigate it by adding is_file_states_sorted flag to the proto message if needed: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 164.8 ms ± 16.6 ms [User: 50.2 ms, System: 111.7 ms] Range (min … max): 148.1 ms … 195.0 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 171.8 ms ± 13.6 ms [User: 61.7 ms, System: 109.0 ms] Range (min … max): 159.5 ms … 192.1 ms 20 runs ``` Without watchman: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 367.3 ms ± 30.3 ms [User: 1415.2 ms, System: 633.8 ms] Range (min … max): 325.4 ms … 421.7 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 327.7 ms ± 24.9 ms [User: 1059.1 ms, System: 654.3 ms] Range (min … max): 296.0 ms … 385.4 ms 20 runs ``` I haven't measured snapshotting against dirty working copy, but I don't think it would be slower than the original implementation.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	fca3690dda	working_copy: add file states wrapper that provides map-like API I'll replace the current lazy loading mechanism with this. Read-only methods are implemented on the borrowed type so that we can narrow lookup scope recursively.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	9292af5e52	working_copy: update file states in bulk This helps migrate BTreeMap<RepoPath, _> to sorted Vec.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	c9150d02fc	working_copy: don't look up file state twice while visiting directories	2023-11-30 12:09:31 +09:00
Yuya Nishihara	6ce7bd5338	repo_path: replace .contains() with .starts_with(), flipping the arguments self.contains(other) means that the self tree contains the other tree (i.e. the self path is prefix of the other), but it could be confused the other way around if we were thinking about the path literal, not the tree. Let's add .starts_with() instead by copying the std::path::Path definition.	2023-11-29 08:41:23 +09:00
Yuya Nishihara	266690a46b	repo_path: make strip_prefix() public function returning &RepoPath There are no external callers, but I think it's useful.	2023-11-29 08:41:23 +09:00
Yuya Nishihara	73690ed54e	matchers: clean up .walk_to(dir) to yield &RepoPath instead of iterator	2023-11-29 08:41:23 +09:00
Yuya Nishihara	bc9725c73c	working_copy: use RepoPath::parent() which no longer allocates temporary object	2023-11-29 08:41:23 +09:00
Yuya Nishihara	016fc2b5cc	repo_path: change .split() and .parent() to return &RepoPath	2023-11-29 08:41:23 +09:00
Yuya Nishihara	28ab9593c3	repo_path: split RepoPath into owned and borrowed types This enables cheap str-to-RepoPath cast, which is useful when sorting and filtering a large Vec<(String, _)> list by using matcher for example. It will also eliminate temporary allocation by repo_path.parent().	2023-11-28 07:33:28 +09:00
Yuya Nishihara	0a1bc2ba42	repo_path: add stub RepoPathBuf type, update callers Most RepoPath::from_internal_string() callers will be migrated to the function that returns &RepoPath, and cloning &RepoPath won't work.	2023-11-28 07:33:28 +09:00
Yuya Nishihara	f5938985f0	repo_path: make RepoPath::from_internal_string() accept owned string I'm going to add borrowed RepoPath type, and most from_internal_string() callers will be migrated to it. For the remaining callers, it makes more sense to move the ownership of String to RepoPathBuf.	2023-11-28 07:33:28 +09:00
Yuya Nishihara	d322df0c8d	matchers: make Files/PrefixMatcher constructors accept slice of borrowed paths RepoPath will become slice type (like str), and it doesn't make sense to require &[RepoPathBuf] here.	2023-11-28 07:33:28 +09:00
Yuya Nishihara	a23bb5b958	matchers: in tests, use alias to RepoPath::from_internal_string() It looked verbose to fully spell the function name.	2023-11-28 07:33:28 +09:00
Ilya Grigoriev	6aef4bb52e	cli rebase: do not allow `-r --skip-empty` This follows up on `3967f63` (see that commit's description for more motivation) and `e79c8b6`. In a discussion linked below, it was decided that forbidding `-r --skip-empty` entirely is preferable to the mixed behavior introduced in `3967f63`. `3967f637dc (commitcomment-133539911)`	2023-11-27 10:16:36 -08:00
Yuya Nishihara	55f75278bc	repo_path: make to_internal_file_string() return &str, rename accordingly	2023-11-27 08:42:09 +09:00
Yuya Nishihara	12d7f8be16	repo_path: turn RepoPath into String wrapper RepoPath::from_components() is removed since it is no longer a primitive function. The components iterator could be implemented on top of str::split(), but it's not as we'll probably want to add components.as_path() -> &RepoPath. Tree walking and tree_states map construction get slightly faster thanks to fewer allocations and/or better cache locality. If we add a borrowed RepoPath type, we can also implement a cheap &str to &RepoPath conversion on top. Then, we can get rid of BTreeMap<RepoPath, FileState> construction at all. Snapshot without watchman: ``` % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 950.1 ms ± 24.9 ms [User: 1642.4 ms, System: 681.1 ms] Range (min … max): 913.8 ms … 990.9 ms 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 872.1 ms ± 14.5 ms [User: 1922.3 ms, System: 625.8 ms] Range (min … max): 853.2 ms … 895.9 ms 10 runs Relative speed comparison 1.09 ± 0.03 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.00 target/release-with-debug/jj-1 -R ~/mirrors/linux status ``` Tree walk: ``` % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files --ignore-working-copy" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux files --ignore-working-copy Time (mean ± σ): 375.3 ms ± 15.4 ms [User: 223.3 ms, System: 151.8 ms] Range (min … max): 359.4 ms … 394.1 ms 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files --ignore-working-copy Time (mean ± σ): 357.1 ms ± 16.2 ms [User: 214.7 ms, System: 142.6 ms] Range (min … max): 341.6 ms … 378.9 ms 10 runs Relative speed comparison 1.05 ± 0.06 target/release-with-debug/jj-0 -R ~/mirrors/linux files --ignore-working-copy 1.00 target/release-with-debug/jj-1 -R ~/mirrors/linux files --ignore-working-copy ```	2023-11-27 08:42:09 +09:00
Yuya Nishihara	974a6870b3	repo_path: make RepoPath::components() return iterator This allows us to change the backing type from Vec<String> to String.	2023-11-27 08:42:09 +09:00
Yuya Nishihara	aba8c640be	repo_path: capture current Vec<String> ordering by tests The added test would fail if paths were purely ordered by concatenated strings. I'm not sure if we want to preserve the current ordering, but let's not break it for the moment.	2023-11-27 08:42:09 +09:00
Ilya Grigoriev	3967f637dc	cli rebase: do not allow `-r --skip-empty` to drop emptied descendants This follows up on @matts1 's #2609. We still allow the `-r` commit to become empty. I would be more comfortable if there was a test for that, but I haven't done that (yet?) and it seems pretty safe. If that's a problem, I'm happy to forbid `-r --skip-empty` entirely, since it is far less useful than `-s --skip-empty` or `-b --skip-empty`. I think it is undesired to abandon emptied descendants. As far as descendants of `A` are concerned, `jj rebase -r A` should be equivalent to `jj abandon A`, and `jj abandon` does not remove emptied commits. It also doesn't seem very useful to do that, since I think descendant commits of an abandoned (or moved with `-r`) commit only become empty in pathological cases. Additionally, if we did want -r to empty descendants of `A`, we'd have to add thorough tests and possibly improve the algorithm. I want to refactor `rebase -r` and add features to it, and having to consider cases of commits becoming abandoned makes everything harder. For example, if we have ``` root -> A -> B -> C ``` and `jj rebase -r A -d C` empties commit `B` (or `C`), I do not know whether the current algorithm will work correctly. It seems possible that it would, but that depends on the fact that empty merge commits are not abandoned for descendants. That seems dangerous to rely on without tests. I hope (but can't promise) that in the near future, making DescendantRebaser return more information should help make it possible to create such functionality in a more robust way. I am likely to attempt this as part of implementing `-r --after`.	2023-11-26 10:56:58 -08:00
Yuya Nishihara	59ef3f0023	repo_path: split RepoPathComponent into owned and borrowed types This is a step towards introducing a borrowed RepoPath type. The current RepoPath type is inefficient as each component String is usually short. We could apply short-string optimization, but still each inlined component would consume 24 bytes just for e.g. "src", and increase the chance of random memory access. If the owned RepoPath type is backed by String, we can implement cheap cast from &str to borrowed &RepoPath type.	2023-11-26 18:21:40 +09:00
Yuya Nishihara	f2096da2d6	repo_path: add stub type to introduce borrowed RepoPathComponent type The current RepoPathComponent will be renamed to RepoPathComponentBuf, and new str wrapper will be added as RepoPathComponent.	2023-11-26 18:21:40 +09:00
Yuya Nishihara	e14b31a033	repo_path: reject leading slash and empty path components Leading/trailing slashes would introduce a bit of complexity if we migrate the backing type from Vec<String> to String. Empty components are okay, but let's reject them as they are cryptic and invalid.	2023-11-26 18:21:40 +09:00
Yuya Nishihara	755af75c30	repo_path: in tests, use alias to RepoPath::from_internal_string() It seemed too verbose to spell the full function name in tests.	2023-11-26 18:21:40 +09:00
Yuya Nishihara	b5b01f4dd7	cargo: add ref-cast dependency It helps to implement transparent conversion from &str to &Wrapped(str). We could instead wrap the reference as Wrapped<'a>(&'a str), but it has various drawbacks. Notably we can't implement Borrow and Deref because these traits require a reference in return position. Since the unsafe bits are pretty small, we can instead implement cast functions without using the ref-cast crate. However, I believe we'll trust ref-cast more than hand-crafted unsafe code. https://crates.io/crates/ref-cast https://docs.rs/ref-cast/1.0.20/ref_cast/attr.ref_cast_custom.html	2023-11-26 18:21:40 +09:00
Yuya Nishihara	b7543f8a08	rewrite: fix check for newly-empty commit in optimized path 'old_base_tree_id == None' means the rebased tree is unchanged, so the commit shouldn't be considered newly-empty.	2023-11-26 14:42:17 +09:00
Yuya Nishihara	2f93de9299	rewrite: flatten mapping from EmptyBehaviour to desired action I think this is slightly easier to follow.	2023-11-26 14:42:17 +09:00
Ilya Grigoriev	c32847696d	rewrite.rs: rename `new_parents` to `parent_mapping` The function `new_parents` makes sense, but I found the mapping being named `new_parents` confusing.	2023-11-25 21:36:35 -08:00
Yuya Nishihara	6344cd56b3	repo_path: remove RepoPathJoin trait, just implement join() on the type I don't think we'll add join() that takes different types.	2023-11-26 07:14:47 +09:00
Yuya Nishihara	d7df2516c5	repo_path: remove RepoPathComponent::string(), use as_str() instead There are only two callers, and one does further conversion to BString.	2023-11-26 07:14:47 +09:00
Martin von Zweigbergk	6d54afa60e	revset: make `evaluate_programmatic()` optimize expression It seems generally useful to optimize revset expressions in `evaluate_programmatic()` so the caller doesn't have to remember to do it. It should generally be cheap to do so even if it's often not needed.	2023-11-24 21:13:58 -10:00
Martin von Zweigbergk	550164209c	revset: add a `RevsetExpression::evaluate_programmatic()` We often resolve a programmatic revset and then immediately evaluate it. This patch adds a convenience method for those two steps.	2023-11-24 21:13:58 -10:00
Martin von Zweigbergk	f2602f78cf	revset: make `resolve_programmatic()` not return a `Result` I think it's always a programming error if `resolve_programmatic()` returns a `Result`, so it shouldn't have to return a `Result`.	2023-11-24 21:13:58 -10:00
Martin von Zweigbergk	f27f52984e	revset: rename `resolve()` to `resolve_programmatic()` `RevsetExpression::resolve()` is meant for programmatically created expressions. In particular, it may not contain symbols. Let's try to clarify that by renaming the function and documenting it.	2023-11-24 21:13:58 -10:00
Yuya Nishihara	b37293fa68	tests: add upper bound to test_concurrent_read_write_commit() loop Hopefully this will fix the unfinished Windows CI issue. A possible scenario is that recent migration to gitoxide made this test flaky on Windows. For example, gitoxide might have in-memory object cache that relies on file mtime, and occasionally fails to detect new object on Windows.	2023-11-24 18:07:35 +09:00
Matt Stark	0a95e20ebe	lib: Implement skipping of empty commits	2023-11-24 14:48:06 +11:00
Matt Stark	dc89566039	lib: Create struct RebaseOptions	2023-11-24 14:48:06 +11:00
Anton Bulakh	5c3c0e9f6e	sign: Implement generic commit signing on the backend	2023-11-23 22:52:20 +02:00
Anton Bulakh	5ab00e197a	backend: Inline gix::Repository::commit_as to prepare for signing Additional bonus is that this allows us to avoid creating keep refs for the intermediate commits in the data race preventing loop.	2023-11-23 22:52:20 +02:00
Yuya Nishihara	042d26049c	working_copy: lazily construct file_states BTreeMap While it got faster to build a large BTreeMap<RepoPath, _>, there's still a measurable cost. Let's eliminate it if watchman is enabled and the working copy is clean. Perhaps, we should introduce new serialization format that supports instant loading and lookup, but this hack works for the moment. I'm not sure if the new tree_state format should be flat (RepoPath, _) list, or tree like the backend storage btw. In my "linux" repo (watchman enabled): % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 768.9 ms ± 14.2 ms [User: 630.7 ms, System: 131.2 ms] Range (min … max): 742.3 ms … 783.1 ms 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 713.0 ms ± 16.8 ms [User: 587.9 ms, System: 116.2 ms] Range (min … max): 681.5 ms … 731.1 ms 10 runs Relative speed comparison 1.08 ± 0.03 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.00 target/release-with-debug/jj-1 -R ~/mirrors/linux status	2023-11-23 18:48:14 +09:00
Yuya Nishihara	12cd657837	working_copy: extract file_states_to_proto() helper Just minimizing the changes in the next commit. As we already have file_states_from_proto(), it makes sense to extract the "to" function.	2023-11-23 18:48:14 +09:00
Yuya Nishihara	74c4ef32aa	fsmonitor: exclude .git and .jj directories from changed files This ensures that the root fsmonitor_matcher matches nothing if there are no working-copy changes. The query result can be observed by "jj debug watchman query-changed-files". I don't have expertise on watchman query language, but using the watchman API is probably better than .filter()-ing the result manually.	2023-11-23 18:48:14 +09:00
Yuya Nishihara	1ddcaa43b3	fsmonitor: don't apply prefix matching to paths obtained from watchman If I understand it, watchman returns changed files and directories, and a directory change doesn't mean we need to scan all files under the directory.	2023-11-23 10:06:00 +09:00
Yuya Nishihara	767e94f5af	fsmonitor: drop unneeded mut from make_fsmonitor_matcher() We only need &self.working_copy_path here.	2023-11-23 10:06:00 +09:00
Yuya Nishihara	c16c89bc27	fsmonitor: keep paths relative to the workspace root Since the caller wants repo-relative paths, it doesn't make sense to convert them back and forth.	2023-11-23 10:06:00 +09:00
Yuya Nishihara	a4f6e0de0b	repo_path: extract helper that converts Path to RepoPath literally	2023-11-23 10:06:00 +09:00
Yuya Nishihara	31def4b131	cleanup: don't use debug format to print source errors	2023-11-23 10:05:37 +09:00
Yuya Nishihara	16620e0e4c	merged_tree: drop legacy tree handling from ConflictsDirItem constructor No callers pass in a legacy tree.	2023-11-21 07:45:30 +09:00
Yuya Nishihara	4ad3db2e84	merged_tree: extract value() function of non-legacy trees	2023-11-21 07:45:30 +09:00
Yuya Nishihara	ca3f549c9e	merged_tree: remove redundant clone() from ConflictIterator construction	2023-11-21 07:45:30 +09:00
Martin von Zweigbergk	acc35a89a8	merged_tree: inline non-recursive entry iterator	2023-11-19 20:29:40 -10:00
Martin von Zweigbergk	426f6d0cdd	merged_tree: inline non-recursive conflict iterator The abstraction is no longer useful since we made the types not self-referential.	2023-11-19 20:29:40 -10:00
Yuya Nishihara	5186066cf5	working_copy: simply collect() proto file states into BTreeMap Suppose the input list is presorted, sorting a sorted vec would be cheaper than .insert()-ing sorted items one by one. In my "linux" repo (watchman eanbled): - jj-0: baseline - jj-1: previous (don't randomize by HashMap) - jj-2: this % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 1.034 s ± 0.020 s [User: 0.881 s, System: 0.212 s] Range (min … max): 1.011 s … 1.068 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 849.3 ms ± 13.8 ms [User: 710.7 ms, System: 199.3 ms] Range (min … max): 821.7 ms … 870.2 ms 10 runs Benchmark 3: target/release-with-debug/jj-2 -R ~/mirrors/linux status Time (mean ± σ): 786.2 ms ± 16.7 ms [User: 650.7 ms, System: 204.1 ms] Range (min … max): 760.8 ms … 805.2 ms 10 runs Relative speed comparison 1.32 ± 0.04 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.08 ± 0.03 target/release-with-debug/jj-1 -R ~/mirrors/linux status 1.00 target/release-with-debug/jj-2 -R ~/mirrors/linux status	2023-11-20 08:29:33 +09:00
Yuya Nishihara	ee6a1e2c0a	working_copy: don't build intermediate HashMap from proto file states According to the doc, this is compatible with the map syntax. https://protobuf.dev/programming-guides/proto3/#maps This change means that the serialized file states are sorted by RepoPath, so BTreeMap<RepoPath, _> can be reconstructed with fewer cache misses. In my "linux" repo (watchman enabled): - jj-0: baseline - jj-1: this % hyperfine --sort command --warmup 3 --runs 10 -L bin jj-0,jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux status" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status Time (mean ± σ): 1.034 s ± 0.020 s [User: 0.881 s, System: 0.212 s] Range (min … max): 1.011 s … 1.068 s 10 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status Time (mean ± σ): 849.3 ms ± 13.8 ms [User: 710.7 ms, System: 199.3 ms] Range (min … max): 821.7 ms … 870.2 ms 10 runs Relative speed comparison 1.32 ± 0.04 target/release-with-debug/jj-0 -R ~/mirrors/linux status 1.08 ± 0.03 target/release-with-debug/jj-1 -R ~/mirrors/linux status Cache-misses got reduced: % perf stat -e task-clock,cycles,instructions,cache-references,cache-misses \ -- ./target/release-with-debug/jj-0 -R ~/mirrors/linux --no-pager status 1,091.68 msec task-clock # 1.032 CPUs utilized 4,179,596,978 cycles # 3.829 GHz 6,166,231,489 instructions # 1.48 insn per cycle 134,032,047 cache-references # 122.776 M/sec 29,322,707 cache-misses # 21.88% of all cache refs 1.057474164 seconds time elapsed 0.897042000 seconds user 0.194819000 seconds sys % perf stat -e task-clock,cycles,instructions,cache-references,cache-misses \ -- ./target/release-with-debug/jj-1 -R ~/mirrors/linux --no-pager status 927.05 msec task-clock # 1.083 CPUs utilized 3,451,299,198 cycles # 3.723 GHz 6,222,418,272 instructions # 1.80 insn per cycle 98,499,363 cache-references # 106.251 M/sec 11,998,523 cache-misses # 12.18% of all cache refs 0.855938336 seconds time elapsed 0.720568000 seconds user 0.207924000 seconds sys	2023-11-20 08:29:33 +09:00
Yuya Nishihara	56047cb7ec	working_copy: don't pass all proto data to from_proto() functions Just a code cleanup. This allows us to consume proto fields if needed. I also removed redundant .clone() and .as_str().	2023-11-20 08:29:33 +09:00
Yuya Nishihara	d7e63837f4	index: use extend_wanted/unwanted() to initialize RevWalk	2023-11-17 21:38:56 +09:00
Yuya Nishihara	a9f3dd95e5	index: remove index field from RevWalkQueue, move it to caller	2023-11-17 21:38:56 +09:00
Yuya Nishihara	ff1bb3133e	index: move index-dependent operations out of RevWalkQueue RevWalkQueue no longer depend on the RevWalkIndex trait, and the index field can be removed.	2023-11-17 21:38:56 +09:00
Yuya Nishihara	32a7b33e56	index: optimize RevWalk to store IndexPosition instead of IndexEntry The idea is the same as the heads_pos() change in `9832ee205d`. While IndexEntry::position() should be cheap, saving 20 bytes per entry appears to improve the performance in mid-size repos. In my "linux" repo: revsets/all() ------------- baseline 1.24 156.0±1.06ms this 1.00 126.0±0.51ms I don't see significant difference in small-sized repos like "jj" or "git". IndexEntryByPosition isn't removed since it's still used by the revset engine.	2023-11-17 21:38:56 +09:00
Martin von Zweigbergk	9be24db051	tree: make TreeEntriesDirItem not self-referential This removes the last use of `ouroboros`. Since `TreeEntriesDirItem` is only used in "legacy trees" (before tree-level conflicts), I didn't bother to check the performance impact. I also didn't bother to check the matcher before adding the entries to the list, instead leaving that where it is in `Iterator::next()`.	2023-11-17 03:50:34 -08:00
Martin von Zweigbergk	c0295c5dbc	merged_tree: make ConflictsDirItem not self-referential This removes the last use of `ouroboros` in `merged_tree.rs`. The set of conflicts to iterate is usually so small that I didn't bother checking the performance impact.	2023-11-17 03:50:34 -08:00
Martin von Zweigbergk	e1a02c5c5b	merged_tree: make TreeDiffDirItem not self-referential This removes another dependency on `ouroboros`, for a small performance hit: ``` ❯ hyperfine --warmup 3 --runs 30 \ '/tmp/jj-before --ignore-working-copy diff -s --from v5.0 --to v6.0' \ '/tmp/jj-after --ignore-working-copy diff -s --from v5.0 --to v6.0' Benchmark 1: /tmp/jj-before --ignore-working-copy diff -s --from v5.0 --to v6.0 Time (mean ± σ): 689.7 ms ± 23.9 ms [User: 400.0 ms, System: 289.8 ms] Range (min … max): 666.9 ms … 759.2 ms 30 runs Benchmark 2: /tmp/jj-after --ignore-working-copy diff -s --from v5.0 --to v6.0 Time (mean ± σ): 710.9 ms ± 19.2 ms [User: 420.4 ms, System: 290.6 ms] Range (min … max): 688.5 ms … 752.0 ms 30 runs Summary '/tmp/jj-before --ignore-working-copy diff -s --from v5.0 --to v6.0' ran 1.03 ± 0.05 times faster than '/tmp/jj-after --ignore-working-copy diff -s --from v5.0 --to v6.0' ```	2023-11-17 03:50:34 -08:00
Martin von Zweigbergk	61d87fe296	merged_tree: make `TreeEntriesIterator` not self-referential While importing the `ouroboros` crate and the `aliasable` crate it depends on, the "unsafe Rust reviewer" expressed some concern that they contain a lot of unsafe code that's hard to review. We can avoid the unsafe code altogether by making `TreeEntriesIterator` not self-refential. Instead, we can collect the matching entries in an individual tree up front. It does have some performance cost: ``` ❯ hyperfine --warmup 3 --runs 30 \ '/tmp/jj-before --ignore-working-copy files -r v6.0' \ '/tmp/jj-after --ignore-working-copy files -r v6.0' Benchmark 1: /tmp/jj-before --ignore-working-copy files -r v6.0 Time (mean ± σ): 461.4 ms ± 14.3 ms [User: 232.1 ms, System: 229.4 ms] Range (min … max): 443.4 ms … 496.3 ms 30 runs Benchmark 2: /tmp/jj-after --ignore-working-copy files -r v6.0 Time (mean ± σ): 482.0 ms ± 14.3 ms [User: 257.2 ms, System: 224.9 ms] Range (min … max): 461.8 ms … 513.3 ms 30 runs Summary '/tmp/jj-before --ignore-working-copy files -r v6.0' ran 1.04 ± 0.04 times faster than '/tmp/jj-after --ignore-working-copy files -r v6.0' ``` I think that's acceptable.	2023-11-17 03:50:34 -08:00
Yuya Nishihara	93fbcec2f7	index: use BinaryHeap instead of BTreeSet in common_ancestors_pos() For the same reason as the heads_pos() change. We just want to omit duplicated items.	2023-11-16 08:27:59 +09:00
Yuya Nishihara	d4059520a9	index: cache generation numbers during common_ancestors_pos() computation I'm not sure if this is better, but common_ancestors_pos() would have a similar property to heads_pos().	2023-11-16 08:27:59 +09:00
Yuya Nishihara	ea4bdd718d	index: use "while let" in common_ancestors_pos()	2023-11-16 08:27:59 +09:00
Yuya Nishihara	02c84a8596	index: remove stale "allow(unstable_name_collisions)" I think this is remainder of nightly shims.	2023-11-16 08:27:59 +09:00
Yuya Nishihara	6399c392fd	index: make heads_pos() deduplicate entries without building separate set This is much faster (maybe because of better cache locality?) Another option is to use BTreeSet, but the BinaryHeap version is slightly faster. "bench revset" result in my linux repo: revsets/heads(tags()) --------------------- baseline 3.28 560.6±4.01ms 1 2.92 500.0±2.99ms 2 1.98 339.6±1.64ms 3 (this) 1.00 171.2±0.30ms	2023-11-16 08:27:59 +09:00
Yuya Nishihara	9832ee205d	index: optimize heads_pos() to cache generation numbers during computation Apparently, IndexEntry::generation_number() isn't cheap probably because it involves random access to larger memory region, and the u32 value might not be aligned. Let's instead store the generation numbers in BinaryHeap. Also, heads_pos() becomes slightly faster by keeping the BinaryHeap entries small, so I've removed the IndexEntry at all. This makes the default log and disambiguation revsets fast, which evaluate 'heads(immutable_heads())'. "bench revset" result in my linux repo: revsets/heads(tags()) --------------------- baseline 3.28 560.6±4.01ms 1 2.92 500.0±2.99ms 2 (this) 1.98 339.6±1.64ms	2023-11-16 08:27:59 +09:00
Yuya Nishihara	1e933b84dd	index: make IndexEntry::parents() lazy instead of collecting to Vec All callers just iterate over the parent entries. "bench revset" result in my linux repo: revsets/heads(tags()) --------------------- baseline 3.28 560.6±4.01ms 1 (this) 2.92 500.0±2.99ms	2023-11-16 08:27:59 +09:00
Yuya Nishihara	39b065f7ab	git: on import_refs(), exclude uninteresting dirs such as refs/jj/keep For loose refs, uninteresting directories can be just skipped. For packed refs, gix will have to do binary search for each prefix to find the starting point. Still it's better overall if the repository contains tons of refs/jj/keep refs. With my linux repo containing ~5k loose jj refs, this saves ~40ms: % hyperfine --warmup 3 --runs 10 \ "/tmp/jj-gix --ignore-working-copy git import -R ~/mirrors/linux" \ "/tmp/jj-gix-iter --ignore-working-copy git import -R ~/mirrors/linux" Benchmark 1: /tmp/jj-gix --ignore-working-copy git import -R ~/mirrors/linux Time (mean ± σ): 151.6 ms ± 11.4 ms [User: 38.8 ms, System: 111.6 ms] Range (min … max): 129.8 ms … 159.5 ms 10 runs Benchmark 2: /tmp/jj-gix-iter --ignore-working-copy git import -R ~/mirrors/linux Time (mean ± σ): 109.9 ms ± 11.6 ms [User: 27.5 ms, System: 82.4 ms] Range (min … max): 89.4 ms … 117.8 ms 10 runs	2023-11-14 17:35:27 +09:00
Yuya Nishihara	044716ee40	git: migrate import_refs() to gix::Repository Gitoxide errors are boxed since there are various error types and they tend to exceed the clippy size limit. Apparently, gitoxide is faster than git2: % hyperfine --warmup 3 --runs 10 \ "/tmp/jj-baseline --ignore-working-copy git import -R ~/mirrors/linux" \ "/tmp/jj-gix --ignore-working-copy git import -R ~/mirrors/linux" Benchmark 1: /tmp/jj-baseline --ignore-working-copy git import -R ~/mirrors/linux Time (mean ± σ): 205.4 ms ± 15.7 ms [User: 59.6 ms, System: 144.6 ms] Range (min … max): 189.7 ms … 223.9 ms 10 runs Benchmark 2: /tmp/jj-gix --ignore-working-copy git import -R ~/mirrors/linux Time (mean ± σ): 176.2 ms ± 13.7 ms [User: 41.2 ms, System: 134.0 ms] Range (min … max): 155.4 ms … 186.5 ms 10 runs	2023-11-14 17:35:27 +09:00
Yuya Nishihara	6c98dfcdcb	git: have import_refs() obtain git2::Repository instance from store This helps gitoxide migration. It's theoretically possible to import Git refs from non-Git backend, but I don't think such API flexibility is needed.	2023-11-14 17:35:27 +09:00
Yuya Nishihara	dbb1adaf0a	git: move import-related types close to import_refs() function	2023-11-14 17:35:27 +09:00
Yuya Nishihara	f991705e47	tests: add test for importing missing ancestor of HEAD If a commit pointed to by HEAD or ref is missing, the ref is considered invalid and excluded by import_refs(). The current test behavior appears to depend on some in-memory cache of git2::Repository.	2023-11-14 17:35:27 +09:00
Yuya Nishihara	8e143541a5	operation: propagate OpStoreError from parents() We need to .collect_vec() the parents iterator to temporary buffer since the borrowed iterator can't be returned back to the dag_walk functions. Another option is to clone op_store and parent ids to remove &self lifetime from the iterator, but that also means a temporary Vec is created.	2023-11-14 07:16:39 +09:00
Yuya Nishihara	8ddad859e8	dag_walk: add fallible topo_order_reverse_lazy() Unlike dfs_ok(), this function short-circuits at an Err as we use non-lazy topo_order_forward() internally. I think that's good enough. If we implement GC on operation log, deleted parents will be excluded (or mapped to tombstone) by caller. An Err shouldn't mean it's GC-ed.	2023-11-14 07:16:39 +09:00
Yuya Nishihara	3d5a07e86a	dag_walk: add fallible dfs(), topo_order(), heads(), and closest_common_node() This unblocks the use of Result<T, E> in op.parents(). There are two ways to encode errors: a. impl IntoIterator<Item = Result<T, E>> b. Result<V, E> where V: FromIterator<Item = T> I think (a) is more natural to algorithms like dfs(), which can process error nodes transparently. Still the caller might have to collect the source iterator to temporary Vec to conform to the neighbors_fn signature. It's not easy for neighbors_fn to return an iterator borrowing the input node. We already have GAT, but doesn't have return-position impl Trait in trait yet.	2023-11-14 07:16:39 +09:00
Yuya Nishihara	e5a9a26911	dag_walk: remove unused and untested leaves() function	2023-11-14 07:16:39 +09:00
Anton Bulakh	e3a1e5b80e	sign: Implement storage for digital commit signatures Recognize signature metadata from git commit objects, implement a basic version of that for the native backend. Extract the signed data (a commit binary repr without the signature) to be verified later.	2023-11-12 03:37:13 +02:00
Yuya Nishihara	b42a69db6d	git_backend: configure committer (and author) of gix::Repository Otherwise, ref updates would fail if we port git::export_refs() to gitoxide. This change isn't strictly needed for the backend itself, but we'll reuse the gix::Repository instance created by the backend when importing and exporting Git refs.	2023-11-11 22:35:54 +09:00
Yuya Nishihara	ea32c0cb9e	git_backend: pass UserSettings to GitBackend constructors	2023-11-11 22:35:54 +09:00
Yuya Nishihara	8a2048a0e5	repo: pass UserSettings to store factories and initializers GitBackend will use it to configure gix::Repository. I think UserSettings is generally useful to pass store-specific parameters, so I've updated all factory functions.	2023-11-11 22:35:54 +09:00
Yuya Nishihara	6125fb160e	op_store: embed details in operation/view not found error This is basically a copy of BackendError::ObjectNotFound. The failed id may be either view or operation id.	2023-11-11 22:35:40 +09:00
Yuya Nishihara	ea96513fd1	op_store: deduplicate functions that map io::Error to OpStoreError io_to_read_error() also translates ErrorKind::NotFound.	2023-11-11 22:35:40 +09:00
Martin von Zweigbergk	120115a20d	cli: pass `MaterializedTreeValue` into `git_diff_part()` Just a little preparation for reading the materialized values concurrently.	2023-11-10 04:54:47 -08:00
Waleed Khan	a60733f632	tree: remove unsafe with `ouroboros` for self-referential iterators	2023-11-09 21:50:29 -08:00
Yuya Nishihara	6ff3a4f3df	repo: reimplement DirtyCell without using unsafe While the safe implementation is a bit more complex (and probably more branchy), I don't think the runtime overhead would matter here. Let's remove one more unsafe for better code maintainability.	2023-11-10 07:42:45 +09:00
Martin von Zweigbergk	9b24d24612	conflicts: add another helper for materializing a tree value We have a few places where we have a `MergedTreeValue` and need to read the data associated with it so we can write to the working copy or include it in a diff. Let's extract some of that shared logic to a function so we can reuse it. I plan to use it for reading file contents in advance while streaming a diff in `local_working_copy` soon (and probably in `jj diff` thereafter), but I think it seems like an improvement on its own.	2023-11-08 21:21:38 -08:00
Martin von Zweigbergk	65bd5cacba	working copy: on checkout, move read from store out of `write_()` functions I'd like to read N files ahead from the backend, to avoid serializing too many server calls on backends that are backed by a server. Moving the reads a little earlier is a little step towards that. The `TreeState::write_()` functions can now be made into free/static functions if we prefer.	2023-11-08 21:21:38 -08:00
Yuya Nishihara	084b99e1e2	index: rewrite CompositeIndex::entry_by_pos() by leveraging ancestors iterator We no longer have "unsafe" in this function, so let's use the iterator API instead of recursion. Apparently I haven't pushed this change before because unsafe in .find_map() looked scary.	2023-11-08 12:09:33 +09:00
Anton Bulakh	d27351b978	misc: drop a few low-hanging unsafes Remove a couple of unnecessary unsafes: - The NonZeroUsize is a constant where the unwrap will optimize away anyway and we don't have an unsafe without any good reason there :) - The other two were simply not needed, lifetimes worked fine, maybe Rust became better since that code was written? NLL? Anyway, they're gone now	2023-11-08 02:16:08 +02:00
Yuya Nishihara	2ac9865ce7	revset: exclude @git branches from remote_branches() As discussed in Discord, it's less useful if remote_branches() included Git-tracking branches. Users wouldn't consider the backing Git repo as a remote. We could allow explicit 'remote_branches(remote=exact:"git")' query by changing the default remote pattern to something like 'remote=~exact:"git"'. I don't know which will be better overall, but we don't have support for negative patterns anyway.	2023-11-08 07:34:30 +09:00
Yuya Nishihara	59640496aa	cargo: sort dependencies list alphabetically	2023-11-07 23:46:05 +09:00
Yuya Nishihara	d1b0c4cc48	merge: relax input type of Merge::from_removes_adds()	2023-11-07 17:10:12 +09:00
Yuya Nishihara	e0c35684af	merge: rename Merge::new() to Merge::from_removes_adds() Since (removes, adds) pair is no longer the canonical representation of Merge, the name Merge::new() seems too generic. Let's give more verbose name.	2023-11-07 17:10:12 +09:00
Yuya Nishihara	2c128f1b61	merged_tree: convert from legacy conflicts through interleaved list This is basically the same change as the previous commit.	2023-11-07 17:10:12 +09:00
Yuya Nishihara	a734f46130	merged_tree: build unresolved Merge<Tree> from interleaved list We no longer need to iterate removes and adds separately.	2023-11-07 17:10:12 +09:00
Yuya Nishihara	dd26b7be40	merge: add Merge constructor that accepts interleaved values Also migrated some callers of 3-way merge, where [left, base, right] order looks okay.	2023-11-07 17:10:12 +09:00
Yuya Nishihara	803b41c426	merge: load legacy Merge values without allocating intermediate buffers	2023-11-07 17:10:12 +09:00
Yuya Nishihara	09987c1d27	merge: micro-optimize allocation of Merge object for resolved value It's super common that a Merge object holds a resolved value, so let's inline up to 1 element. T of Merge<T> usually consists of a couple of pointer-sized fields. I don't see any measurable speed up, but it's no worse than the original.	2023-11-07 17:10:12 +09:00
Martin von Zweigbergk	1140295829	merged_tree: extract polling of tree futures into a function	2023-11-07 00:03:50 -08:00
Martin von Zweigbergk	c77417d4e4	merged_tree: drop outer loop in `TreeDiffStreamImpl::poll_next()` As suggested by Yuya. I also added a comment and an assertion in the case where return `Poll::Pending`.	2023-11-07 00:03:50 -08:00
Martin von Zweigbergk	d989d4093d	merged_tree: let backend influence whether to use new diff algo Since the concurrent diff algorithm is significantly slower when using the Git backend, I think we'll have to use switch between the two algorithms depending on backend. Even if the concurrent version always performed as well as the sequential version, exactly how concurrent it should be probably still depends on the backend. This commit therefore adds a function to the `Backend` trait, so each backend can say how much concurrency they deal well with. I then use that number for choosing between the sequential and concurrent versions in `MergedTree::diff_stream()`, and also to decide the number of concurrent reads to do in the concurrent version.	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	f40adb84fc	merged_tree: add a `Stream` for concurrent diff off trees When diffing two trees, we currently start at the root and diff those trees. Then we diff each subtree, one at a time, recursively. When using a commit backend that uses remote storage, like our backend at Google does, diffing the subtrees one at a time gets very slow. We should be able to diff subtrees concurrently. That way, the number of roundtrips to a server becomes determined by the depth of the deepest difference instead of by the number of differing trees (times 2, even). This patch implements such an algorithm behind a `Stream` interface. It's not hooked in to `MergedTree::diff_stream()` yet; that will happen in the next commit. I timed the new implementation by updating `jj diff -s` to use the new diff stream and then ran it on the Linux repo with `jj diff --ignore-working-copy -s --from v5.0 --to v6.0`. That slowed down by ~20%, from ~750 ms to ~900 ms. Maybe we can get some of that performance back but I think it'll be hard to match `MergedTree::diff()`. We can decide later if we're okay with the difference (after hopefully reducing the gap a bit) or if we want to keep both implementations. I also timed the new implementation on our cloud-based repo at Google. As expected, it made some diffs much faster (I'm not sure if I'm allowed to share figures).	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	9af09ec236	test_meregd_tree: test diffing with a matcher We didn't have any tests at all for `MergedTree::diff()` with a matcher other than `EverythingMatcher`. This patch adds a few.	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	16aa8e8f10	test_merged_tree: nest each part of `test_diff_dir_file()` I'm about to add a few more checks for diffing with a matcher. I think it will help make it readable and reduce the risk of mixing up variables between each part of the test if we use some nested blocks. I also removed some unnecessary `.clone()` calls while at it.	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	c9ce80a82a	merged_tree: extract function for merged iterator of basenames in diff I'm going to reuse this for stream/async diffing.	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	b72f04ba61	merged_tree: rename `all_tree_conflict_names()` since it's not about conflicts	2023-11-06 23:12:02 -08:00
Yuya Nishihara	3fddc31da8	merge: remove Merge::take() which is no longer used Merge::take() is no longer a cheap function. We can add into_vec() if needed.	2023-11-07 06:52:35 +09:00
Yuya Nishihara	92dfe59ade	refs: run non-trivial merge of ref targets without destructuring Merge object	2023-11-07 06:52:35 +09:00
Yuya Nishihara	93601541cb	refs: use swap_remove() in non-trivial merge of ref targets I'm going to add a Merge method that removes negative/positive terms pair, and swap_remove() is the easiest option. The order of the conflicted ref targets doesn't matter.	2023-11-07 06:52:35 +09:00

... 3 4 5 6 7 ...

2667 commits