mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2025-01-03 18:24:19 +00:00

Author	SHA1	Message	Date
Martin von Zweigbergk	502150b2f4	conflicts: test materialization with with negative snapshots We didn't have any tests with negative snapshots (after a `-------` line). I initially thought we couldn't produce such conflict markers anymore. I'm not sure we want to render conflicts like the one in the test like this. I don't think I intended for `add_index` in the code to be able to be two steps ahead of the remove. Maybe we should rewrite the algorithm to not do that and thus never produce negative snapshots.	2024-01-23 07:18:54 -08:00
Ilya Grigoriev	d168fd2b09	`test_rebase_abandoning_empty`: add children of an empty `@` to the test case This demonstrates the minor bug discussed in https://github.com/martinvonz/jj/pull/2766#discussion_r1442365389 AKA https://github.com/martinvonz/jj/issues/2869. It's also interesting whether changing the definition of "discardable" commit would affect this test, see https://github.com/martinvonz/jj/issues/2859#issuecomment-1903275884 (I think it won't, but still)	2024-01-22 18:36:49 -08:00
Jonathan Tan	0bc1341fd0	revset: add count_estimate() to Revset trait The count() function in this trait is used by "jj branch" to determine (and then report) how many commits a certain branch is ahead/behind another branch. This is currently implemented by walking all commits in the revset, counting how many were encountered. But this could be improved: if the number is large, it is probably sufficient to report "at least N" (instead of walking all the way), and this does not scale well to jj backends that may not have all commits present locally (which may prefer to return an estimate, rather than access the network). Therefore, add a function that is explicitly documented to be O(1) and that can return a range of values if the backend so chooses. Also remove count(), as it is not immediately obvious that it is an expensive call, and callers that are willing to pay the cost can obtain the exact same functionality through iter().count() anyway. (In this commit, all users of count() are migrated to iter().count() to preserve all existing functionality; they will be migrated to count_estimate() in a subsequent commit.) "branch" needed to be updated due to this change. Although jj is currently only available in English, I have attempted to keep user-visible text from being assembled piece by piece, so that if we later decide to translate jj into other languages, things will be easier for translators.	2024-01-22 15:07:00 -08:00
Yuya Nishihara	c7be4d019c	index: add all_heads_for_gc() that iterates heads of all indexed commits GitBackend::gc() will recreate no-gc refs for the indexed heads. We could collect all historical heads by traversing operation log, but it isn't enough because there may be predecessor links to hidden commits, and "git gc" isn't aware of predecessors.	2024-01-17 23:07:14 +09:00
Yuya Nishihara	afa72ff496	git_backend: inline prevent_gc() to bulk-update refs	2024-01-17 10:43:25 +09:00
Yuya Nishihara	96ee9bdb9f	git_backend: ensure no-gc refs are created for all imported head commits This also means that we can implement GC without taking care of extra metadata. I haven't tried, but it wouldn't be easy to keep Git refs and extra table in sync.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	2e1aa6c49c	git_backend: remove fast path testing imported commits, filter them by caller The idea is that GC, if implemented, will clean up objects based on the Index knowledge. It's probably okay to leave some extra metadata of unreachable objects, but GC-ed refs should be recreated if the corresponding heads get reimported. See also the next patch.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	48c4985e34	git_backend: ensure that no-gc ref target never conflicts	2024-01-17 10:43:25 +09:00
Yuya Nishihara	f66c859fe4	git_backend: use lower-level API to create no-gc refs This will allow us to issue multiple prevent_gc() requests all at once. It's not important here, but will be unavoidable when implementing GC. Deleting tons of refs from packed refs is super slow if the requests were processed one by one.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	34956f17e5	op_walk: assert that virtual root op is not reparented This is enforced by the caller, but it's scary if it weren't.	2024-01-16 21:46:54 +09:00
Yuya Nishihara	fb3e006a45	op_store: add special case for root id resolution	2024-01-16 21:46:54 +09:00
Yuya Nishihara	660806ffed	tests: set up unparented operations for id prefix tests Otherwise we can't easily pick i to create operation id starting with "0".	2024-01-16 21:46:54 +09:00
Yuya Nishihara	df1be14aa8	tests: split op id resolution tests, don't require merged op for prefix tests This makes it easy to set up crafted environment for prefix resolution tests.	2024-01-16 21:46:54 +09:00
Essien Ita Essien	dc074363d1	no-op: Move external git repo canonicalization into Workspace::init_git_external * Move canonicalization of the external git repo path into the Workspace::init_git_external(). This keeps necessary code together. * Add a new variant of WorkspaceInitError for reporting path not found errors. The user error string is written to pass existing tests.	2024-01-16 10:46:02 +00:00
Yuya Nishihara	da218d19db	repo: optimize enforce_view_invariants() to not traverse ancestors until root Because the default index cuts off the traversal at min(generations), including the root id means all ancestors will be visited. This could be worked around at the index side, but I think it's the repo/view's responsibility. That being said, it's not uncommon to pad a revset with "root()", so it might make sense for the index to special case the root id. I also removed the redundant .clone().	2024-01-15 09:57:02 +09:00
Martin von Zweigbergk	6e302bb3a2	op_store: add a virtual root operation, similar to root commit It seems obvious in hindsight to have a virtual root operation just like we have a virtual root commit. It removes the same kind of problems by making sure there's always a common ancestor (or multiple) between any two commits. I think the reason I didn't add a root operation from the beginning was that there used to be a mandatory working-copy commit in the view (this was before support for multiple workspaces). Perhaps we should remove the "initialize repo" operation now. The only difference between their view objects is that the "initialize repo" operation adds the root commit as a head. We could add that to the root operation, but then the root operation's value depends on the commit backend.	2024-01-14 10:15:14 -08:00
Martin von Zweigbergk	c9af8bf43a	view: drop tracking of public heads We've had the public_heads for as long as we've had the View object, IIRC (I didn't check), but we still don't use it for anything. I don't have any concrete plans for using it either. Maybe our config for immutable commits is good enough, or maybe we'll want something more generic (like Mercurial's phases). For now, I think we should simplify by removing it the storage for public heads.	2024-01-13 22:23:57 -08:00
Martin von Zweigbergk	a66e2a0a6d	working_copy: mark commit_id field in proto reserved By marking it reserved, we prevent accidental use. We can still read working copy protos that have the field.	2024-01-12 17:38:23 -08:00
Yuya Nishihara	543036c753	cli: run "op log" without loading repo or merging concurrent ops When debugging behavior of badly-GCed repos, I find it's annoying that "op log" fails because the index can't be loaded. Since "op log" doesn't need a repo, I think it's better to display the exact op-heads state without merging.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	831a530283	op_walk: make walk_ancestors() sort head ops to stabilize output I thought this would be done by dag_walk::topo_order_reverse_lazy_ok(), but apparently I made it preserve the input order in a way topo_order_reverse() would do.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	b7eb551cf7	index: fix reindexing to scan all referenced commits such as hidden remote refs Since hidden commits can be looked up by remote_branches() revset for example, reindexing should traverse ancestors from all named refs in addition to the visible heads.	2024-01-12 12:53:16 +09:00
Yuya Nishihara	805046ceba	op_walk: extract function that resolves op expression with preloaded head op I'm going to make "op abandon" not load the repo, and this function will be used there instead of resolve_op_with_repo().	2024-01-12 08:01:13 +09:00
Yuya Nishihara	83ede241e3	op_walk: don't resolve heads beyond @ operation Since `jj undo --at-op=OP @` resolves @ to OP, I think OP should be the head in that context, and the descendants of OP shouldn't be accessible by @+.	2024-01-12 08:01:13 +09:00
Yuya Nishihara	ba42b37a67	operation: remove operation::View wrapper in favor of view::View view::View doesn't track ViewId, but there are no callers of cheap Eq/Hash functions.	2024-01-12 08:01:02 +09:00
Yuya Nishihara	d5a98df046	git_backend: teach "format.tree-level-conflicts" config by constructor Since GitBackend constructors now depend on &UserSettings, it makes sense to initialize the formatting options there.	2024-01-10 08:57:51 +09:00
Yuya Nishihara	e5286aed08	index: move lifetimed change_id_index() to MutableIndex, rename 'static version change_id_index() is only used by Readonly/MutableRepo, so we don't need an abstraction at Index. evaluate_revset() is somewhat similar, but the callers rely on &dyn Repo.	2024-01-09 10:38:00 +09:00
Yuya Nishihara	dc68f1eeb2	revset: remove unused lifetime parameter from Revset<'index>	2024-01-09 10:37:43 +09:00
Yuya Nishihara	e9d31177cb	op_store: implement GC of unreachble operations and views Since new operations and views may be added concurrently by another process, there's a risk of data corruption. The keep_newer parameter is a mitigation for this problem. It's set to preserve files modified within the last 2 weeks, which is the default of "git gc". Still, a concurrent process may replace an existing view which is about to be deleted by the gc process, and the view file would be lost. #12	2024-01-09 10:37:03 +09:00
Yuya Nishihara	5894f3dfba	operation: add shorthand for &store_operation().view_id	2024-01-09 10:37:03 +09:00
Martin von Zweigbergk	c98b0d76af	index: move Revset::change_id_index() to Index We current have `Revset::change_id_index()` for creating a `ChangeIdIndex` for a given revset. I think it will be hard to make it performant for general revsets, especially in very large repos and with custom index implementations, like the one we have at Google. If we instead restrict it to including all ancestors of a set of heads, I think it will be much easier to implement. We only use `Revset::change_id_index()` with revsets including all visible commits today, so we won't lose any current functionality by making it more restricted.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	2f4594540a	tests: move ChangeIdIndex test from test_revset to test_index	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	1508f28567	tests: update ChangeIdIndex test to include ancestors in set I plan to replace `Revset::change_id_index()` by `Index::change_id_index(heads)`, but one of the tests currently uses a set of commits that does not include ancestors. This patch updates it to include ancestors (and changes the set of heads to keep the set small enough for the test).	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f9dc00704d	index: specialize evaluate_revset_static() to change_id_index_static() I'd like to move `change_id_index()` from `Revset` to `Index` (and make it take the set of visible heads as argument). We currently use `evaluate_revset_static()` only to get a `ChangeIdIndex`, so a good place to start is to convert that into `change_id_index_static()`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	b549090acc	index: adopt ChangeIdIndex and relatives from revset module The `ChangeIdIndex` type is currently in defined in the `revset` module because that's the only placed it's used. However, I'd like to start using it directly from `index`. The idea is to make it possible to create a `ChangeIdIndex` given a set of heads, without first creating a `Revset`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f0182ad4b8	default_index: adopt revset engine and graph iterator modules The revset engine and the graph iterator are specific to the default index implementation, so they belong in the same module.	2024-01-07 05:37:47 -08:00
Yuya Nishihara	a6616e9cea	object_id: don't allow ObjectId::from_hex() a dynamically allocated string This isn't technically needed, but it prevents API misuse. Another option is to do some compile-time substitution, but most callers are tests and the runtime performance wouldn't matter.	2024-01-06 00:26:36 +09:00
Yuya Nishihara	837ac15052	op_store: add resolve_operation_id_prefix() trait method that uses readdir() The OpStore backends should have a better way to look up operation by id than traversing from the op heads. The added method is similar to the commit Index one, but returns an OpStoreResult because the backend operation can fail. FWIW, if we want .shortest() in the op log template, we'll probably need a trait method that returns an OpIndex instead.	2024-01-05 23:36:57 +09:00
Yuya Nishihara	95ea352b0a	object_id: add fallible version of ObjectId::from_hex()	2024-01-05 23:36:57 +09:00
Yuya Nishihara	95d83cbfe5	object_id: make ObjectId constructors non-trait methods I'm going to add try_from_hex(), which requires Self: Sized. Such trait bound could be added, but I don't think we'll need abstracted ObjectId constructors at all.	2024-01-05 23:36:57 +09:00
Yuya Nishihara	31b236a70d	object_id: move HexPrefix and PrefixResolution from index module	2024-01-05 10:20:57 +09:00
Yuya Nishihara	fa5e40719c	object_id: extract ObjectId trait and macros to separate module I'm going to add a prefix resolution method to OpStore, but OpStore is unrelated to the index. I think ObjectId, HexPrefix, and PrefixResolution can be extracted to this module.	2024-01-05 10:20:57 +09:00
Yuya Nishihara	dbaee198e6	hex_util: move common_hex_len() from backend module This function predates the hex_util module. If there were hex_util, I would add it there.	2024-01-05 10:20:57 +09:00
Yuya Nishihara	e5255135bb	op_walk: add function that reparents (and abandons) operation range This will be used in "jj op abandon ..op_id" command. The "op_id..@" range will be reparented onto the root operation. The current implementation is good enough for local repos, but it won't scale. We might want to extract it as a trait method or introduce OpIndex for efficient DAG operation.	2024-01-04 11:44:36 +09:00
Yuya Nishihara	392e83be42	op_heads: ensure that update_op_heads([id], id) fails The doc states it's invalid, but I made such bug.	2024-01-04 11:44:36 +09:00
Matt Stark	3f0a49dafe	Ensure you never drop the working commit with --skip-empty See #2766 for discussions	2024-01-04 13:33:24 +11:00
Matt Stark	a4aed2391f	Rewrite instead of abandoning empty commits. Fixes #2760 Given the tree: ``` A-B-C \ B2 ``` And the command `jj rebase -s B -d B2` We were previously marking B as abandoned, despite the comment stating that we were marking it as being succeeded by B2. This resulted in a call to `rewrite(rewrites={}, abandoned={B})` instead of `rewrite(rewrites={B=>B2}, abandoned={})`, which then made the new parent of `C` into `A` instead of `B2`	2024-01-04 13:33:24 +11:00
Ilya Grigoriev	6edaa97517	DescendantRebaser: change `rebased()` method to `into_map()` that consumes the rebaser This prevents a clone and does not affect the public API, as suggested in https://github.com/martinvonz/jj/pull/2738#discussion_r1438903463.	2024-01-01 21:55:18 -08:00
Ilya Grigoriev	ddec3f91b2	lib: mild refactoring made possible by previous commit Inline `create_descendant_commits`, move some functionality of `DescendantRebaser::rebase_next` to `rebase_all`, a seemingly more logical location.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	277b81ff6f	lib: make `DescendantRebaser`-related APIs private. Finally, there are no test uses of these APIs. `DescendantRebaser` is made `pub(crate)`, since it is used by `MutRepo`. Other functions are made private.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	45cd0bf11b	test_rewrite.rs: stop using DescendantRebaser when testing EmptyBehavior This completes the process of removing DescendantRebaser-related APIs from tests. It requires creating some new test utils and a new `rebase_descendants_with_option_return_map`.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	7cef879ef6	lib `repo.rs` & `rewrite.rs`: Move clearing of rewritten/abandoned commits This commit is a little out of place in this sequence, but it seems to make more sense for MutRepo to own these maps. @yuja [pointed out] that any tests written using `create_descendant_rebaser` now need to do this cleanup, but there are no longer any such tests after the previous commits and a follow-up commit removes `create_descendant_rebaser` entirely. [pointed out]: https://github.com/martinvonz/jj/pull/2737#discussion_r1435754370	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	4461d61254	test_rewrite: test branches of descendants of divergent commits A TODO left over from a previous PR	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	b2abba07e9	tests: (mostly) stop using soon-to-be-private DescendantRebaser-related APIs This removes uses of `DescendantRebaser::new` or `MutRepo::create_descendant_rebaser` from most tests. The exceptions are the tests having to do with abandoning empty commits on rebase, since adjusting those is a bit more elaborate (see follow-up commits).	2024-01-01 18:51:36 -08:00
Yuya Nishihara	3eafca65ea	op_walk: add support for op_id+ (children) operator A possible use case is when doing some archaeology around a certain operation. The current implementation is quadratic if + is repeated. Suppose op_id is usually close to the current op heads, I think it'll practically work better than building a reverse lookup table.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	ab299a6af5	op_walk: reimplement prefix lookup by using walk_ancestors() and HexPrefix Perhaps, OpStore should provide prefix resolution method, but let's think that later.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	c53748d732	op_walk: allow walk_ancestors() from more than one head operations	2024-01-02 10:30:08 +09:00
Yuya Nishihara	51691ea22c	tests: add lib tests for op id resolution, migrate some from cli CLI testing is slow and harder to set up crafted environment.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	dad890b960	operation: make parent_ids() return slice instead of Vec reference	2024-01-02 02:47:41 +09:00
Yuya Nishihara	c9b581589c	op_walk: simplify arguments passed to high-level "opset" query functions	2024-01-01 10:22:23 +09:00
Yuya Nishihara	26b5f38f45	op_walk: move "opset" query functions from jj_cli	2024-01-01 10:22:23 +09:00
Yuya Nishihara	e4460d5386	op_walk: add error types for fake "opset" expression This removes CommandError dependency from these resolution functions. We might want to refactor the error types again if we introduce a real "opset" evaluator. The error message for unresolved op heads now includes "@" instead of the whole expression.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	94fc32ab47	op_walk: extract walk_ancestors() to new module I'm going to extract fake "opset" resolution functions there, and I think walk_ancestors() belongs to the same category.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	6dd936f72f	op_heads: let caller decide resolve_op_heads() error type The resolver callback usually returns wider error type, which I don't think is a variant of OpHeadResolutionError. To help type inference, resolver's error type is E, not E1 where E: From<E1>.	2024-01-01 10:22:23 +09:00
Martin von Zweigbergk	90744fb770	working copy: read files ahead when updating If the commit backend has high latency, it can make a big difference to read files concurrently. This patch updates the working copy code to do that in the update code (when reading files from the backend to write to the working copy). Because our backend at Google reads files from a local daemon process that already does a lot of prefetching, this patch doesn't actually help us. I think it's still the right thing to do for backends that don't do the same kind of prefetching. It speeds up `jj sparse set --add` by >10x when I disable the prefetching in our daemon (our `Backend::concurrency()` is 100).	2023-12-29 13:37:13 -08:00
Yuya Nishihara	f9e9058b9b	index: show bad operation id if commit lookup failed during reindexing My jj repo contains such head commits, and "jj debug reindex" fails. To address this problem, we'll probably need to implement GC, and the user will discard operations before the first bad op id.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	43e016a7d1	index: add explicit reindexing method that can propagate error	2023-12-29 13:05:58 +09:00
Yuya Nishihara	ab1c8656a4	index: rename private index_at_operation methods, reorder arguments I'm going to add a public method that rebuilds index, and its return type will be different. I also added "build_" because "index" could be misinterpreted as noun. The method arguments are reordered to follow the public IndexStore interface.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	3abe6be384	index: propagate DefaultIndexStore::init/reinit() errors	2023-12-29 13:05:58 +09:00
Yuya Nishihara	955f6e356a	repo: add error propagation path to IndexStore initialization and loading The error types are shared with the commit store backend. We could add per-store error types, but it's unlikely that the caller needs to discriminate them.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	bb73cd491f	clenaup: don't use debug format to embed ObjectId in error message Also fixed typo, s/a/an/.	2023-12-29 13:05:58 +09:00
Martin von Zweigbergk	d06764eb7c	op heads: remove now-unused methods for adding/removing op heads	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	65a6aa61db	op heads: replace last use of remove_op_head() by update_op_heads()	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	76516bb46b	op heads: inline handle_ancestor_ops() This gets us closer to being able to use the new `update_op_heads()` function here (without calling it multiple times).	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	4221c7cf5c	op heads: remove handle_ancestor_ops() from trait I think the idea behind `handle_ancestor_ops()` was to let our backend at Google delegate the work to the server, which could then avoid walking ancestors. However, we're now thinking that we're going to make our server resolve divergent operations on its own instead, so the client will never see more than one op head, unless it manually creates the second op head itself (e.g. because the user ran two concurrent commands). In those cases it should be fine to do the walk. So let's simplify the trait by removing the function.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	f969f4b0b0	op heads: remove lifetime from OpHeadsStoreLock	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	c304777a35	op heads: remove promote_new_op() `OpHeadsStoreLock::promote_new_op()` doesn't add much over the new `update_op_heads()`, so let's switch to the latter.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	b8e45d196f	op heads: add a new trait method combining add and remove of op heads Consider how one would implment the current `OpHeadsStore` interface for a cloud-based backend. After `OpHeadsStore::add_op_head()` is called, the set of op heads temporarily contains two heads (typically) until `OpHeadsStore::remove_op_head()` is called. That's not invalid, but it's annoying to have to deal with that state more than necessary. Also, it's unnecessarily inefficient to send the addition and removal of op heads as separate RPCs. This patch therefore adds a `update_op_heads()` method that takes a list of old heads to remove and a single new head to add. Coming patches will start migrating to that method.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	8137975785	op heads: drop support for old location/format We move `.jj/repo/op_heads/` into `.jj/repo/op_heads/heads/` almost a year ago, in commits `90a66ec262` and `37ba17589d`. We said we would drop support for it in 0.9+. I think we said that before we started doing monthly releases, but I we're still past the goal of 6 months (which is what I think we were aiming for).	2023-12-28 09:17:42 -08:00
Yuya Nishihara	dde42b9c05	index: rename resolve_prefix() to resolve_commit_id_prefix() I'll probably add change id lookup methods to CompositeIndex. The Index trait won't gain resolve_change_id_prefix(), but I also renamed its resolve_prefix() for consistency.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0f2f566188	index: remove "segment_" prefix from IndexSegment methods Since Readonly/MutableIndexSegment no longer implement Index trait, there's no ambiguity between segment-local and index-global operations. Let's shorten the method names.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	c9b9e2864e	index: introduce newtype that represents segment-local position I'm thinking of changing some IndexSegment methods to return LocalPosition instead of global IndexPosition, and using u32 there would be a source of bugs.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	ee8d5e279a	index: make segment-level lookup return neighbor commit ids instead of positions Both readonly and mutable segments know the commit ids to return, and the caller only needs the ids. Since segment_commit_id(local_pos) scans the graph entries, doing that would increase the chance of cache miss.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0e7834feb9	index: inline segment_entry_by_pos() There's no reasonable way to abstract the IndexEntry construction.	2023-12-26 01:03:10 +09:00
Ilya Grigoriev	1fb9df252b	`split.rs`: stop using DescendantRebaser::new This requires creating a new public API as a substitute. I took the opportunity to also add some comments to the `MutRepo::record_rewritten_commit`/`record_abandoned_commit` functions. I imade the simplest possible addition to the API; it is not a very elegant one. Eventually, the entire `record_rewritten_commit` API should probably be refactored again. I also added some comments explaining what these functions do.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	6bfd09009f	`move.rs`: remove use of `MutRepo::create_descenant_rebaser`. After this, the internal function is only used in tests.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	cde8ea8985	Make CommitBuilder constructors private to the library crate The implementation of `CommitBuilder::write` is tightly bound to the MutRepo, so only MutRepo should construct CommitBuilder-s.	2023-12-24 19:25:16 -08:00
Yuya Nishihara	b954bab0ca	index: fix partial reindexing to not lose commits only reachable from one side Spotted while adding error propagation there. This wouldn't likely be a real problem because "jj debug reindex" removes all of the operation links. The "} else {" condition is removed because it doesn't make sense to exclude only the exact parent_op_id operation. This can be optimized to not walk ancestors of the parent_op_id operation, but I don't see a motivation to add tests covering such scenarios. It's pretty rare that an intermediate operation link is missing.	2023-12-24 23:31:16 +09:00
Yuya Nishihara	320d15412b	index: let caller of segment-level save-in() squash segments explicitly There are many unit tests that call mutable_segment.save_in(), but I don't think these callers expect that the segment file could be squashed depending on the size. Let's make it caller's responsibility. maybe_squash_with_ancestors() should be cheap if segment_num_commits() == 0, so it's okay to call it before checking the emptiness.	2023-12-24 00:22:47 +09:00
Yuya Nishihara	1d80bbb70a	index: leverage ancestor iterator to collect segments to be squashed I think "for" loop is easier to follow. Maybe it could be rewritten further to .find_map() loop, but that would be too clever. I also made ancestor_index_segments() pub(super) since it doesn't make sense to only provide ancestor_files_without_local().	2023-12-24 00:22:47 +09:00
Yuya Nishihara	55b4f69fb6	repo: propagate store error from add_heads()	2023-12-24 00:22:30 +09:00
Yuya Nishihara	0f6a7418f2	index: propagate store error from reindexing function If the error is permanent (because the repo predates the no-gc-ref fix for example), there's no easy way to recover. Still, panicking in this function seems wrong.	2023-12-24 00:22:30 +09:00
Yuya Nishihara	7a44e590dc	lock: remove byteorder dependency from tests, use fs helper functions This is the last use of Read/WriteBytesExt. The byteorder crate is great, but we don't need an abstraction of endianness. Let's simply use the std functions.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	9de6273e10	index, stacked_table: inline read_u32::<LittleEndian>() There aren't many callers of ReadBytesExt::read_u32().	2023-12-23 00:14:17 +09:00
Yuya Nishihara	21c22be96e	stacked_table: use u32::from_le_bytes() to reinterpret bytes as integer Apparently, I forgot to update this in `fb06e89649`.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	6f5096e266	index, stacked_table: use u32::try_from() instead of numeric cast These .unwrap()s wouldn't be compiled out, but I don't think they would have measurable impact. Let's use the safer method.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	9ec89bcf86	index, stacked_table: use u32::to_le_bytes() to reinterpret as bytes	2023-12-22 09:03:50 +09:00
Yuya Nishihara	392539fa29	index, stacked_table: simply extend Vec<u8> to not use .write_all() I'm going to remove use of .write_u32() there. It's not super important, but fewer .unwrap()s, the code looks slightly better.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	fb06e89649	index: use u32::from_le_bytes() to reinterpret bytes as integer It's less abstract than going through io::Read, so is probably easier for compiler to optimize out. I also feel it's a bit more readable.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	38ce914321	index: reindex on content-related I/O errors If read_exact() or read_u32() reached to EOF, the index file should be considered corrupted. File not found error is also treated as data corruption because an invalid file name could be read from the child segment file. It can't handle special file names like "..", though.	2023-12-21 08:05:30 +09:00
Yuya Nishihara	e98104d6f0	index: add file name to both io/corrupt errors, combine these variants Index file name also applies to io::Error. New error type reuses io::Error to represent data corruption. We could add an inner Corrupt\|Io enum instead, but we'll need to remap some io::Error variants (e.g. UnexpectedEof) to Corrupt anyway.	2023-12-21 08:05:30 +09:00
Yuya Nishihara	88f3085bb1	index: extract function that opens file and loads index segments	2023-12-21 08:05:30 +09:00
Yuya Nishihara	eccb9b7a44	index: propagate index load errors from DefaultIndexStore	2023-12-19 07:41:57 +09:00
Yuya Nishihara	dd8e686127	index: don't reload parent files after saving new segment file This should be cheaper, and more importantly, we no longer need to propagate ReadonlyIndexLoadError to the caller.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	fb07749291	index: split load function into header and local parts as well	2023-12-19 07:41:57 +09:00
Yuya Nishihara	616a8c7f54	index: split serialization function into header and local parts The idea is that we don't have to reload parent files as we already have the chain of the parent segments. The resulting readonly index will be constructed from the loaded parent segments + local entries blob.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	31b6e93c6e	index: move IndexLoadError to "readonly" module, rename accordingly I thought IndexLoadError and DefaultIndexStoreError would represent "load" and "store" failures respectively, but they aren't. Actually, DefaultIndexStoreError is the store-level error, and IndexLoadError should be wrapped in it.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	b5de16007e	index: add stub IndexReadError type This is needed to remove .unwrap()s from DefaultIndexStore.	2023-12-19 07:41:57 +09:00
Yuya Nishihara	d49b079494	index: update file format comment about ReadonlyIndexSegment Also made it a doc comment. I think 4-byte alignment is a nice property, so added note about that.	2023-12-19 07:41:34 +09:00
Yuya Nishihara	8909647d86	index: pass base directory path by reference	2023-12-18 08:49:21 +09:00
Yuya Nishihara	b733d52557	index: split DefaultIndexStoreError::Io variant, extract save helper Since OpStoreError can also include io::Error, it doesn't make much sense to have Io variant at this level. Let's split it to context-specific errors, and extract helper method that maps io::Error.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	bf4a4e70b1	index: use DefaultMutableIndex wrapper when reconstructing missing index This allows us to extract helper method that writes index file and associates it with the operation.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	50164bb36f	index: have IndexWriteError carry opaque error type instead of string I'm going to remove some .unwrap()s from DefaultIndexStore, and the inner error type will be consolidated to DefaultIndexStoreError.	2023-12-18 08:49:21 +09:00
Yuya Nishihara	87a8238bee	git: turn git.auto-local-branch off by default As far as I can see in the chat, there's no objection to changing the default, and git.auto-local-branch = false is generally preferred. docs/branches.md isn't updated as it would otherwise conflict with #2625. I think the "Remotes" section will need a non-trivial rewrite. #1136, #1862	2023-12-17 08:30:24 +09:00
Yuya Nishihara	6971ec239a	tests: set git_settings.auto_local_branch where it matters	2023-12-17 08:30:24 +09:00
Yuya Nishihara	ac99145a28	working_copy: drop open file instance from PersistError For the same reason as the file_util change.	2023-12-17 08:20:07 +09:00
Yuya Nishihara	c6df0ba4c3	file_util: don't try to overwrite existing content-addressed file on Windows The doc says persist() replaces the destination file as rename() would do on Unix. persist_noclobber() doesn't, and is probably more reliable on Windows. I don't know if persist() is completely atomic on Windows, but if it isn't, it might be the source of the "permission denied" error under highly contended situation. https://docs.rs/tempfile/latest/tempfile/struct.NamedTempFile.html#method.persist https://github.com/Stebalien/tempfile/blob/v3.8.0/src/file/imp/windows.rs#L77 We could use persist_noclobber() on all platforms, but it's more involved on Unix. https://github.com/Stebalien/tempfile/blob/v3.8.0/src/file/imp/unix.rs#L107	2023-12-17 08:20:07 +09:00
Yuya Nishihara	dd325c089c	file_util: drop open file instance from PersistError PersistError is basically a pair of io::Error and NamedTempFile instance. It's unlikely that we would want to propagate the open file instance to the CLI error handler, leaving the temporary file alive.	2023-12-17 08:20:07 +09:00
Yuya Nishihara	4d91e4c196	revset: simplify type constraints on combination iterators Just a minor cleanup to remove lifetime parameter from the types. I tried to reimplement them by using itertools, but I couldn't find a simple way to encode short-circuiting at the end of either left or right iterator.	2023-12-16 07:50:04 +09:00
Yuya Nishihara	6d59156858	revset: parameterize candidates set of FilterRevset as well	2023-12-16 07:50:04 +09:00
Yuya Nishihara	a36368bb88	revset: make revset combinators generic over set types, merge UnionPredicate UnionRevset and UnionPredicate are conceptually the same. Let's unify them.	2023-12-16 07:50:04 +09:00
Yuya Nishihara	af6047a655	lib: forbid unsafe_code at all	2023-12-15 16:10:28 +09:00
Yuya Nishihara	9990c41a90	repo: remove unsafe lifetime hack from change_id_index()	2023-12-15 16:10:28 +09:00
Yuya Nishihara	d9e8297059	index: add 'static version of evaluate_revset() to ReadonlyIndex We'll probably need a better abstraction, but a separate method is good enough to remove unsafe code from ReadonlyRepo. I'm not sure if this is feasible for the other backends, but I guess there would be less lifetimed variables than DefaultReadonlyIndex.	2023-12-15 16:10:28 +09:00
Yuya Nishihara	2ba50c76c7	revset: abstract evaluated RevsetImpl over owned/borrowed index types	2023-12-15 16:10:28 +09:00
Yuya Nishihara	72d9cd019b	index: extract as_composite() to trait method The revset engine will accept abstract AsCompositeIndex type, and the evaluated revset can be 'static if the index is behind Arc<T>.	2023-12-15 16:10:28 +09:00
Yuya Nishihara	8fdf9db6e0	revset: remove 'index lifetime from InternalRevset	2023-12-15 14:58:12 +09:00
Yuya Nishihara	c426d34c11	revset: pass in index to PurePredicateFn as an argument to make it 'static	2023-12-15 14:58:12 +09:00
Yuya Nishihara	71070e85d7	revset: add helper that coerces closure to PurePredicateFn Also renamed the boxed version to discriminate it from the cast helper.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	a9a7de4a5e	revset: store RevWalk factory function in RevWalkRevset The returned iterator is boxed by caller due to the limitation of the type system. There's a workaround, but it's super ugly. https://users.rust-lang.org/t/hrtb-on-multiple-generics/34255/3	2023-12-15 14:58:12 +09:00
Yuya Nishihara	575d3dc7bf	revset: store IndexPosition in EagerRevset to drop 'index lifetime This adds overhead to re-look up IndexEntry, but I don't think that would have significant impact on performance.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	261bf848a9	revset: pass in index to InternalRevset as an argument The idea is that InternalRevset will store a 'static boilerplate function that borrows an 'index passed by function argument. This way, we can abstract the index type over Arc<T> and &T without introducing too much generics.	2023-12-15 14:58:12 +09:00
Yuya Nishihara	e332d39375	revset: extract inner method that constructs IndexEntry iterator	2023-12-15 14:58:12 +09:00
Yuya Nishihara	b8f60c4dd6	cargo: bump gix to 0.56.0 I don't know why the dependabot didn't catch this, but there are things to fix manually. EntryMode was changed to a u16 wrapper, and the enum was renamed to EntryKind. Other than that, I don't find anything breaking our codebase.	2023-12-15 14:17:02 +09:00
Yuya Nishihara	95a0cceb97	index: use loaded readonly data without splitting into vecs Since lookup data isn't typically small, .split_off() can take a few milliseconds to memcpy().	2023-12-14 08:43:50 +09:00
Yuya Nishihara	5121e1f4e9	index: move IndexSegment trait to "composite" module Perhaps, this is the most controversial part. It could be moved to new "segment" module (or something like "common"), but I think IndexSegment can be considered a trait that enables the CompositeIndex abstraction.	2023-12-14 08:43:40 +09:00
Yuya Nishihara	b89ae7c0b5	index: use IndexEntry::position() instead of direct field access	2023-12-14 08:43:40 +09:00
Yuya Nishihara	9fb0f00f2d	index: add IndexEntry constructor instead of pub(super)-ing fields	2023-12-14 08:43:40 +09:00
Yuya Nishihara	771f447d99	index: split IndexEntry and related types to "entry" module Added pub(super) or pub where needed. I won't implement accessor methods on IndexPositionByGeneration and IndexPosition as they are purely value types, and protecting the inner values wouldn't make sense.	2023-12-14 08:43:40 +09:00
Martin von Zweigbergk	60fae3114e	transaction: take description at end instead of start It seems better to have the caller pass the transaction description when we finish the transaction than when we start it. That way we have all the information we want to include more readily available.	2023-12-13 08:12:49 -08:00
Ilya Grigoriev	316ab8efb8	rewrite.rs: refactor `new_parents` to depend only on `parent_mapping` Previously, the function relied on both the `self.parent_mapping` and `self.rebased`. If `(A,B)` was in `parent_mapping` and `(B,C)` was in `rebased`, `new_parents` would map `A` to `C`. Now, `self.rebased` is ignored by `new_parents`. In the same situation, DescendantRebaser is changed so that both `(A,B)` and `(B,C)` are in `parent_mapping` before. `new_parents` now applies `parent_mapping` repeatedly, and will map `A` to `C` in this situation. ## Cons - The semantics are changed; `new_parents` now panics if `self.parent_mapping` contain cycles. AFAICT, such cycles never happen in `jj` anyway, except for one test that I had to fix. I think it's a sensible restriction to live with; if you do want to swap children of two commits, you can call `rebase_descendants` twice. ## Pros - I find the new logic much easier to reason about. I plan to extract it into a function, to be used in refactors for `jj rebase -r` and `jj new --after`. It will make it much easier to have a correct implementation of `jj rebase -r --after`, even when rebasing onto a descendant. - The de-duplication is no longer O(n^2). I tried to keep the common case fast. ## Alternatives - We could make `jj rebase` and `jj new` use a separate function with the algorithm shown here, without changing DescendantRebaser. I believe that the new algorithm makes DescendatRebaser easier to understand, though, and it feels more elegant to reduce code duplication. - The de-duplication optimization here is independent of other changes, and could be used on its own.	2023-12-12 19:35:51 -08:00
Yuya Nishihara	2abbb637e3	index: add wrapper functions to DefaultReadonlyIndex to remove pub(super) field	2023-12-13 08:09:48 +09:00
Yuya Nishihara	c0a12a7cbc	index: add methods that provides commit/change_id_length We could add Layout struct holding these parameters, but I don't think that's needed just for two parameters.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	3831ad423c	index: use as_composite().num_commits() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	30984b1505	index: use name() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	e5c8252fb4	index: use segment_parent_file() instead of direct field access	2023-12-13 08:09:48 +09:00
Yuya Nishihara	402e36bab7	index: split readonly index types to "readonly" module Added pub(super) where needed. There are a few pub(super) fields that look suspicious, which will be fixed by the subsequent patches.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	fbec16b49f	index: add wrapper functions to DefaultMutableIndex to remove pub(super) field into_segment() could be added instead of save_in(), but I decided to wrap save_in(). save_in() may squash ancestor files, so it could be considered an index-level operation.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	5aeeb5f723	index: split mutable index types to "mutable" module Added pub(super) where needed or makes sense.	2023-12-13 08:09:48 +09:00
Yuya Nishihara	ab2742f2c9	index: split RevWalk types to "rev_walk" module Added pub(super) where needed.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	caa1b99c24	index: add CompositeIndex constructor instead of pub(super)-ing field This wouldn't matter, but seemed slightly better.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	679518fdf2	index: split CompositeIndex and stats types to "composite" module Added pub(super) where needed or makes sense.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	2423558e68	index: split DefaultIndexStore and Load/StoreError types to "store" module IndexLoadError isn't store-specific, but I think it's better to put I/O stuff in the store module.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	cdcd465c79	index: move default_index_store.rs to sub directory named default_index default_index_store.rs is relatively big, and it contains types and impls in arbitrary order. Let's split them into sub modules. After everything moved, mod.rs will only contain tests.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	f86b338681	revset: inline walk_ancestors()	2023-12-11 09:14:03 +09:00
Yuya Nishihara	cd0b24ef14	revset: inline walk_children() There's only one caller, and we have common code at the call site.	2023-12-11 09:14:03 +09:00
Yuya Nishihara	d28bd8fa0f	revset: inline collect_dag_range()	2023-12-11 09:14:03 +09:00
Yuya Nishihara	73fb922517	index: reimplement collect_dag_range() of revset engine as iterator I'm going to remove 'index lifetime from InternalRevset so Revset<'static> can be easily constructed from DefaultReadonlyIndex. As the first step, this series removes some lifetime complexity from EvaluationContext methods. We don't need an descendant iterator API, but it helps to add separate function to collect into HashSet<IndexPosition> instead of returning a pair of ordered vec and set.	2023-12-11 09:14:03 +09:00
Yuya Nishihara	cbbe38ba7b	index: rename MutableIndexImpl to MutableIndexSegment	2023-12-10 11:03:07 +09:00
Yuya Nishihara	c94e1de6d2	index: add DefaultMutableIndex wrapper, move Index impls to it The wrapper type isn't needed for the mutable layer, but this mirrors the readonly type structure. Test cases are also migrated to be using the index wrapper so long as we don't have to care for the nesting of the segment files.	2023-12-10 11:03:07 +09:00
Yuya Nishihara	ce312ae288	index: duplicate add_commit() to MutableIndexImpl	2023-12-10 11:03:07 +09:00
Yuya Nishihara	e0206a82f2	index: extract merge_in() function that works on segment types Prepares for splitting MutableIndexImpl into segment and index wrapper types.	2023-12-10 11:03:07 +09:00
Yuya Nishihara	a110ec6d95	cli: print failed git export reason for each ref Not all reasons are actionable, but we print hint in common cryptic cases.	2023-12-09 23:37:00 +09:00
Yuya Nishihara	990edcefc9	index: impl Index for DefaultReadonlyIndex instead of ReadonlyIndexSegment The idea is that the ReadonlyIndexSegment is a sub component of the index. The Index trait could be implemented for any Segment type, but we don't need a public interface to access sub segment as an index.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	1cbd2ddb4b	index: rename ReadonlyIndexImpl to ReadonlyIndexSegment I'm going to split the internal Segment types and the public Index types in order to clarify the layering concept. The public Index types will be wrappers like DefaultReadonlyIndex. Strictly speaking, ReadonlyIndexImpl is a segment + parent pointer pair, but I think calling it a segment is pretty okay. It could be called a ReadonlyIndexFile, but "File" can't apply to the mutable part.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	172043e968	index: make ReadonlyIndexImpl private There are no external callers.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	6c57ba7f21	index: rename ReadonlyIndexWrapper to DefaultReadonlyIndex This matches the store naming: impl IndexStore for DefaultIndexStore. I also added minimal doc comment and Debug.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	cee69d1665	tests: remove index downcast helpers called only by as_<type>_composite() I'm going to rename the impl types, and I don't want to think about the names of these downcast functions.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	5f6e28c8cf	git: migrate export_refs() to gix::Repository FailedToDelete/Set reasons are boxed because gix error types aren't small. They could be casted to std::error::Error if needed.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	2d76907048	git: unimplement PartialEq on FailedRefExportReason Gitoxide errors don't implement PartialEq. We could instead stringify the errors, but there aren't many callers who expect FailedRefExportReason to be comparable.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	9f8831e825	git: unimplement PartialEq on GitExportError Gitoxide errors don't implement PartialEq, and I don't think it makes sense to test equality of InternalGitError objects.	2023-12-09 15:18:19 +09:00
Yuya Nishihara	a77eed648b	git: have export_refs() obtain git2::Repository instance from store	2023-12-09 15:18:19 +09:00
Yuya Nishihara	0f37027646	index: remove unneeded Any trait bound from MutableIndex We use .as_any() to downcast to the backend impl instead.	2023-12-08 23:30:35 +09:00
Yuya Nishihara	c197add39b	git_backend: do not try to resolve git_target path as working directory path The git_target path is normalized and managed by jj, so we don't need a fallback mechanism. Let's make it stricter.	2023-12-07 08:43:49 +09:00
Yuya Nishihara	77c811163f	tests: make sure to specify external git repository path including ".git"	2023-12-07 08:43:49 +09:00
Yuya Nishihara	25fcc3e403	workspace: consider .git symlink when generating relative git_target path Before, an absolute path would be saved in the git_target file if .git is a symlink. That's not wrong, but seemed a bit weird. Let's consolidate the behavior across .git file types.	2023-12-05 14:23:59 -08:00
Yuya Nishihara	787fa1340b	workspace: remove redundant cloning from init_external_git() Apparently, I forgot to update it in `1db033504c` "repo, workspace: remove 'static lifetime bound from initializer functions."	2023-12-05 14:23:59 -08:00
Yuya Nishihara	899c6375a0	git_backend: don't fully canonicalize .git symlink Apparently, libgit2 doesn't deduce "core.bare" config from the directory name, but gitoxide implements it correctly. So we shouldn't blindly canonicalize the Git repository path. Fortunately, the saved git_target path isn't a fully- canonicalized form (unless user explicitly sepcified "--git-repo ./.git"), so we don't need a hack to remap git_target back to the symlink path. is_colocated_git_workspace() is adjusted since the git_workdir is no longer resolved from the fully-canonicalized repo path, at least in our code. Still we have the ".git/.." fallback because test_init_git_colocated_symlink_gitlink() would otherwise fail. I haven't figured out why, and the test might be actually wrong compared to the git CLI behavior, but let's not change that for now. Fixes #2668	2023-12-05 14:23:59 -08:00
Martin von Zweigbergk	1cc271441f	gc: implement basic GC for Git backend This adds an initial `jj util gc` command, which simply calls `git gc` when using the Git backend. That should already be useful in non-colocated repos because it's not obvious how to GC (repack) such repos. In my own jj repo, it shrunk `.jj/repo/store/` from 2.4 GiB to 780 MiB, and `jj log --ignore-working-copy` was sped up from 157 ms to 86 ms. I haven't added any tests because the functionality depends on having `git` binary on the PATH, which we don't yet depend on anywhere else. I think we'll still be able to test much of the future parts of garbage collection without a `git` binary because the interesting parts are about manipulating the Git repo before calling `git gc` on it.	2023-12-03 07:40:12 -08:00
Yuya Nishihara	35f718f212	merged_tree: remove canceling terms prior to resolving file-level conflict I think this is a variant of the problem fixed by `7fda80fc22` "tree: simplify conflict before resolving at hunk level." We need to simplify() the conflict before and after extracting file ids because the source conflict values may contain trees to be cancelled out, and the file values may differ only in exec bits. Since the legacy tree passes a simplified conflict in to this function, I made the merged tree do the same. Fixes #2654	2023-12-03 07:44:58 +09:00
Yuya Nishihara	4ffbf40c82	merged_tree: do not propagate conflicting empty tree value to parent Otherwise an empty subtree would be added to the parent tree. If the stored tree contained an empty subtree, simplify() wouldn't work against new "absent" subtree representation. I don't know if there's a such code path, but I believe it's very rare to encounter the problem. #2654	2023-12-03 07:44:58 +09:00
Yuya Nishihara	1db033504c	repo, workspace: remove 'static lifetime bound from initializer functions	2023-12-03 07:44:41 +09:00
Yuya Nishihara	d747879aee	signing: pass SigningFn by reference write_commit() doesn't need ownership of the signing function.	2023-12-01 22:55:04 +09:00
Anton Bulakh	eb1c0ab4a2	sign: Implement a test signing backend and add a few basic tests	2023-11-30 23:36:56 +02:00
Anton Bulakh	d7229a3f90	sign: Define signing backend API and integrate it Finished everything except actual signing backend implementation(s) and the UI.	2023-11-30 23:36:56 +02:00
Yuya Nishihara	076b49b610	merged_tree: use merged_tree_entry_diff() in stream version	2023-12-01 00:05:06 +09:00
Yuya Nishihara	97a260b1bf	merged_tree: reimplement TreeEntryDiffIterator by using iterator adapter We don't need a named type anymore.	2023-12-01 00:05:06 +09:00
Yuya Nishihara	fd1c03d037	merged_tree: use sync get_tree() in TreeDiffIterator This basically backs out the change `1b9a3e27e0` "merged_tree: read before/after trees concurrently." As we decided to add a separate impl for async access, it doesn't make sense to read before/after pair in parallel. The async single_tree() is moved to TreeDiffStreamImpl. It will help remove the sync version when the performance problem is solved.	2023-12-01 00:05:06 +09:00
Yuya Nishihara	601be0d480	working_copy: narrow file_states recursively while visiting directories This saves another ~10ms. Without watchman: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-1,jj-2 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 327.7 ms ± 24.9 ms [User: 1059.1 ms, System: 654.3 ms] Range (min … max): 296.0 ms … 385.4 ms 20 runs Benchmark 3: target/release-with-debug/jj-2 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 311.0 ms ± 24.8 ms [User: 960.0 ms, System: 643.1 ms] Range (min … max): 274.9 ms … 358.5 ms 20 runs ```	2023-11-30 12:09:31 +09:00
Yuya Nishihara	a935a4f70c	working_copy: use proto file states without rebuilding BTreeMap In snapshot(), changed_file_states are received in arbitrary order. For the other callers, entries are in diff_stream order, so we don't have to sort them. With watchman enabled, we can see the cost of sorting the sorted proto entries. I don't think this is significant, but we can mitigate it by adding is_file_states_sorted flag to the proto message if needed: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 164.8 ms ± 16.6 ms [User: 50.2 ms, System: 111.7 ms] Range (min … max): 148.1 ms … 195.0 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 171.8 ms ± 13.6 ms [User: 61.7 ms, System: 109.0 ms] Range (min … max): 159.5 ms … 192.1 ms 20 runs ``` Without watchman: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1 \ "target/release-with-debug/{bin} -R ~/mirrors/linux files ~/mirrors/linux/no-match" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 367.3 ms ± 30.3 ms [User: 1415.2 ms, System: 633.8 ms] Range (min … max): 325.4 ms … 421.7 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux files ~/mirrors/linux/no-match Time (mean ± σ): 327.7 ms ± 24.9 ms [User: 1059.1 ms, System: 654.3 ms] Range (min … max): 296.0 ms … 385.4 ms 20 runs ``` I haven't measured snapshotting against dirty working copy, but I don't think it would be slower than the original implementation.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	fca3690dda	working_copy: add file states wrapper that provides map-like API I'll replace the current lazy loading mechanism with this. Read-only methods are implemented on the borrowed type so that we can narrow lookup scope recursively.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	9292af5e52	working_copy: update file states in bulk This helps migrate BTreeMap<RepoPath, _> to sorted Vec.	2023-11-30 12:09:31 +09:00
Yuya Nishihara	c9150d02fc	working_copy: don't look up file state twice while visiting directories	2023-11-30 12:09:31 +09:00
Yuya Nishihara	6ce7bd5338	repo_path: replace .contains() with .starts_with(), flipping the arguments self.contains(other) means that the self tree contains the other tree (i.e. the self path is prefix of the other), but it could be confused the other way around if we were thinking about the path literal, not the tree. Let's add .starts_with() instead by copying the std::path::Path definition.	2023-11-29 08:41:23 +09:00
Yuya Nishihara	266690a46b	repo_path: make strip_prefix() public function returning &RepoPath There are no external callers, but I think it's useful.	2023-11-29 08:41:23 +09:00
Yuya Nishihara	73690ed54e	matchers: clean up .walk_to(dir) to yield &RepoPath instead of iterator	2023-11-29 08:41:23 +09:00
Yuya Nishihara	bc9725c73c	working_copy: use RepoPath::parent() which no longer allocates temporary object	2023-11-29 08:41:23 +09:00
Yuya Nishihara	016fc2b5cc	repo_path: change .split() and .parent() to return &RepoPath	2023-11-29 08:41:23 +09:00
Yuya Nishihara	28ab9593c3	repo_path: split RepoPath into owned and borrowed types This enables cheap str-to-RepoPath cast, which is useful when sorting and filtering a large Vec<(String, _)> list by using matcher for example. It will also eliminate temporary allocation by repo_path.parent().	2023-11-28 07:33:28 +09:00
Yuya Nishihara	0a1bc2ba42	repo_path: add stub RepoPathBuf type, update callers Most RepoPath::from_internal_string() callers will be migrated to the function that returns &RepoPath, and cloning &RepoPath won't work.	2023-11-28 07:33:28 +09:00
Yuya Nishihara	f5938985f0	repo_path: make RepoPath::from_internal_string() accept owned string I'm going to add borrowed RepoPath type, and most from_internal_string() callers will be migrated to it. For the remaining callers, it makes more sense to move the ownership of String to RepoPathBuf.	2023-11-28 07:33:28 +09:00

... 2 3 4 5 6 ...

2667 commits