mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2025-01-03 18:24:19 +00:00

Author	SHA1	Message	Date
Yuya Nishihara	815437598f	revset: disable parsing rules of legacy dag range operator The legacy parsing rules are turned into compatibility errors. The x:y rule is temporarily enabled when parsing string patterns. It's weird, but we can't isolate the parsing function because a string pattern may be defined in an alias.	2024-02-14 10:04:56 +09:00
Yuya Nishihara	2905a70b18	doc, tests: drop use of deprecated revset dag range operator	2024-02-14 10:04:56 +09:00
Yuya Nishihara	1f6d1de62d	index: on reindexing, print error details to stderr It's not ideal to print the error there, but using stderr should be slightly better. It could be a tracing message, but tracing won't be displayed by default.	2024-02-12 19:38:36 +09:00
Yuya Nishihara	b0e8e2a1af	index: move segment files to sub directory, add version number I'm going to introduce breaking changes in index format. Some of them will affect the file size, so version number or signature won't be needed. However, I think it's safer to detect the format change as early as possible. I have no idea if embedded version number is the best way. Because segment files are looked up through the operation links, the version number could be stored there and/or the "segments" directory could be versioned. If we want to support multiple format versions and clients, it might be better to split the tables into data chunks (e.g. graph entries, commit id table, change id table), and add per-chunk version/type tag. I choose the per-file version just because it's simple and would be non-controversial. As I'm going to introduce format change pretty soon, this patch doesn't implement data migration. The existing index files will be deleted and new files will be created from scratch. Planned index format changes include: 1. remove unused "flags" field 2. inline commit parents up to two 3. add sorted change ids table	2024-02-12 19:38:36 +09:00
Yuya Nishihara	4b541e6c93	index: on reinit(), don't remove "operations" directory itself This should be slightly safer as the store may be accessed concurrently from another process.	2024-02-12 19:38:36 +09:00
Yuya Nishihara	81837897dc	index: extract dir.join("operations") to private method	2024-02-12 19:38:36 +09:00
Martin von Zweigbergk	48a9f9ef56	repo: use Transaction for creating repo-init operation Since the operation log has a root operation, we don't need to create the repo-initialization operation in order to create a valid `ReadonlyRepo` instance. I think it's conceptually simpler to create the instance at the root operation id and then add the initial operation using the usual `Transaction` API. That's what this patch does. Doing that also brought two issues to light: 1. The empty view object doesn't have the root commit as head. 2. The initialized `OpHeadsStore` doesn't have the root operation as head. Both of those seem somewhat reasonable, but maybe we should change them. For now, I just made the initial repo (before the initial operation) have a single op head (to compensate for (2)). It might be worth addressing both issues so the repo is in a better state before we create the initial operation. Until we do, we probably shouldn't drop the initial operation.	2024-02-11 21:19:30 -08:00
Martin von Zweigbergk	305a507ae3	repo: move creation of repo-init operation to end of init() Since we now have a root operation, we don't need the repo-initialization operation to create the repo. Let's move it later to clarify that.	2024-02-11 21:19:30 -08:00
Ilya Grigoriev	a9c3af8153	test_local_working_copy: use std::fs:write instead of OpenOptions	2024-02-10 16:06:28 -08:00
Ilya Grigoriev	b2e37d448b	clippy: add `truncate` option as suggested by clippy In the next commit, I replace the whole thing with std::fs::write, but I'll leave this here in case the next commit is somhow incorrect	2024-02-10 16:06:28 -08:00
Ilya Grigoriev	a88c06068e	clippy: new nightly fixes For some reason, clippy also suggested surrounding `self.value` with parentheses. Not sure whether that's a clippy bug. Cc: https://github.com/rust-lang/rust-clippy/issues/12268	2024-02-10 16:06:28 -08:00
dependabot[bot]	6d1faf9b03	Update strsim (changes tests), clap, clap_complete This is #3002 with tests rerun to account for changes to `strsim`, as @thoughtpolice noticed in https://github.com/martinvonz/jj/pull/3002#issuecomment-1936763101 The string similarity changes include an example that seems better and one that seems worse. Decreasing the threshold definitely makes things worse.	2024-02-10 00:01:47 -08:00
Yuya Nishihara	e908bd9a17	simple_op_store: use TryFrom<i32> instead of deprecated from_i32()	2024-02-10 09:15:30 +09:00
Yuya Nishihara	421ab592be	cargo: bump gix to 0.58.0, migrate to ObjectId::try_from() The panicking conversion function appears to be renamed, and try_from() is added instead.	2024-02-10 09:15:30 +09:00
Austin Seipp	5b517b542e	rust: bump MSRV to 1.76.0 Signed-off-by: Austin Seipp <aseipp@pobox.com>	2024-02-09 15:48:01 -06:00
Martin von Zweigbergk	6c1aeff7a9	working copy: materialize symlinks on Windows as regular files I was a bit surprised to learn (or be reminded?) that checking out symlinks on Windows leads to a panic. This patch fixes the crash by materializing symlinks from the repo as regular files. It also updates the snapshotting code so we preserve the symlink-ness of a path. The user can update the symlink in the repo by updating the regular file in the working copy. This seems to match Git's behavior on Windows when symlinks are disabled.	2024-02-09 09:20:24 -08:00
Martin von Zweigbergk	b253a28788	merge: add as_normal(), taken from RefTarget The `RefTarget::as_normal()` function is not specific to `RefTarget`, and I plan to use it from `local_working_copy`.	2024-02-09 09:20:24 -08:00
Martin von Zweigbergk	5a898b16a8	working_copy: handle symlink outside write_path_to_store() The `write_path_to_store()` has almost no overlapping code between the handling of symlinks and regular files, which suggests that we should move out the handling of symlinks to the caller (there's only one).	2024-02-09 09:20:24 -08:00
Jonathan Tan	33f3a420a1	workspace: recover from missing operation If the operation corresponding to a workspace is missing for some reason (the specific situation in the test in this commit is that an operation was abandoned and garbage-collected from another workspace), currently, jj fails with a 255 error code. Teach jj a way to recover from this situation. When jj detects such a situation, it prints a message and stops operation, similar to when a workspace is stale. The message tells the user what command to run. When that command is run, jj loads the repo at the @ operation (instead of the operation of the workspace), creates a new commit on the @ commit with an empty tree, and then proceeds as usual - in particular, including the auto-snapshotting of the working tree, which creates another commit that obsoletes the newly created commit. There are several design points I considered. 1) Whether the recovery should be automatic, or (as in this commit) manual in that the user should be prompted to run a command. The user might prefer to recover in another way (e.g. by simply deleting the workspace) and this situation is (hopefully) rare enough that I think it's better to prompt the user. 2) Which command the user should be prompted to run (and thus, which command should be taught to perform the recovery). I chose "workspace update-stale" because the circumstances are very similar to it: it's symptom is that the regular jj operation is blocked somewhere at the beginning, and "workspace update-stale" already does some special work before the blockage (this commit adds more of such special work). But it might be better for something more explicitly named, or even a sequence of commands (e.g. "create a new operation that becomes @ that no workspace points to", "low-level command that makes a workspace point to the operation @") but I can see how this can be unnecessarily confusing for the user. 3) How we recover. I can think of several ways: a) Always create a commit, and allow the automatic snapshotting to create another commit that obsoletes this commit. b) Create a commit but somehow teach the automatic snapshotting to replace the created commit in-place (so it has no predecessor, as viewed in "obslog"). c) Do either a) or b), with the added improvement that if there is no diff between the newly created commit and the former @, to behave as if no new commit was created (@ remains as the former @). I chose a) since it was the simplest and most easily reasoned about, which I think is the best way to go when recovering from a rare situation.	2024-02-09 00:38:47 -08:00
Ilya Grigoriev	12c3be70f4	lib refs.rs: rename `TrackingRefPair` to `LocalAndRemoteRef` As discussed in https://github.com/martinvonz/jj/pull/2962#discussion_r1479384841, the previous name is confusing since the struct is used for pairs where the remote branch is not tracked by the local branch.	2024-02-07 17:06:28 -08:00
jyn	d66fcf2ca0	compile integration tests as a single binary this greatly speeds up the time to run all tests, at the cost of slightly larger recompile times for individual tests. this unfortunately adds the requirement that all tests are listed in `runner.rs` for the crate. to avoid forgetting, i've added a new test that ensures the directory is in sync with the file. ## benchmarks before this change, recompiling all tests took 32-50 seconds and running a single test took 3.5 seconds: ``` ; hyperfine 'touch lib/src/lib.rs && cargo t --test test_working_copy' Time (mean ± σ): 3.543 s ± 0.168 s [User: 2.597 s, System: 1.262 s] Range (min … max): 3.400 s … 3.847 s 10 runs ``` after this change, recompiling all tests take 4 seconds: ``` ; hyperfine 'touch lib/src/lib.rs ; cargo t --test runner --no-run' Time (mean ± σ): 4.055 s ± 0.123 s [User: 3.591 s, System: 1.593 s] Range (min … max): 3.804 s … 4.159 s 10 runs ``` and running a single test takes about the same: ``` ; hyperfine 'touch lib/src/lib.rs && cargo t --test runner -- test_working_copy' Time (mean ± σ): 4.129 s ± 0.120 s [User: 3.636 s, System: 1.593 s] Range (min … max): 3.933 s … 4.346 s 10 runs ``` about 1.4 seconds of that is the time for the runner, of which .4 is the time for the linker. so there may be room for further improving the times.	2024-02-06 18:19:41 -08:00
Ilya Grigoriev	1741ab22e4	view.rs: clarify some internal function docstrings Mostly, I was a bit confused that some of these functions return a `TrackingRefPair` but don't seem to take into account whether the remote branch is being tracked or not.	2024-02-06 17:52:01 -08:00
Martin von Zweigbergk	b343289238	working_copy: make reset() take a commit instead of a tree Our virtual file system at Google (CitC) would like to know the commit so it can scan backwards and find the closest mainline tree based on it. Since we always record an operation id (which resolves to a working-copy commit) when we write the working-copy state, it doesn't seem like a restriction to require a commit.	2024-02-06 12:41:09 -08:00
Yuya Nishihara	77ceadbfd0	cleanup: remove remaining ": {source}" from error message templates	2024-02-04 09:13:21 +09:00
Yuya Nishihara	1efadd96c8	git: remove ": {source}" from FailedRefExportReason, walk chain by caller The error output gets more verbose because all gix error sources are printed. Maybe we'll need a better formatting, but changing to multi-line output doesn't look nice either.	2024-02-04 09:13:21 +09:00
Yuya Nishihara	a0cefb8b7b	revset, template: remove ": {source}" from parse error message template These error types are special because the message is embedded in ASCII art. I think it would be a source of bugs if some error types had ": {source}" but others don't. So I'm going to remove all ": {source}"s, and let the callers concatenate them when needed.	2024-02-04 09:13:21 +09:00
Ilya Grigoriev	d439de073d	rewrite.rs: revert commits `cfcc7c5e` and `becbc889` This mostly reverts https://github.com/martinvonz/jj/pull/2901 as well as its fixup https://github.com/martinvonz/jj/pull/2903. The related bug is reopened, see https://github.com/martinvonz/jj/issues/2869#issuecomment-1920367932. The problem is that while the fix did fix #2869 in most cases, it did reintroduce the more severe bug https://github.com/martinvonz/jj/issues/2760 in one case, if the working copy is the commit being rebased. For example, suppose you have the tree ``` root -> A -> B -> @ (empty) -> C ``` ### Before this commit #### Case 1 `jj rebase -s B -d root --skip-empty` would work perfectly before this commit, resulting in ``` root -> A \-------B -> C \- @ (new, empty) ``` #### Case 2 Unfortunately, if you run `jj rebase -s @ -d A --skip-empty`, you'd have the following result (before this commit), which shows the reintroduction of #2760: ``` root -> A @ -> C \-- B ``` with the working copy at `A`. The reason for this is explained in https://github.com/martinvonz/jj/pull/2901#issuecomment-1920043560. ### After this commit After this commit, both case 1 and case 2 will be wrong in the sense of #2869, but it will no longer exhibit the worse bug #2760 in the second case. Case 1 would result in: ``` root -> A \-------B -> @ (empty) -> C ``` Case 2 would result in: ``` root -> A -> @ -> C \-- B ``` with the working copy remaining a descendant of A	2024-02-03 15:56:44 -08:00
Essien Ita Essien	8423c63a04	cli: Refactor workspace root directory creation * Add file_util::create_or_reuse_dir() which is needed by all init functionality regardless of the backend.	2024-02-03 14:15:05 +00:00
Yuya Nishihara	ec0f2753ae	repo: mark inner error of EditCommitError as source	2024-02-01 16:59:44 +09:00
Martin von Zweigbergk	7c87fe243c	backends: implement as_any() on OpStore and OpHeadsStore too It's useful for custom commands to be able to downcast to custom backend types.	2024-01-31 00:15:29 -08:00
Ilya Grigoriev	cfcc7c5e34	test_rewrite: Fixup test comment after `becbc88`	2024-01-30 23:43:05 -08:00
Martin von Zweigbergk	9efa66e8c9	rewrite: remove return value from `rebase_next()` `rebase_next()` returns an `Option<RebasedDescendant>`, but the only way we use it is to decide whether to terminate the loop over `to_visit`. Let's simplify by making the caller iterate over `to_visit` instead.	2024-01-30 23:27:48 -08:00
Martin von Zweigbergk	881d75e899	rewrite: drop TODO about changing the API The `rebase_next()` method is private, so I think we've addressed the TODO.	2024-01-30 23:27:48 -08:00
Ilya Grigoriev	becbc88915	rewrite.rs: fix working copy position after `jj rebase --abandon-empty` Fixes #2869	2024-01-30 22:53:55 -08:00
Ilya Grigoriev	1fff6e37a1	rewrite.rs DescendantRebaser: rename variable for clarity The `edit` argument seems to be true if and only if the old commit was not abandoned. So, I flipped its value and renamed it to `abandoned_old_commit`.	2024-01-30 22:53:55 -08:00
Yuya Nishihara	976b801208	index: on reinit(), delete all segment files to save disk space Perhaps, reinit() will evolve to gc() function? It's basically a gc() with empty operation set.	2024-01-31 09:40:52 +09:00
Yuya Nishihara	3d68601c01	index: remove redundant stat() of operation link file, handle error instead This wouldn't matter in practice, but the operation link file could be deleted after testing the existence.	2024-01-31 09:40:52 +09:00
Yuya Nishihara	3d0b3d57d8	git_backend: on gc(), remove unreachable no-gc refs and compact them With my jj repo, the number of jj/keep refs went down from 87887 to 27733. The .git directory size is halved, but we'll need to clean up extra and index files to save disk space. "git gc --prune=now && jj debug reindex" passed, so the repo wouldn't be corrupted. #12	2024-01-27 10:18:11 +09:00
Yuya Nishihara	351487b9f5	backend: pass Index and keep_newer timestamp parameters to gc() GitBackend::gc() will need to check if a commit is reachable from any historical operations. This could be calculated from the view and commit objects, but the Index will do a better job.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	845eb4ce01	git_backend: when running "git gc", chdir instead of specifying it by GIT_DIR Hopefully this will be more reliable on Windows where path/environment stuff is messy.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	4e54021930	backend: have gc() return BackendError instead of opaque error type The gc() implementation is likely to call other backend functions, which return BackendError.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	84949dd551	backend: mark BackendError::Other as transparent The inner error should be the source, and I don't think the "Error:" prefix gives additional context.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	8a67191d25	git: simplify import_head() as it doesn't have to process multiple head commits	2024-01-27 00:01:59 +09:00
Yuya Nishihara	fc114ef217	git: extract Git HEAD handling bits from import_some_refs() I'm going to make WorkspaceCommandHelper::maybe_snapshot() snapshot the working copy before importing refs. git::import_some_refs() can rebase the working copy branch and therefore @ can be moved. git::import_head() doesn't, and it should be invoked before snapshotting. git::import_head() is inserted to some of the git:import_refs() callers where HEAD seems to matter. I feel it's a bit odd that the HEAD ref is imported to non-colocated repo, but "jj init --git-repo" relies on that, and I think the existence of HEAD@git is harmless. It's merely a ref to the revision checked out somewhere else.	2024-01-27 00:01:59 +09:00
Yuya Nishihara	5a88180720	git_backend: fix import_head_commits() to not issue duplicated ref edits This was broken at `afa72ff496` "git_backend: inline prevent_gc() to bulk-update refs." Since no-gc refs are created within a transaction, duplicated edits are no longer allowed.	2024-01-27 00:00:57 +09:00
Ilya Grigoriev	dff440c4a8	`clippy`: Fix nightly warnings about "useless use of vec!"	2024-01-25 22:00:26 -08:00
Daniel Ploch	20cbe77bf5	workspace: support creating shares of custom workspaces	2024-01-25 11:46:07 -08:00
Daniel Ploch	cb889f0b45	workspace: combine working copy functions into a trait	2024-01-25 11:46:07 -08:00
Yuya Nishihara	5a7d8ac596	working_copy: don't follow symlinks when visiting files in gitignored directory Fixes #2878	2024-01-24 16:38:48 +09:00
Yuya Nishihara	d0d4496258	tests: add executable files and symlinks to gitignored directory test	2024-01-24 16:38:48 +09:00
Martin von Zweigbergk	502150b2f4	conflicts: test materialization with with negative snapshots We didn't have any tests with negative snapshots (after a `-------` line). I initially thought we couldn't produce such conflict markers anymore. I'm not sure we want to render conflicts like the one in the test like this. I don't think I intended for `add_index` in the code to be able to be two steps ahead of the remove. Maybe we should rewrite the algorithm to not do that and thus never produce negative snapshots.	2024-01-23 07:18:54 -08:00
Ilya Grigoriev	d168fd2b09	`test_rebase_abandoning_empty`: add children of an empty `@` to the test case This demonstrates the minor bug discussed in https://github.com/martinvonz/jj/pull/2766#discussion_r1442365389 AKA https://github.com/martinvonz/jj/issues/2869. It's also interesting whether changing the definition of "discardable" commit would affect this test, see https://github.com/martinvonz/jj/issues/2859#issuecomment-1903275884 (I think it won't, but still)	2024-01-22 18:36:49 -08:00
Jonathan Tan	0bc1341fd0	revset: add count_estimate() to Revset trait The count() function in this trait is used by "jj branch" to determine (and then report) how many commits a certain branch is ahead/behind another branch. This is currently implemented by walking all commits in the revset, counting how many were encountered. But this could be improved: if the number is large, it is probably sufficient to report "at least N" (instead of walking all the way), and this does not scale well to jj backends that may not have all commits present locally (which may prefer to return an estimate, rather than access the network). Therefore, add a function that is explicitly documented to be O(1) and that can return a range of values if the backend so chooses. Also remove count(), as it is not immediately obvious that it is an expensive call, and callers that are willing to pay the cost can obtain the exact same functionality through iter().count() anyway. (In this commit, all users of count() are migrated to iter().count() to preserve all existing functionality; they will be migrated to count_estimate() in a subsequent commit.) "branch" needed to be updated due to this change. Although jj is currently only available in English, I have attempted to keep user-visible text from being assembled piece by piece, so that if we later decide to translate jj into other languages, things will be easier for translators.	2024-01-22 15:07:00 -08:00
Yuya Nishihara	c7be4d019c	index: add all_heads_for_gc() that iterates heads of all indexed commits GitBackend::gc() will recreate no-gc refs for the indexed heads. We could collect all historical heads by traversing operation log, but it isn't enough because there may be predecessor links to hidden commits, and "git gc" isn't aware of predecessors.	2024-01-17 23:07:14 +09:00
Yuya Nishihara	afa72ff496	git_backend: inline prevent_gc() to bulk-update refs	2024-01-17 10:43:25 +09:00
Yuya Nishihara	96ee9bdb9f	git_backend: ensure no-gc refs are created for all imported head commits This also means that we can implement GC without taking care of extra metadata. I haven't tried, but it wouldn't be easy to keep Git refs and extra table in sync.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	2e1aa6c49c	git_backend: remove fast path testing imported commits, filter them by caller The idea is that GC, if implemented, will clean up objects based on the Index knowledge. It's probably okay to leave some extra metadata of unreachable objects, but GC-ed refs should be recreated if the corresponding heads get reimported. See also the next patch.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	48c4985e34	git_backend: ensure that no-gc ref target never conflicts	2024-01-17 10:43:25 +09:00
Yuya Nishihara	f66c859fe4	git_backend: use lower-level API to create no-gc refs This will allow us to issue multiple prevent_gc() requests all at once. It's not important here, but will be unavoidable when implementing GC. Deleting tons of refs from packed refs is super slow if the requests were processed one by one.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	34956f17e5	op_walk: assert that virtual root op is not reparented This is enforced by the caller, but it's scary if it weren't.	2024-01-16 21:46:54 +09:00
Yuya Nishihara	fb3e006a45	op_store: add special case for root id resolution	2024-01-16 21:46:54 +09:00
Yuya Nishihara	660806ffed	tests: set up unparented operations for id prefix tests Otherwise we can't easily pick i to create operation id starting with "0".	2024-01-16 21:46:54 +09:00
Yuya Nishihara	df1be14aa8	tests: split op id resolution tests, don't require merged op for prefix tests This makes it easy to set up crafted environment for prefix resolution tests.	2024-01-16 21:46:54 +09:00
Essien Ita Essien	dc074363d1	no-op: Move external git repo canonicalization into Workspace::init_git_external * Move canonicalization of the external git repo path into the Workspace::init_git_external(). This keeps necessary code together. * Add a new variant of WorkspaceInitError for reporting path not found errors. The user error string is written to pass existing tests.	2024-01-16 10:46:02 +00:00
Yuya Nishihara	da218d19db	repo: optimize enforce_view_invariants() to not traverse ancestors until root Because the default index cuts off the traversal at min(generations), including the root id means all ancestors will be visited. This could be worked around at the index side, but I think it's the repo/view's responsibility. That being said, it's not uncommon to pad a revset with "root()", so it might make sense for the index to special case the root id. I also removed the redundant .clone().	2024-01-15 09:57:02 +09:00
Martin von Zweigbergk	6e302bb3a2	op_store: add a virtual root operation, similar to root commit It seems obvious in hindsight to have a virtual root operation just like we have a virtual root commit. It removes the same kind of problems by making sure there's always a common ancestor (or multiple) between any two commits. I think the reason I didn't add a root operation from the beginning was that there used to be a mandatory working-copy commit in the view (this was before support for multiple workspaces). Perhaps we should remove the "initialize repo" operation now. The only difference between their view objects is that the "initialize repo" operation adds the root commit as a head. We could add that to the root operation, but then the root operation's value depends on the commit backend.	2024-01-14 10:15:14 -08:00
Martin von Zweigbergk	c9af8bf43a	view: drop tracking of public heads We've had the public_heads for as long as we've had the View object, IIRC (I didn't check), but we still don't use it for anything. I don't have any concrete plans for using it either. Maybe our config for immutable commits is good enough, or maybe we'll want something more generic (like Mercurial's phases). For now, I think we should simplify by removing it the storage for public heads.	2024-01-13 22:23:57 -08:00
Martin von Zweigbergk	a66e2a0a6d	working_copy: mark commit_id field in proto reserved By marking it reserved, we prevent accidental use. We can still read working copy protos that have the field.	2024-01-12 17:38:23 -08:00
Yuya Nishihara	543036c753	cli: run "op log" without loading repo or merging concurrent ops When debugging behavior of badly-GCed repos, I find it's annoying that "op log" fails because the index can't be loaded. Since "op log" doesn't need a repo, I think it's better to display the exact op-heads state without merging.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	831a530283	op_walk: make walk_ancestors() sort head ops to stabilize output I thought this would be done by dag_walk::topo_order_reverse_lazy_ok(), but apparently I made it preserve the input order in a way topo_order_reverse() would do.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	b7eb551cf7	index: fix reindexing to scan all referenced commits such as hidden remote refs Since hidden commits can be looked up by remote_branches() revset for example, reindexing should traverse ancestors from all named refs in addition to the visible heads.	2024-01-12 12:53:16 +09:00
Yuya Nishihara	805046ceba	op_walk: extract function that resolves op expression with preloaded head op I'm going to make "op abandon" not load the repo, and this function will be used there instead of resolve_op_with_repo().	2024-01-12 08:01:13 +09:00
Yuya Nishihara	83ede241e3	op_walk: don't resolve heads beyond @ operation Since `jj undo --at-op=OP @` resolves @ to OP, I think OP should be the head in that context, and the descendants of OP shouldn't be accessible by @+.	2024-01-12 08:01:13 +09:00
Yuya Nishihara	ba42b37a67	operation: remove operation::View wrapper in favor of view::View view::View doesn't track ViewId, but there are no callers of cheap Eq/Hash functions.	2024-01-12 08:01:02 +09:00
Yuya Nishihara	d5a98df046	git_backend: teach "format.tree-level-conflicts" config by constructor Since GitBackend constructors now depend on &UserSettings, it makes sense to initialize the formatting options there.	2024-01-10 08:57:51 +09:00
Yuya Nishihara	e5286aed08	index: move lifetimed change_id_index() to MutableIndex, rename 'static version change_id_index() is only used by Readonly/MutableRepo, so we don't need an abstraction at Index. evaluate_revset() is somewhat similar, but the callers rely on &dyn Repo.	2024-01-09 10:38:00 +09:00
Yuya Nishihara	dc68f1eeb2	revset: remove unused lifetime parameter from Revset<'index>	2024-01-09 10:37:43 +09:00
Yuya Nishihara	e9d31177cb	op_store: implement GC of unreachble operations and views Since new operations and views may be added concurrently by another process, there's a risk of data corruption. The keep_newer parameter is a mitigation for this problem. It's set to preserve files modified within the last 2 weeks, which is the default of "git gc". Still, a concurrent process may replace an existing view which is about to be deleted by the gc process, and the view file would be lost. #12	2024-01-09 10:37:03 +09:00
Yuya Nishihara	5894f3dfba	operation: add shorthand for &store_operation().view_id	2024-01-09 10:37:03 +09:00
Martin von Zweigbergk	c98b0d76af	index: move Revset::change_id_index() to Index We current have `Revset::change_id_index()` for creating a `ChangeIdIndex` for a given revset. I think it will be hard to make it performant for general revsets, especially in very large repos and with custom index implementations, like the one we have at Google. If we instead restrict it to including all ancestors of a set of heads, I think it will be much easier to implement. We only use `Revset::change_id_index()` with revsets including all visible commits today, so we won't lose any current functionality by making it more restricted.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	2f4594540a	tests: move ChangeIdIndex test from test_revset to test_index	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	1508f28567	tests: update ChangeIdIndex test to include ancestors in set I plan to replace `Revset::change_id_index()` by `Index::change_id_index(heads)`, but one of the tests currently uses a set of commits that does not include ancestors. This patch updates it to include ancestors (and changes the set of heads to keep the set small enough for the test).	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f9dc00704d	index: specialize evaluate_revset_static() to change_id_index_static() I'd like to move `change_id_index()` from `Revset` to `Index` (and make it take the set of visible heads as argument). We currently use `evaluate_revset_static()` only to get a `ChangeIdIndex`, so a good place to start is to convert that into `change_id_index_static()`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	b549090acc	index: adopt ChangeIdIndex and relatives from revset module The `ChangeIdIndex` type is currently in defined in the `revset` module because that's the only placed it's used. However, I'd like to start using it directly from `index`. The idea is to make it possible to create a `ChangeIdIndex` given a set of heads, without first creating a `Revset`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f0182ad4b8	default_index: adopt revset engine and graph iterator modules The revset engine and the graph iterator are specific to the default index implementation, so they belong in the same module.	2024-01-07 05:37:47 -08:00
Yuya Nishihara	a6616e9cea	object_id: don't allow ObjectId::from_hex() a dynamically allocated string This isn't technically needed, but it prevents API misuse. Another option is to do some compile-time substitution, but most callers are tests and the runtime performance wouldn't matter.	2024-01-06 00:26:36 +09:00
Yuya Nishihara	837ac15052	op_store: add resolve_operation_id_prefix() trait method that uses readdir() The OpStore backends should have a better way to look up operation by id than traversing from the op heads. The added method is similar to the commit Index one, but returns an OpStoreResult because the backend operation can fail. FWIW, if we want .shortest() in the op log template, we'll probably need a trait method that returns an OpIndex instead.	2024-01-05 23:36:57 +09:00
Yuya Nishihara	95ea352b0a	object_id: add fallible version of ObjectId::from_hex()	2024-01-05 23:36:57 +09:00
Yuya Nishihara	95d83cbfe5	object_id: make ObjectId constructors non-trait methods I'm going to add try_from_hex(), which requires Self: Sized. Such trait bound could be added, but I don't think we'll need abstracted ObjectId constructors at all.	2024-01-05 23:36:57 +09:00
Yuya Nishihara	31b236a70d	object_id: move HexPrefix and PrefixResolution from index module	2024-01-05 10:20:57 +09:00
Yuya Nishihara	fa5e40719c	object_id: extract ObjectId trait and macros to separate module I'm going to add a prefix resolution method to OpStore, but OpStore is unrelated to the index. I think ObjectId, HexPrefix, and PrefixResolution can be extracted to this module.	2024-01-05 10:20:57 +09:00
Yuya Nishihara	dbaee198e6	hex_util: move common_hex_len() from backend module This function predates the hex_util module. If there were hex_util, I would add it there.	2024-01-05 10:20:57 +09:00
Yuya Nishihara	e5255135bb	op_walk: add function that reparents (and abandons) operation range This will be used in "jj op abandon ..op_id" command. The "op_id..@" range will be reparented onto the root operation. The current implementation is good enough for local repos, but it won't scale. We might want to extract it as a trait method or introduce OpIndex for efficient DAG operation.	2024-01-04 11:44:36 +09:00
Yuya Nishihara	392e83be42	op_heads: ensure that update_op_heads([id], id) fails The doc states it's invalid, but I made such bug.	2024-01-04 11:44:36 +09:00
Matt Stark	3f0a49dafe	Ensure you never drop the working commit with --skip-empty See #2766 for discussions	2024-01-04 13:33:24 +11:00
Matt Stark	a4aed2391f	Rewrite instead of abandoning empty commits. Fixes #2760 Given the tree: ``` A-B-C \ B2 ``` And the command `jj rebase -s B -d B2` We were previously marking B as abandoned, despite the comment stating that we were marking it as being succeeded by B2. This resulted in a call to `rewrite(rewrites={}, abandoned={B})` instead of `rewrite(rewrites={B=>B2}, abandoned={})`, which then made the new parent of `C` into `A` instead of `B2`	2024-01-04 13:33:24 +11:00
Ilya Grigoriev	6edaa97517	DescendantRebaser: change `rebased()` method to `into_map()` that consumes the rebaser This prevents a clone and does not affect the public API, as suggested in https://github.com/martinvonz/jj/pull/2738#discussion_r1438903463.	2024-01-01 21:55:18 -08:00
Ilya Grigoriev	ddec3f91b2	lib: mild refactoring made possible by previous commit Inline `create_descendant_commits`, move some functionality of `DescendantRebaser::rebase_next` to `rebase_all`, a seemingly more logical location.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	277b81ff6f	lib: make `DescendantRebaser`-related APIs private. Finally, there are no test uses of these APIs. `DescendantRebaser` is made `pub(crate)`, since it is used by `MutRepo`. Other functions are made private.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	45cd0bf11b	test_rewrite.rs: stop using DescendantRebaser when testing EmptyBehavior This completes the process of removing DescendantRebaser-related APIs from tests. It requires creating some new test utils and a new `rebase_descendants_with_option_return_map`.	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	7cef879ef6	lib `repo.rs` & `rewrite.rs`: Move clearing of rewritten/abandoned commits This commit is a little out of place in this sequence, but it seems to make more sense for MutRepo to own these maps. @yuja [pointed out] that any tests written using `create_descendant_rebaser` now need to do this cleanup, but there are no longer any such tests after the previous commits and a follow-up commit removes `create_descendant_rebaser` entirely. [pointed out]: https://github.com/martinvonz/jj/pull/2737#discussion_r1435754370	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	4461d61254	test_rewrite: test branches of descendants of divergent commits A TODO left over from a previous PR	2024-01-01 18:51:36 -08:00
Ilya Grigoriev	b2abba07e9	tests: (mostly) stop using soon-to-be-private DescendantRebaser-related APIs This removes uses of `DescendantRebaser::new` or `MutRepo::create_descendant_rebaser` from most tests. The exceptions are the tests having to do with abandoning empty commits on rebase, since adjusting those is a bit more elaborate (see follow-up commits).	2024-01-01 18:51:36 -08:00
Yuya Nishihara	3eafca65ea	op_walk: add support for op_id+ (children) operator A possible use case is when doing some archaeology around a certain operation. The current implementation is quadratic if + is repeated. Suppose op_id is usually close to the current op heads, I think it'll practically work better than building a reverse lookup table.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	ab299a6af5	op_walk: reimplement prefix lookup by using walk_ancestors() and HexPrefix Perhaps, OpStore should provide prefix resolution method, but let's think that later.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	c53748d732	op_walk: allow walk_ancestors() from more than one head operations	2024-01-02 10:30:08 +09:00
Yuya Nishihara	51691ea22c	tests: add lib tests for op id resolution, migrate some from cli CLI testing is slow and harder to set up crafted environment.	2024-01-02 10:30:08 +09:00
Yuya Nishihara	dad890b960	operation: make parent_ids() return slice instead of Vec reference	2024-01-02 02:47:41 +09:00
Yuya Nishihara	c9b581589c	op_walk: simplify arguments passed to high-level "opset" query functions	2024-01-01 10:22:23 +09:00
Yuya Nishihara	26b5f38f45	op_walk: move "opset" query functions from jj_cli	2024-01-01 10:22:23 +09:00
Yuya Nishihara	e4460d5386	op_walk: add error types for fake "opset" expression This removes CommandError dependency from these resolution functions. We might want to refactor the error types again if we introduce a real "opset" evaluator. The error message for unresolved op heads now includes "@" instead of the whole expression.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	94fc32ab47	op_walk: extract walk_ancestors() to new module I'm going to extract fake "opset" resolution functions there, and I think walk_ancestors() belongs to the same category.	2024-01-01 10:22:23 +09:00
Yuya Nishihara	6dd936f72f	op_heads: let caller decide resolve_op_heads() error type The resolver callback usually returns wider error type, which I don't think is a variant of OpHeadResolutionError. To help type inference, resolver's error type is E, not E1 where E: From<E1>.	2024-01-01 10:22:23 +09:00
Martin von Zweigbergk	90744fb770	working copy: read files ahead when updating If the commit backend has high latency, it can make a big difference to read files concurrently. This patch updates the working copy code to do that in the update code (when reading files from the backend to write to the working copy). Because our backend at Google reads files from a local daemon process that already does a lot of prefetching, this patch doesn't actually help us. I think it's still the right thing to do for backends that don't do the same kind of prefetching. It speeds up `jj sparse set --add` by >10x when I disable the prefetching in our daemon (our `Backend::concurrency()` is 100).	2023-12-29 13:37:13 -08:00
Yuya Nishihara	f9e9058b9b	index: show bad operation id if commit lookup failed during reindexing My jj repo contains such head commits, and "jj debug reindex" fails. To address this problem, we'll probably need to implement GC, and the user will discard operations before the first bad op id.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	43e016a7d1	index: add explicit reindexing method that can propagate error	2023-12-29 13:05:58 +09:00
Yuya Nishihara	ab1c8656a4	index: rename private index_at_operation methods, reorder arguments I'm going to add a public method that rebuilds index, and its return type will be different. I also added "build_" because "index" could be misinterpreted as noun. The method arguments are reordered to follow the public IndexStore interface.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	3abe6be384	index: propagate DefaultIndexStore::init/reinit() errors	2023-12-29 13:05:58 +09:00
Yuya Nishihara	955f6e356a	repo: add error propagation path to IndexStore initialization and loading The error types are shared with the commit store backend. We could add per-store error types, but it's unlikely that the caller needs to discriminate them.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	bb73cd491f	clenaup: don't use debug format to embed ObjectId in error message Also fixed typo, s/a/an/.	2023-12-29 13:05:58 +09:00
Martin von Zweigbergk	d06764eb7c	op heads: remove now-unused methods for adding/removing op heads	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	65a6aa61db	op heads: replace last use of remove_op_head() by update_op_heads()	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	76516bb46b	op heads: inline handle_ancestor_ops() This gets us closer to being able to use the new `update_op_heads()` function here (without calling it multiple times).	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	4221c7cf5c	op heads: remove handle_ancestor_ops() from trait I think the idea behind `handle_ancestor_ops()` was to let our backend at Google delegate the work to the server, which could then avoid walking ancestors. However, we're now thinking that we're going to make our server resolve divergent operations on its own instead, so the client will never see more than one op head, unless it manually creates the second op head itself (e.g. because the user ran two concurrent commands). In those cases it should be fine to do the walk. So let's simplify the trait by removing the function.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	f969f4b0b0	op heads: remove lifetime from OpHeadsStoreLock	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	c304777a35	op heads: remove promote_new_op() `OpHeadsStoreLock::promote_new_op()` doesn't add much over the new `update_op_heads()`, so let's switch to the latter.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	b8e45d196f	op heads: add a new trait method combining add and remove of op heads Consider how one would implment the current `OpHeadsStore` interface for a cloud-based backend. After `OpHeadsStore::add_op_head()` is called, the set of op heads temporarily contains two heads (typically) until `OpHeadsStore::remove_op_head()` is called. That's not invalid, but it's annoying to have to deal with that state more than necessary. Also, it's unnecessarily inefficient to send the addition and removal of op heads as separate RPCs. This patch therefore adds a `update_op_heads()` method that takes a list of old heads to remove and a single new head to add. Coming patches will start migrating to that method.	2023-12-28 09:17:42 -08:00
Martin von Zweigbergk	8137975785	op heads: drop support for old location/format We move `.jj/repo/op_heads/` into `.jj/repo/op_heads/heads/` almost a year ago, in commits `90a66ec262` and `37ba17589d`. We said we would drop support for it in 0.9+. I think we said that before we started doing monthly releases, but I we're still past the goal of 6 months (which is what I think we were aiming for).	2023-12-28 09:17:42 -08:00
Yuya Nishihara	dde42b9c05	index: rename resolve_prefix() to resolve_commit_id_prefix() I'll probably add change id lookup methods to CompositeIndex. The Index trait won't gain resolve_change_id_prefix(), but I also renamed its resolve_prefix() for consistency.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0f2f566188	index: remove "segment_" prefix from IndexSegment methods Since Readonly/MutableIndexSegment no longer implement Index trait, there's no ambiguity between segment-local and index-global operations. Let's shorten the method names.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	c9b9e2864e	index: introduce newtype that represents segment-local position I'm thinking of changing some IndexSegment methods to return LocalPosition instead of global IndexPosition, and using u32 there would be a source of bugs.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	ee8d5e279a	index: make segment-level lookup return neighbor commit ids instead of positions Both readonly and mutable segments know the commit ids to return, and the caller only needs the ids. Since segment_commit_id(local_pos) scans the graph entries, doing that would increase the chance of cache miss.	2023-12-26 01:03:10 +09:00
Yuya Nishihara	0e7834feb9	index: inline segment_entry_by_pos() There's no reasonable way to abstract the IndexEntry construction.	2023-12-26 01:03:10 +09:00
Ilya Grigoriev	1fb9df252b	`split.rs`: stop using DescendantRebaser::new This requires creating a new public API as a substitute. I took the opportunity to also add some comments to the `MutRepo::record_rewritten_commit`/`record_abandoned_commit` functions. I imade the simplest possible addition to the API; it is not a very elegant one. Eventually, the entire `record_rewritten_commit` API should probably be refactored again. I also added some comments explaining what these functions do.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	6bfd09009f	`move.rs`: remove use of `MutRepo::create_descenant_rebaser`. After this, the internal function is only used in tests.	2023-12-24 19:25:16 -08:00
Ilya Grigoriev	cde8ea8985	Make CommitBuilder constructors private to the library crate The implementation of `CommitBuilder::write` is tightly bound to the MutRepo, so only MutRepo should construct CommitBuilder-s.	2023-12-24 19:25:16 -08:00
Yuya Nishihara	b954bab0ca	index: fix partial reindexing to not lose commits only reachable from one side Spotted while adding error propagation there. This wouldn't likely be a real problem because "jj debug reindex" removes all of the operation links. The "} else {" condition is removed because it doesn't make sense to exclude only the exact parent_op_id operation. This can be optimized to not walk ancestors of the parent_op_id operation, but I don't see a motivation to add tests covering such scenarios. It's pretty rare that an intermediate operation link is missing.	2023-12-24 23:31:16 +09:00
Yuya Nishihara	320d15412b	index: let caller of segment-level save-in() squash segments explicitly There are many unit tests that call mutable_segment.save_in(), but I don't think these callers expect that the segment file could be squashed depending on the size. Let's make it caller's responsibility. maybe_squash_with_ancestors() should be cheap if segment_num_commits() == 0, so it's okay to call it before checking the emptiness.	2023-12-24 00:22:47 +09:00
Yuya Nishihara	1d80bbb70a	index: leverage ancestor iterator to collect segments to be squashed I think "for" loop is easier to follow. Maybe it could be rewritten further to .find_map() loop, but that would be too clever. I also made ancestor_index_segments() pub(super) since it doesn't make sense to only provide ancestor_files_without_local().	2023-12-24 00:22:47 +09:00
Yuya Nishihara	55b4f69fb6	repo: propagate store error from add_heads()	2023-12-24 00:22:30 +09:00
Yuya Nishihara	0f6a7418f2	index: propagate store error from reindexing function If the error is permanent (because the repo predates the no-gc-ref fix for example), there's no easy way to recover. Still, panicking in this function seems wrong.	2023-12-24 00:22:30 +09:00
Yuya Nishihara	7a44e590dc	lock: remove byteorder dependency from tests, use fs helper functions This is the last use of Read/WriteBytesExt. The byteorder crate is great, but we don't need an abstraction of endianness. Let's simply use the std functions.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	9de6273e10	index, stacked_table: inline read_u32::<LittleEndian>() There aren't many callers of ReadBytesExt::read_u32().	2023-12-23 00:14:17 +09:00
Yuya Nishihara	21c22be96e	stacked_table: use u32::from_le_bytes() to reinterpret bytes as integer Apparently, I forgot to update this in `fb06e89649`.	2023-12-23 00:14:17 +09:00
Yuya Nishihara	6f5096e266	index, stacked_table: use u32::try_from() instead of numeric cast These .unwrap()s wouldn't be compiled out, but I don't think they would have measurable impact. Let's use the safer method.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	9ec89bcf86	index, stacked_table: use u32::to_le_bytes() to reinterpret as bytes	2023-12-22 09:03:50 +09:00
Yuya Nishihara	392539fa29	index, stacked_table: simply extend Vec<u8> to not use .write_all() I'm going to remove use of .write_u32() there. It's not super important, but fewer .unwrap()s, the code looks slightly better.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	fb06e89649	index: use u32::from_le_bytes() to reinterpret bytes as integer It's less abstract than going through io::Read, so is probably easier for compiler to optimize out. I also feel it's a bit more readable.	2023-12-22 09:03:50 +09:00
Yuya Nishihara	38ce914321	index: reindex on content-related I/O errors If read_exact() or read_u32() reached to EOF, the index file should be considered corrupted. File not found error is also treated as data corruption because an invalid file name could be read from the child segment file. It can't handle special file names like "..", though.	2023-12-21 08:05:30 +09:00
Yuya Nishihara	e98104d6f0	index: add file name to both io/corrupt errors, combine these variants Index file name also applies to io::Error. New error type reuses io::Error to represent data corruption. We could add an inner Corrupt\|Io enum instead, but we'll need to remap some io::Error variants (e.g. UnexpectedEof) to Corrupt anyway.	2023-12-21 08:05:30 +09:00

1 2 3 4 5 ...

2667 commits