ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Martin von Zweigbergk	33d27ed09f	working copy: start defining a working copy trait This just extracts a trait for the trivial bits to start with.	2023-10-12 16:10:38 -07:00
Martin von Zweigbergk	187ba9430a	working_copy: rename to local_working_copy It's about time we make the working copy a pluggable backend like we have for the other storage. We will use it at Google for at least two reasons: * To support our virtual file system. That will be a completely separate working copy backend, which will interact with the virtual file system to update and snapshot the working copy. * On local disk, we need to tell our build system where to find the paths that are not in the sparse patterns. We plan to do that by wrapping the standard local working copy backend (the one moved in this commit), writing a symlink that points to the mainline commit where the "background" files can be read from. Let's start by renaming the exising implementation to `local_working_copy`.	2023-10-07 08:19:03 -07:00
Martin von Zweigbergk	9d8be29d94	watchman: use single-threaded async runtime The `#[tokio::main]` annotation uses a multi-threaded runtime by default. We don't need that for querying watchman. Switching to the single-threaded runtime saves about 20 ms.	2023-09-24 15:46:13 -07:00
Martin von Zweigbergk	b8f71a4b30	working_copy: in `LockedWorkingCopy::drop()`, discard unsaved changes In `LockedWorkingCopy::drop()`, we panic if the caller had not called `finish()`. IIRC, the idea was both to find bugs where we forgot to call `finish()` and to prevent continuing with a modified `WorkingCopy` instance. I don't think the former has been a problem in practice. It has been a problem in practice to call `discard()` to avoid the panic, though. To address that, we can make the `Drop` implementation discard the changes (forcing a reload of the state if the working copy is accessed again).	2023-09-01 12:25:47 -07:00
Martin von Zweigbergk	26581750fe	store: add a `empty_merged_tree_id()` helper Many (most?) callers of `Store::empty_tree_id()` really want a `MergedTreeId`, so let's create a helper for that. It returns the `Legacy` variant, which is what all current callers used. That should be all we need since the two variants compare equal these days, and since trees built based on the legacy variant can get promoted to the new variant on write if the config is enabled.	2023-08-30 19:58:42 -07:00
Martin von Zweigbergk	f54b456e64	working_copy: leverage `Store::get_root_tree()` in `current_tree()` I guess I forgot this when I added `Store::get_root_tree()`.	2023-08-29 08:32:04 -07:00
Martin von Zweigbergk	67832a3940	merged_tree: take store argument to `write_tree()` instead of `new()` The store isn't needed until we write the trees, so I think it makes more sense to pass it there.	2023-08-29 08:32:04 -07:00
Martin von Zweigbergk	1895a55157	working_copy: make `old_checkout` argument be `MergedTreeId` I think this was the last piece for making the working copy handle tree-level conflicts.	2023-08-27 06:49:45 -07:00
Martin von Zweigbergk	abf3853717	working_copy: return `MergedTreeId` on snapshot	2023-08-27 06:49:45 -07:00
Martin von Zweigbergk	88e9933462	working_copy: enable storing multiple tree ids in state file	2023-08-27 06:49:45 -07:00
Martin von Zweigbergk	36674e8f7e	merged_tree: make `id()` return a `MergedTreeId` We will rarely want to use the tree id without knowing whether it can contain `TreeValue::Conflict` values, so let's make the callers check.	2023-08-27 06:49:45 -07:00
Martin von Zweigbergk	389f27f042	working_copy: move writing of conflict objects into new tree builder This introduces a `MergedTreeBuilder` type, which takes a set of base trees and overrides. The idea is that it will be able to write multiple trees or a legacy tree. For now, it's only able to write legacy trees. To show that it works, the working copy's snaphotting code has been updated to use it.	2023-08-26 08:16:57 -07:00
Martin von Zweigbergk	2fe4372121	tree_builder: remove unnecessary `has_overrides()` method It's easy to instead check if the new tree id is different from the tree id.	2023-08-26 07:02:04 -07:00
Martin von Zweigbergk	0b27c33a13	working_copy: remove last use of `current_tree()` We were using `current_tree()` only for an assertion where we were walking its entries. Now that `MergedTree` supports that, we can replace `current_tree()` by `current_merged_tree()`. There's more work needed before the working copy can fully work with tree-level conflicts. We still need to be able to store multiple tree ids in the `tree_state` file, and we need to be able to create multiple trees instead of writing conflict objects to the backend.	2023-08-25 07:06:20 -07:00
Martin von Zweigbergk	85bdba5bea	working_copy: use `MergedTree` for diffing in `reset()`	2023-08-25 07:06:20 -07:00
Martin von Zweigbergk	23509e939e	working_copy: get diff from `MergedTree`s To support tree-level conflicts, we're going to need to update the working copy from one `MergedTree` to another. We're going need to store multiple tree ids in the `tree_state` file. This patch gets us closer to that by getting the diff from `MergedTree`s`, even though we assume that they are legacy trees for now, so we can write to the single-tree `tree_state` file.	2023-08-25 06:40:36 -07:00
Martin von Zweigbergk	5610525c29	working_copy: pass in `Merge` arguments to closure in `update()` When we do an update between two `MergedTree` instances, we'll get diffs between two `Merge<Option<TreeValue>>`. This commit prepares for that by changing the type of the `before` and `after` arguments we pass into the closure in `update()`.	2023-08-25 06:40:36 -07:00
Martin von Zweigbergk	c65fcabdf8	working_copy: collect code for updating update stats in one place I think it's a little easier to follow if we don't update the stats in the large callback. It also reduces the risk of forgetting to update the stats in some case (like in the exec-bit-optimization case I just removed).	2023-08-25 06:40:36 -07:00
Martin von Zweigbergk	2151fd8930	working_copy: drop optimization for exec-bit-only change When updating the working copy from one tree to another, if only the executable bit has changed between the two trees, we set the executable bit on the file without touching its contents. The optimization probably gets used quite rarely. Maybe it's even so rarely that it's a pessimization overall. Perhaps its value lies more in that we avoid updating the file's mtime unnecessarily. Either way, I'm about to change this code to use `Merge<Option<TreeValue>>` and that will make this block more complex. I don't think it's worth the complexity even it provides some small benefit sometimes.	2023-08-25 06:40:36 -07:00
Piotr Kufel	2109a7b488	Fix .gitignore handling of ignored directories - Ignore .gitignore files from untracked directories - Do not allow un-ignoring files within ignored directories	2023-08-22 22:08:32 -07:00
Martin von Zweigbergk	49fb26fdae	working_copy: write state file even if only mtimes changed When the main `TreeState::snapshot()` thread doesn't receive any updated tree entries over the channel, it correctly doesn't write a new tree. However, it also doesn't write the working copy state file (`.jj/working_copy/tree_state`). This resulted in performance regression in `3f97a6da78`. From that commit, repeated snapshotting would have to re-read all files from disk because it didn't remember the updated mtime from the previous time. This patch fixes the bug by also writing the file if there were any new file states.	2023-08-22 14:45:52 -07:00
Martin von Zweigbergk	5641ef9a42	working_copy: don't send unchanged file states over channel This doesn't seem to make any difference right now, but it will if we write the state file when there are mtime-only changes, which we currently don't do.	2023-08-22 14:45:52 -07:00
Benjamin Saunders	6c4b8a7383	settings: support human-readable byte sizes for max-new-file-size	2023-08-17 19:29:38 -07:00
Ben Saunders	351e7feef5	working_copy: don't snapshot new files larger than 1MiB by default	2023-08-17 19:29:38 -07:00
Martin von Zweigbergk	7ad2270c05	working_copy: pass `Merge`, not `ConflictId`, to `write_conflict()` This is another small step towards making this code work with tree-level conflicts.	2023-08-16 22:59:12 -07:00
Martin von Zweigbergk	1571541214	working_copy: combine blocks for updating added/modified paths There's a lot of duplication between the blocks of code for updating modified and added paths. This commit combines them.	2023-08-16 22:59:12 -07:00
Martin von Zweigbergk	01a6578ada	working_copy: move up special case for exec-bit-only change This is also just to make the next change simpler.	2023-08-16 22:59:12 -07:00
Martin von Zweigbergk	8ded5ae03b	working_copy: convert `Diff` into `Options` for matching This just a little refactoring to make the next step of sharing code between `Modified` and `Added` simpler.	2023-08-16 22:59:12 -07:00
Martin von Zweigbergk	5b8c1e013f	working_copy: add a helper for getting the current tree The code for getting the current tree object was repeated a few times over. I'm going to soon make it return a `MergedTree` and I don't want to repeat that code (it's more complicated than the current code).	2023-08-16 22:59:12 -07:00
Martin von Zweigbergk	9138bb5517	working_copy: use `MergedTree` for current tree when snapshotting We now have all the pieces in place to read the current tree as a `MergedTree` when snapshotting the working copy. For now, it's still always a legacy tree. We'll need to update the working copy state file to support storing multiple trees before we can create a `MergedTree` with multiple sides here.	2023-08-15 07:56:55 -07:00
Martin von Zweigbergk	c126e75b2b	working_copy: make `write_path_to_store()` work with merged values For tree-level conflicts, we're going to be getting `Merge<Option<TreeValue>>` from the current tree and produce a new such value if contents changes on disk. This commit gets us a little closer to that by passing in a value of that type into `write_path_to_store()`. This seems to have a small but measurable performance impact. Snapshotting the working copy in the git repo with all files `touch`ed went from 2.36 s to 2.43 s (3%). I think that's okay, especially since most files' mtimes rarely change, and we only pay the price when it has.	2023-08-15 07:56:55 -07:00
Martin von Zweigbergk	3f97a6da78	working_copy: avoid adding unchanged values to tree builder If the value at a path hasn't changed, there's no need to send it over the channel and have the receiver add it to `TreeBuilder`. I couldn't measure any performance impact. Now we should no longer send `TreeValue::Conflict` variants over the tree entry channel.	2023-08-14 23:32:52 -07:00
Martin von Zweigbergk	eacdad3ebd	working_copy: move writing of conflicts to receiver side of channel When writing tree-level conflicts, we're going to be writing multiple tree (maybe using some new `MergedTreeBuilder`), so we'll need the full `Merge<Option<TreeValue>>` object. This gets us closer to that by sending such objects over the channel and having the receiver write the conflict object. Note that we still sometimes send `TreeValue::Conflict` variants over the channel. That only happens if they're unchanged.	2023-08-14 23:32:52 -07:00
Martin von Zweigbergk	03f00bbf30	working_copy: return `Merge<Option<TreeValue>>` over channel When writing tree-level conflicts, we won't pass `TreeValue::Conflict` over the `tree_entries` channel. Instead, we're going to pass possibly unresolved `Merge<Option<TreeValue>>` instances. This commit prepares for that by changing the type even though we'll only pass `Merge::normal()` over the channel at this point. I did this partly to see what the performance impact is. I tested that by touching all files in the git.git repo to force the trees (and files) to be rewritten. There was no measurable impact at all (best-of-10 time was 2.44 s before and 2.40 s after, but I assume that was a fluke).	2023-08-14 23:32:52 -07:00
Martin von Zweigbergk	6c5d6d7e39	working_copy: delete duplicate comment I copied a comment that I should have just moved in `37a770e8b4`.	2023-08-14 23:32:52 -07:00
Martin von Zweigbergk	4eadb06251	working_copy: propagate errors from writing conflict parts to store	2023-08-14 23:32:52 -07:00
Martin von Zweigbergk	f1b817e8ca	cleanup: fix warnings from nightly clippy	2023-08-14 22:11:56 -07:00
Martin von Zweigbergk	e414f3b73c	cleanup: use `fs:read()` instead of `File::open().read_to_end()`	2023-08-13 14:04:59 +00:00
Martin von Zweigbergk	f9e0feaaf8	working_copy: return early from `write_path_to_store()` for non-files Almost the entire method deals with `FileType::Normal`, so we can reduce indentation and repeated matching on the file type by doing it early and returning in the non-normal-file cases.	2023-08-13 01:00:31 +00:00
Martin von Zweigbergk	23f54b8151	working_copy: propagate errors when reading conflicted file	2023-08-13 01:00:31 +00:00
Martin von Zweigbergk	33a93b6d2d	working_copy: reduce scope of a `content` variable This also avoids reading non-file conflict from disk.	2023-08-13 01:00:31 +00:00
Martin von Zweigbergk	585c212617	working_copy: reduce scope of an `executable` variable	2023-08-13 01:00:31 +00:00
Martin von Zweigbergk	2102de94b0	working_copy: inline `write_conflict_to_store()` For tree-level conflicts, we're eventually not going to have `ConflictId`. We'd want to make `write_conflict_to_store()` take a `Merge<Option<TreeValue>>` and return an updated such value. That would leave very little logic in the function, so let's just inline it instead.	2023-08-13 01:00:31 +00:00
Martin von Zweigbergk	4c46398b1c	conflicts: make `update_from_content()` write resolved content to store `update_from_content()` already writes file content for each term of an unresolved merge, so it seems consistent for it to also write the file content for resolved merges. I think this should simplify further refactoring for tree-level conflicts and for preserving the executable bit.	2023-08-11 23:59:44 +00:00
Martin von Zweigbergk	0b85f06e3d	conflicts: make `update_from_content()` work with only `FileId`s Since `update_from_contents()` only works with file contents and not the executable or other kinds of paths, I think it makes more sense for it to deal with `FileId`s instead of `TreeValue`s.	2023-08-11 23:59:44 +00:00
Martin von Zweigbergk	a995c66635	merge: move some methods back to `conflicts` as free functions I think I moved way too many functions onto `Merge<Option<TreeValue>>` in `82883e648d`. This effectively reverts almost all of that commit. The `Merge<T>` type is simple container and it seems like it should be at fairly low level in the dependency graph. By moving functions off of it, we can get rid of the back-depdencies from the `merge` module to the `conflict` module that I introduced when I moved `Merge` to the `merge` module. I'm thinking the `conflict` module can focus on materialized conflicts.	2023-08-11 21:11:25 +00:00
Martin von Zweigbergk	abc7312dbc	working_copy: avoid an unused variable on Windows	2023-08-11 01:14:52 +00:00
Martin von Zweigbergk	14ddd17673	working_copy: add debug assertion that tree and file states match Perhaps the most important invariant in `.jj/working_copy/tree_state` is that its set of files in it matches the files in its tree. In particular, if a file that exists in the tree doesn't exist in the file state and doesn't exist on disk either, we won't notice that it's gone, and we will therefore not delete it from the tree on future rounds of snapshotting either.	2023-08-06 22:17:18 +00:00
Martin von Zweigbergk	6cce5e758b	working_copy: reduce scope of some variables With the recent refactorings, we don't need the `tree_builder` and `deleted_files` until a bit later.	2023-08-06 22:17:18 +00:00
Martin von Zweigbergk	16d00581f6	working_copy: add trace scope to tree-writing call Writing the tree can probably take a bit of time when the working copy has changed.	2023-08-06 22:17:18 +00:00

1 2 3 4 5

241 commits