ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Yuya Nishihara	b3ee8a0b3e	git: extract immutable part of import_refs() to separate function	2023-08-06 14:47:20 +09:00
Yuya Nishihara	a2b8d1cc3a	git: calculate refs to be imported first, then apply in later pass This allows us to use mut_repo.view() reference during diff computation.	2023-08-06 14:47:20 +09:00
Yuya Nishihara	feaddf6e51	git: on import_refs(), use post-mutation view to collect heads to be pinned This is simpler than carefully tracking mutation through old/new git refs and merged local branches. There are two subtle behavior changes: a. unimported git refs excluded by git_ref_filter() are not pinned. b. unexported branches are pinned (so fetched deletion doesn't abandon the branch if it's referenced by another branch.) I think (a) is okay (and even more correct) since such refs aren't known to jj yet. (b) is desired.	2023-08-06 14:47:20 +09:00
Austin Seipp	b19bf3757f	gen-protos: specify license Summary: Trying to make 'cargo deny' happy. Signed-off-by: Austin Seipp <aseipp@pobox.com> Change-Id: Iabe34cdefac38a4e2bf1942362a28ca4	2023-08-04 19:00:42 -05:00
Martin von Zweigbergk	56109bda38	tests: another attempt to fix remaining flaky tests in `test_view.rs` This is another commit like `0f3bd7fb03` and `8e7e32710d`. I don't know how I didn't catch all remaining instances last time :(	2023-08-04 23:36:15 +00:00
Martin von Zweigbergk	4a10ea4e3e	tests: attempt to fix more flaky tests in `test_view.rs` This is the same kind of fix as in `8e7e3271`. I should have just fixed all instances then.	2023-08-04 21:13:30 +00:00
Kevin Liao	e00cb0fe08	Update init_with_factories to initialize a workspace with a workspace_id other than "default" This change allows a custom jj binary to initialize a workspace with a workspace_id other than "default".	2023-08-04 01:26:26 -07:00
Yuya Nishihara	dd5cc843da	revset_graph: remove unneeded Vec<IndexGraphEdge> cloning	2023-08-04 06:19:22 +09:00
Yuya Nishihara	8dc59a3d69	revset_graph: discard cache of edges that won't be accessed anymore This appears to be a bit slower (1.170s -> 1.211s with "log -R git -r 'tags()' -Tcommit_id --ignore-working-copy"), but seemed better than keeping growing cache.	2023-08-04 06:19:22 +09:00
Waleed Khan	e1c194ce67	working_copy: rename `WorkItem` -> `DirectoryToVisit`	2023-08-03 19:09:59 +00:00
Waleed Khan	84f807d222	working_copy: traverse filesystem in parallel This improves `jj status` time by a factor of ~2x on my machine (M1 Macbook Pro 2021 16-inch, uses an SSD): ```sh $ hyperfine --parameter-list hash before,after --parameter-list repo nixpkgs,gecko-dev --setup 'git checkout {hash} && cargo build --profile release-with-debug' --warmup 3 './target/release-with-debug/jj -R ../{repo} st' Benchmark 1: ./target/release-with-debug/jj -R ../nixpkgs st (hash = before) Time (mean ± σ): 1.640 s ± 0.019 s [User: 0.580 s, System: 1.044 s] Range (min … max): 1.621 s … 1.673 s 10 runs Benchmark 2: ./target/release-with-debug/jj -R ../nixpkgs st (hash = after) Time (mean ± σ): 760.0 ms ± 5.4 ms [User: 812.9 ms, System: 2214.6 ms] Range (min … max): 751.4 ms … 768.7 ms 10 runs Benchmark 3: ./target/release-with-debug/jj -R ../gecko-dev st (hash = before) Time (mean ± σ): 11.403 s ± 0.648 s [User: 4.546 s, System: 5.932 s] Range (min … max): 10.553 s … 12.718 s 10 runs Benchmark 4: ./target/release-with-debug/jj -R ../gecko-dev st (hash = after) Time (mean ± σ): 5.974 s ± 0.028 s [User: 5.387 s, System: 11.959 s] Range (min … max): 5.937 s … 6.024 s 10 runs $ hyperfine --parameter-list repo nixpkgs,gecko-dev --warmup 3 'git -C ../{repo} status' Benchmark 1: git -C ../nixpkgs status Time (mean ± σ): 865.4 ms ± 8.4 ms [User: 119.4 ms, System: 1401.2 ms] Range (min … max): 852.8 ms … 879.1 ms 10 runs Benchmark 2: git -C ../gecko-dev status Time (mean ± σ): 2.892 s ± 0.029 s [User: 0.458 s, System: 14.244 s] Range (min … max): 2.837 s … 2.934 s 10 runs ``` Conclusions: - ~2x improvement from previous `jj status` time. - Slightly faster than Git on nixpkgs. - Still 2x slower than Git on gecko-dev, not sure why. For reference, Git's default number of threads is defined in the `online_cpus` function: `ee48e70a82/thread-utils.c (L21-L66)`. We are using whatever the Rayon default is.	2023-08-03 18:20:49 +00:00
Waleed Khan	326be7c91e	working_copy: send updates via `channel` In preparation of traversing the filesystem in parallel, send updates via `channel`. An alternative is to modify shared mutable state, e.g. put `self.file_states` behind a mutex or use a concurrent hash-map. This risks leaving the `TreeState` in an invalid state if an error occurs, and makes invariants harder to reason about. Using a channel introduces a small performance regression. (I didn't try out the concurrent hash-map approach.) ```sh $ hyperfine --parameter-list hash before,after --setup 'git checkout {hash} && cargo build --profile release-with-debug' --warmup 3 './target/release-with-debug/jj -R ../nixpkgs st' Benchmark 1: ./target/release-with-debug/jj -R ../nixpkgs st (hash = before) Time (mean ± σ): 1.533 s ± 0.013 s [User: 0.587 s, System: 0.926 s] Range (min … max): 1.510 s … 1.559 s 10 runs Benchmark 2: ./target/release-with-debug/jj -R ../nixpkgs st (hash = after) Time (mean ± σ): 1.563 s ± 0.021 s [User: 0.607 s, System: 0.936 s] Range (min … max): 1.518 s … 1.595 s 10 runs Summary ./target/release-with-debug/jj -R ../nixpkgs st (hash = before) ran 1.02 ± 0.02 times faster than ./target/release-with-debug/jj -R ../nixpkgs st (hash = after) ```	2023-08-03 17:56:05 +00:00
Waleed Khan	174704d752	working_copy: extract `visit_directory` function for snapshotting	2023-08-03 17:40:18 +00:00
Waleed Khan	515fb02049	working_copy: extract `WorkItem` to top-level `struct`	2023-08-03 09:49:22 -07:00
Yuya Nishihara	d17ef14956	merge_tools: extract 2-way diff checkout helper The directory prefix is renamed to "jj-diff-" as I'm going to use it for "jj diff --tool <external-diff-generator>".	2023-08-03 13:53:37 +09:00
dependabot[bot]	14d7f60603	cargo: bump the cargo-dependencies group with 1 update Bumps the cargo-dependencies group with 1 update: [rustix](https://github.com/bytecodealliance/rustix). - [Release notes](https://github.com/bytecodealliance/rustix/releases) - [Commits](https://github.com/bytecodealliance/rustix/compare/v0.38.4...v0.38.6) --- updated-dependencies: - dependency-name: rustix dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies ... Signed-off-by: dependabot[bot] <support@github.com>	2023-08-02 09:05:01 -07:00
dependabot[bot]	7751cea47c	cargo: bump the cargo-dependencies group with 3 updates Bumps the cargo-dependencies group with 3 updates: [pest](https://github.com/pest-parser/pest), [pest_derive](https://github.com/pest-parser/pest) and [serde](https://github.com/serde-rs/serde). Updates `pest` from 2.7.1 to 2.7.2 - [Release notes](https://github.com/pest-parser/pest/releases) - [Commits](https://github.com/pest-parser/pest/compare/v2.7.1...v2.7.2) Updates `pest_derive` from 2.7.1 to 2.7.2 - [Release notes](https://github.com/pest-parser/pest/releases) - [Commits](https://github.com/pest-parser/pest/compare/v2.7.1...v2.7.2) Updates `serde` from 1.0.179 to 1.0.180 - [Release notes](https://github.com/serde-rs/serde/releases) - [Commits](https://github.com/serde-rs/serde/compare/v1.0.179...v1.0.180) --- updated-dependencies: - dependency-name: pest dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies - dependency-name: pest_derive dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies - dependency-name: serde dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies ... Signed-off-by: dependabot[bot] <support@github.com>	2023-08-01 09:14:15 -07:00
Martin von Zweigbergk	48b1a1c533	working_copy: in ignored directories, visit only already tracked paths `.gitignores` in ignored directories should be ignored. Before this commit, we would visit ignored directories like any others if there were any ignored paths in them. I've done a lot of preparation for this commit, but There's still a bit of duplication between the new code and the existing code. I don't mind improving it if anyone has suggestions. Otherwise I might end up doing that when I get back to working on snapshotting tree-level conflicts soon. This fixes #1785.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	bcba1c6682	working_copy: rename `sub_path` to `path` The `sub_path` is created by joining `dir` to a basename. I think calling it just `path` is clear, especially since its the main path involved in each iteration of the loop.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	02f2325fae	working_copy: add test for .gitignores in ignored directory This tests the scenario that was repored in #1785.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	aff483c431	working_copy: test changes in tracked-but-ignored directory It's currently the same code path for handling changes to tracked paths in ignored directories as outside ignored directories, but I'm about to change that. I also updated the assertion in the test to compare all entries instead of just the tree id, so it's easier to spot errors if it fails.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	0dc5d967ae	working_copy: move a duplicate statement out of `match` block	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	b48b3780c8	working_copy: replace `FileStateUpdate` by `Option` The `FileStateUpdate` enum now looks very similar to `Option`, so let's just use that. I also renamed `get_updated_file_state()` to `get_updated_tree_value()` since it returns a `TreeValue`.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	035d4bbbae	working_copy: remove file state for deleted files in only one place We currently remove the file state for deleted files after walking the working copy and noticing that the file is not there. However, in the case of files that have been replaced by special files like Unix sockets, we delete the file state inside the loop. Let's simplify a tiny bit by not doing that.	2023-08-01 06:31:52 +00:00
Martin von Zweigbergk	4fa2a27f38	working_copy: treat a missing file state as dirty If we don't have a recorded state for a file, we assume that it's new, so we add it to the tree as the type it appears on disk. That means we won't check if it exists as a conflict in the current tree. As another step towards making the file state just a cache, let's instead treat this case as a dirty file, so we look up the current value from the tree. That means that adding files will be a tiny bit slower, but I doubt it will be noticeable (we need to read the file from disk and write it to the backend anyway).	2023-07-31 05:59:30 +00:00
Martin von Zweigbergk	cb8ff84cc8	working_copy: don't pass `FileState` through `get_updated_file_state()` Since the caller now has the `FileState`, there's no need to pass it in by value only to get it back in the return value.	2023-07-31 05:59:30 +00:00
Martin von Zweigbergk	01feb40fbb	working_copy: handle deleted files outside `get_updated_file_state()` This is simpler, and it will enable further simplfications.	2023-07-31 05:59:30 +00:00
Martin von Zweigbergk	5cc2c91453	working_copy: pass in PathBuf and Metadata to `get_updated_file_state()` This will let us call the function even if we don't have a `DirEntry`.	2023-07-31 05:59:30 +00:00
Martin von Zweigbergk	37d9aae894	working_copy: handle ignored files outside of `get_updated_file_state()` I want to replace the `DirEntry` argument to `get_updated_file_state()` by a `PathBuf` and a `Metadata`. To avoid always reading the metadata, we need to check for ignored files outside of `get_updated_file_state()`. I also think that gives the call site a nice symmetry in how we use the `git_ignore` for directories (`.matches_all_files_in()`)) and files (`.matches_file()`).	2023-07-31 05:59:30 +00:00
Martin von Zweigbergk	be8d471e76	working_copy: preserve executable-ness from tree on Windows This removes another little bit (literally) of dependency on the cached file state by reading the old executable bit from the current tree instead. That helps make it possible to discard the file states without affecting the resulting snapshot, as we may want to do with Watchman.	2023-07-31 05:48:32 +00:00
Martin von Zweigbergk	37a770e8b4	working_copy: make `write_conflict_to_store()` also handle conflicts With this change, `write_path_to_store()` contains all the logic for reading a file from disk and writing it to a `TreeBuilder`, making the code for added and modified files more similar.	2023-07-31 05:48:32 +00:00
Martin von Zweigbergk	48580ed8b1	revsets: allow `::` as synonym for `:` The `--allow-large-revsets` flag we have on `jj rebase` and `jj new` allows the user to do e.g. `jj rebase --allow-large-revsets -b main.. -d main` to rebase all commits that are not in main onto main. The reason we don't allow these revsets to resolve to multiple commits by default is that we think users might specify multiple commits by mistake. That's probably not much of a problem with `jj rebase -b` (maybe we should always allow that to resolve to multiple commits), but the user might want to know if `jj rebase -d @-` resolves to multiple commits. One problem with having a flag to allow multiple commits is that it needs to be added to every command where we want to allow multiple commits but default to one. Also, it should probably apply to each revset argument those commands take. For example, even if the user meant `-b main..` to resolve to multiple commits, they might not have meant `-d main` to resolve to multiple commits (which it will in case of a conflicted branch), so we might want separate `--allow-large-revsets-in-destination` and `--allow-large-revsets-in-source`, which gets quite cumbersome. It seems better to have some syntax in the individual revsets for saying that multiple commits are allowed. One proposal I had was to use a `multiple()` revset function which would have no effect in general but would be used as a marker if used at the top level (e.g. `jj rebase -d 'multiple(@-)'`). After some discussion on the PR adding that function (#1911), it seems that the consensus is to instead use a prefix like `many:` or `all:`. That avoids the problem with having a function that has no effect unless it's used at the top level (`jj rebase -d 'multiple(x)\|y'` would have no effect). Since we already have the `:` operator for DAG ranges, we need to change it to make room for `many:`/`all:` syntax. This commit starts that by allowing both `:` and `::`. I have tried to update the documentation in this commit to either mention both forms, or just the new and preferred `::` form. However, it's useless to search for `:` in Rust code, so I'm sure I've missed many instances. We'll have to address those as we notice them. I'll let most tests use `:` until we deprecate it or delete it.	2023-07-28 22:30:40 -07:00
Ilya Grigoriev	a22255bd51	Fix `cargo +nightly clippy` warnings	2023-07-28 22:19:52 -07:00
Yuya Nishihara	b3ae7e7657	revset_graph: preserve original parents order It seemed awkward if merged PR is sometimes rendered as a first branch. Instead of sorting edges in index order, let's build a HashSet only when deduplication is needed.	2023-07-29 05:36:09 +09:00
Yuya Nishihara	1bf6ab5370	revset_graph: avoid construction of edges if already known The stack can contain duplicated entries, and we only need the last one to resolve edges.	2023-07-29 05:36:09 +09:00
Martin von Zweigbergk	beb997e85a	watchman: don't even add non-watchman files to set of deleted files It's faster to add only files matched by the Watchman matcher to the set of deleted files than to add all files and then removed files not matched. This speeds up `jj diff` with Watchman in the Linux repo from ~530 ms to ~460 ms.	2023-07-28 12:12:09 -07:00
Waleed Khan	4b635e9713	working_copy: use tree rather than file states to detect if directory is tracked This moves us closer towards treating the `file_states` map as purely a cache rather than authoritative state.	2023-07-28 10:15:33 -07:00
Waleed Khan	9d8702b537	refactor(working_copy): return new file state from `update_file_state`	2023-07-28 09:52:37 -07:00
Waleed Khan	c409f4ed53	refactor(working_copy): hoist current and new file states to top of `update_file_state` Reduces some indentation and simplifies some control flow.	2023-07-28 09:52:37 -07:00
Waleed Khan	3264135489	refactor(working_copy): return updated new tree value from `update_file_state` The intention in this and some future commits is to have `update_file_state` accept `&self` instead of `&mut self` to make clear what data is updated by `update_file_state` and to ensure transactional safety of the `TreeState` contents.	2023-07-28 09:52:37 -07:00
Martin von Zweigbergk	3ef552c4c1	tests: add `TestWorkspace::snapshot()` to simplify snapshotting It's currently a bit complicated to snapshot the working copy and there's a lot of duplication in tests. This commit introduces a function to simplify it. I made the function snapshot the working copy and save the updated state. Some of the tests I changed previously discarded the changes instead of saving them, but I think they all did so because it was simpler. I left a few call sites unchanged because they make concurrent changes.	2023-07-28 09:32:18 -07:00
Martin von Zweigbergk	2ccd3247f9	tests: remove a duplicate code block It looks like this has been duplicated since I added the test in `cd4fbd3565`.	2023-07-28 09:32:18 -07:00
Waleed Khan	1e28b312c6	perf: instrument some steps in `snapshot`	2023-07-28 09:28:01 -07:00
Waleed Khan	018bb88ec6	perf: add several `#[instrument]`s	2023-07-28 09:28:01 -07:00
Waleed Khan	7875656354	perf: add `#[instrument]` to all `cmd_*` functions	2023-07-28 09:28:01 -07:00
dependabot[bot]	5d02a57713	cargo: bump the cargo-dependencies group with 2 updates Bumps the cargo-dependencies group with 2 updates: [serde](https://github.com/serde-rs/serde) and [serde_json](https://github.com/serde-rs/json). Updates `serde` from 1.0.175 to 1.0.176 - [Release notes](https://github.com/serde-rs/serde/releases) - [Commits](https://github.com/serde-rs/serde/compare/v1.0.175...v1.0.176) Updates `serde_json` from 1.0.103 to 1.0.104 - [Release notes](https://github.com/serde-rs/json/releases) - [Commits](https://github.com/serde-rs/json/compare/v1.0.103...v1.0.104) --- updated-dependencies: - dependency-name: serde dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies - dependency-name: serde_json dependency-type: direct:production update-type: version-update:semver-patch dependency-group: cargo-dependencies ... Signed-off-by: dependabot[bot] <support@github.com>	2023-07-27 15:57:54 +00:00
Yuya Nishihara	4834d12c37	simple_op_store: serialize RefTarget in new format (breaks downgrades) This is breaking change. Old jj binary will panic if it sees a view saved by new jj. Alternatively, we can store both new and legacy data for backward compatibility.	2023-07-27 15:32:48 +09:00
Yuya Nishihara	8351a743f6	simple_op_store: add deserialization support of new Conflict-based RefTarget CommitId is wrapped with a message since we need a representation for None. This is different from TreeConflict in which an empty tree has an id.	2023-07-27 15:32:48 +09:00
Yuya Nishihara	2382ae09e2	gen-protos: make old protoc accept optional fields The version installed by ubuntu "latest" doesn't seem to support optional fields yet. This is copied from the prost documentation. https://docs.rs/prost-build/latest/prost_build/struct.Config.html#method.protoc_arg	2023-07-27 15:32:48 +09:00
Martin von Zweigbergk	4348d0b487	working_copy: leverage `create_tree()` in test This also lets us compare the resulting tree because the working copy now exactly matches the tree (it used to be that the `.gitignore` file wasn't initially snapshotted).	2023-07-26 23:30:10 -07:00

1 2 3 4 5 ...

1832 commits