ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Martin von Zweigbergk	bfa0573cab	repo/workspace: drop support for old repo formats It's been more than 6 months since we added support for dynamically selecting the working copy implementation. This patch drops support for selecting the default implementation of that and other stores.	2024-06-11 22:03:20 +09:00
Yuya Nishihara	243675b793	index: turn CompositeIndex into transparent reference type This helps to eliminate higher-ranked trait bounds from RevWalkRevset and RevWalk combinators to be added. Since &CompositeIndex is now a real reference, it can be passed to functions as index: &T.	2024-03-11 17:24:10 +09:00
Yuya Nishihara	f5eb172769	tests: remove last use of walk_revs() from integration tests	2024-03-08 10:07:40 +09:00
Yuya Nishihara	3c7aa75b9b	index: switch to persistent change id index The shortest change id prefix will become a few digits longer, but I think that's acceptable. Entries included in the "revsets.short-prefixes" set are unaffected. The reachable set is calculated eagerly, but this is still faster as we no longer need to sort the reachable entries by change id. The lazy version will save another ~100ms in mid-size repos. "jj log" without working copy snapshot: ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1,jj-2 \ -s "target/release-with-debug/{bin} -R ~/mirrors/linux debug reindex" \ "target/release-with-debug/{bin} -R ~/mirrors/linux \ --ignore-working-copy log -r.. -l100 --config-toml='revsets.short-prefixes=\"\"'" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux --ignore-working-copy log -r.. -l100 --config-toml='revsets.short-prefixes=""' Time (mean ± σ): 353.6 ms ± 11.9 ms [User: 266.7 ms, System: 87.0 ms] Range (min … max): 329.0 ms … 365.6 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux --ignore-working-copy log -r.. -l100 --config-toml='revsets.short-prefixes=""' Time (mean ± σ): 271.3 ms ± 9.9 ms [User: 183.8 ms, System: 87.7 ms] Range (min … max): 250.5 ms … 282.7 ms 20 runs Relative speed comparison 1.99 ± 0.16 target/release-with-debug/jj-0 -R ~/mirrors/linux --ignore-working-copy log -r.. -l100 --config-toml='revsets.short-prefixes=""' 1.53 ± 0.12 target/release-with-debug/jj-1 -R ~/mirrors/linux --ignore-working-copy log -r.. -l100 --config-toml='revsets.short-prefixes=""' ``` "jj status" with working copy snapshot (watchman enabled): ``` % hyperfine --sort command --warmup 3 --runs 20 -L bin jj-0,jj-1,jj-2 \ -s "target/release-with-debug/{bin} -R ~/mirrors/linux debug reindex" \ "target/release-with-debug/{bin} -R ~/mirrors/linux \ status --config-toml='revsets.short-prefixes=\"\"'" Benchmark 1: target/release-with-debug/jj-0 -R ~/mirrors/linux status --config-toml='revsets.short-prefixes=""' Time (mean ± σ): 396.6 ms ± 10.1 ms [User: 300.7 ms, System: 94.0 ms] Range (min … max): 373.6 ms … 408.0 ms 20 runs Benchmark 2: target/release-with-debug/jj-1 -R ~/mirrors/linux status --config-toml='revsets.short-prefixes=""' Time (mean ± σ): 318.6 ms ± 12.6 ms [User: 219.1 ms, System: 94.1 ms] Range (min … max): 294.2 ms … 333.0 ms 20 runs Relative speed comparison 1.85 ± 0.14 target/release-with-debug/jj-0 -R ~/mirrors/linux status --config-toml='revsets.short-prefixes=""' 1.48 ± 0.12 target/release-with-debug/jj-1 -R ~/mirrors/linux status --config-toml='revsets.short-prefixes=""' ```	2024-02-18 09:44:57 +09:00
Yuya Nishihara	8cdf6d752c	index: move change ids to sstable, build change-id-to-pos lookup table This basically means that the change ids are interned. We'll implement binary search over the sorted change ids table. The table could be sorted differently for better cache locality, but it is in lexicographical order for simplicity. With my testing, the cost of the id lookup isn't dominant. Unlike the parent entries, the size of the per-id overflow items isn't saved. That's s because the number of the same-change-id commits is either 1 or many. It doesn't make sense to allocate 8 bytes for each change id. Instead, we'll pay extra indirection cost to determine the size.	2024-02-18 09:44:57 +09:00
Yuya Nishihara	e2c8a8fabd	index: fix change id resolution test to not depend on deterministic order Since IdIndex sorts the entries by using .sort_unstable_by_key(), the order of the same-key elements is undefined. Perhaps, it's stable for short arrays, and the test passes because of that.	2024-02-14 23:22:23 +09:00
Yuya Nishihara	b0e8e2a1af	index: move segment files to sub directory, add version number I'm going to introduce breaking changes in index format. Some of them will affect the file size, so version number or signature won't be needed. However, I think it's safer to detect the format change as early as possible. I have no idea if embedded version number is the best way. Because segment files are looked up through the operation links, the version number could be stored there and/or the "segments" directory could be versioned. If we want to support multiple format versions and clients, it might be better to split the tables into data chunks (e.g. graph entries, commit id table, change id table), and add per-chunk version/type tag. I choose the per-file version just because it's simple and would be non-controversial. As I'm going to introduce format change pretty soon, this patch doesn't implement data migration. The existing index files will be deleted and new files will be created from scratch. Planned index format changes include: 1. remove unused "flags" field 2. inline commit parents up to two 3. add sorted change ids table	2024-02-12 19:38:36 +09:00
Yuya Nishihara	b7eb551cf7	index: fix reindexing to scan all referenced commits such as hidden remote refs Since hidden commits can be looked up by remote_branches() revset for example, reindexing should traverse ancestors from all named refs in addition to the visible heads.	2024-01-12 12:53:16 +09:00
Yuya Nishihara	e5286aed08	index: move lifetimed change_id_index() to MutableIndex, rename 'static version change_id_index() is only used by Readonly/MutableRepo, so we don't need an abstraction at Index. evaluate_revset() is somewhat similar, but the callers rely on &dyn Repo.	2024-01-09 10:38:00 +09:00
Martin von Zweigbergk	c98b0d76af	index: move Revset::change_id_index() to Index We current have `Revset::change_id_index()` for creating a `ChangeIdIndex` for a given revset. I think it will be hard to make it performant for general revsets, especially in very large repos and with custom index implementations, like the one we have at Google. If we instead restrict it to including all ancestors of a set of heads, I think it will be much easier to implement. We only use `Revset::change_id_index()` with revsets including all visible commits today, so we won't lose any current functionality by making it more restricted.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	2f4594540a	tests: move ChangeIdIndex test from test_revset to test_index	2024-01-08 06:06:47 -08:00
Yuya Nishihara	fa5e40719c	object_id: extract ObjectId trait and macros to separate module I'm going to add a prefix resolution method to OpStore, but OpStore is unrelated to the index. I think ObjectId, HexPrefix, and PrefixResolution can be extracted to this module.	2024-01-05 10:20:57 +09:00
Yuya Nishihara	f9e9058b9b	index: show bad operation id if commit lookup failed during reindexing My jj repo contains such head commits, and "jj debug reindex" fails. To address this problem, we'll probably need to implement GC, and the user will discard operations before the first bad op id.	2023-12-29 13:05:58 +09:00
Yuya Nishihara	b954bab0ca	index: fix partial reindexing to not lose commits only reachable from one side Spotted while adding error propagation there. This wouldn't likely be a real problem because "jj debug reindex" removes all of the operation links. The "} else {" condition is removed because it doesn't make sense to exclude only the exact parent_op_id operation. This can be optimized to not walk ancestors of the parent_op_id operation, but I don't see a motivation to add tests covering such scenarios. It's pretty rare that an intermediate operation link is missing.	2023-12-24 23:31:16 +09:00
Yuya Nishihara	55b4f69fb6	repo: propagate store error from add_heads()	2023-12-24 00:22:30 +09:00
Yuya Nishihara	72d9cd019b	index: extract as_composite() to trait method The revset engine will accept abstract AsCompositeIndex type, and the evaluated revset can be 'static if the index is behind Arc<T>.	2023-12-15 16:10:28 +09:00
Martin von Zweigbergk	60fae3114e	transaction: take description at end instead of start It seems better to have the caller pass the transaction description when we finish the transaction than when we start it. That way we have all the information we want to include more readily available.	2023-12-13 08:12:49 -08:00
Yuya Nishihara	cdcd465c79	index: move default_index_store.rs to sub directory named default_index default_index_store.rs is relatively big, and it contains types and impls in arbitrary order. Let's split them into sub modules. After everything moved, mod.rs will only contain tests.	2023-12-12 08:07:52 +09:00
Yuya Nishihara	c94e1de6d2	index: add DefaultMutableIndex wrapper, move Index impls to it The wrapper type isn't needed for the mutable layer, but this mirrors the readonly type structure. Test cases are also migrated to be using the index wrapper so long as we don't have to care for the nesting of the segment files.	2023-12-10 11:03:07 +09:00
Yuya Nishihara	6c57ba7f21	index: rename ReadonlyIndexWrapper to DefaultReadonlyIndex This matches the store naming: impl IndexStore for DefaultIndexStore. I also added minimal doc comment and Debug.	2023-12-09 15:18:36 +09:00
Yuya Nishihara	cee69d1665	tests: remove index downcast helpers called only by as_<type>_composite() I'm going to rename the impl types, and I don't want to think about the names of these downcast functions.	2023-12-09 15:18:36 +09:00
Martin von Zweigbergk	380e204e73	test: use test backend in most remaining tests too I don't think the backend should matter for any of these tests, so let's test with only one, and let's make that the strictest one - the new test backend. This reduces the number of tests by 74 (from 974 to 900), but saves no measurable run time.	2023-09-24 21:24:01 -07:00
Martin von Zweigbergk	9c30d7500b	testutils: delete bool-typed `init()` in favor of enum-typed version It makes the call sites clearer if we pass the `TestRepoBackend` enum instead of the boolean `use_git` value. It's also more extensible (I plan to add another backend for tests).	2023-09-18 07:15:37 -07:00
Martin von Zweigbergk	aac5b7aa25	cargo: rename crates from `jujutsu`/`jujutsu-lib` to `jj-cli`/`jj-lib` Almost everyone calls the project "jj", and there seeems to be consensus that we should rename the crates. I originally wanted the crates to be called `jj` and `jj-lib`, but `jj` was already taken. `jj-cli` is probably at least as good for it anyway. Once we've published a 0.8.0 under the new names, we'll release 0.7.1 versions under the old names with pointers to the new crates names.	2023-07-09 06:40:43 +02:00
Yuya Nishihara	a67d8b5a65	index: turn CompositeIndex::walk_revs() into position-based API This gets rid of round-trip conversion from queries like "(main..)-". I have such expression in my default log/disambiguation revset, and the query could take ~150ms to convert head positions back and forth if the repository had tons of unmerged commits.	2023-06-19 13:41:43 +09:00
Yuya Nishihara	cf5cb380bb	index: implement Index for CompositeIndex We can't get rid of the other "impl Index"es because .as_composite() must return a real reference type. Maybe we could turn CompositeIndex into an owned wrapper, but I don't know if that would be worth the effort.	2023-05-29 08:15:40 +09:00
Yuya Nishihara	5989bdf781	index: move Index::as_any() to MutableIndex, obtain CompositeIndex from there It might sound scary to add public .mutable_index() accessor, but I think it's okay because immutable MutableIndex reference has no more power than Index. This allows us to implement Index for lifetime-bound type such as CompositeIndex<'_>.	2023-05-29 08:15:40 +09:00
Yuya Nishihara	93284a153f	index: obtain CompositeIndex from ReadonlyIndexWrapper I'll remove Index::as_any() so that Index can be implemented for reference wrapper.	2023-05-29 08:15:40 +09:00
Yuya Nishihara	fb77c55268	index: use as_composite() to access to index stats The idea is that .as_composite() is equivalent to .as_index(), but for the implementation type. I'm going to add "impl Index for CompositeIndex" to clean up index references passed to revset engine.	2023-05-29 08:15:40 +09:00
Yuya Nishihara	e24fe817c9	tests: invoke .walk_revs() through CompositeIndex Prepares for removal of the index trait method.	2023-05-24 01:02:37 +09:00
Martin von Zweigbergk	c60f14899a	index: remove entry_by_id() from trait It no longer needs to be on the `Index` trait, thereby removing the last direct use of `IndexEntry` in the trait (it's still used indirectly in `walk_revs()`).	2023-04-18 18:32:23 -07:00
Martin von Zweigbergk	baea314fc0	index: get generation number from specific impl in test	2023-03-24 10:09:40 -07:00
Martin von Zweigbergk	0a7de2540f	tests: call num_commits() on specific implementation This removes the last calls to `Index::num_commits()`.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	5423feb8e1	tests: call stats() on specific implementation This removes the remaining calls to `Index::stats()`.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	37151e0ff9	index: load store based on type recorded in .jj/repo/index/type This is another step towards allowing a custom `jj` binary to have its own index type. We're going to have a server-backed index implementation at Google, for example.	2023-03-11 22:22:46 -08:00
Martin von Zweigbergk	491ecc6b2e	repo: replace load_at_head() by helper in tests I'm about to make `RepoLoader::init()` return a `Result`, and I don't want to have to wrap that in a new error in `ReadonlyRepo::load_at_head()` since that's only used in tests.	2023-02-27 09:44:28 -08:00
Martin von Zweigbergk	f6a4cb57da	repo: extract a `Repo` trait for `Arc<ReadonlyRepo>` and `MutableRepo` This will soon replace the `RepoRef` enum, just like how the `Index` trait replaced the `IndexRef` enum.	2023-02-15 19:15:17 -08:00
Martin von Zweigbergk	8a067282c8	repo: make `ReadonlyRepo::index()` return a `&dyn Index` This is just a little preparation for extracting a `Repo` trait that's implemented by both `ReadonlyRepo` and `MutableRepo`. The `index()` function in that trait will of course have to return the same type in both implementations, and that type will be `&dyn Index`.	2023-02-15 19:15:17 -08:00
Martin von Zweigbergk	b955e3de03	index: extract a trait for the index Even though we don't know the details yet, we know that we want to make the index pluggable like the commit and opstore backends. Defining a trait for it should be a good step. We can refine the trait later.	2023-02-14 06:51:49 -08:00
Martin von Zweigbergk	a474c688a8	index: simplify a test helper by specializing it We apparently always have an `&Arc<ReadonlyIndex>` where we call the `generation_number()` function.	2023-02-14 06:51:49 -08:00
Martin von Zweigbergk	812ef97adb	repo: add `MutableRepo::new_commit()` returning `CommitBuilder` Since `CommitBuilder` now has a reference to `MutableRepo`, it's convenient to create instances of it by calling a method on `MutableRepo`.	2022-12-26 23:30:52 -08:00
Martin von Zweigbergk	f3208f59c4	store: propagate error from `Backend::write_commit()`	2022-12-26 23:30:52 -08:00
Martin von Zweigbergk	f1d7bbe508	testutils: create a function for writing a random commit to `MutableRepo` We already have `create_random_commit()`, which returns a `CommitBuilder`. Most callers directly write that to a `MutableRepo`. That currently returns a `Commit`, but I'm about to make it propagate errors from the backend. That would add an `unwrap()` to this sequence, making it longer. Let's create a simple helper for these callers to simplify this common pattern.	2022-12-26 23:30:52 -08:00
Martin von Zweigbergk	49b2f3b6ca	commit_builder: keep MutableRepo reference When you're done with the `CommitBuilder`, you're going to have to call `write_to_repo()`, passing it a mutable `MutableRepo` reference. It's a bit simpler to pass that reference when we create the `CommitBuilder` instead, so that's what this patch does. A drawback of passing in the mutable reference when we create the builder is that we can't have multiple unfinished `CommitBuilder` instance live at the same time. We don't have any such use cases yet, and it's not hard to work around them, so I think this change is worth it.	2022-12-26 23:30:52 -08:00
Daniel Ploch	7cbea42a24	repo: rename BackendFactories to StoreFactories	2022-12-14 14:10:30 -08:00
Yuya Nishihara	4a889b986c	index: implement generation filter on RevWalkGenerationRange This will be a building block of 'parents(base)' revset. 'base---' will be .filter_by_generation(3..4) for example. I think 'ancestors(base)' can also have an optional generation parameter, but I haven't considered any particular syntax yet.	2022-12-11 13:14:19 +09:00
Martin von Zweigbergk	d8feed9be4	copyright: change from "Google LLC" to "The Jujutsu Authors" Let's acknowledge everyone's contributions by replacing "Google LLC" in the copyright header by "The Jujutsu Authors". If I understand correctly, it won't have any legal effect, but maybe it still helps reduce concerns from contributors (though I haven't heard any concerns). Google employees can read about Google's policy at go/releasing/contributions#copyright.	2022-11-28 06:05:45 -10:00
Martin von Zweigbergk	9502d84872	operations: make hostname and username configurable We currently get the hostname and username from the `whoami` crate. We do that in lib crate, without giving the caller a way to override them. That seems wrong since it might be used in a server and performing operations on behalf of some other user. This commit makes the hostname and username configurable, so the calling crate can pass them in. If they have not been passed in, we still default to the values from the `whoami` crate.	2022-11-14 10:02:04 -08:00
Martin von Zweigbergk	eb89f6b6ca	tests: consistently import `create_random_tree()` These calls often appear in expressions long enough that not having to qualify it means that we can sometimes avoid wrapping a line. I noticed because IntelliJ told me that `test_git.rs` had some unnecessary qualificiations (the function was already imported there).	2022-11-13 15:06:10 -08:00
Martin von Zweigbergk	3c7c4e9f5c	tests: move `testutils` module into separate crate The `testutils` module should ideally not be part of the library dependencies. Since they're used by the integration tests (and the CLI tests), we need to move them to a separate crate to achieve that.	2022-11-08 07:29:35 -08:00

1 2

92 commits