mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2024-12-27 06:27:43 +00:00

Author	SHA1	Message	Date
dploch	57a5d7dd64	cli_util: support multiple extensions consistently If we ever implement some sort of ABI for dynamic extension loading, we'll need these underlying APIs to support multiple extensions, so we might as well do that first.	2024-04-12 14:07:33 -04:00
Yuya Nishihara	97024e5be4	cli: extract CommandError and helper functions to new module The cli_util module is big enough to slow down Emacs, so let's split it up. This change is an easy one.	2024-03-03 01:11:46 +09:00
Yuya Nishihara	351487b9f5	backend: pass Index and keep_newer timestamp parameters to gc() GitBackend::gc() will need to check if a commit is reachable from any historical operations. This could be calculated from the view and commit objects, but the Index will do a better job.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	4e54021930	backend: have gc() return BackendError instead of opaque error type The gc() implementation is likely to call other backend functions, which return BackendError.	2024-01-27 10:18:11 +09:00
Martin von Zweigbergk	1cc271441f	gc: implement basic GC for Git backend This adds an initial `jj util gc` command, which simply calls `git gc` when using the Git backend. That should already be useful in non-colocated repos because it's not obvious how to GC (repack) such repos. In my own jj repo, it shrunk `.jj/repo/store/` from 2.4 GiB to 780 MiB, and `jj log --ignore-working-copy` was sped up from 157 ms to 86 ms. I haven't added any tests because the functionality depends on having `git` binary on the PATH, which we don't yet depend on anywhere else. I think we'll still be able to test much of the future parts of garbage collection without a `git` binary because the interesting parts are about manipulating the Git repo before calling `git gc` on it.	2023-12-03 07:40:12 -08:00
Martin von Zweigbergk	a0cbe7ced0	cli: rename `Commands` enums to `Command` Each instance of the enum represents a single command, so singular `*Command` seems better. That also seems to match the examples in clap's documentation.	2023-12-01 16:53:54 -08:00
Yuya Nishihara	d747879aee	signing: pass SigningFn by reference write_commit() doesn't need ownership of the signing function.	2023-12-01 22:55:04 +09:00
Anton Bulakh	eb1c0ab4a2	sign: Implement a test signing backend and add a few basic tests	2023-11-30 23:36:56 +02:00
Anton Bulakh	5c3c0e9f6e	sign: Implement generic commit signing on the backend	2023-11-23 22:52:20 +02:00
Yuya Nishihara	ea32c0cb9e	git_backend: pass UserSettings to GitBackend constructors	2023-11-11 22:35:54 +09:00
Yuya Nishihara	8a2048a0e5	repo: pass UserSettings to store factories and initializers GitBackend will use it to configure gix::Repository. I think UserSettings is generally useful to pass store-specific parameters, so I've updated all factory functions.	2023-11-11 22:35:54 +09:00
Martin von Zweigbergk	d989d4093d	merged_tree: let backend influence whether to use new diff algo Since the concurrent diff algorithm is significantly slower when using the Git backend, I think we'll have to use switch between the two algorithms depending on backend. Even if the concurrent version always performed as well as the sequential version, exactly how concurrent it should be probably still depends on the backend. This commit therefore adds a function to the `Backend` trait, so each backend can say how much concurrency they deal well with. I then use that number for choosing between the sequential and concurrent versions in `MergedTree::diff_stream()`, and also to decide the number of concurrent reads to do in the concurrent version.	2023-11-06 23:12:02 -08:00
Martin von Zweigbergk	cfcdd71865	backend: make `read_conflict` synchronous again This avoids https://github.com/rust-lang/futures-rs/issues/2090. I don't think we need to worry about reading legacy conflicts asynchronously - async is really only useful for Google's backend right now, and we don't use the legacy format at Google. In particular, I don't want `MergedTree::value()` to have to be async.	2023-10-28 16:45:40 -07:00
Martin von Zweigbergk	7c8a0a18f9	repo: define types for backend initializer functions `ReadonlyRepo::init()` takes callbacks for initializing each kind of backend. We called these things like `op_store_initializer`. I found that confusing because it is not a `OpStoreFactory` (which is for loading an existing backend). This patch tries to clarify that by renaming the arguments and adding types for each kind of callback function.	2023-10-16 22:33:44 -07:00
Martin von Zweigbergk	5174489959	backend: make read functions async The commit backend at Google is cloud-based (and so are the other backends); it reads and writes commits from/to a server, which stores them in a database. That makes latency much higher than for disk-based backends. To reduce the latency, we have a local daemon process that caches and prefetches objects. There are still many cases where latency is high, such as when diffing two uncached commits. We can improve that by changing some of our (jj's) algorithms to read many objects concurrently from the backend. In the case of tree-diffing, we can fetch one level (depth) of the tree at a time. There are several ways of doing that: * Make the backend methods `async` * Use many threads for reading from the backend * Add backend methods for batch reading I don't think we typically need CPU parallelism, so it's wasteful to have hundreds of threads running in order to fetch hundreds of objects in parallel (especially when using a synchronous backend like the Git backend). Batching would work well for the tree-diffing case, but it's not as composable as `async`. For example, if we wanted to fetch some commits at the same time as we were doing a diff, it's hard to see how to do that with batching. Using async seems like our best bet. I didn't make the backend interface's write functions async because writes are already async with the daemon we have at Google. That daemon will hash the object and immediately return, and then send the object to the server in the background. I think any cloud-based solution will need a similar daemon process. However, we may need to reconsider this if/when jj gets used on a server with a custom backend that writes directly to a database (i.e. no async daemon in between). I've tried to measure the performance impact. That's the largest difference I've been able to measure was on `jj diff --ignore-working-copy -s --from v5.0 --to v6.0` in the Linux repo, which increases from 749 ms to 773 ms (3.3%). In most cases I've tested, there's no measurable difference. I've tried diffing from the root commit, as well as `jj --ignore-working-copy log --no-graph -r '::v3.0 & author(torvalds)' -T 'commit_id ++ "\n"'` (to test a commit-heavy load).	2023-10-08 23:36:49 -07:00
Martin von Zweigbergk	d575aaeca8	backend: move constant functions first `root_commit_id()`, `root_change_id()`, and `empty_tree_id()` were strangely ordered between `write_symlink()` and `read_tree().	2023-09-19 05:24:51 -07:00
Martin von Zweigbergk	cc335a9970	cargo: move `examples/` into `cli/` so they are part of the build again	2023-08-07 21:49:45 +00:00

17 commits