ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Yuya Nishihara	3d0b3d57d8	git_backend: on gc(), remove unreachable no-gc refs and compact them With my jj repo, the number of jj/keep refs went down from 87887 to 27733. The .git directory size is halved, but we'll need to clean up extra and index files to save disk space. "git gc --prune=now && jj debug reindex" passed, so the repo wouldn't be corrupted. #12	2024-01-27 10:18:11 +09:00
Yuya Nishihara	351487b9f5	backend: pass Index and keep_newer timestamp parameters to gc() GitBackend::gc() will need to check if a commit is reachable from any historical operations. This could be calculated from the view and commit objects, but the Index will do a better job.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	845eb4ce01	git_backend: when running "git gc", chdir instead of specifying it by GIT_DIR Hopefully this will be more reliable on Windows where path/environment stuff is messy.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	4e54021930	backend: have gc() return BackendError instead of opaque error type The gc() implementation is likely to call other backend functions, which return BackendError.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	84949dd551	backend: mark BackendError::Other as transparent The inner error should be the source, and I don't think the "Error:" prefix gives additional context.	2024-01-27 10:18:11 +09:00
Yuya Nishihara	8a67191d25	git: simplify import_head() as it doesn't have to process multiple head commits	2024-01-27 00:01:59 +09:00
Yuya Nishihara	fc114ef217	git: extract Git HEAD handling bits from import_some_refs() I'm going to make WorkspaceCommandHelper::maybe_snapshot() snapshot the working copy before importing refs. git::import_some_refs() can rebase the working copy branch and therefore @ can be moved. git::import_head() doesn't, and it should be invoked before snapshotting. git::import_head() is inserted to some of the git:import_refs() callers where HEAD seems to matter. I feel it's a bit odd that the HEAD ref is imported to non-colocated repo, but "jj init --git-repo" relies on that, and I think the existence of HEAD@git is harmless. It's merely a ref to the revision checked out somewhere else.	2024-01-27 00:01:59 +09:00
Yuya Nishihara	5a88180720	git_backend: fix import_head_commits() to not issue duplicated ref edits This was broken at `afa72ff496` "git_backend: inline prevent_gc() to bulk-update refs." Since no-gc refs are created within a transaction, duplicated edits are no longer allowed.	2024-01-27 00:00:57 +09:00
Ilya Grigoriev	dff440c4a8	`clippy`: Fix nightly warnings about "useless use of vec!"	2024-01-25 22:00:26 -08:00
Daniel Ploch	20cbe77bf5	workspace: support creating shares of custom workspaces	2024-01-25 11:46:07 -08:00
Daniel Ploch	cb889f0b45	workspace: combine working copy functions into a trait	2024-01-25 11:46:07 -08:00
Yuya Nishihara	5a7d8ac596	working_copy: don't follow symlinks when visiting files in gitignored directory Fixes #2878	2024-01-24 16:38:48 +09:00
Yuya Nishihara	d0d4496258	tests: add executable files and symlinks to gitignored directory test	2024-01-24 16:38:48 +09:00
Martin von Zweigbergk	502150b2f4	conflicts: test materialization with with negative snapshots We didn't have any tests with negative snapshots (after a `-------` line). I initially thought we couldn't produce such conflict markers anymore. I'm not sure we want to render conflicts like the one in the test like this. I don't think I intended for `add_index` in the code to be able to be two steps ahead of the remove. Maybe we should rewrite the algorithm to not do that and thus never produce negative snapshots.	2024-01-23 07:18:54 -08:00
Ilya Grigoriev	d168fd2b09	`test_rebase_abandoning_empty`: add children of an empty `@` to the test case This demonstrates the minor bug discussed in https://github.com/martinvonz/jj/pull/2766#discussion_r1442365389 AKA https://github.com/martinvonz/jj/issues/2869. It's also interesting whether changing the definition of "discardable" commit would affect this test, see https://github.com/martinvonz/jj/issues/2859#issuecomment-1903275884 (I think it won't, but still)	2024-01-22 18:36:49 -08:00
Jonathan Tan	0bc1341fd0	revset: add count_estimate() to Revset trait The count() function in this trait is used by "jj branch" to determine (and then report) how many commits a certain branch is ahead/behind another branch. This is currently implemented by walking all commits in the revset, counting how many were encountered. But this could be improved: if the number is large, it is probably sufficient to report "at least N" (instead of walking all the way), and this does not scale well to jj backends that may not have all commits present locally (which may prefer to return an estimate, rather than access the network). Therefore, add a function that is explicitly documented to be O(1) and that can return a range of values if the backend so chooses. Also remove count(), as it is not immediately obvious that it is an expensive call, and callers that are willing to pay the cost can obtain the exact same functionality through iter().count() anyway. (In this commit, all users of count() are migrated to iter().count() to preserve all existing functionality; they will be migrated to count_estimate() in a subsequent commit.) "branch" needed to be updated due to this change. Although jj is currently only available in English, I have attempted to keep user-visible text from being assembled piece by piece, so that if we later decide to translate jj into other languages, things will be easier for translators.	2024-01-22 15:07:00 -08:00
Yuya Nishihara	c7be4d019c	index: add all_heads_for_gc() that iterates heads of all indexed commits GitBackend::gc() will recreate no-gc refs for the indexed heads. We could collect all historical heads by traversing operation log, but it isn't enough because there may be predecessor links to hidden commits, and "git gc" isn't aware of predecessors.	2024-01-17 23:07:14 +09:00
Yuya Nishihara	afa72ff496	git_backend: inline prevent_gc() to bulk-update refs	2024-01-17 10:43:25 +09:00
Yuya Nishihara	96ee9bdb9f	git_backend: ensure no-gc refs are created for all imported head commits This also means that we can implement GC without taking care of extra metadata. I haven't tried, but it wouldn't be easy to keep Git refs and extra table in sync.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	2e1aa6c49c	git_backend: remove fast path testing imported commits, filter them by caller The idea is that GC, if implemented, will clean up objects based on the Index knowledge. It's probably okay to leave some extra metadata of unreachable objects, but GC-ed refs should be recreated if the corresponding heads get reimported. See also the next patch.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	48c4985e34	git_backend: ensure that no-gc ref target never conflicts	2024-01-17 10:43:25 +09:00
Yuya Nishihara	f66c859fe4	git_backend: use lower-level API to create no-gc refs This will allow us to issue multiple prevent_gc() requests all at once. It's not important here, but will be unavoidable when implementing GC. Deleting tons of refs from packed refs is super slow if the requests were processed one by one.	2024-01-17 10:43:25 +09:00
Yuya Nishihara	34956f17e5	op_walk: assert that virtual root op is not reparented This is enforced by the caller, but it's scary if it weren't.	2024-01-16 21:46:54 +09:00
Yuya Nishihara	fb3e006a45	op_store: add special case for root id resolution	2024-01-16 21:46:54 +09:00
Yuya Nishihara	660806ffed	tests: set up unparented operations for id prefix tests Otherwise we can't easily pick i to create operation id starting with "0".	2024-01-16 21:46:54 +09:00
Yuya Nishihara	df1be14aa8	tests: split op id resolution tests, don't require merged op for prefix tests This makes it easy to set up crafted environment for prefix resolution tests.	2024-01-16 21:46:54 +09:00
Essien Ita Essien	dc074363d1	no-op: Move external git repo canonicalization into Workspace::init_git_external * Move canonicalization of the external git repo path into the Workspace::init_git_external(). This keeps necessary code together. * Add a new variant of WorkspaceInitError for reporting path not found errors. The user error string is written to pass existing tests.	2024-01-16 10:46:02 +00:00
Yuya Nishihara	da218d19db	repo: optimize enforce_view_invariants() to not traverse ancestors until root Because the default index cuts off the traversal at min(generations), including the root id means all ancestors will be visited. This could be worked around at the index side, but I think it's the repo/view's responsibility. That being said, it's not uncommon to pad a revset with "root()", so it might make sense for the index to special case the root id. I also removed the redundant .clone().	2024-01-15 09:57:02 +09:00
Martin von Zweigbergk	6e302bb3a2	op_store: add a virtual root operation, similar to root commit It seems obvious in hindsight to have a virtual root operation just like we have a virtual root commit. It removes the same kind of problems by making sure there's always a common ancestor (or multiple) between any two commits. I think the reason I didn't add a root operation from the beginning was that there used to be a mandatory working-copy commit in the view (this was before support for multiple workspaces). Perhaps we should remove the "initialize repo" operation now. The only difference between their view objects is that the "initialize repo" operation adds the root commit as a head. We could add that to the root operation, but then the root operation's value depends on the commit backend.	2024-01-14 10:15:14 -08:00
Martin von Zweigbergk	c9af8bf43a	view: drop tracking of public heads We've had the public_heads for as long as we've had the View object, IIRC (I didn't check), but we still don't use it for anything. I don't have any concrete plans for using it either. Maybe our config for immutable commits is good enough, or maybe we'll want something more generic (like Mercurial's phases). For now, I think we should simplify by removing it the storage for public heads.	2024-01-13 22:23:57 -08:00
Martin von Zweigbergk	a66e2a0a6d	working_copy: mark commit_id field in proto reserved By marking it reserved, we prevent accidental use. We can still read working copy protos that have the field.	2024-01-12 17:38:23 -08:00
Yuya Nishihara	543036c753	cli: run "op log" without loading repo or merging concurrent ops When debugging behavior of badly-GCed repos, I find it's annoying that "op log" fails because the index can't be loaded. Since "op log" doesn't need a repo, I think it's better to display the exact op-heads state without merging.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	831a530283	op_walk: make walk_ancestors() sort head ops to stabilize output I thought this would be done by dag_walk::topo_order_reverse_lazy_ok(), but apparently I made it preserve the input order in a way topo_order_reverse() would do.	2024-01-13 10:38:10 +09:00
Yuya Nishihara	b7eb551cf7	index: fix reindexing to scan all referenced commits such as hidden remote refs Since hidden commits can be looked up by remote_branches() revset for example, reindexing should traverse ancestors from all named refs in addition to the visible heads.	2024-01-12 12:53:16 +09:00
Yuya Nishihara	805046ceba	op_walk: extract function that resolves op expression with preloaded head op I'm going to make "op abandon" not load the repo, and this function will be used there instead of resolve_op_with_repo().	2024-01-12 08:01:13 +09:00
Yuya Nishihara	83ede241e3	op_walk: don't resolve heads beyond @ operation Since `jj undo --at-op=OP @` resolves @ to OP, I think OP should be the head in that context, and the descendants of OP shouldn't be accessible by @+.	2024-01-12 08:01:13 +09:00
Yuya Nishihara	ba42b37a67	operation: remove operation::View wrapper in favor of view::View view::View doesn't track ViewId, but there are no callers of cheap Eq/Hash functions.	2024-01-12 08:01:02 +09:00
Yuya Nishihara	d5a98df046	git_backend: teach "format.tree-level-conflicts" config by constructor Since GitBackend constructors now depend on &UserSettings, it makes sense to initialize the formatting options there.	2024-01-10 08:57:51 +09:00
Yuya Nishihara	e5286aed08	index: move lifetimed change_id_index() to MutableIndex, rename 'static version change_id_index() is only used by Readonly/MutableRepo, so we don't need an abstraction at Index. evaluate_revset() is somewhat similar, but the callers rely on &dyn Repo.	2024-01-09 10:38:00 +09:00
Yuya Nishihara	dc68f1eeb2	revset: remove unused lifetime parameter from Revset<'index>	2024-01-09 10:37:43 +09:00
Yuya Nishihara	e9d31177cb	op_store: implement GC of unreachble operations and views Since new operations and views may be added concurrently by another process, there's a risk of data corruption. The keep_newer parameter is a mitigation for this problem. It's set to preserve files modified within the last 2 weeks, which is the default of "git gc". Still, a concurrent process may replace an existing view which is about to be deleted by the gc process, and the view file would be lost. #12	2024-01-09 10:37:03 +09:00
Yuya Nishihara	5894f3dfba	operation: add shorthand for &store_operation().view_id	2024-01-09 10:37:03 +09:00
Martin von Zweigbergk	c98b0d76af	index: move Revset::change_id_index() to Index We current have `Revset::change_id_index()` for creating a `ChangeIdIndex` for a given revset. I think it will be hard to make it performant for general revsets, especially in very large repos and with custom index implementations, like the one we have at Google. If we instead restrict it to including all ancestors of a set of heads, I think it will be much easier to implement. We only use `Revset::change_id_index()` with revsets including all visible commits today, so we won't lose any current functionality by making it more restricted.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	2f4594540a	tests: move ChangeIdIndex test from test_revset to test_index	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	1508f28567	tests: update ChangeIdIndex test to include ancestors in set I plan to replace `Revset::change_id_index()` by `Index::change_id_index(heads)`, but one of the tests currently uses a set of commits that does not include ancestors. This patch updates it to include ancestors (and changes the set of heads to keep the set small enough for the test).	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f9dc00704d	index: specialize evaluate_revset_static() to change_id_index_static() I'd like to move `change_id_index()` from `Revset` to `Index` (and make it take the set of visible heads as argument). We currently use `evaluate_revset_static()` only to get a `ChangeIdIndex`, so a good place to start is to convert that into `change_id_index_static()`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	b549090acc	index: adopt ChangeIdIndex and relatives from revset module The `ChangeIdIndex` type is currently in defined in the `revset` module because that's the only placed it's used. However, I'd like to start using it directly from `index`. The idea is to make it possible to create a `ChangeIdIndex` given a set of heads, without first creating a `Revset`.	2024-01-08 06:06:47 -08:00
Martin von Zweigbergk	f0182ad4b8	default_index: adopt revset engine and graph iterator modules The revset engine and the graph iterator are specific to the default index implementation, so they belong in the same module.	2024-01-07 05:37:47 -08:00
Yuya Nishihara	a6616e9cea	object_id: don't allow ObjectId::from_hex() a dynamically allocated string This isn't technically needed, but it prevents API misuse. Another option is to do some compile-time substitution, but most callers are tests and the runtime performance wouldn't matter.	2024-01-06 00:26:36 +09:00
Yuya Nishihara	837ac15052	op_store: add resolve_operation_id_prefix() trait method that uses readdir() The OpStore backends should have a better way to look up operation by id than traversing from the op heads. The added method is similar to the commit Index one, but returns an OpStoreResult because the backend operation can fail. FWIW, if we want .shortest() in the op log template, we'll probably need a trait method that returns an OpIndex instead.	2024-01-05 23:36:57 +09:00

1 2 3 4 5 ...

2530 commits