ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Yuya Nishihara	0532301e03	revset: add latest(candidates, count) predicate This serves the role of limit() in Mercurial. Since revsets in JJ is (conceptually) an unordered set, a "limit" predicate should define its ordering criteria. That's why the added predicate is named as "latest". Closes #1110	2023-03-25 23:48:50 +09:00
Martin von Zweigbergk	baea314fc0	index: get generation number from specific impl in test	2023-03-24 10:09:40 -07:00
Martin von Zweigbergk	75605e36af	revset: iterate over commit ids instead of index entries There are no remaining places where we iterate over a revset and need the `IndexEntry`s, so we can now make `Revset::iter()` yield `CommitId`s instead.	2023-03-23 21:58:15 -07:00
Martin von Zweigbergk	b5ea79f32e	revset: add new graph iterator function for tests I'm about to make `Revset::iter()` yield just `CommitId`s, but the tests in `test_default_revset_graph_iterator.rs` need an `IndexEntry` iterator so they can pass it into `RevsetGraphIterator::new()`. This commits prepares for the change by adding a `RevsetImpl::iter_graph_impl()` that returns `RevsetGraphIterator`, keeping `InternalRevset` still hidden within the revset engine. We could instead have made that (and `ToPredicateFn`) visible to tests. I can't say which is better.	2023-03-23 21:58:15 -07:00
Martin von Zweigbergk	c8f387d5b3	revset: pass IndexEntry iterator to graph iterator The graph iterator is specific to the index implementation, and it needs access to `IndexEntry`, which `Revset::iter()` will soon not yield.	2023-03-23 21:58:15 -07:00
Martin von Zweigbergk	27a7fccefa	revset: add a method returning a change id index One of the remaining places we depend on index positions is when creating a `ChangeIdIndex`. This moves that into the revset engine (which is coupled to the commit index implementation) by adding a `Revset::change_id_index()` method. We will also use this function later when add support for resolving change id prefixes within a small revset. The current implementation simply creates an in-memory index using the existing `IdIndex` we have in `repo.rs`. The custom implementation at Google might do the same for small revsets that are available on the client, but for revsets involving many commits on the server, it might use a suboptimmal implementation that uses longer-than-necessary prefixes for performance reasons. That can be done by querying a server-side index including changes not in the revset, and then verifying that the resulting commits are actually in the revset.	2023-03-23 20:49:15 -07:00
Martin von Zweigbergk	d3cf543abc	revset: move `revset_for_commits()` to test The function is only used in tests, so it doesn't belong in `default_revset_engine`. Also, it's not specific to that implementation, so I rewrote as a revset evaluation.	2023-03-23 04:50:33 -07:00
Martin von Zweigbergk	5f74dd5db3	repo: implement `Repo` on `ReadonlyRepo` instead of its `Arc` I'd like to be able to pass a `self` of `type `&ReadonlyRepo` to functions that take a `&dyn Repo`. For that, we need `ReadonlyRepo` itself to implement `Repo` instead of having `Arc<ReadonlyRepo>` implement it. I could have solved it in a different way, but the `Arc` requirement seems like an unnecessary constraint.	2023-03-21 21:43:44 -07:00
Martin von Zweigbergk	01d7239732	revset: make graph iterator yield commit ids (not index entries) We only need `CommitId`s, and `IndexEntry` is specific to the default index implementation.	2023-03-20 01:45:54 -07:00
Martin von Zweigbergk	2f876861ae	graphlog: key by commit id (not index position) The index position is specific to the default index implementation and we don't want to use it in outside of there. This commit removes the use of it as a key for nodes in the graphlog. I timed it on the git.git repo using `jj log -r 'all()' -T commit_id` (the worst case I can think of) and it slowed down from ~2.02 s to ~2.20 s (~9%).	2023-03-20 01:45:54 -07:00
Martin von Zweigbergk	91df7ec4c5	revset: rename graph iterator test to match implementation	2023-03-20 01:45:54 -07:00
Martin von Zweigbergk	f758b646a9	commit_builder: add accessors for most fields I'd like to be able to access the current committer on a `CommitBuilder`.	2023-03-19 00:48:05 -07:00
Martin von Zweigbergk	70d4a0f42e	revset: remove context parameter from evaluate() The `RevsetWorkspaceContext` argument is now instead used by the new `resolve_symbol()` function.	2023-03-17 22:42:41 -07:00
Martin von Zweigbergk	d971148e4e	revset: move resolve_symbol() back to revset module The only caller is now in `revset.rs`.	2023-03-17 22:42:41 -07:00
Martin von Zweigbergk	94aec90bee	revset: resolve symbols earlier, before passing to revset engine For large repos, it's useful to be able to use shorter change id and commit id prefixes by resolving the prefix in a limited subset of the repo (typically the same subset that you'd want to see in your default log output). For very large repos, like Google's internal one, the shortest unique prefix evaluated within the whole repo is practically useless because it's long enough that the user would want to copy and paste it anyway. Mercurial supports this with its `revisions.disambiguatewithin` config (added in https://www.mercurial-scm.org/repo/hg/rev/503f936489dd). I'd like to add the same feature to jj. Mercurial's implementation works by attempting to resolve the prefix in the whole repo and then, if the prefix was ambiguous, it resolves it in the configured subset instead. The advantage of doing it that way is that there's no extra cost of resolving the revset defining the subset if the prefix was not ambiguous within the whole repo. However, there are two important reasons to do it differently in jj: * We support very large repos using custom backends, and it's probably cheaper to resolve a prefix within the subset because it can all be cached on the client. Resolving the prefix within the whole repo requires a roundtrip to the server. * We want to be able to resolve change id prefixes, which is always done in some revset. That revset is currently `all()`, i.e. all visible commits. Even on local disk, it's probably cheaper to resolve a small revset first and then resolve the prefix within that than it is to build up the index of all visible change ids. We could achieve the goal by letting each revset engine respect the configured subset, but since the solution proposed above makes sense also for local-disk repos, I think it's better to do it outside of the revset engine, so all revset engines can share the code. This commit prepares for the new functionality by moving the symbol resolution out of `Index::evaluate_revset()`.	2023-03-17 22:42:41 -07:00
Martin von Zweigbergk	5afe5091a0	revset: add `default_` prefix to graph iterator module The current revset graph iterator is the default one, which the default revset engine provides.	2023-03-14 05:32:02 -07:00
Martin von Zweigbergk	3871efd2f9	revset: move `ReverseRevsetGraphIterator` into `revset` module The iterator is not specific to the implementation in `revset_graph_iterator`, so it belongs in the standard `revset` module.	2023-03-14 05:32:02 -07:00
Martin von Zweigbergk	f62fac24ac	revset: move graph iteration onto Revset trait We want to allow custom revset engines define their own graph iterator. This commit helps with that by adding a `Revset::iter_graph()` function that returns an abstract iterator. The current `RevsetGraphIterator` can be configured to skip or include transitive edges. It skips them by default and we don't expose option in the CLI. I didn't bother including that functionality in the new `iter_graph()` either. At least for now, it will be up to the implementation whether it includes such edges (it would of course be free to ignore the caller's request even if we added an option for it in the API).	2023-03-14 05:32:02 -07:00
Martin von Zweigbergk	eed0b23009	revset: move current implementation to new module We want to allow customization of the revset engine, so it can query server indexes, for example. The current revset implementation will be our default implementation for now. What's left in the `revset` module after this commit is mostly parsing code.	2023-03-14 05:32:02 -07:00
Martin von Zweigbergk	ada48c6f71	revset: rename file() test for consistency This should have been part of `bbd6ef0c7b`.	2023-03-13 07:20:35 -07:00
Martin von Zweigbergk	0a7de2540f	tests: call num_commits() on specific implementation This removes the last calls to `Index::num_commits()`.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	5423feb8e1	tests: call stats() on specific implementation This removes the remaining calls to `Index::stats()`.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	59b40d8380	tests: avoid some unnecessary calls to index().stats() The tests adding and removing heads to the repo mostly want to verify that the set of heads is expected. Some of them also check that commits are available in the index. But they shouldn't care about the exact index stats.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	121b86c89c	tests: remove an obsolete TODO about unreachable commits in index I don't think there's much to gain from making the index match exactly what's reachable from the view. FWIW, our cloud-based implementation at Google will probably make everyone's commits visible in the index regardless of which operation they're at.	2023-03-12 22:08:31 -07:00
Martin von Zweigbergk	37151e0ff9	index: load store based on type recorded in .jj/repo/index/type This is another step towards allowing a custom `jj` binary to have its own index type. We're going to have a server-backed index implementation at Google, for example.	2023-03-11 22:22:46 -08:00
Samuel Tardieu	182919ff6f	git: add function to import a selection of the git refs	2023-03-02 10:09:08 +01:00
Samuel Tardieu	0ca4e2dad2	git: absence of globs is None rather than &[] In `git_fetch()`, any glob present in `globs` is an "allow" mark. Using `&[]` to represent an "allow-all" may be misleading, as it could indicate that no branch (only the git HEAD) should be fetched. By using an `Option<&[&str]>`, it is clearer that `None` means that all branches are fetched.	2023-03-02 10:09:08 +01:00
Martin von Zweigbergk	bbd6ef0c7b	revset: remove filter_by_diff(), have caller intersect expression To be able to make e.g. `jj log some/path` perform well on cloud-based repos, a custom revset engine needs to be able to see the paths to filter by. That way it is able pass those to a server-side index. This commit helps with that by effectively converting `jj log -r foo some/path` into `jj log -r 'foo & file(some/path)'`.	2023-02-28 17:45:34 -08:00
Martin von Zweigbergk	346e3c849b	repo: propagate error when failing to look up backend type	2023-02-27 09:44:28 -08:00
Martin von Zweigbergk	491ecc6b2e	repo: replace load_at_head() by helper in tests I'm about to make `RepoLoader::init()` return a `Result`, and I don't want to have to wrap that in a new error in `ReadonlyRepo::load_at_head()` since that's only used in tests.	2023-02-27 09:44:28 -08:00
Yuya Nishihara	da16bf340c	conflicts: fix off-by-one error in materialize_merge_result() This should fix #1304. I think the added test simulates the behavior of multiple rebase conflicts, but I don't have expertise around this. add_index could be replaced with a peekable iterator, but the iterator version wouldn't be as readable as the current implementation.	2023-02-24 19:58:10 +09:00
Ilya Grigoriev	30d03a66e6	cmd: `--branch` option for `git fetch`. Thanks to @samueltardieu for noticing a subtle bug in the refspecs, providing the fix, as well as the two `conflicting_branches` tests.	2023-02-21 18:33:40 -08:00
Martin von Zweigbergk	bc9f66dad3	revset: replace RevsetIterator wrapper by extension The type doesn't seem to provide any benefit. I don't think I had a good reason for creating it in the first place; it was probably just unfamiliarity with Rust.	2023-02-19 21:37:26 -08:00
Martin von Zweigbergk	30160f4d20	revset: pass revset, not iterator, into `RevsetGraphIterator` I was thinking of replacing `RevsetIterator` by a regular `Iterator<Item=IndexEntry>`. However, that would make it easier to pass in an iterator that produces revisions in a non-topological order into `RevsetGraphIterator`, which would produce unexpected results (it would result in nodes that are not connected to their parents, if their parents had already been emitted). I think it makes sense to instead pass in a revset into `RevsetGraphIterator`. Incidentally, it will also be useful to have the full revset available in `RevsetGraphIterator` if we rewrite the algorithm to be more similar to Mercurial's and Sapling's algorithm, which involves asking the revset if it contains parent revisions.	2023-02-19 21:37:26 -08:00
Martin von Zweigbergk	f70e6987b5	conflicts: preserve order of adds in materialized conflict We write conflict to the working copy by materializing them as conflict markers in a file. When the file has been modified (or just the mtime has changed), we parse the markers to reconstruct the conflict. For example, let's say we see this conflict marker: ``` <<<<<<< +++++++ b %%%%%%% -a +c >>>>>>> ``` Then we will create a hunk with ["a"] as removed and ["b", "c"] as added. Now, since commit `b84be06c08`, when we materialize conflicts, we minimize the diff part of the marker (the `%%%%%%%` part). The problem is that that minimization may result in a different order of the positive conflict terms. That's particularly bad because we do the minimization per hunk, so we can end up reconstructing an input that never existed. This commit fixes the bug by only considering the next add and the one after that, and emitting either only the first with `%%%%%%%`, or both of them, with the first one in `++++++++` and the second one in `%%%%%%%`. Note that the recent fix to add context to modify/delete conflicts means that when we parse modified such conflicts, we'll always consider them resolved, since the expected adds/removes we pass will not match what's actually in the file. That doesn't seem so bad, and it's not obvious what the fix should be, so I'll leave that for later.	2023-02-18 22:01:25 -08:00
Martin von Zweigbergk	975350f73b	conflicts: demo bad roundtripping of conflict	2023-02-18 22:01:25 -08:00
Martin von Zweigbergk	fe0eb9137c	conflicts: use snapshot testing for conflict-parsing	2023-02-18 22:01:25 -08:00
Martin von Zweigbergk	a87125d08b	backend: rename `ConflictPart` to `ConflictTerm` It took a while before I realized that conflicts could be modeled as simple algebraic expressions with positive and negative terms (they were modeled as recursive 3-way conflicts initially). We've been thinking of them that way for a while now, so let's make the `ConflictPart` name match that model.	2023-02-17 23:28:50 -08:00
Martin von Zweigbergk	e48ace56d1	conflicts: replace missing files by empty in materialized conflict When we materialize modify/delete conflicts, we currently don't include any context lines. That's because modify/delete conflicts have only two sides, so there's no common base to compare to. Hunks that are unchanged on the "modify" side are therefore not considered conflicting, and since they they don't contribute new changes, they're simply skipped (here: `3dfedf5814/lib/src/files.rs (L228-L230)`). It seems more useful to instead pretend that the missing side is an empty file. That way we'll get a conflict in the entire file. We can still decide later to make e.g. `jj resolve` prompt the user on modify/delete conflicts just like `hg resolve` does (or maybe it actually happens earlier there, I don't remember). Closes #1244.	2023-02-17 22:19:04 -08:00
Martin von Zweigbergk	e1d71c3713	conflicts: add test for materializing modify/delete conflict	2023-02-17 22:19:04 -08:00
Martin von Zweigbergk	dfcc7a9cee	conflicts: merge modify/delete and delete/modify tests The two tests only differ in the order of the changes in the input, so let's reuse some of the setup code.	2023-02-17 22:19:04 -08:00
Martin von Zweigbergk	af3f8b6cfd	conflicts: create a helper for creating a `ConflictPart` in test	2023-02-17 22:19:04 -08:00
Martin von Zweigbergk	d8997999f2	repo: replace RepoRef by Repo trait	2023-02-15 19:15:17 -08:00
Martin von Zweigbergk	f6a4cb57da	repo: extract a `Repo` trait for `Arc<ReadonlyRepo>` and `MutableRepo` This will soon replace the `RepoRef` enum, just like how the `Index` trait replaced the `IndexRef` enum.	2023-02-15 19:15:17 -08:00
Martin von Zweigbergk	8a067282c8	repo: make `ReadonlyRepo::index()` return a `&dyn Index` This is just a little preparation for extracting a `Repo` trait that's implemented by both `ReadonlyRepo` and `MutableRepo`. The `index()` function in that trait will of course have to return the same type in both implementations, and that type will be `&dyn Index`.	2023-02-15 19:15:17 -08:00
Martin von Zweigbergk	2d8aa2d90e	index: delete IndexRef, use Index trait I don't know why I didn't create a trait to begin with. Maybe I had trouble with lifetimes or object-safety.	2023-02-14 06:51:49 -08:00
Martin von Zweigbergk	b955e3de03	index: extract a trait for the index Even though we don't know the details yet, we know that we want to make the index pluggable like the commit and opstore backends. Defining a trait for it should be a good step. We can refine the trait later.	2023-02-14 06:51:49 -08:00
Martin von Zweigbergk	a474c688a8	index: simplify a test helper by specializing it We apparently always have an `&Arc<ReadonlyIndex>` where we call the `generation_number()` function.	2023-02-14 06:51:49 -08:00
Martin von Zweigbergk	9261bfe5fc	revset: resolve change ids only using the new hex digits Now that we use the new hex digits when we display change ids, we no longer need to be able to resolve the old (conventional) digits.	2023-02-13 22:49:21 -08:00
Martin von Zweigbergk	39640cc288	revset: allow resolving change id using hex digits from reverse alphabet By separating the value spaces change ids and commit ids, we can simplify lookup of a prefix. For example, if we know that a prefix is for a change id, we don't have to try to find matching commit ids. I think it might also help new users more quickly understand that change ids are not commit ids. This commit is a step towards that separation. It allows resolving change ids by using hex digits from the back of the alphabet instead of 0-f, so 'z'='0', 'y'='1', etc, and 'k'='f'. Thanks to @ilyagr for the idea. The regular hex digits are still allowed.	2023-02-13 22:49:21 -08:00

1 2 3 4 5 ...

465 commits