mirrors/jj

mirror of https://github.com/martinvonz/jj.git synced 2024-10-28 01:16:57 +00:00

Author	SHA1	Message	Date
Martin von Zweigbergk	c6f6498cc9	conflicts: add newline after conflict marker lines Merging is currently done with line-level granularity, so it makes sense to have newlines after the markers. That makes them easier to edit out when resolving conflicts.	2021-04-24 13:53:24 -07:00
Martin von Zweigbergk	9dc18524fc	revsets: add "ancestor difference" range operator (like git's `..`)	2021-04-23 19:10:28 -07:00
Martin von Zweigbergk	49173de423	revsets: add DAG range operator (like hg's infix `::`) This lets you use the same operator as we currently have for ancestors and descendants (`,,`) to also specify a DAG range. That's what Mercurial uses the `::` operator for and what Git has `git log --ancestry-path` for.	2021-04-23 19:10:26 -07:00
Martin von Zweigbergk	d8c209c82a	revsets: give parents/children operators higher precedence than range operators	2021-04-23 18:45:42 -07:00
Martin von Zweigbergk	9de5f94af6	revsets: use same error variant for imcomplete parse as for syntax error	2021-04-23 16:59:04 -07:00
Martin von Zweigbergk	0b0374d401	revsets: make parsed Children and Descendants have roots and heads It seems clearer to let the parsed `RevsetExpression`s have only root and head expression instead of adding the ancestors when building the expression tree.	2021-04-23 16:15:17 -07:00
Martin von Zweigbergk	f209354503	revsets: simplify and clarify description revset slightly	2021-04-23 13:30:31 -07:00
Martin von Zweigbergk	145731ec74	revsets: change operators around a bit to prepare for infix DAG range operator I really liked the idea of having the operators for parents and ancestors (etc.) look similar, but that turned out to be problematic when we want to add an infix operator for a DAG range (hg's `::` revset operator and git's `--ancestry-path` flag). Let's say we chose `::` as the operator. Part of the problem is how to parse `foo::bar` without eagerly parsing the `foo:`. It would also be nicer to use exactly the same operator as prefix, postfix, and infix. Since the "parents" operator can be repeated, we can't have it be just `:` and the "ancestors" operator be `::`. We could make the "ancestors" operator be something like `:` (or anything symmetric with the `:` symbol on the inside). However, at that point, the operator is getting ugly and hard to type. Another option would be to use `:` for ancestors and `::` for parents, but that is counterintuitive and get annoying if you want to repeat it. So it seems that the best option is to simply pick different symbols for parents/children and ancestors/descendants/range. This patch changes the ancestors/descendants operators to both be `,,`. I'm not at all attached to that particular symbol. I suspect we'll change it later.	2021-04-23 11:11:07 -07:00
Martin von Zweigbergk	c894a7435f	revsets: make function arguments always be revset expressions Now that expressions may contain literal strings, we can simply have functions accept only expressions arguments. That simplifies both the grammar and the code. A small drawback is that `description((foo), bar)` is now allowed and does a search for the string "foo" (not "(foo)"). That seems unlikely to trip up users.	2021-04-23 10:51:41 -07:00
Martin von Zweigbergk	5819687237	revsets: accept quoted symbol names Git refs with names containing e.g "-" are currently not accepted symbol names, and I don't plan to change the grammar to accept them. Instead, let's have the user quote symbol names containing unusual characters. That way we can keep these symbols reserved for revset operators. With this patch the user can do e.g. `jj diff -r '"v2.9.0-rc2"'`.	2021-04-23 10:37:42 -07:00
Martin von Zweigbergk	d78fd9e979	revsets: add functions and operators for children and descendants This adds `children(<set>)` and `<set>:` for the children of the given set, and `descendants(<set>)` and `<set>:*` for the descendants of the given set. The children and descendants are filtered to be among ancestors of non-obsolete commits. I haven't added a way of overriding that yet.	2021-04-21 23:34:20 -07:00
Martin von Zweigbergk	618abf4379	revsets: use consistent "_op" suffix for operator rules in the grammar This is especially important now that we leak the rule names into the `SyntaxError` message. For example, the error message when doing `jj diff -r :` will now mention "expected parents_op, ancestors_op, or primary". It seems much clearer with the "_op" suffixes there. Longer term, we should think more about how we can best surface syntax errors from the library crate.	2021-04-21 18:53:58 -07:00
Martin von Zweigbergk	a4ef42962c	revsets: don't crash when given ungrammatical revset I also snuck in some updates to the test cases.	2021-04-21 18:53:58 -07:00
Martin von Zweigbergk	744f209e76	revsets: move parse-tests to to revset module (from separate test module) The tests don't need any complex set up (no repo necessary), so they can be in the `revset` module itself. I'm sure we'll need to split up that module later (at least separate out the parsing), but that's a separate problem.	2021-04-21 18:53:58 -07:00
Martin von Zweigbergk	6bc1361b84	index: make revision walk be by position instead of generation number I don't know why I made it walk by generation number to start with. Walking by position is better in at least two ways: 1) revsets now depend on the walks to be by descending index position (though they could equally well depend on the walks to be by generation number -- it just needs to be consistent), and 2) the log output gets less interleaved. This commit makes the number of bytes in the graphlog output in the git.git repo drop by ~40% due to the reduced amount of interleaving. Also, it reduces the time of `jj bench walkrevs v1.0.0 v2.0.0` in the git.git repo by 32% (9.4ms -> 6.4ms) and `jj bench walkrevs v2.0.0 v1.0.0` by 33% (7.7ms -> 5.1ms).	2021-04-21 18:53:56 -07:00
Martin von Zweigbergk	98f4e24892	cli: make benchmark ids include parameters It makes no sense to compare a run of `jj walkrevs v1.0.0 v2.0.0` with a run of `jj walkrevs v2.0.0 v1.0.0`, for example.	2021-04-21 16:56:45 -07:00
Martin von Zweigbergk	64fcf90c68	view: make root commit public	2021-04-18 23:04:15 -07:00
Martin von Zweigbergk	b52cfc156c	revsets: add a public_heads() revset function	2021-04-18 22:52:31 -07:00
Martin von Zweigbergk	3a65c1d2ab	revsets: add intersection operator	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	332580918c	revsets: add union operator	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	2ac5d1f912	revsets: allow spaces in most places (but not after prefix operators)	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	c04f418e67	revsets: add difference operator	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	e733b074e1	revsets: restructure grammar to prepare for operator precedences	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	d9ae7cdd6d	revsets: allow parenthesized expressions We'll clearly want to allow parenthesized expressions once we have infix operators (if not before). Let's prepare by allowing parentheses already now.	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	d71c083a7f	cli: use revsets also when looking up by description	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	62f0778942	revsets: add a description() revset The revset is currently eagerly evaluated, which is clearly bad. We'll need to fix that later.	2021-04-18 22:45:12 -07:00
Martin von Zweigbergk	05e9149157	revsets: add a non_obsolete_heads() revset This change adds a `non_obsolete_heads(<set>)` revset, which walks up ancestors of the input set until it gets to a non-obsolete and non-pruned commit. That's what we do by default in `jj log` (i.e. without `--all`). Now we can make `jj log` use revsets and teach it a `-r` option!	2021-04-18 22:45:10 -07:00
Martin von Zweigbergk	30aa459d2a	revsets: add a all_heads() revset function This adds a `all_heads()` revset function, which contains all heads in the view, i.e. including non-public heads and obsolete heads.	2021-04-18 22:31:46 -07:00
Martin von Zweigbergk	88904e2b63	revsets: add support for function syntax This adds `parents(foo)` and `ancestors(foo)` as alternative ways of writing `:foo` and `*:foo`. I haven't added support for for whitespace yet; the parsing is very strict. The error messages will also need to be improved later.	2021-04-18 21:25:58 -07:00
Martin von Zweigbergk	2d6325b0f4	revsets: define grammar in pest	2021-04-18 21:25:58 -07:00
Martin von Zweigbergk	0d62a336af	revsets: initial support for Mercurial-style revsets This patch adds initial support for a DSL for specifying revisions inspired by Mercurial's "revset" language. The initial support includes prefix operators ":" (parents) and ":" (ancestors) with naive parsing of the revsets. Mercurial uses postfix operator "^" for parent 1 just like Git does. It uses prefix operator "::" for ancestors and the same operator as postfix operator for descendants. I did it differently because I like the idea of using the same operator as prefix/postfix depending on desired direction, so I wanted to apply that to parents/children as well (and for predecessors/successors). The "" in the "*:" operator is copied from regular expression syntax. Let's see how it works out. This is an experimental VCS, after all. I've updated the CLI to use the new revset support. The implementation feels a little messy, but you have to start somewhere...	2021-04-18 21:25:51 -07:00
Martin von Zweigbergk	7861968f64	index: make IndexRef::entry_by_id() etc return entry with repo's lifetime It's useful to be able to know that given a `repo: RepoRef<'a>`, the the lifetime of `repo.index().entry_by_id()` will also be `'a`.	2021-04-15 07:00:04 -07:00
Martin von Zweigbergk	4c3d73ff3b	evolution: walk orphans using index This actually seems to make it slightly slower, but it fixes an important bug (we used to evolve only one topological branch per `jj evolve` call). The slowdown seemed to be on the order of 5% when evolving 100 commits on git.git's "what's cooking" branch.	2021-04-14 08:25:14 -07:00
Martin von Zweigbergk	783e1f6512	repo: make MutableRepo have an Arc<ReadonlyRepo> instead of a reference I suspect that at least one reason that I didn't make `MutableRepo::base_repo` by an `Arc<ReadonlyRepo>` before was that I thought that that would mean that `start_transaction()` would need be moved off of `ReadonlyRepo` so it can be given an `&Arc<ReadonlyRepo>`, which would make it much less convenient to use. It turns out that a `self` argument can actually be of type `&Arc<ReadonlyRepo>`.	2021-04-11 13:42:31 -07:00
Martin von Zweigbergk	ce855bccfa	repo: make reload() and reload_at() return a new ReadonlyRepo After this patch `ReadonlyRepo` is even closer to readonly. That makes it easier to reason about. It will allow some further cleanups too.	2021-04-11 10:39:29 -07:00
Martin von Zweigbergk	e3ca27bf77	revsets: support git refs	2021-04-10 10:10:09 -07:00
Martin von Zweigbergk	40f75ec641	revsets: don't crash if given non-hex symbol	2021-04-10 10:08:47 -07:00
Martin von Zweigbergk	9e8a7e2ba6	revsets: move code for resolving symbol to commit to new module	2021-04-10 09:46:27 -07:00
Martin von Zweigbergk	102f7a0416	diff: also recurse into final region after after unchanged regions See test case for details. Before: test bench_diff_10k_lines_reversed ... bench: 36,249,659 ns/iter (+/- 174,455) test bench_diff_10k_modified_lines ... bench: 37,258,890 ns/iter (+/- 803,963) test bench_diff_10k_unchanged_lines ... bench: 4,252 ns/iter (+/- 69) test bench_diff_1k_lines_reversed ... bench: 982,834 ns/iter (+/- 6,467) test bench_diff_1k_modified_lines ... bench: 3,343,469 ns/iter (+/- 23,243) test bench_diff_1k_unchanged_lines ... bench: 231 ns/iter (+/- 2) test bench_diff_git_git_read_tree_c ... bench: 95,559 ns/iter (+/- 816) After: test bench_diff_10k_lines_reversed ... bench: 36,186,715 ns/iter (+/- 196,903) test bench_diff_10k_modified_lines ... bench: 37,511,000 ns/iter (+/- 1,370,476) test bench_diff_10k_unchanged_lines ... bench: 3,099 ns/iter (+/- 8) test bench_diff_1k_lines_reversed ... bench: 986,010 ns/iter (+/- 11,565) test bench_diff_1k_modified_lines ... bench: 3,370,938 ns/iter (+/- 17,041) test bench_diff_1k_unchanged_lines ... bench: 230 ns/iter (+/- 2) test bench_diff_git_git_read_tree_c ... bench: 102,189 ns/iter (+/- 1,052) So this patch makes diffing even slower (but still easily fast enough for all cases I've run into in real life). There's probably a lot that can be done to make things faster, but the first priority is that the diffs are correct and easy to read.	2021-04-08 23:54:54 -07:00
Martin von Zweigbergk	f4a41f3880	trees: make tree diff return an iterator instead of taking a callback This is yet another step towards making it easy to propagate `BrokenPipe` errors. The `jj diff` code (naturally) diffs two trees and prints the diffs. If the printing fails, we shouldn't just crash like we do today. The new code is probably slower since it does more copying (the callback got references to the `FileRepoPath` and `TreeValue`). I hope that won't make a noticeable difference. At least `jj diff -r 334afbc76fbd --summary` didn't seem to get measurably slower.	2021-04-07 23:18:00 -07:00
Martin von Zweigbergk	8b2ce18254	trees: make diff_entries() return an iterator instead of taking a callback The iterator version is easier to use and we get rid of the ugly type parameter for the error type. I also simplified the code by using `Peekable` iterators.	2021-04-07 15:48:11 -07:00
Martin von Zweigbergk	5c10c93e64	diff: fix tests broken by the previous commit Sorry, I forgot to run the automated tests again :(	2021-04-07 11:00:04 -07:00
Martin von Zweigbergk	0dd000d236	diff: do final refinement at byte-level for non-word bytes This results in significantly more readable diffs on commits like `659393bec2` in this repo. Before: test bench_diff_10k_lines_reversed ... bench: 38,122,998 ns/iter (+/- 557,688) test bench_diff_10k_modified_lines ... bench: 32,556,563 ns/iter (+/- 548,114) test bench_diff_10k_unchanged_lines ... bench: 4,231 ns/iter (+/- 15) test bench_diff_1k_lines_reversed ... bench: 958,296 ns/iter (+/- 46,963) test bench_diff_1k_modified_lines ... bench: 3,014,723 ns/iter (+/- 15,830) test bench_diff_1k_unchanged_lines ... bench: 249 ns/iter (+/- 2) test bench_diff_git_git_read_tree_c ... bench: 78,599 ns/iter (+/- 1,079) After: test bench_diff_10k_lines_reversed ... bench: 38,289,493 ns/iter (+/- 413,712) test bench_diff_10k_modified_lines ... bench: 37,352,516 ns/iter (+/- 1,293,950) test bench_diff_10k_unchanged_lines ... bench: 4,238 ns/iter (+/- 13) test bench_diff_1k_lines_reversed ... bench: 967,253 ns/iter (+/- 8,506) test bench_diff_1k_modified_lines ... bench: 3,358,028 ns/iter (+/- 37,154) test bench_diff_1k_unchanged_lines ... bench: 233 ns/iter (+/- 1) test bench_diff_git_git_read_tree_c ... bench: 95,787 ns/iter (+/- 740) So the biggest slowdown is when there are modified lines.	2021-04-07 10:27:17 -07:00
Martin von Zweigbergk	f634ff0e3f	files: make diff() return an iterator instead of using a callback Iterators are generally nicer to work with. My immediate goal is to be able to propagate errors when failing to write to stdout.	2021-04-07 10:07:18 -07:00
Martin von Zweigbergk	d7395cc34a	diff: add copyright header	2021-04-06 21:26:37 -07:00
Martin von Zweigbergk	7e4e43f358	diff: first diff lines, then refine to words, producing better diffs The new diff algorithm produces pretty bad diffs in some cases, such as `cc4b1e9230` in this repo (the parent of this commit). I think the problem there is that many words are repeated over and over. Diffing first at the line level and then refining the diff of the changed ranges at the word level gives much better results. That's what this patch does. After this patch, `jj diff -r cc4b1e923091` looks pretty similar to the diff in GitHub's UI. I hope to get around to doing the same for the merge code soon. Impact on benchmarks: Before: test bench_diff_10k_lines_reversed ... bench: 42,647,532 ns/iter (+/- 765,347) test bench_diff_10k_modified_lines ... bench: 21,407,980 ns/iter (+/- 126,366) test bench_diff_10k_unchanged_lines ... bench: 4,235 ns/iter (+/- 16) test bench_diff_1k_lines_reversed ... bench: 1,190,483 ns/iter (+/- 7,192) test bench_diff_1k_modified_lines ... bench: 1,919,766 ns/iter (+/- 9,665) test bench_diff_1k_unchanged_lines ... bench: 231 ns/iter (+/- 1) test bench_diff_git_git_read_tree_c ... bench: 174,702 ns/iter (+/- 1,199) After: test bench_diff_10k_lines_reversed ... bench: 38,289,509 ns/iter (+/- 129,004) test bench_diff_10k_modified_lines ... bench: 33,140,659 ns/iter (+/- 3,989,339) test bench_diff_10k_unchanged_lines ... bench: 3,099 ns/iter (+/- 14) test bench_diff_1k_lines_reversed ... bench: 973,551 ns/iter (+/- 94,895) test bench_diff_1k_modified_lines ... bench: 3,033,818 ns/iter (+/- 29,513) test bench_diff_1k_unchanged_lines ... bench: 230 ns/iter (+/- 1) test bench_diff_git_git_read_tree_c ... bench: 79,100 ns/iter (+/- 963) So most of them get slower, as expected. The last one, taken from a real diff in the git.git repo, get faster, however (which is also what I would have expected).	2021-04-04 21:50:31 -07:00
Martin von Zweigbergk	cc4b1e9230	test: fix merge tests to expect line-based merging I made a quite late change in a recent patch to make the merge code to merge based on lines instead of words. I forgot to update the tests (and to even run them). Sorry :(	2021-04-01 08:27:27 -07:00
Martin von Zweigbergk	c071d412af	diff: use new diff algorithm for content diff The previous patch switched over the content-merge code to use the new histogram diff code. This patch switches over the content-diff code to use the histogram diff code. As before, the immediate goal is to speed it up. `jj diff -r c28ded83fc` in the git.git repo is a good example of a diff that's extremely slow to calculate with our current LCS-based diff. With this patch, that drops from 35 s to 0.12 s. The diff was slightly better before. I think that's mostly because of our different definition of a "word" in the data. We can improve that later. The speedup we get now is easily worth the slightly worse diff.	2021-03-31 22:22:59 -07:00
Martin von Zweigbergk	3c35dbace6	merge: use new diff algorithm for finding sync regions With the histogram diff code from the previous patch, we can now start using that for finding the "sync regions" in 3-way merge. That helps a lot with the slow merging we had before this patch. `jj diff -r 9d540e9726` in the git.git repo drops from 22 s to 0.15 s with this patch. (That commit is a rather arbitrary merge commit from aroun 5 years ago.) With the new diff algorithm, the output of `jj diff -r 9d540e9726` in git.git looks better if we find unchanged sync regions based on lines than on words, so that's what I'm using in this patch. That's a change compared the the LCS-based diff we used before this patch. I suspect the reason that finding sync regions based on words works worse now is not because of the change from LCS to histogram but because of the change in how we define a word. My goal right now is mostly to make it faster; I'll get back to refining the diff result later.	2021-03-31 22:16:19 -07:00
Martin von Zweigbergk	1e657c5331	diff: add a histogram(-like?) diff algorithm The current diff algorithm does a full LCS on the words of the texts, which is really slow. Diffing the working copy when e.g. `src/commands.py` has changes far apart takes seconds. This patch adds an implementation inspired by JGit's Histogram diff. I say "inspired" because I just didn't quite understand it :P In particular, I didn't understand what it does when it finds non-unique elements. I decided to line up the leading common elements on both sides of the merge. I don't know if that usually gives good enough results in practice. I'm sure this can still be optimized a lot, but this seems good enough as a start. There is also many things to improve about the quality of the diffs.	2021-03-31 22:15:36 -07:00

1 2 3 4 5

245 commits