ok/jj - ok.software

ok/jj

Author	SHA1	Message	Date
Martin von Zweigbergk	2c21ed40d8	working_copy: don't crash if file becomes directory while snapshotting If a file gets replaced by a directory right after list files in a directory but before we stat the file, we currently crash. Let's instead treat it as a missing file, using the mechanism introduced for #258.	2022-06-21 05:38:28 +07:00
Martin von Zweigbergk	29a71c619a	working_copy: ignore special files This patch makes us treat special files (e.g. Unix sockets) as absent when snapshotting the working copy. We can consider later reporting such files back to the caller (possibly via callback) so it can inform the user about them. Closes #258	2022-06-20 09:26:29 +07:00
Martin von Zweigbergk	a42b24c014	working_copy: on checkout, record file state's type and size This patch is essentially `f6a516ff6d` taken further, to also apply to when we write a symlink or a conflict. As with regular files, these races seem very unlikely to happen, but I found these cases while working on #258, so let's fix. Fixing it also means that we don't need to handle these transition cases in the next patch (when `file_states()` can indicate that the file is e.g. a socket).	2022-06-20 09:26:29 +07:00
Martin von Zweigbergk	540f2eb583	errors: avoid using `Debug` formatting on error types The regular `Display` format is (not surprisingly) more user-friendly, as pointed out by @yuja. I also switched to using format strings for these cases, and some nearby strings for consistency.	2022-05-25 19:33:59 -07:00
Martin von Zweigbergk	c9ab0a20d3	working_copy: stat path without traversing from root in one case We can easily make the `DirEntry` available here, so we can call `.metadata()` on that instead of on the `Path`. I think that avoids walking the path. I'm sure this has no significant impact on performance, but it's also almost as readable.	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	f6a516ff6d	working_copy: record expected written size, not actual size, on checkout When we have just written a file to disk on checkout, let's record the size we expected instead of what we got from `fstat()`. This should address a race where the file was modified between the time we wrote it and the time we requested its stat. I just happened to notice this while going through the code; it seems very unlikely to be noticed in practice.	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	fa4b5aa2c7	working_copy: propagate most errors on checkout	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	9aa2009320	working_copy: improve error handling when getting file stats	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	9feb786a51	working_copy: extract function for getting mtime We had a few lines of duplicated code for this, so I moved it into a function and added better error handling.	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	23f9e6479f	working_copy: stop suppressing a warning that doesn't happen We mutate the `new_file_state` variable on line 513 regardless of platform these days, so we don't need to suppress the warning on Unix.	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	ecb86118e5	working_copy: get file metadata using open file descriptor When we have just written a file or conflict, we can get metadata for it via the open file descriptor instead of using the path. That removes the risk of a race where the file got removed or replaced by another file type (at least on Unix).	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	e7aaf2f95f	working_copy: pass `Metadata` into `file_state()` This is a refactoring to prepare for getting the metadata from an already open file when possible.	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	d451242746	working_copy: propagate backend errors on checkout	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	899325e94e	working_copy: use `#[from]` to slightly simplify error handling	2022-05-25 11:51:18 -07:00
Martin von Zweigbergk	45c9d297c2	drop: downgrade two assertions to error messages These assertions were there to catch bugs, but when the bugs happen, the assertions can obsure the underlying error (as @tp-woven found out on #258). Let's just print errors instead.	2022-05-23 15:36:18 -07:00
Martin von Zweigbergk	ffc57310f6	cargo: upgrade protobuf crates to 3.0.1 The biggest difference in the API is that fields are now public. The exception from that is `oneof` fields, which still require setters and getters. I couldn't measure any difference in performance. I didn't expect any difference either, but it's good that it didn't seem to regress. I timed `jj debug operation <some hash prefix>`, which will read the whole operation log (to check that the prefix is unambiguous).	2022-05-04 17:02:11 -07:00
Martin von Zweigbergk	7268e5608e	working_copy: propagate errors when snapshotting Closes #258	2022-05-02 11:23:38 -07:00
Martin von Zweigbergk	96b3e05bc5	working_copy: rename `write_tree()` to `snapshot()` I think I copied the name `write_tree()` from Git, but I find it quite confusing, since it's not clear if it write a tree to the working copy or reads the working copy and writes a tree to the store (it's the former).	2022-05-02 08:00:15 -07:00
Martin von Zweigbergk	5893b52fd1	cleanup: use `while let Some(...)` instead of checking before popping	2022-05-01 13:45:00 -07:00
Martin von Zweigbergk	78da5596b7	working_copy: switch an `if let Some() { } else { }` to a `match`	2022-04-26 21:19:09 -07:00
Martin von Zweigbergk	0fbb1d3971	working_copy: don't visit whole ignored tree even if last in order When committing the working copy, we try to not visit ignored directories (as e.g. `target/` often is), but we need to visit it if there are already tracked files in it. I initially missed that in `c1060610bd` and then fixed it in `a028f33e3b`. The fix works by checking if the next path after the ignored path is inside the ignore path (viewed as a directory). However, I forgot to handle the case where there are no paths at all after the ignored path. So, for example, if the `target/` directory should be ignored and it there were no tracked paths after `target/` in alphabetical order, we would still visit the directory. That's why the bug reproduced in the `git-branchless` repo but not in the `jj` repo (because there are files under `testing/` and `tests/` here). Closes #247.	2022-04-26 20:53:49 -07:00
Martin von Zweigbergk	18a64c365a	working_copy: respect sparse patterns when writing tree (#52 )	2022-04-26 14:52:17 -07:00
Martin von Zweigbergk	0d881de56c	working_copy: allow updating sparse patterns (#52 ) With this patch, we add support for setting the sparse patterns, and we respect it when updating the working copy.	2022-04-26 14:52:17 -07:00
Martin von Zweigbergk	ceb6c152a1	working_copy: record sparse patterns in the tree state (#52 ) This patch makes room for sparse patterns in the `TreeState` proto message. We also start setting that value to a list of just the pattern `.` when we create new working copies. Old working copies without the sparse patterns are also interpreted as having that single pattern. Note that this absence of sparse patterns is different from a present list of no patterns. The latter is a valid state and means that no paths are included in the sparse checkout.	2022-04-26 14:52:17 -07:00
Martin von Zweigbergk	ed2d2f8a4f	working_copy: extract function for updating the working copy files (#52 ) Updating the working copy with new sparse patterns is very similar to updating it with a new tree. We're going to reuse this extracted function soon.	2022-04-26 14:52:17 -07:00
Martin von Zweigbergk	53911b076b	working_copy: fix crash when updating and only executable bit changed	2022-04-14 23:46:28 -07:00
Martin von Zweigbergk	f16d2a237b	backend: pass in path when reading/writing conflicts as well We do it for all the other kinds of objects already. It's useful to have the path for backends that store objects by path (we don't have any such backends yet). I think the reason I didn't do it from the beginning was because we had separate `RepoPath` types for files and directories back then.	2022-03-31 10:23:33 -07:00
Martin von Zweigbergk	d56ae79d3f	working_copy: let caller pass in base Git ignores (#65 , #87 ) The library crate shouldn't look up the user's `$HOME` directory (maybe the library is used by a server process), so let's have the caller pass it into the library crate instead.	2022-03-12 10:48:06 -08:00
Martin von Zweigbergk	51da2a2dc1	gitignore: move function for chaining .gitignore to central place (#65 , #87 ) I want to be able to reuse the code for chaining two `.gitignore` files outside of `working_copy.rs`.	2022-03-12 10:48:06 -08:00
Martin von Zweigbergk	e7a7cb8ea5	gitignores: remove error type that's never instantiated	2022-03-12 10:48:06 -08:00
Martin von Zweigbergk	03e6b8c0e6	working_copy: take `Tree`, not `CommitId`, as argument to `check_out()` We no longer need the commit ID, so we shouldn't make the callers pass it. This lets us simplify several tests, because they no longer to create commits just to check out a tree in the working copy.	2022-02-13 12:14:34 -08:00
Martin von Zweigbergk	315e5e87a3	working_copy: take a tree object instead of ID in `TreeState::check_out()` The callers mostly have the tree object available anyway.	2022-02-13 12:12:08 -08:00
Martin von Zweigbergk	00c9a1ae11	working_copy: stop taking commit ID in `LockedWorkingCopy::finish()` We used to use the value to detect races, but we use the tree ID and the operation ID these days, so we don't need the commit ID. By changing this, we can avoid creating some commit IDs in tests, which is why I tackled this issue now.	2022-02-12 23:48:06 -08:00
Martin von Zweigbergk	cde3609163	working_copy: stop taking initial checkout in constructor We don't use the value anymore.	2022-02-12 23:45:05 -08:00
Martin von Zweigbergk	993de96fc7	working_copy: stop keeping track of commit ID	2022-02-12 17:22:37 -08:00
Martin von Zweigbergk	e098c01935	working_copy: replace commit ID by tree ID for checking for changes What matters for the working copy is the tree ID. We should be able to remove the commit ID. This patch gets us close.	2022-02-12 17:16:19 -08:00
Martin von Zweigbergk	537b1de7d9	working_copy: move check of old commit ID on checkout to higher level There are only two callers of `LockedWorkingCopy::check_out()`. One is in `commands.rs`. That caller already checks after taking the lock that the old commit ID is as expected. The other caller is `WorkingCopy::check_out()`. We can simply move the check to that level since it's the only caller that cares now.	2022-02-12 14:27:40 -08:00
Martin von Zweigbergk	b74851e005	working_copy: make sure discarded update is not visible `LockedWorkingCopy::discard()` shouldn't result in changes to the on-disk state, but `LockedWorkingCopy::check_out()` may have already written a state file, which is surprising. The changes also remain in memory, which is also surprising. Let's fix both of those issues.	2022-02-09 10:40:51 -08:00
Martin von Zweigbergk	c09a4e15c5	workspace: add a function for initializing additional workspace (#13 )	2022-02-02 17:00:03 -08:00
Martin von Zweigbergk	fb8fbdc4b3	working_copy: keep track of workspace ID (#13 ) This patch makes it so the workspace ID can be stored in `.jj/working_copy/checkout`. The workspace ID is still always "default".	2022-02-02 08:15:22 -08:00
Martin von Zweigbergk	38180555de	working_copy: keep track of operation ID (#13 ) When there are concurrent operations that want to update the working copy, it's useful to know which operation was the last to successfully update the working copy. That can help use decide how to resolve a mismatch between the repo view's record and the working copy's record. If we detect such a difference, we can look at the working copy's operation ID to see if it was updated by an operation before or after we loaded the repo. If the working copy's record says that it was updated at operation A and we have loaded the repo at operation B (after A), we know that the working copy is stale, so we can automatically update it (or tell the user to run some command to update it if we think that's more user-friendly). Conversely, if we have loaded the repo at operation A and the working copy's record says that it was updated at operation B, we know that there was some concurrent operation that updated it. We can then decide to print a warning telling the user that we skipped updating because of the conflict. We already have logic for not updating the working copy if the repo is loaded at an earlier operation, but maybe we can drop that if we record the operation in the working copy (as this patch does).	2022-01-19 19:15:29 -08:00
Martin von Zweigbergk	419efb88f9	working_copy: update stale comment on `LockedWorkingCopy` The recent refactoring introducing `WorkgingCopy::start_mutation()` (`25d19e8a65`) made this comment incorrect.	2022-01-19 15:12:45 -08:00
Martin von Zweigbergk	45a00e819d	working_copy: pass only a `TreeId` to `LockedWorkingCopy::check_out()` It only needs a `TreeId`.	2022-01-19 09:04:01 -08:00
Martin von Zweigbergk	fabaf8b608	working_copy: move logic of `check_out()` onto `LockedWorkingCopy` Having the checkout functionality in `LockedWorkingCopy` makes it a little more flexible (one could imagine using it for udating working copy files and then discarding the state changes, for example). It also lets us reuse a few lines of code for locking. I left `WorkingCopy::check_out()` for convenience because that's what all current users want.	2022-01-19 09:04:01 -08:00
Martin von Zweigbergk	9e1869dcef	working_copy: pass in old commit ID to check_out() `WorkingCopy::check_out()` currently fails if the commit recorded on disk has changed since it was last read. It fails with a "concurrent checkout" error. That usually works well in practice, but one can imagine cases where it's not correct. For an example where the current behavior is wrong, consider this sequence of events: 1. Process A loads the repo and working copy. 2. Process B loads the repo at operation A. It has not loaded the working copy yet. 3. Process A writes an operation and updates the working copy. 4. Process B loads the working copy and sees that it is checked out to the commit process B set it to. We don't currently have any checks that the working copy commit matches the view's checkout (though I plan to add that). 5. Process B finishes its operation (which is now divergent with the operation written by process A). It updates the working copy to the checkout set in the repo view by process B. There's no data loss here, but the behavior is surprising because we would usually tell the user that we detected a concurrent update to the working copy. We should instead check that the working copy's commit on disk matches what the previous repo view said, i.e. the view at the start of the operation we just committed. This patch does that by having the caller pass in the expected old commit ID.	2022-01-19 09:04:01 -08:00
Martin von Zweigbergk	8c97fdf5d6	working_copy: remove `untrack()` now that we have more flexible `reset()`	2022-01-19 09:04:01 -08:00
Martin von Zweigbergk	cd4fbd3565	working_copy: add a `reset()` function for Git-like reset We already have two usecases that can be modeled as updating the `TreeState` without touching the working copy: 1. `jj untrack` can be implemented as removing paths from the tree object and then doing a reset of the working copy state. 2. Importing Git HEAD when sharing the working copy with a Git repo. This patch adds that functionality to `TreeState`.	2022-01-19 08:32:59 -08:00
Martin von Zweigbergk	9a640bfe13	working_copy: save `TreeState` later, just before releasing lock I was surprised that we save the `TreeState` before `LockedWorkingCopy::finish()`. That means that even if the caller instead decides to discard the changes, some changes will already have been written.	2022-01-19 08:32:59 -08:00
Martin von Zweigbergk	25d19e8a65	working_copy: start improving interface for mutations This patch changes the interface for making changes to the working copy by replacing `write_tree()` and `untrack()` by a single `start_mutation()` method. The two functions now live on the returned `LockedWorkingCopy` object instead. That is more flexible because the caller can make multiple changes while the working copy is locked. It also helps us reduce the risk of buggy callers that read the commit ID before taking the lock, because we can now make it accessible only on `LockedWorkingCopy`.	2022-01-19 08:32:59 -08:00
Martin von Zweigbergk	2916dc3423	working_copy: make a `&mut self` argument not mutable	2022-01-19 08:32:51 -08:00
Martin von Zweigbergk	1fc19dbbaf	working_copy: take initial commit in init() function The working copy object knows the currently checked out commit ID. It is set to `None` when the object is initialized. It is also set to `None` when an existing working copy is loaded. In that case, it's used only to facilitate lazy loading. However, that means that `WorkingCopy::current_commit_id()` fails if the working copy has been initalized but no checkout has been specified. I've never run into that case, but it's ugly that it can happen. This patch fixes it by having `WorkingCopy::init()` take a `CommitId`.	2022-01-17 14:12:55 -08:00
Martin von Zweigbergk	0fadac38d6	working_copy: remove `current_commit()` (leaving `current_commit_id()` `WorkingCopy::current_commit()` has been there from the beginning. It has made less sense since we made the repo view keep track of the current checkout. Let's remove it.	2022-01-15 17:11:56 -08:00
Martin von Zweigbergk	1f68de64d4	debug: upgrade `Drop` implementations to non-debug `assert!` I think these remaining implementations of `Drop` are for types that are infrequently dropped (unlike `Transaction`), so it be fine to be more strict about them.	2021-12-01 10:32:11 -08:00
Martin von Zweigbergk	8cf5dd286a	backend: make Vec inside CommitId non-public The recent `e5dd93cbf7`, whose description says "cleanup: make Vec inside CommitId etc. non-public", made all ID types in the `backend` module except for `CommitId` non-public :P This patch makes	2021-11-19 23:19:00 -08:00
Martin von Zweigbergk	33b272f5fa	working_copy: make some functions require mutable references We use interior mutability for caching in `WorkingCopy`, but let's still take mutable reference in the functions where the state change is visible.	2021-11-17 10:15:33 -08:00
Martin von Zweigbergk	287602966e	working_copy: add a method for untracking specified paths	2021-11-17 08:41:37 -08:00
Martin von Zweigbergk	e5dd93cbf7	cleanup: make Vec inside CommitId etc. non-public	2021-11-10 10:46:10 -08:00
Martin von Zweigbergk	ea82340654	working_copy: preserve conflicts in the working copy until markers are removed I realized only recently that we can try to parse conflict markers in files and leave them as conflicted if they haven't changed. If they have changed and some conflict markers have been removed, we can even update the conflict with that partial resolution. This change teaches the working copy to write conflicts to the working copy. It used to expect that the caller had already updated the tree by materializing conflicts. With this change, we also start parsing the conflict markers and leave the conflicts unresolved in the working copy if the conflict markers remain. There are some cases that we don't handle yet. For example, we don't even try to set the executable bit correctly when we write conflicts. OTOH, we didn't do that even before this change. We still never actually write conflicts to the working copy (outside of tests) because we currently materialize conflicts in `MutRepo::check_out()`. I'll change that next.	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	856b219943	working_copy: if a file's recorded mtime is equal to the state's mtime, set to 0 early	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	033bfe7d5b	working_copy: fix an incorrect comment about timestamps	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	9ac643ed1a	working_copy: rename confusingly named read_time field	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	baa403b41e	working_copy: clarify code for preserving current file state On Windows, we preserve the executable bit. I plan to also teach the working copy to preserve conflict state. This refactoring prepares for that by simplifying how we preserve parts of the current file state.	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	7c12014513	working_copy: extract a function for updating the file state and tree if dirty	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	730c442462	working_copy: move file_state() off of TreeState The function doesn't use `self`, so let's move it off of `TreeState`. That way we can call it even while holding a mutable reference.	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	f86ffbd47b	working_copy: take advantage of DirEntry::path()	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	5dce6e9884	working_copy: avoid using file_state() for getting mtime of tree_state file I want to change the signature of `file_state()` and this caller is different from the others and only needs the mtime.	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	efe7316354	working_copy: make FileType variants more similar to TreeValue variants This change doesn't make much difference yet, but I think it will enable further improvements.	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	bf61e9cf3e	working_copy: don't stat ignored files	2021-11-07 15:17:51 -08:00
Martin von Zweigbergk	ce5e95fa80	store: rename Store to Backend and StoreWrapper to Store For what's currently called `Store` in the code, I have been using "backend" in plain text. That probably means that `Backend` is a good name for it.	2021-09-12 12:02:10 -07:00
Martin von Zweigbergk	49e1462fe5	working_copy: delete two obsolete TODOs about ignores We have had support for ignores via `.gitignore` files since `3b326a942c`, and we haven't had the problem with the temporary `.git/` directory created by libgit2 since `88f7f4732b`.	2021-09-03 23:10:45 -07:00
Martin von Zweigbergk	ccdd651953	working_copy: ignore `.git` directory/file when writing tree to store Git doesn't want `.git` entries in its trees, so at least when using the Git backend, we need to ignore such paths. Let's just ignore `.git` paths regardless of backend to keep it simple. Closes #24.	2021-09-01 08:40:28 -07:00
Martin von Zweigbergk	2afed65132	working_copy: move logic for creating commit to caller The auto-rebasing of descendants doesn't work if you have an open commit checked out, which means that you may still end up with orphans in that case (though that's usually a short-lived problem since they get rebased when you close the commit). I'm also about to make branches update to successors, but that also doesn't work when the branch is on a working copy commit that gets rewritten. To fix this problem, I've decided to let the caller of `WorkingCopy::commit()` responsible for the transaction. I expect that some of the code that this change moves from the lib crate to the cli crate will later move back into the lib crate in some form.	2021-08-15 18:55:09 -07:00
Martin von Zweigbergk	7360589246	working_copy: consider it an error if temp file cannot be renamed to target Unlike the other places I fixed in `134940d2bb`, the calls in `working_copy.rs` should not simply use an existing file if the target file was open. They should probably try again instead, but I'll leave that for later.	2021-06-16 10:52:55 -07:00
Martin von Zweigbergk	4c416dd864	cleanup: let Clippy fix a bunch of warnings	2021-06-14 00:27:31 -07:00
Martin von Zweigbergk	134940d2bb	windows: don't fail when concurrent threads/processes fail to rename file On Windows, it seems that you can't rename a file if the target file is open (Stebalien/tempfile#131). I think that's the reason for our failing tests on Windows. This patch adds a simple wrapper around `NamedTempFile::persist()` that returns the existing file instead of failing, if there is one.	2021-06-14 00:09:22 -07:00
Martin von Zweigbergk	f26c9f01ce	working_copy: silence some warnings about unused code on Windows	2021-06-13 22:34:33 -07:00
Martin von Zweigbergk	dd4c47f373	tree: support filtering diff by matcher This change teaches `Tree::diff()` to filter by a matcher. It only filters the result so far; it does not restrict the tree walk to what `Matcher::visit()` says is necessary yet. It also doesn't teach the CLI to create a matcher and pass it in.	2021-06-09 16:26:58 -07:00
Martin von Zweigbergk	fdeb499836	trees: merge into tree module	2021-06-05 14:20:07 -07:00
Martin von Zweigbergk	54f6165ef1	repo_path: replace remaining uses of DirRepoPath by RepoPath	2021-05-19 15:11:04 -07:00
Martin von Zweigbergk	69664f57fe	working_copy: use RepoPath in write_tree()	2021-05-19 15:11:04 -07:00
Martin von Zweigbergk	c66990d3a3	repo_path: rename from() to from_internal_{,dir}_string() Since `RepoPath` can be either a file or a directory, I made its name not include `file`.	2021-05-19 15:11:04 -07:00
Martin von Zweigbergk	ef726be78b	repo_path: rename to_internal_string() to separate names for dir vs file	2021-05-19 15:11:04 -07:00
Martin von Zweigbergk	03ae1b747c	repo_path: remove FileRepoPath in favor of just RepoPath I had initially hoped that the type-safety provided by the separate `FileRepoPath` and `DirRepoPath` types would help prevent bugs. I'm not sure if it has prevented any bugs so far. It has turned out that there are more cases than I had hoped where it's unknown whether a path is for a directory or a file. One such example is for the path of a conflict. Since it can be conflict between a directory and a file, it doesn't make sense to use either. Instead we end up with quite a bit of conversion between the types. I feel like they are not worth the extra complexity. This patch therefore starts simplifying it by replacing uses of `FileRepoPath` by `RepoPath`. `DirRepoPath` is a little more complicated because its string form ends with a '/'. I'll address that in separate patches.	2021-05-19 15:11:04 -07:00
Martin von Zweigbergk	525a5116a2	RepoLoader: stop returning Result since the functions cannot currently fail	2021-05-19 14:12:54 -07:00
Martin von Zweigbergk	1cdac0f902	working copy: use system path separator when creating file system path Hopefully this fixes the failing Windows CI.	2021-05-15 10:04:09 -07:00
Martin von Zweigbergk	a028f33e3b	working copy: don't remove already-tracked files in ignored directories The recent optimization to not walk ignored directories did not account for the case where there already are files in the ignored directory.	2021-05-15 09:15:45 -07:00
Martin von Zweigbergk	cbc3e915b7	working copy: allow overwriting untracked files Before this commit, we would crash when checking out a commit that has a file that's currently ignored in the working copy.	2021-05-15 08:24:56 -07:00
Martin von Zweigbergk	df175f549f	working copy: don't assume that $HOME exists (it doesn't on Windows CI)	2021-05-13 23:02:56 -07:00
Martin von Zweigbergk	c1060610bd	working copy: optimize simple case of entire directory being ignored This makes the workging copy walk skip an entire ignored directory if there are no negative patterns later in the ignore file. That speeds up `jj st` in this repo with ~13k files in `target/` from ~100 ms to ~25 ms (6.0dB). This closes issue #8.	2021-05-13 22:28:38 -07:00
Martin von Zweigbergk	88f7f4732b	gitignores: add own implementation and stop using libgit2's This is to address issue #8. I haven't added the optimization to avoid walking all the files in `target/` yet. Even so, this patch still speeds up `jj st` in this repo, with ~13k files in `target/`, from ~320 ms to ~100 ms (-5.1dB). The time actually checking if paths match gitignores seems to go down from 116 ms to 6 ms. I think that's mostly because libgit2 has to look for `.gitignore` files in every parent directory every time we ask it about a file, while the rewritten code looks for a `.gitignore` file only when visiting a new directory.	2021-05-13 22:23:59 -07:00
Martin von Zweigbergk	70b99c960e	transaction: make commit() return resulting ReadonlyRepo I've wanted the API to look like this for a while. It seems like a good API to me. It means that the caller won't have to reload the repo after committing. The cost seems relatively small. It involves copying potentially a lot of data in memory (at least the View object), but it shouldn't involve reading from disk or any other processing. To reduce the amount of data to copy, it may be worth switching to persistent data types. I've also wanted to do that for the copying we do when start a transaction. I couldn't measure any slowdown caused by this change.	2021-05-08 13:50:59 -07:00
Martin von Zweigbergk	ce855bccfa	repo: make reload() and reload_at() return a new ReadonlyRepo After this patch `ReadonlyRepo` is even closer to readonly. That makes it easier to reason about. It will allow some further cleanups too.	2021-04-11 10:39:29 -07:00
Martin von Zweigbergk	f4a41f3880	trees: make tree diff return an iterator instead of taking a callback This is yet another step towards making it easy to propagate `BrokenPipe` errors. The `jj diff` code (naturally) diffs two trees and prints the diffs. If the printing fails, we shouldn't just crash like we do today. The new code is probably slower since it does more copying (the callback got references to the `FileRepoPath` and `TreeValue`). I hope that won't make a noticeable difference. At least `jj diff -r 334afbc76fbd --summary` didn't seem to get measurably slower.	2021-04-07 23:18:00 -07:00
Martin von Zweigbergk	06df609482	transaction: delete check_out() and set_checkout() helpers	2021-03-16 22:31:28 -07:00
Martin von Zweigbergk	69de4698ac	tests: set $HOME in a few tests to avoid depending in developer's ~/.gitignore I just changed my `~/.gitignore` and some tests started failing because the working copy respects the user's `~/.gitignore`. We should probably not depend on `$HOME` in the library crate. For now, this patch just makes sure we set it to an arbitrary directory in the tests where it matters.	2021-03-16 22:05:36 -07:00
Martin von Zweigbergk	81a0e0bd2a	protobuf: upgrade to version 2.22.0 I only noticed that there was a newer version when running `cargo install --path .`, which resulted in warnings about deprecated functions. There's no other reason I'm aware of to upgrade now.	2021-03-15 17:09:29 -07:00
Jun Wu	eacab648b0	working_copy: clean up ".git" automatically TreeState::write_tree leaves a ".git" file in the working copy. This is undesirable but more problematic on Windows - The second time TreeState::write_tree would panic because Repository::init_opts will fail with a Permission Denied error. This seems to be a libgit2 defect. But for now let's just remove ".git" automatically. This makes `cargo test --test smoke_test` pass on Windows.	2021-03-14 15:49:42 -07:00
Jun Wu	4cd29a2130	working_copy: avoid std::os::unix on Windows std::os::unix::fs::PermissionsExt::mode() does not exist on Windows. Treat files on Windows as regular files.	2021-03-14 15:49:22 -07:00
Martin von Zweigbergk	a7f4f4cf5b	rustfmt: configure to merge imports by module Perhaps we should even set the config to "Item" to reduce merge conflicts.	2021-03-14 10:53:14 -07:00
Martin von Zweigbergk	4b8484e561	rustfmt: configure to group imports	2021-03-14 10:46:25 -07:00
Martin von Zweigbergk	337b15c98d	cleanup: replace #[cfg(not(windows))] by $[cfg(unix)] I didn't realize that the `unix` configuration existed before.	2021-03-12 15:45:55 -08:00
Martin von Zweigbergk	031a39ecba	cleanup: fix lots of issues found in the lib crate by clippy I had forgotten to pass `--workspace` to clippy all this time :P	2021-02-26 23:15:43 -08:00
Martin von Zweigbergk	d80903ce48	index: also index predecessors Evolution needs to have fast access to the predecessors. This change adds that information to the commit index. Evolution also needs fast access to the change id and the bit saying whether a commit is pruned. We'll add those soon. Some tests changed because they previously added commits with predecessors that were not indexed, which is no longer allowed from this change. (We'll probably eventually want to allow that again, so that the user can prune predecessors they no longer care about from the repo.)	2021-02-26 10:33:34 -08:00
Martin von Zweigbergk	302c66825f	working_copy: preserve executable bit on Windows Windows doesn't support recording the executable bit in the file system. Before this commit, the code for reading and writing the executable wouldn't even compile on Windows. This commit at least makes it so we preserve whatever bit has been recorded in the repo. At least I hope that's what it does -- I don't have access to a Windows machine right now.	2021-02-07 00:50:21 -08:00
Martin von Zweigbergk	d4aed83aa6	working_copy: correct comment about stat accuracy We don't take a write lock when writing a file; other processes can modify the file.	2021-02-07 00:48:50 -08:00
Martin von Zweigbergk	3d679de022	working_copy: print warning about ignored symlinks instead of failing build The project doesn't currently build on Windows. One reason is because we had a `unimplemented!()` when trying to write a symlink. Let's print a warning instead, so the project can start building on Windows. (The next patch will fix another build problem on Windows.)	2021-02-07 00:46:17 -08:00
Martin von Zweigbergk	b820eddde3	commands: add an interactive mode for `jj restore` This adds an interactive mode for `jj restore`. It works by first creating two temporary directories with the contents of the subset of files that differ between the two trees, and then letting the user edit the directory representing the right/after side. This has some advantages compared to the interactive modes in Git and Mercurial: * It lets the user edit the final state as opposed to the diff itself (depending on the diff tool, of course). I think most users find it easier to edit the file contents than to edit the patch format. * It delegates the hard work to a tool that is already written (this is a big advantage for an immature tool like Jujube, but it is not an advantage from the user's point of view). Almost all of the work in this commit went into adding a function that takes two trees, lets the user edit the diff, and returns a new tree id. I plan to reuse that function for other interactive commands. One planned command is `jj edit`, which will let the user edit the changes in a commit. `jj edit -r abc123` will be mostly about providing a more intuitive name for `jj restore --source abc123^ --destination abc123`, plus it will be different for merge commits (it will edit only the changes in the merge commit). I also plan to add `jj split` by letting the user edit the full diff, leaving only the parts that should go into the first commit. Perhaps there will also be commands for moving part of a commit out of or into a parent commit.	2020-12-26 01:16:19 -08:00
Martin von Zweigbergk	9ade41078a	working_copy: remove a working_copy_path argument I missed earlier This should clearly have been removed in `4734eb6`.	2020-12-26 00:35:45 -08:00
Martin von Zweigbergk	c7ee24727a	protobuf: generate code at build-time I had tried to generate the protobuf code at build time many months ago, but decided against it because it slowed down the build too much. I didn't realize there was the "cargo:rerun-if-changed=<filename>" feature that time. Given that that exists, it seems like an obvious win to generate the source code at build time. I put the generated sources in `$OUT_DIR` (where [1] says they should be), then include them in the `protos` module by using the `include!` macro. The biggest problem with that is that I couldn't get IntelliJ to understand it, even after enabling the experimental features described in [2]. [1] https://doc.rust-lang.org/cargo/reference/build-script-examples.html#code-generation [2] https://github.com/intellij-rust/intellij-rust/issues/1908#issuecomment-592773865	2020-12-24 01:05:17 -08:00
Martin von Zweigbergk	3b326a942c	working_copy: add support for .gitignore files The project's source of truth is now in Git and I really miss support for anonymous heads and evolution (compared to when the code was in Mercurial). I'm therefore more motivated to make the tool useful for day-to-day work on small repos, so I can use it myself. Until now, I had been more focused on improving performance when it was used as a read-only client for medium-to-large repos. One important feature for my day-to-day work is support for ignores. This commit adds simple and effective, but somewhat hacky support for that. libgit2 requires a repo to check if a file should be ignored (presumably so it can respect `.git/info/excludes`). To work around that, we create a temporary git repo in `/tmp/` whenever the working copy is committed. We set that temporary git repo's working copy to be shared with our own working copy. Due to https://github.com/libgit2/libgit2sharp/issues/1716 (which seems to apply to the non-.NET version as well), this workaround unfortunately leaves a .git file (pointing to the deleted temporary git repo) around in every Jujube repo. That's always ignored by libgit2, so it's not much of a problem.	2020-12-20 00:37:43 -08:00
Martin von Zweigbergk	4734eb6493	working_copy: let WorkingCopy and TreeState have the working copy path I don't know why I didn't do it this way from the beginning.	2020-12-18 23:56:32 -08:00
Martin von Zweigbergk	6b1427cb46	import commit 0f15be02bf4012c116636913562691a0aaa7aed2 from my hg repo	2020-12-12 00:23:38 -08:00

... 2 3 4 5 6

262 commits