ok/jj
1
0
Fork 0
forked from mirrors/jj

WIP docs/design: describe submodule storage ideas and how to judge them

This doc describes what we need to consider in a submodule storage
solution, some possible solutions and what criteria we should use to
decide on a future direction.

This is still a WIP:

- The solutions are still underdescribed
- The actual evaluation of solutions is missing

Suggestions for the above are welcome :)
This commit is contained in:
Glen Choo 2023-04-03 17:02:17 -07:00
parent a19a91bfbc
commit 562eebe213
2 changed files with 83 additions and 2 deletions

View file

@ -0,0 +1,80 @@
# Git submodule storage
## Objective
Decide what approach(es) to Git submodule storage we should pursue.
The decision will be recorded in [./git-submodules.md](./git-submodules.md).
## Use cases to consider
The submodule storage format should support the workflows specified in the
[submodules roadmap](./git-submodules.md). It should be obvious how "Phase 1"
requirements will be supported, and we should have an idea of how "Phases 2,3,X"
might be supported.
Notable use cases and workflows are noted below.
### Fetching submodule commits
Git's protocol is designed for communicating between copies of the same
repository. Notably, a Git fetch calculates the list of required objects by
performing reachability checks between the refs on the local and the remote
side. We should expect that this will only work well if the submodule repository
is stored as a local Git repository.
Rolling our own Git fetch is too complex to be worth the effort.
### "jj op restore" and operation log format
We want `jj op restore` to restore to an "expected" state in the submodule.
There is a potential distinction between running `jj op restore` in the
superproject vs in the submodule, and the expected behavior may be different in
each case, e.g. in the superproject, it might be enough to restore the submodule
working copy, but in the submodule, refs also need to be restored.
Currently, the operation log only references objects and refs in the
superproject, so it is likely that proposed approaches will need to extend this
format. It is also worth considering that submodules may be added, updated or
removed in superproject commits, thus the list of submodules is likely to change
over the repository's lifetime.
### Nested submodules
Git submodules may contain submodules themselves, so our chosen storage schemes
should support that.
We should consider limiting the recursion depth to avoid nasty edge cases (e.g.
cyclical submodules.) that might surprise users.
### Supporting future extensions
There are certain extensions we may want to make in the future, but we don't
have a timeline for them today. Proposed approaches should take these
extensions into account (e.g. the approach should be theoretically extensible),
but a full proposal for implementing them is not necessary.
These extensions are:
- Non-git subrepos
- Colocated Git repos
- Non-git backends
## Possible approaches
### Approach 1: Store Git submodules as full jj repos
This would be somewhere in `.jj` but outside of `.jj/store`. We would then
expose a "submodules" interface that gets hooked up to the relevant machinery
(e.g. updating the working copy).
TODO(chooglen): Discuss operation log
TODO(chooglen): Discuss nested submodules
### Approach 3: Store Git submodules as alternate jj repo backends
This is Approach 3, but instead of storing the submodule in a Git backend,
create a new backend that is backed by a full jj repo (like Approach 2), and
store the Git submodule in its own jj repo backend.
TODO(chooglen): Discuss operation log
TODO(chooglen): Discuss nested submodules

View file

@ -139,9 +139,10 @@ I.e. outcomes we would like to see if there were no constraints whatsoever.
TODO
### On-disk representation
### Storing submodules
TODO
Possible approaches under discussion. See
[./git-submodule-storage.md](./git-submodule-storage.md).
### Snapshotting new submodule changes