ostree/src/libostree
Jasper St. Pierre 20647edcbf repo-pull: Add a queue for scanning
On systems with slow disks, the recursive scanning of directories can
be expensive -- it takes upwards of 2 minutes on our systems. This can
block the main loop for such a long time that it allows the download to
time out...

As such, move all the scanning of objects to a queue, processed from
an idle, to make sure that we don't block the main loop when scanning.

https://bugzilla.gnome.org/show_bug.cgi?id=753336
2015-08-26 12:15:38 -07:00
..
README-deltas.md syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
README-gpg pull: Verify commits with gpg signatures from detached metadata 2013-09-29 14:49:47 -04:00
README.md core: Associate branches with remotes, move trigger runs into checkout 2012-04-03 23:46:34 -04:00
bupsplit.c core: Import bup's "rollsum" code, add a test case 2014-02-04 10:12:56 -05:00
bupsplit.h core: Import bup's "rollsum" code, add a test case 2014-02-04 10:12:56 -05:00
ostree-1.pc.in Install a shared library 2013-07-26 19:25:07 -04:00
ostree-async-progress.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-async-progress.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-bootconfig-parser.c Use g_autoptr(GHashTable) instead of gs_unref_hashtable 2015-05-06 22:07:10 -04:00
ostree-bootconfig-parser.h sysroot: Read the bootloader configuration with fd-relative API 2015-03-09 14:29:14 -04:00
ostree-bootloader-grub2.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-bootloader-grub2.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-bootloader-syslinux.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-bootloader-syslinux.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-bootloader-uboot.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-bootloader-uboot.h libostree: Make OstreeBootloader private 2013-09-15 20:26:13 -04:00
ostree-bootloader.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-bootloader.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-chain-input-stream.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-chain-input-stream.h core: Fix all introspection warnings 2013-07-27 10:13:30 -04:00
ostree-checksum-input-stream.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-checksum-input-stream.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-cmdprivate.c Teach fsck about partial commits 2015-05-06 08:07:20 -04:00
ostree-cmdprivate.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-core-private.h core: Add _ostree_get_default_sysroot_path() 2015-07-16 12:48:11 -04:00
ostree-core.c core: Add _ostree_get_default_sysroot_path() 2015-07-16 12:48:11 -04:00
ostree-core.h pull: verify signature for the summary file 2015-05-07 21:58:04 +02:00
ostree-deployment.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-deployment.h deployment: Add an API to get relative origin path 2015-03-10 10:47:47 -04:00
ostree-diff.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-diff.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-fetcher.c pull: Stop using GMainLoop 2015-08-13 22:02:00 -04:00
ostree-fetcher.h pull: Stop using GMainLoop 2015-08-13 22:02:00 -04:00
ostree-gpg-verifier.c repo: Change GPG verification policy 2015-06-10 12:28:57 -04:00
ostree-gpg-verifier.h repo: Change GPG verification policy 2015-06-10 12:28:57 -04:00
ostree-gpg-verify-result-private.h libotutil: Establish a place for GPG utilities 2015-05-01 10:20:34 -04:00
ostree-gpg-verify-result.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-gpg-verify-result.h gpg: Add ostree_gpg_verify_result_describe_variant() 2015-05-06 21:04:40 -04:00
ostree-kernel-args.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-kernel-args.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-libarchive-input-stream.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-libarchive-input-stream.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-linuxfsutil.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-linuxfsutil.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-lzma-common.c syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-lzma-common.h deltas: Link to liblzma, add internal API to use it 2014-04-29 10:59:57 -04:00
ostree-lzma-compressor.c Fix repeated words. 2015-01-30 15:27:36 +01:00
ostree-lzma-compressor.h deltas: Link to liblzma, add internal API to use it 2014-04-29 10:59:57 -04:00
ostree-lzma-decompressor.c deltas: Link to liblzma, add internal API to use it 2014-04-29 10:59:57 -04:00
ostree-lzma-decompressor.h deltas: Link to liblzma, add internal API to use it 2014-04-29 10:59:57 -04:00
ostree-metalink.c pull: Stop using GMainLoop 2015-08-13 22:02:00 -04:00
ostree-metalink.h pull: Stop using GMainLoop 2015-08-13 22:02:00 -04:00
ostree-mutable-tree.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-mutable-tree.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-repo-checkout.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-commit.c repo: fix an incorrect comment 2015-07-28 11:35:33 -04:00
ostree-repo-file-enumerator.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-repo-file-enumerator.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-repo-file.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-file.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-repo-libarchive.c Use g_autoptr(GPtrArray) instead of gs_unref_ptrarray 2015-05-06 22:07:10 -04:00
ostree-repo-private.h repo: Add a "sysroot-path" property 2015-07-16 12:48:56 -04:00
ostree-repo-prune.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-pull.c repo-pull: Add a queue for scanning 2015-08-26 12:15:38 -07:00
ostree-repo-refs.c libostree: new API ostree_repo_remote_list_refs 2015-07-24 12:37:42 -04:00
ostree-repo-static-delta-compilation-analysis.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-static-delta-compilation.c static-delta: add max-bsdiff-size option 2015-07-28 13:02:27 +02:00
ostree-repo-static-delta-core.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-static-delta-private.h pull: add new switch option --disable-static-deltas 2015-05-13 09:38:04 +02:00
ostree-repo-static-delta-processing.c Use g_autoptr(GVariant) instead of gs_unref_variant 2015-05-06 22:07:10 -04:00
ostree-repo-traverse.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-repo.c pull: Stop using GMainLoop 2015-08-13 22:02:00 -04:00
ostree-repo.h pull: new option --commit-metadata-only 2015-07-24 12:43:35 -04:00
ostree-rollsum.c rollsum: Fix assertion for CRC matches 2015-08-25 09:26:22 -04:00
ostree-rollsum.h Add bsdiff support to deltas 2015-03-03 12:16:17 -05:00
ostree-sepolicy.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-sepolicy.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-sysroot-cleanup.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-sysroot-deploy.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-sysroot-private.h sysroot: Add an API to lock 2015-05-05 08:52:44 -04:00
ostree-sysroot-upgrader.c Remove unnecessary #include "libgsystem.h" 2015-05-06 22:07:11 -04:00
ostree-sysroot-upgrader.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-sysroot.c sysroot: Add an unload() API 2015-08-13 12:43:14 -04:00
ostree-sysroot.h sysroot: Add an unload() API 2015-08-13 12:43:14 -04:00
ostree-tls-cert-interaction.c libostree: Make OstreeTlsCertInteraction explicitly private 2014-06-09 14:04:53 -04:00
ostree-tls-cert-interaction.h libostree: Make OstreeTlsCertInteraction explicitly private 2014-06-09 14:04:53 -04:00
ostree-types.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-varint.c syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree-varint.h syntax-check: Remove empty lines at the end of file 2015-02-02 15:07:56 +01:00
ostree.h Include ostree-gpg-verify-result.h in ostree.h 2015-03-31 19:48:55 -04:00

README.md

Repository design

At the heart of OSTree is the repository. It's very similar to git, with the idea of content-addressed storage. However, OSTree is designed to store operating system binaries, not source code. There are several consequences to this. The key difference as compared to git is that the OSTree definition of "content" includes key Unix metadata such as owner uid/gid, as well as all extended attributes.

Essentially OSTree is designed so that if two files have the same OSTree checksum, it's safe to replace them with a hard link. This fundamental design means that an OSTree repository imposes negligible overhead. In contrast, a git repository stores copies of zlib-compressed data.

Key differences versus git

  • As mentioned above, extended attributes and owner uid/gid are versioned
  • Optimized for Unix hardlinks between repository and checkout
  • SHA256 instead of SHA1
  • Support for empty directories

Binary files

While this is still in planning, I plan to heavily optimize OSTree for versioning ELF operating systems. In industry jargon, this would be "content-aware storage".

Trimming history

OSTree will also be optimized to trim intermediate history; in theory one can regenerate binaries from corresponding (git) source code, so we don't need to keep all possible builds over time.

MILESTONE 1

  • Basic pack files (like git)

MILESTONE 2

  • Store checksums as ay
  • Drop version/metadata from tree/dirmeta objects
  • Add index size to superindex, pack size to index
    • So pull can calculate how much we need to download
  • Split pack files into metadata/data
  • pull: Extract all we can from each packfile one at a time, then delete it
  • Restructure repository so that links can be generated as a cache; i.e. objects/raw, pack files are now the canonical
  • For files, checksum combination of metadata variant + raw data
    • i.e. there is only OSTREE_OBJECT_TYPE_FILE (again)

MILESTONE 3

  • Drop archive/raw distinction - archive repositories always generate packfiles per commit
  • Include git packv4 ideas:
    • metadata packfiles have string dictionary (tree filenames and checksums)
    • data packfiles match up similar objects
  • Rolling checksums for partitioning large files? Kernel debuginfo
  • Improved pack clustering
    • file fingerprinting?
  • ELF-x86 aware deltas

git: http://git-scm.com/ Venti: http://plan9.bell-labs.com/magic/man2html/6/venti Elephant FS: http://www.hpl.hp.com/personal/Alistair_Veitch/papers/elephant-hotos/index.html

Compression

xdelta: http://xdelta.org/ Bsdiff: http://www.daemonology.net/bsdiff/ xz: http://tukaani.org/xz/