33.10 Dependencies Git LFS

20210420

The Git large file storage project supports the store of large files within a repository, without downloading the file on a zip download. This is useful for the git owner since the free GitHub account limits downloads to 1GB per month.

For MLHub this presents a small challenge. MLHub relies on files being included in the zip archive. To support this, the package maintainer needs to ensure that the LFS files are included in the zip archive with the appropriate settings. In GitHub, under the repository’s Settings, Scroll down the Options to Archives and check the box for Include Git LFS objects in archives.

If this presents challenges for the limits on the total downloads from a git repository, then the package developer may want to host the large pre-built models on another cloud service and have them downloaded during a configuration, specified as a file dependency in MLHUB.yaml. See Section 33.7 for an example of storing large files on a Nextcloud server.



Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0