18 Package Configuration

A MLHub yaml configuration file is all that is required to turn a git based repository into quickly accessible and ready to run, explore, rebuild, and even deploy, pre-built machine learning models. Git repositories, like github, gitlab, and bitbucket, are where many researchers today publish their algorithms.

By installing direct from git repositories, a package developer can simply add a MLHUB.yaml file and the appropriate Python or R scripts (typcially within a mlhub folder) to their git repository. They now have an MLHub package. The scripts are generally short wrappers for the package functionality, supporting command line arguments.

A single git repository may contain multiple MLHub packages, simply through multiple YAML files which can have any name:

$ ml install gjwgit/recommenders:mlhub/sar.yaml
$ ml readme sar
$ ...
$ ml install gjwgit/recommenders:mlhub/rbm.yaml
$ ml readme rbm
$ ...

Each configuration files identifies the other files required specifically for the package.

Your donation will support ongoing development and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 1995-2021 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0.