Glob Pattern
About #
A glob pattern is a string of characters that specifies a set of filenames or paths in a file system. The term “glob” is short for “global,” and refers to the fact that a glob pattern can match multiple filenames or paths at once. For HPE Machine Learning Data Management, you can use glob patterns to define the shape of your datums against your inputs, which are spread across HPE Machine Learning Data Management workers for distributing computing.
Examples #
Glob Pattern | Datum created |
---|---|
/ |
HPE Machine Learning Data Management denotes the whole repository as a single datum and sends all input data to a single worker node to be processed together. |
/* |
HPE Machine Learning Data Management defines each top-level files / directories in the input repo, as a separate datum. For example, if you have a repository with ten files and no directory structure, HPE Machine Learning Data Management identifies each file as a single datum and processes them independently. |
/*/* |
HPE Machine Learning Data Management processes each file / directory in each subdirectories as a separate datum. |
/** |
HPE Machine Learning Data Management processes each file in all directories and subdirectories as a separate datum. |
Glob patterns can also use other special characters, such as the question mark (?
) to match a single character, or brackets ([...]
) to match a set of characters.