Skip to content

Latest commit

 

History

History
149 lines (96 loc) · 10.2 KB

README.md

File metadata and controls

149 lines (96 loc) · 10.2 KB

colas-bash-lib

This is the repository of the miscellaneous small bits of code and info I use in my bash scripts. My personal bash developer toolkit.

Scripts (in bin/)

Standalone small bash scripts that do not deserve a separate repository (such as cgibashopts, rsync-incr, tewiba, irclogger, ...) can be found in bin/:

Library (in src/)

My collection of various small bash functions I reuse often in my bash scripts. they expect a modern bash version, I guess version 5 at least, although most should work with bash 4.3+. Fell free to copy and use in any of your projects or compilation of bash tools.

Bigger copy-pastable functions have their own repository, such as bashoptions.

I have tried to make them the fastest possible, by avoiding forking sub-shells or external commands, and benchmarking extensively to compare the possible way of coding them. Of course, I will gladly accept suggestions or code to make them faster. But it means that error checking is often terse and minimal, and readability of the code was not a priority.

I have a kind of «anti-npm» approach, in that I copy these functions into my scripts rather that using them as a true external library that I would load at runtime. It thus avoid installation issues, and the dependency problems that may arise from automated upgrades. Just keep the "from:" comment at the start to know where (url) to check for upgrades, and which version of the function this copy is.

Most of the files in the form src/foo.sh have many versions of the same functions. For a function named foo, you can find:

  • "functional forms" foo that prints its result on stdout. This is the traditional way used in most scripts, so that's why I always provide it. E.g: gee=$(foo bar)
  • "set forms" set_foo that sets the result into a variable that must be provided as first argument. This is my preferred way to use bash functions, as it is faster than the "normal" way above (often twice as fast), and it allows the function to access the main script variables, which is especially important with arrays as they cannot be passed as arguments. And it provides a simple way to return multiple values, by using more than one "set" variables. But the drawback is that they are cumbersome to use in recursive programs, as you must take care that the variable name is not the same as the name used for it inside the function. I prefix these internal variables with an underscore to try to minimize name clashes, but it is not foolproof. E.g: set_foo gee bar
  • "var forms" var_foo modifies in place the contents of the variable given as argument, if it is a common use case. For instance, trimming space is often performed in place, so I provide var_trim, but not var_regexp_quote, as using the same variable to hold a regexp or its quoted version could be error-prone. E.g: var_foo gee_is the same as set_foo gee "$gee"or gee=$(foo "$gee")
  • "samename forms" foo is a simplified, and a tiny bit faster, form of set forms, where the function sets its return value to a variable of the same name of the function. But I rarely use this, as I think that the very small speed gain is not worth the potential confusion for readers that are not familiar that the same name can refer to a function or a variable. E.g: after calling foo bar, the result can be found in the shell variable $foo

All these functions duplicate their code so that they are standalone and can be copied individually, unless clearly specified. It is an «anti-framework» approach. It is expected you will only copy the functions and the forms you are actually using into your scripts rather than loading the whole file as is (but it can be done).

The non-trivial functions start with a local -; set +xve line that removes the trace mode when in the function, with nearly no speed penalty. Thus when debugging a script by set -xv your log is not polluted by the internal tracing of these function that can be quite long when iterating on strings for instance. It also disables the set -e flag, just to be sure that these functions do not trigger spurious errors if you run your script with this flag.

The functions pass shellcheck, and can be used with set -u.

library files

Here is the user documentation for the library functions, in alphabetical order on the file names. More technical details can be found as comments in the files, and I only document here the "functional form", knowing that at least "set forms" are also provided for each functions. The list of available forms is given in a "Forms:" line. E.g. you can use variable=$(regexp_nocase expression) (functional form) or set_regexp_nocase variable expression (set form) even if only regexp_nocase is documented below. And if the "Form:" line was listing "var", you could use var_regexp_nocase variable.

src/ascii.sh

  • i2a {integer} Returns the character (string of length 1) of ascii code {integer}. E.g: i2a 65 returns A

    Forms: functional, set

  • a2i {character} Returns the decimal ascii code (integer) of the character (string of length 1) parameter. E.g: a2i "A" returns 65

    Forms: functional, set

  • x2a{hexadecimal} Returns the character (string of length 1) of ascii code {hexadecimal}. E.g:x2a 41returnsA

    Forms: functional, set

  • a2x {character} Returns the hexadecimal ascii code (integer) of the character (string of length 1) parameter. E.g: x2i "A" returns 41

    Forms: functional, set

src/regexp_nocase.sh

  • regexp_nocase {regular-expression} Takes a regular expression (as used in [[ =~ ]]) as only parameter, and returns a modified version that matches in a case-insensitive way. E.g: a becomes [aA], a.b becomes [aA].[bB], [a-c] becomes [a-cA-C], [[:lower:]] becomes [[:alpha:]], etc...

    This is very useful when you want to match a regular expression with some parts matching with case, and others not, because bash lacks inline regexp modifiers such as (?i).

    Forms: functional, set.

src/regexp_quote.sh

  • regexp_quote {regular-expression}

    Takes a regular expression (as used in [[ =~ ]]) as only parameter, and returns a modified version that quotes (or escape) special characters to make them match literally. A bit like fgrep. E.g: a.b becomes a[.]b, x*[a-c] becomes x[*][[]a-c[]], etc... This is very useful when you want to match a regular expression with some parts matching literally, and other parts matching as regexps.

    Forms: functional, set.

  • regexp_quote_nocase {regular-expression}

    As regexp_quote above, but the returned regular expression is made case-independent. E.g: a.b becomes[aA][.][bB], x*[a-c]becomes[xX][*][[][aA]-[xX][cC][]], etc...

    Forms: functional, set.

Only in "set" form:

  • add_regexp_quote {variable} {expression} As set_regexp_quote, but the quoted regular expression is appended to the string already in variable._Convenient when building a regular expression piece by piece.
  • add_regexp_quot_nocase{variable} {expression} Asadd_regexp_quote above, but the appended regular expression is made case-independent.

src/trim.sh

  • trim {string} Removes the spaces or tabs at the start and end of the {string} argument and returns it

    Forms: functional, set, var.

Library test suite (in tests/)

A test suite for the library function is available in test/, based on Tewiba (TEsting WIth BAsh), (provided). It can be useful to run it by cd tests; ./tewiba or tests/RUN_ALL_TESTSon non-standard linux installation to check that the functions can be used without problems.

Useful links

These are the web sites I have found useful for my bash programming needs:

Info:

Tools:

Libraries:

History

High-level history of changes

  • v1.0.0 (2022-02-04) First deployment, with modules: ascii, regexp_nocase, regexp_quote, trim