Balthazar

Just enough Makefile to be dangerous

2023-08-30T00:00:00+02:00

Table of Contents

Getting started with make
My best practices

Over the years, I have developed a bit of a love-hate relationship with make. On the plus side, it is ubiquitous, preinstalled on most UNIX systems, and widely used. On the other hand, its syntax can feel arcane and clunky, and it can prove hard to debug. In this article, I will go over the basic make concepts, and the set of best practices I've come to embrace as my own, to make make enjoyable to use.

Let's start with the beginning.

Getting started with `make`

The step structure

make is a build system: a piece of tooling allowing you to define steps to build your project. It should make sure to only rebuild what needs to be rebuilt, to keep build time as short as possible. All these steps are defined in a file named Makefile, usually located at the root of your project.

A make step has the following syntax:

target: [space separated dependencies]
        shell instructions
        ...

By default, make assumes that a target is a file, and will build it by executing the shell instructions associated with that target, after it has executed the shell instructions associated with the possible target dependencies (if any).

Let's have a look at a simple example in which we will build this hello.c file into a hello binary, using the gcc compiler.

#include <stdio.h>

int main() {
    printf("hello world\n");
    return 0;
}

We define the following Makefile:

hello: hello.c
    gcc hello.c -o hello

We can then run make hello to compile the hello binary, after which we run it:

$ make hello
gcc hello.c -o hello
$ ./hello
hello world

When we ran make hello, make detected that the hello file wasn't found on disk, and built it by running gcc hello.c -o hello.

What happens if we re-run the same command now?

$ make hello
make: `hello' is up to date.

make detected that hello.c hadn't changed since last time hello was built, and thus did nothing. If we change hello.c to print hello bobbytables instead of hello world, make will see that the file had changed and will happily rebuild the binary:

#include <stdio.h>

int main() {
-    printf("hello world\n");
+    printf("hello bobbytables\n");
     return 0;
}

$ make hello
gcc hello.c -o hello
$ ./hello
hello bobbytables

Phony targets

Say now that you'd like to define a run step, that will simply run the binary:

hello:
    gcc -o hello hello.c

run: hello
    ./hello

$ make run
./hello
hello world

The issue here, is that run does not represent a file on disk. To avoid confusing make, we mark this step as being PHONY, aka not a file make needs to build. This will make sure the associated shell instructions are always executed.

hello:
    gcc -o hello hello.c

.PHONY: run
run: hello
    ./hello

Default target

We can define what step should be run when invocating make without any argument by using .DEFAULT_GOAL:

.DEFAULT_GOAL = run

hello:
    gcc -o hello hello.c

.PHONY: run
run: hello
    ./hello

$ make
./hello
hello world

We can hide the command being executed by prefixing it with @.

.DEFAULT_GOAL = run

hello:
    gcc -o hello hello.c

.PHONY: run
run: hello
    @./hello

$ make
hello world

And with that, we now know just enough to get started for real.

My best practices

The examples are taken from the 5esheets Makefile.

Makefile auto-documentation as the default step

Ever since I stumbled on this article, I have made sure to auto-document all my Makefiles, to help with discoverability. This works by adding a one-liner explanation of the "public" targets (the one a contributor might find themselves executing) after a ##. We then define a help target that will parse the current Makefile, extract all the target names and associated comments, and format them nicely. The finishing touch is to make help the default target, to make it extra easy for a newcomer to understand what can be built with your Makefile.

.DEFAULT_GOAL = help
...

run: admin-statics build  ## Run the app
    ...

help:  ## Display help
    @grep -E '^[a-zA-Z_-]+:.*?## .*$$' $(MAKEFILE_LIST) | sort | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[36m%-30s\033[0m %s\n", $$1, $$2}'

This is what the output looks like for the 5esheets project:

Tell what's happening, not how

I personally like to have each step include a short explanation of what it is doing, and hide the actual shell command, which I find of low value.

deps-python: poetry.lock
    @echo "\n[+] Installing python dependencies"
    @poetry install

In that example, when the target executes, I see [+] Installing python dependencies, as well at the command output, but not the poetry install command itself. I find that communicating the intent is clearer and more self-explanatory than taking screen real-estate by displaying the nitty-gritty details.

Define commonalities in variables

When I find myself repeating things too much in various rules, this is when I start using variables. For example, instead of writing many rules that hardcode a given directory name in them, I define that directory name in a variable. This makes it easier to keep the Makefile valid when the project structure evolves.

app-root = dnd5esheets

black:
    @echo "\n[+] Reformatting python files"
    @poetry run black --check $(app-root)/

mypy:
    @echo "\n[+] Checking Python types"
    @poetry run mypy $(app-root)/

ruff:
    @echo "\n[+] Running linter"
    @poetry run ruff $(app-root)/

Keep all paths in the Makefile

Some of my targets are oftentimes generated via scripts (usually python), which process some input and dump their result to a target file. I find that passing the output file path to the script (instead of hardcoding the file path in the script) allows the Makefile to be more self-contained and makes it easier to rename files without having to update both the Makefile and the script.

$(data-dir)/translations-items-fr.json:
    @echo "\n[+] Fetching items french translations"
    @curl -s $(fr-translations-data-dir)/dnd5e.items.json > $(data-dir)/translations-items-fr.json

$(data-dir)/items-base.json: $(data-dir)/translations-items-fr.json
    @echo "\n[+] Fetching base equipment data"
    @curl -s $(5etools-data-dir)/items-base.json | ./scripts/preprocess_base_item_json.py $(data-dir)/items-base.json

We can then avoid repeating ourselves by leveraging the $@ symbol, which expands to the name of the target being generated.

$(data-dir)/translations-items-fr.json:
    @echo "\n[+] Fetching items french translations"
    @curl -s $(fr-translations-data-dir)/dnd5e.items.json > $@

$(data-dir)/items-base.json: $(data-dir)/translations-items-fr.json
    @echo "\n[+] Fetching base equipment data"
    @curl -s $(5etools-data-dir)/items-base.json | ./scripts/preprocess_base_item_json.py $@

Generate a visual representation of the Makefile

I like having a visual representation of the dependencies of each target. It allows me to debug why some targets are not being rebuilt when they should, or are always being rebuilt when they shouldn't be. I find that it it also helps when getting started with the project for the first time. I leverage the makefile2dot Python package for this:

doc/makefile.png: Makefile
    @echo "\n[+] Generating a visual graph representation of the Makefile"
    @poetry run makefile2dot -o $@

You'll notice that this target depends on the Makefile itself, as it needs to be re-generated as the Makefile evolves.

Keep things readable

This is probably my most fundamental best practice.

Over the years, I have realized that I'm not smart enough to maintain a cryptic-looking Makefile. I my view, articles such as this one steer the reader into producing "smart" Makefiles that are non obvious to reason about (especially the last example). I need to be able to read a target's logic and understand what it does months after having written it. The same way, I won't hesitate to repeat myself and avoid variables when I think the output looks clearer. I try not to use "magic variables" too much.

There's a delicate balance to be struck between expressibility and readability, and I think readability should always win. You'll thank yourself later.

Pinning your SQLite version across environments

2023-08-25T00:00:00+02:00

The project I'm currrently working on only has a single external dependency: SQLite, with full text search enabled. As a result, the application is extremely easy to package and run. However, I found out that ensuring that you have the exact same SQLite version and feature set in all your environments (development machines running macOS and linux, CI and production) is trickier than I expected.

When you rely on a traditional database server (PostgreSQL, MySQL, mongoDB, etc), you can achieve this by running the same server version in all your environments.

Docker really shines there, as it allows to do just that in a single command.

$ docker run postgres:15.4

Things are a bit different with SQLite, as it is not an SQL server. It is a library that you embed in your program (either by compiling it alongside your code, or by relying on a shared library and language bindings). Python does the latter: its sqlite3 module is written in C using the CPython API, and includes the sqlite3.h header file. Where does this header file come from though?

Inspecting the sqlite version on linux

If we have a look at a python3.11 installation directory on a random Ubuntu server, we see that it bundles an _sqlite.so shared object, that itself dynamically loads libsqlite3.so.0.

$ find /usr/lib/python3.11  -name "*sqlite3*.so"
/usr/lib/python3.11/lib-dynload/_sqlite3.cpython-311-x86_64-linux-gnu.so
$ ldd /usr/lib/python3.11/lib-dynload/_sqlite3.cpython-311-x86_64-linux-gnu.so
    linux-vdso.so.1 (0x00007ffcda976000)
    libsqlite3.so.0 => /lib/x86_64-linux-gnu/libsqlite3.so.0 (0x00007fab44d9c000) # <--
    libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fab44a00000)
    libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fab44cb3000)
    /lib64/ld-linux-x86-64.so.2 (0x00007fab44f17000)

Same question: where does /lib/x86_64-linux-gnu/libsqlite3.so.0 come from then?

$ apt-file search /lib/x86_64-linux-gnu/libsqlite3.so.0
libsqlite3-0: /usr/lib/x86_64-linux-gnu/libsqlite3.so.0
libsqlite3-0: /usr/lib/x86_64-linux-gnu/libsqlite3.so.0.8.6
$ apt-cache search libsqlite3-0
libsqlite3-0 - SQLite 3 shared library

This means that python relies on whatever libsqlite3 version is installed by the system package manager. We can double check this by having a look at the python3 package recursive dependencies: python3 -> libpython3-stdlib -> libpython3.11-stdlib -> libsqlite3-0.

To know what version is installed on that system, we can inspect the version of the libsqlite3-0 apt package:

$ apt-cache show libsqlite3-0 | grep Version
Version: 3.40.1-1

We can check that we're getting this exact version via python:

>>> import sqlite3
>>> conn = sqlite3.connect(":memory:")
>>> conn.execute("select sqlite_version()").fetchone()
('3.40.1',)

Inspecting the sqlite version on macOS

Assuming you are installing your packages via brew on macOS, you'll find that it does things a bit differently than apt. The python3 formula depends on sqlite, which itself downloads an archive pinned to a given version (3.43.0 at the time of writing), and then compiles libsqlite3.dylib.

Indeed, we see this library when inspecting the content of the sqlite brew package:

~ ❯ ls -alh /opt/homebrew/opt/sqlite/lib/libsqlite3.dylib
lrwxr-xr-x    18 br   16 May 15:45  /opt/homebrew/opt/sqlite/lib/libsqlite3.dylib -> libsqlite3.0.dylib

And sure enough, we see that we're running the expected version in python:

>>> import sqlite3
>>> conn = sqlite3.connect(":memory:")
>>> conn.execute("select sqlite_version()").fetchone()
('3.43.0',)

Pinning the sqlite version by vendoring the compiled library

To pin the sqlite version across all environments and OSes, we can compile these shared/dynamically loaded libraries ourselves for all architectures we plan to support, vendor them in our codebase, and inject them into our application via LD_PRELOAD.

We'd need to cover all the ways we run the app:

running make run, which runs the app on the host, against the version of libsqlite3 installed by the package manager
running make docker-run, which runs the application in a docker container against the libsqlite3 version available through the image OS package manager
running make test in CI (Github Actions), which runs the test against the libsqlite3 version made available by the runner OS package manager

Compiling the sqlite source code into a shared library was made easy to do as Simon Willison already documented the process.

Compiling `libsqlite3` for linux

The following script compiles libsqlite3 for linux, with full text search enabled:

# script/compile-libsqlite-linux.sh
#!/usr/bin/env bash
set -e

apt-get install -y build-essential wget tcl

# link associated with sqlite 3.42.0, found on https://www.sqlite.org/src/timeline?t=version-3.42.0
# pointing to https://www.sqlite.org/src/info/831d0fb2836b71c9
sqlite_ref=831d0fb2
wget https://www.sqlite.org/src/tarball/${sqlite_ref}/SQLite-${sqlite_ref}.tar.gz
tar -xzvf SQLite-${sqlite_ref}.tar.gz
pushd SQLite-${sqlite_ref}

CPPFLAGS="-DSQLITE_ENABLE_FTS5" ./configure
make

popd
mv SQLite-${sqlite_ref}/.libs/libsqlite3.so ./lib/
rm -r SQLite-${sqlite_ref}.tar.gz SQLite-${sqlite_ref}

Compiling `libsqlite3` for macOS

The following script compiles libsqlite3 for macOS, with full text search enabled:

# script/compile-libsqlite-macos.sh
#!/usr/bin/env bash

set -eu

sqlite_version=3420000

wget https://www.sqlite.org/2023/sqlite-amalgamation-${sqlite_version}.zip
unzip sqlite-amalgamation-${sqlite_version}.zip
pushd sqlite-amalgamation-${sqlite_version}

gcc -dynamiclib sqlite3.c -o libsqlite3.0.dylib -lm -lpthread -DSQLITE_ENABLE_FTS5

popd
mv sqlite-amalgamation-${sqlite_version}/libsqlite3.0.dylib ./lib/
rm -r sqlite-amalgamation-${sqlite_version}.zip sqlite-amalgamation-${sqlite_version}

Compiling the right version on-demand

We then define a $(libsqlite) make target, either pointing to lib/libsqlite3.so if you run the app on linux, or lib/libsqlite3.0.dylib if you run it on macOS. We finally make sure to override the system shared library by the vendored one when running the app, via LD_PRELOAD on linux and DYLD_LIBRARY_PATH on macOS.

# Makefile

UNAME_S := $(shell uname -s)
PWD := $(shell pwd)
ifeq ($(UNAME_S),Linux)
    libsqlite = lib/libsqlite3.so
    ld_preload = LD_PRELOAD=$(PWD)/$(libsqlite)
else ifeq ($(UNAME_S),Darwin)
    libsqlite = lib/libsqlite3.0.dylib
    ld_preload = DYLD_LIBRARY_PATH=$(PWD)/lib
endif

app-root = dnd5esheets
poetry-run = $(ld_preload) poetry run

lib/libsqlite3.so:
    @./scripts/compile-libsqlite-linux.sh

lib/libsqlite3.0.dylib:
    @./scripts/compile-libsqlite-macos.sh

build: $(libsqlite) ...

test:
    @$(poetry-run) pytest

run: build ...
    @cd $(app-root) && $(poetry-run) uvicorn --factory $(app-root).app:create_app --reload

Compiling `libsqlite3` in docker

While the previous steps work, they also prove to be quite brittle, as they only works for a given CPU architecture. For example, the libsqlite3.0.dylib library will not load on an Intel Mac if it was compiled on a M1 or M2.

The most robust way to go remains building libsqlite3 in a build stage of the docker image process. This way, you know that you only need to build it for linux, whatever the host OS is, and you're guaranteed that it will be built for your CPU architecture, thanks to the multi-arch property of the python:3.11.4-slim base image.

# Dockerfile
...
# -- Build the libsqlite3.so shared object for the appropriate architecture
FROM python:3.11.4-slim AS sqlite-build

WORKDIR /app/src/build

COPY scripts/compile-libsqlite-linux.sh ./
RUN apt-get update && \
    apt-get install --no-install-recommends -y build-essential wget tcl && \
    ./compile-libsqlite-linux.sh && \
    apt-get remove -y build-essential wget tcl && \
    apt-get auto-clean


# -- Main build combining the FastAPI and compiled frontend apps
FROM python:3.11.4-slim
...
COPY --from=sqlite-build /app/src/build/libsqlite3.so ./lib/libsqlite3.so
CMD ["./start-app.sh"]

# start-app.sh
#!/bin/bash

set -e

exec \
    env LD_PRELOAD=./lib/libsqlite3.so \ # inject the LD_PRELOAD environment variable in the process
    uvicorn --factory dnd5esheets.app:create_app --host "0.0.0.0" --port 8000

Unit testing the SQLite version and feature set

With all of that said and done, we can now expose the sqlite version and compilation options through a debug API handler:

# dnd5esheets/api/debug.py
from fastapi import APIRouter, Depends
from sqlalchemy import text
from sqlalchemy.ext.asyncio import AsyncSession

from dnd5esheets.db import create_scoped_session

debug_api = APIRouter(prefix="/debug")


@debug_api.get("/sqlite")
async def sqlite_info(
    session: AsyncSession = Depends(create_scoped_session),
):
    """Return debug information about the sqlite database"""
    version = (await session.execute(text("select sqlite_version()"))).scalar_one()
    pragma_compile_options = (
        (await session.execute(text("pragma compile_options"))).scalars().all()
    )
    return {
        "version": version,
        "compile_options": pragma_compile_options,
    }

We can then query the sqlite version through the API:

❯ curl -s localhost:8000/api/debug/sqlite | jq .version
"3.42.0"

However, we can go even further! By unit-testing the version and compile options, we ensure that our CI uses the exact required sqlite version and feature set.

# dnd5esheets/tests/test_api_debug.py

def test_sqlite_version(client):
    sqlite_debug_info = client.get("/api/debug/sqlite").json()
    assert sqlite_debug_info["version"] == "3.42.0"
    assert "ENABLE_FTS5" in sqlite_debug_info["compile_options"]

See the effect of vendoring the compiled library in CI: before / after.

Sources

How to profile a FastAPI asynchronous request

2023-08-05T00:00:00+02:00

I have been experimenting with FastAPI recently, a Python API framework self-describing as "high performance, easy to learn, fast to code, ready for production".

One of the features I wanted my project to have is to be fully asynchronous, from the app server to the SQL requests. As the API is mostly I/O bound, this would allow it to handle many concurrent requests with a single server process, instead of starting a thread per request, as one commony seen with Flask/Gunicorn.

However, this poses a challenge when it comes to profiling the code and interpreting the results.

The limitations of `cProfile` when profiling asynchronous code

For example, the following graph representation was generated from a cProfile profile recording 300 consecutive calls to a single API endpoint, with an associated get_character handler.

Zooming in, we notice 2 things about the get_character span:

its ncalls value is 9605, when we really called it 300 times
it is free-floating, completely unlinked from any other span

As an asynchronous function is "entered" and "exited" by the event loop at each await clause, every time the event-loop re-enters a function, cProfile will see this as an additional call, thus causing seemingly larger-than-normal ncalls numbers. Indeed, we await every-time we perform an SQL request, commit or refresh the SQLAlchemy session, or anything else inducing asynchronous I/O. Secondly, the reason that the get_character span appears to be free-floating is because it is executed outside of the main thread, by the Python event-loop.

This means that our good old faithful cProfile might not cut it for this inherently asynchronous server, and we need a more modern profiler with builtin asynchronous support if we want to really make sense of where time is spent during a request.

Enter pyinstrument!

pyinstrument is a statistical profiler, contrary to cProfile, which is deterministic.

Deterministic profiling is meant to reflect the fact that all function call, function return, and exception events are monitored, and precise timings are made for the intervals between these events (during which time the user’s code is executing). In contrast, statistical profiling [...] randomly samples the effective instruction pointer, and deduces where time is being spent. The latter technique traditionally involves less overhead (as the code does not need to be instrumented), but provides only relative indications of where time is being spent.

Source

Second, it advertises native support for profiling asynchronous python code:

pyinstrument can profile async programs that use async and await. This async support works by tracking the context of execution, as provided by the built-in contextvars module.

When you start a Profiler with the async_mode enabled or strict (not disabled), that Profiler is attached to the current async context.

When profiling, pyinstrument keeps an eye on the context. When execution exits the context, it captures the await stack that caused the context to exit. Any time spent outside the context is attributed to the that halted execution of the await.

Source

This should allow us to get a sensible picture of where time is spent during the lifespan of a FastAPI request, while also skipping the spans that are too fast to be profiled.

Integrating pyinstrument with FastAPI

We rely on the FastAPI.middleware decorator to register a profiling middleware (only enabled if the PROFILING_ENABLED setting it set to True) in charge of profiling a request if the profile=true query argument is passed by the client.

By default, this middleware will generate a JSON report compatible with Speedscope, an online interactive flamegraph visualizer. However, if the profile_format=html query argument is passed, then a simple HTML report will be dumped to disk instead.

from fastapi import Request
from pyinstrument import Profiler
from pyinstrument.renderers.html import HTMLRenderer
from pyinstrument.renderers.speedscope import SpeedscopeRenderer


def register_middlewares(app: FastAPI):
    ...
    if app.settings.PROFILING_ENABLED is True:

        @app.middleware("http")
        async def profile_request(request: Request, call_next: Callable):
            """Profile the current request

            Taken from https://pyinstrument.readthedocs.io/en/latest/guide.html#profile-a-web-request-in-fastapi
            with small improvements.

            """
            # we map a profile type to a file extension, as well as a pyinstrument profile renderer
            profile_type_to_ext = {"html": "html", "speedscope": "speedscope.json"}
            profile_type_to_renderer = {
                "html": HTMLRenderer,
                "speedscope": SpeedscopeRenderer,
            }

            # if the `profile=true` HTTP query argument is passed, we profile the request
            if request.query_params.get("profile", False):

                # The default profile format is speedscope
                profile_type = request.query_params.get("profile_format", "speedscope")

                # we profile the request along with all additional middlewares, by interrupting
                # the program every 1ms1 and records the entire stack at that point
                with Profiler(interval=0.001, async_mode="enabled") as profiler:
                    response = await call_next(request)

                # we dump the profiling into a file
                extension = profile_type_to_ext[profile_type]
                renderer = profile_type_to_renderer[profile_type]()
                with open(f"profile.{extension}", "w") as out:
                    out.write(profiler.output(renderer=renderer))
                return response

            # Proceed without profiling
            return await call_next(request)

You can browse the project code to see how the middleware is wired into the application itself

Let's see the results

HTML profile

Speedscope profile

We see pretty clearly the different SQL requests being performed (the execute spans), the different await clauses in the code causing the event loop to pause the execution, and that most of the request time is spent in SQL requests.

Finally, using this setup, I was able to observe the effects of replacing the json stdlib library by orjson when deserializing JSON data from database, and speed up this endpoint by a couple of percent very easily.

Sources

Preventing a pull request from being merged until it's safe

2023-07-25T00:00:00+02:00

Sometimes, a pull request is ready to go, but shouldn't be merged before some other changes are merged first. While the patch is valid on its own, it might depend on other changes, and could even break the application if merged before the other.

I'll demonstrate a simple technique relying on Github Actions and pull request labels to fully block a pull request from being merged until deemed safe (at least without some admin privileges on the repository).

First, we introduce a Github Actions workflow executed when a pull request is opened, labeled or unlabeled. This workflow will fail if labeled with do not merge.

name: Check do not merge

on:
  # Check label at every push in a feature branch
  push:
    branches-ignore:
      - main
  # Check label during the lifetime of a pull request
  pull_request:
    types:
    - opened
    - labeled
    - unlabeled

jobs:
  fail-for-do-not-merge:
    if: contains(github.event.pull_request.labels.*.name, 'do not merge')
    runs-on: ubuntu-latest
    steps:
      - name: Fail if PR is labeled with do not merge
        run: |
          echo "This PR can't be merged, due to the 'do not merge' label."
          exit 1

We then define a branch protection rule for our main branch, by going to the repository Settings, then Branches. We add a new rule if none exist, tick Require status checks to pass before merging, and add the fail-for-do-not-merge to the list of required checks.

Finally, apply the do not merge label to your pull request.

At that point, the fail-for-do-not-merge check will run and fail, preventing the PR to be merged.

When the pull request is finally safe to merge, simply remove the do not merge tag, and the checks will automagically pass, thus allowing you to merge.

Neapolitan pizza dough recipe

2023-06-09T00:00:00+02:00

Whenever I'm baking pizzas, the first step is always to remember the dough recipe, and adjust for the number of pies I'm making. I have probably written it on at least 10 pieces of paper, that all ended up in the trash.

I have created a small pizza dough recipe calculator so I don't have to go through these steps again.

Merging multiple mp3 files into an audiobook with chapters

2023-05-08T00:00:00+02:00

I recently found the 3 Lord of the Rings audiobooks I bought from Phil Dragash some years back, as I was digging through my NAS. Each book is split into about 20 mp3 files, which makes it a bit unwieldy for me. As I mostly listen to audiobooks when I'm going to sleep, I oftentimes have to find the last part I remember listening to and start again from there the next day.

Luckily, BookPlayer solves this for me, via its "Stop after this chapter" feature. However, to import these books into the app, I needed to merge the mp3 files into a full-fledge audiobook m4b file, with chapter metadata.

After digging a little bit, this is what I found:

$ docker run \
    -it \  # to see the output of the containerized process in the terminal
    --rm \  # delete the container once the conversion ends
    -u $(id -u):$(id -g) \  # Use the same UID and GID than in the host to avoid permission issues
    -v "$(pwd)/audiobooks":/mnt \  # mount the ./audiobooks folder into /mnt in the container via a docker volume
    sandreas/m4b-tool:latest \  # cf https://hub.docker.com/r/sandreas/m4b-tool
        merge \  # subcommand in charge of merging the mp3 files into a single m4b file
            "/mnt/The Fellowship of the Ring" \  # directory in which the audio files are located
            --output-file "/mnt/The Fellowship of the Ring.m4b" \  # name of the generated m4b file
            --series "The Lord of the Rings" \  # name of the book series
            --name="The Fellowship of the Ring" \  # title of the book
            --series-part=1 \  # book number in the series
            --artist "J.R.R Tolkien" \  # writer's name
            --albumartist="Phil Dragash" \  # narrator's name
            --use-filenames-as-chapters \  # generate a chapter per mp3 file
            --cover "/mnt/The Fellowship of the Ring/cover.jpg" \  # path to the cover image
            --jobs=8 \  # I used number of CPUs - 2
            --audio-channels=2 \  # 1=mono, 2=stereo
            --audio-samplerate=44100  # I used the same as in the input files

Here's the result after I imported the result m4b file into BookPlayer.

Generating pretty maps ready to be gift-wrapped

2023-05-06T00:00:00+02:00

I have been toying with the idea of generating visually pleasing maps centered on a given address, to have them printed and framed. The way I see it, it would make an original and personalised gift for the person living there. So when Marcelo de Oliveira Rosa Prates' prettymaps blew up on Reddit, I decided to try it.

The library was great and the visuals looked incredible, yet, I felt it was lacking a couple of features if I were to print the maps.

a CLI to make it easy to generate maps on the fly
easily changing the color scheme of buildings (and allowing black and white)
enabling the generation of rectangular maps, on top of circle and square
changing the output format of the figure to make it fit into a standard page (A3, A4, etc)
ensuring a 300dpi output
set the CLI command used to generate the map as the map title, for autodocumentation purposes

My good friend Etienne solved the rectangular map generation in a beautifully laid out PR, that has been sadly sitting there for a while without attention. It seems that the repository owner got issues with NFT con "artists", and pretty much abandonned the project, which hasn't seen activity for the last 5 months.

Seeing this, I decided to fork the project, and work on the remaining ideas.

Here are a couple of examples of maps that I've generated and printed for people in my entourage.

The command used to generate each map is displayed as the map title

The color schemes are only applied to buildings, and are automatically generated from matplotlib colormaps. This was an quick and easy to generate themes "for free". I also added a couple of Scottish tartan inspired themes, that I used to print a map as a wedding gift for a lovely franco-scottish couple.

My local printer bills me about 1.5€ for each print, which makes for an original and yet remarkably cheap gift. I recommend a thick and matte paper, without any texture, as it might collide with the map dotted background.

If you'd like to give it a try, feel free to have a look at the repository!

Monitoring my solar panel power production

2023-05-03T00:00:00+02:00

I have recently acquired two solar panels from Sunology advertising a cumulated instantaneous production of up to 810W. The panels come with a smart plug emitting the data to Tuya, in order to retain and graph historical data. However, the only available granuarity for that data is daily kWh production. In order to optimize the orientation and placement of the panels, as well as measure the production efficiency (power produced / 810 * 100), I needed a much finer granularity than that. I decided to query the data myself and send it to Datadog.

The first thing I needed to do was to find a working client that would be able to talk to the plug. It seems that tinytuya would do the job. However, it didn't seem like I could simply fetch the data from the plug locally. Instead, I first needed to create a Tuya account, a cloud project, and add the plug to the project devices to get both an API key as well as a key for the plug. That proved out to be quite tedious, as the Tuya IoT interface is very confusing and slow, but I managed thanks to these Home-Assistant instructions.

With that data now available, I was then able to setup the tinytuya client on a Raspberry Pi with network access to the plug IP.

$ python -m tinytuya wizard
TinyTuya Setup Wizard [1.12.4]


    Enter API Key from tuya.com: [REDACTED]
    Enter API Secret from tuya.com: [REDACTED]
    Enter any Device ID currently registered in Tuya App (used to pull full list) or 'scan' to scan for one: [REDACTED]
    Enter Your Region (Options: cn, us, us-e, eu, eu-w, or in): eu

>> Configuration Data Saved to tinytuya.json
>> Device Listing
>> Saving list to devices.json
    1 registered devices saved

>> Saving raw TuyaPlatform response to tuya-raw.json

Poll local devices? (Y/n): y

Scanning local network for Tuya devices...
    1 local devices discovered

Polling local devices...
    [Sunology                 ] 192.168.5.171      - [On]  - DPS: {'1': True, '9': 0, '17': 109, '18': 2704, '19': 6491, '20': 2379, '21': 1, '22': 529, '23': 26153, '24': 13705, '25': 3040, '26': 0}

>> Saving device snapshot data to snapshot.json


>> Saving IP addresses to devices.json
    1 device IP addresses found

Done.

At that point, the tinytuya wizard script had scanned the networks the Pi was connected to, found the plug, and was able to connect to it via the provided device key.

I then created a dedicated APP/API keypair on Datadog, and scheduled this python script to run every minute via cron.

# Run every minute via this crontab
# * * * * * cd /home/br/tuya && /home/br/tuya/.env/bin/python exporter.py

import json
import time

import datadog
import tinytuya

datadog.initialize(
    api_key="[REDACTED]",
    app_key="[REDACTED]",
)

with open("devices.json") as device_file:
    device_data = json.load(device_file)

plug = tinytuya.OutletDevice(
    dev_id=device_data[0]["id"],
    address=device_data[0]["ip"],
    local_key=device_data[0]["key"],
    version=3.3,
)

plug_status = plug.updatedps()
data = plug_status["dps"]

now = time.time()
metrics = []
if "18" in data:
    current = data["18"]  # mA
    metrics.append(
        {
            "metric": "solarpanel.current",
            "type": "gauge",
            "points": [(now, current)],
            "tags": ["location:terrasse_1"],
        }
    )

if "19" in data:
    power = data["19"] / 10.0  # W
    metrics.append(
        {
            "metric": "solarpanel.power",
            "type": "gauge",
            "points": [(now, power)],
            "tags": ["location:terrasse_1"],
        }
    )

if "20" in data:
    voltage = data["20"] / 10.0  # V
    metrics.append(
        {
            "metric": "solarpanel.voltage",
            "type": "gauge",
            "points": [(now, voltage)],
            "tags": ["location:terrasse_1"],
        }
    )

datadog.api.Metric.send(metrics=metrics)

At that point, the measured current, voltage and power was sent out to Datadog every minute, and I was then able to create the following dashboard:

This dashboard makes it seem like the panel can only hit 75% efficiency, when I have seen them hit 95-99%. This is due to the Datadog point interpolation happening on large time windows. When we focus on a smaller window, we can see these high (albeit brief) peaks.

With that granularity, I realized that the panels only started to really kick in after midday, and that I should probably move them to a spot with more exposure if I wanted to produce more than 4kWh a day (measured on a hot and sunny day without any clouds). That day, I only hit 85% efficiency though, even though I had hit 99% at some point during the previous weeks. That makes me wonder if I need to wash the panel.

Edit: it rained that very night and I did hit 95% efficiency the next day.

Speeding up a 21h job to 8 minutes: a story of SQLAlchemy optimization

2023-01-08T00:00:00+01:00

I have recently published an article on the Alan tech blog walking the reader through how we have reduced the runtime of our longest nightly job from 21 hours to about 8 minutes, by using simple profiling and SQLAlchemy optimizations.

Enjoy the reading!

My DIY Dungeons and Dragons ambiance mixer

2022-09-24T00:00:00+02:00

Table of Contents

Getting started
Reacting to key presses
Sending structured data from the keypad
Playing sounds after a keypress
Let's rub some web on it
The finishing touch
Demo time
I have the hardware! How can I run it?
Closing words

I find that an immersive sound ambiance is key to helping tabletop RPG players engage. It can increase their stress and sense of urgency during a fight, galvanize them during a harrowing speech, or break their heart when they realize they've just lost something and there's no getting it back.

I have been thinking about using a Launchpad to control and mix the ambiance while we play, but the more I read about its design, the less it seemed to fit. The cheapest Launchpad starts at 110€, and it is a full fledged MIDI controller. What I wanted was something simpler: a way to play different long sound ambiance tracks at the same time, and adjust their respective volume to create an immersive atmosphere.

The project started to take shape when I stumbled upon the Pimoroni RGB Keypad, a 4x4 rainbow-illuminated keypad that I could program using a Raspberry Pi Pico, for a budget of about 30€.

The color and brightness of the LEDs under the keys is programmable, meaning I could go for the look and feel of a Launchpad, while keeping my budget and the overall complexity in check.

The main idea would be to use 12 of the available 16 keys to start and stop audio tracks, and use the 4 remaining keys as controls (increase/decrease volume, pause all tracks).

Getting started

If you, like myself, want to program a Raspberry Pi Pico in Python, you have two options:

It took me a while to figure out that these are more or less the same. In the end, I went with the CircuitPython starting-up guide, and was ready to make these keys light up.

A CircuitPython main program lives in a code.py file, that is executed when the board is plugged in. Any dependency can be put under the lib/ directory, itself placed at the root of the board filesystem.

I downloaded the rgbkeypad.py library, placed it under lib/ and wrote the following program in code.py

from rgbkeypad import RGBKeypad

keypad = RGBKeypad()

# make all the keys red
keypad.color = (255, 0, 0)  # red
keypad.brightness = 0.5

# turn a key blue when pressed
while True:
    for key in keypad.keys:
        if key.is_pressed():
            key.color = (0, 0, 255)  # blue

I then copied code.py and lib/rgbkeypad.py under the CIRCUITPY volume that is mounted when the keypad gets plugged into the computer, and voilà.

Reacting to key presses

Now that I knew how to program the key colors, brightness as well as knowing what keys were being pressed, I still needed a way to map these key events to starting audio tracks, and I was facing an immediate problem: the Pico has no way to play sound, even less on a Bluetooth-connected speaker. You know what can do all that really well though? My laptop.

So, if I could send messages from the Pico to my laptop (on which the Pico is connected for power anyway) and have a program running on my laptop receive them, I could then start thinking about how to play sounds.

It turns out that this was way easier than I thought, thanks to CircuitPython sending anything print-ed as binary data over the serial port. Using pyserial, I can write a program that connects to the same serial port the Pico is connected to, and receive the data.

# code.py, running on the Raspberry Pi Pico
from rgbkeypad import RGBKeypad

keypad = RGBKeypad()

# make all the keys red
keypad.color = (255, 0, 0)  # red
keypad.brightness = 0.5

# turn a key blue when pressed
while True:
    for key in keypad.keys:
        if key.is_pressed():
            key.color = (0, 0, 255)  # blue
            print(f"Key ({key.x}, {key.y} pressed!") # <-- that message will be sent over USB

# usb_listener.py, running on the laptop
from serial import Serial

# /dev/tty.usbmodem14201 is the name of the serial port the Pico was connected to
# on my mac. Your mileage may vary.
usb_device = Serial("/dev/tty.usbmodem14201")
for line in usb_device:
    print(line.decode("utf-8"))

I can now run usb_listener.py and press a key on the keypad to see the following:

$ python usb_listener.py
Key (1, 0) was pressed

Key (1, 0) was pressed

Key (1, 0) was pressed
...

Sending structured data from the keypad

Sending text data is fine, but we should probably send data that can be serialized on the keypad size and deserialized on the event listener side, as we will probably send the key ID, a state (pressed, stop, volume_up, etc). JSON is simple enough, and while the json package isn't available in CircuitPython, it's pretty easy to hand-encode JSON data:

# code.py, running on the Raspberry Pi Pico
from rgbkeypad import RGBKeypad

keypad = RGBKeypad()

# make all the keys red
keypad.color = (255, 0, 0)  # red
keypad.brightness = 0.5

# turn a key blue when pressed
while True:
    for key in keypad.keys:
        if key.is_pressed():
            key.color = (0, 0, 255)  # blue
            key_id = 4 * key.x + key.y
            print(f'{"key": %d, "state": "pressed"}' % (key_id))

# usb_listener.py, running on the laptop
import json

from serial import Serial

# /dev/tty.usbmodem14201 is the name of the serial port the Pico was connected to
# on my mac. Your mileage may vary.
usb_device = Serial("/dev/tty.usbmodem14201")
for line in usb_device:
    print(json.loads(line.decode("utf-8").strip()))

Playing sounds after a keypress

Playing multiple sounds at the same time in Python isn't something many packages allow you to do simply. In the end, I could only make it reliably work with pygame, which was developped to ease the creation of video games in Python. The package provides us with 2 different APIs to work with sound tracks:

pygame.mixer.music, which allows an audio track to be played while streamed. This was intended to play some background music.
pygame.mixer.Sound, which allows you to play an audio track on a specific audio channel. Mutiple Sounds can be played over different audio Channels.

Using pygame.mixer.Sound, we manage to react to a keypress and start the associated audio track

import pygame
pygame.init()

import json

from serial import Serial
from pygame import mixer
from pygame.mixer import Sound, Channel

key_id_to_audio_tracks = {
    0: Sound("example0.ogg"),
    1: Sound("example1.ogg"),
    2: Sound("example2.ogg"),
}

channels = [Channel() for _ in key_id_to_audio_tracks]
mixer.set_num_channels(len(channels))

usb_device = Serial("/dev/tty.usbmodem14201")
for line in usb_device:
    key_event = json.loads(line.decode("utf-8").strip())
    key_id = key_event['key_id']
    sound = key_id_to_audio_tracks[key_id]
    channel = channels[key_id]
    channel.play(sound)  # will play in the background

(The actual code can be inspected here).

While it works rather well, this approach has a fundamental issue. Because mixer.Sound does not support streaming the sound, as mixer.music does, it requires that all sounds be fully loaded in memory at startup. As the ambiance tracks that I'm planning to use all last between 30 minutes and 2h, the actual load time takes a couple of minutes. Using pygame.music would solve that issue, except for the fact that it only supports streaming of a single audio file at the same time.

I'm only left with mixer.Sound and loading hours of audio files in memory at startup, which means that the whole ambiance would take a lot of time to restart in case of a crash, and the energy around the table might deflate like a soufflé.

Sigh

Back to the whiteboard.

Alright, so, what program do I already have on my laptop that is good at streaming sound? What about an Internet browser? Youtube videos don't have to fully load before they start, and the same goes for audio files, so that might just work! I'd need a way to propagate these key events to a web page, so that it can then start/stop the audio files, change their volume, etc. Enter websockets.

Let's rub some web on it

The mixer would be composed of 3 different systems:

the keypad, running the CircuitPython code
a webpage, listening for key events over a websocket, in charge of playing the audio files and adjusting their individual volume
an HTTP server in charge of receving the events over USB and propagating them to the websocket (ergo, to the browser), and serving the local audio files to the webpage. I'll use Flask and Flask-sock for this.

So what happens now when I press a key:

a JSON-formatted message is sent from the pico to the serial port
the message is received by the webserver process, and propagated to the browser on a websocket
the browser deserializes the message, and takes action, depending on the content of the event

The browser-side message handler looks like this:

ws.addEventListener('message', event => {
  const keyEvent = JSON.parse(event.data);
  const usbStatus = document.getElementById("usb_status");

  if (keyEvent.state === "usb_disconnected") {
    usbStatus.textContent = "🔌 🚫";
  } else if (keyEvent.state === "usb_connected") {
    usbStatus.textContent = "🔌 ✅";
  } else if (keyEvent.state === "init") {
    colorizeTracksKbdElements(keyEvent.colors);
  } else if (keyEvent.state === "pause") {
    pauseAllPlayingTracks();
  } else if (keyEvent.state === "unpause") {
    unpauseAllPlayingTracks();
  } else {

    const trackProgressBar = document.getElementById(`progress_track_${keyEvent.key}`);
    const audioElement = document.getElementById(`audio_track_${keyEvent.key}`);

    if (audioElement === null) {
      return;
    }

    switch (keyEvent.state) {
      case "on":
        startTrack(keyEvent.key, audioElement, trackProgressBar);
        break;
      case "off":
        stopTrack(keyEvent.key, audioElement, trackProgressBar);
        break;
      case "vol_up":
        increaseTrackVolume(audioElement, trackProgressBar);
        break;
      case "vol_down":
        decreaseTrackVolume(audioElement, trackProgressBar);
        break;
    }
  }
}

The finishing touch

I have added a couple of features that will help me stay as focused on the storytelling as possible while I'm DMing, instead of thinking about the sound mixing process:

a configuration-based tagging system, allowing me to get reminded of the main features for each individual track (is that an ambiance or combat music? Is it dark, light, opressing, eerie, etc?)
I'm also propagating the key colors to the associated volume bar, allowing me to quickly identify the key that I need to press to start/pause/adjust a given audio track.

The key colors were generated from iwanthue and are stored in the COLORS list, in code.py. Any changes to the colors will be reflected in the web UI, as they are advertised to the web-server at propagated to the UI when the keypad starts.

Demo time

I have the hardware! How can I run it?

Getting started instructions are available here for Windows users, and here for macOS and Linux users.

Don't hesitate to read the comments if you have any doubt, as a fair share of questions have already be answered there.

Once you have everything running, you can:

press one of the 12 track keys to start/stop each individual sound track
press the volume up/down key and a track key at the same time to increase/decrease the volume of the associated track
press the pause key and a track key at the same time to pause/restart the associated track
press the pauseAll key to pause/restart all tracks that were currently playing

Closing words

The final iteration of that project is available here (for the keypad code) and here (for the webserver and webapp code). The black casing was 3D-printed using the rgb_keypad_-_bottom.stl file from this Thingiverse model.

I am grateful to tom's Hardware, Adafruit, all3dp, hackster, Game News 24, msn and weareteachers to have featured and shared this project to their audience.

Can't enough be enough?

2022-05-11T00:00:00+02:00

I left Datadog 2 weeks ago, after 5 intense and incredible years. When I joined, we were about 300 people strong, whereas the current headcount is now approaching 4000. If you never have experienced exponential growth, this is about as close as you can get to it! This means that we were close to doubling in size each year, whether in headcount, infrastructure size, number of teams, and complexity.

About 2.5 years after I was hired, Datadog became a publicly traded company.

In this article, I will explain the impact this had on me, both financially and psychologically, as transparently as I can. The intention is to examine how such an event can change one's life, positively and not, and give you some return of experience on the choices that I made.

This is a weird and personal article. It is about the stock market, how stock options work, psychological paralysis, burn-out and life choices. I hope some of it can be useful to you, but really, this is also something I needed to write for my own catharsis.

How it started

When I joined, back in 2017, the first thing I had to do was choose between 3 compensation packages:

higher salary and lower equity (36000 stock options)
medium salary and medium equity (48000 stock options)
lower salary and higher equity (60000 stock options)

I chose the first one, as I didn't really understand what stock options were. I held the financial world pretty much in contempt, and chose what I could understand: actual money in my bank account at the end of the month. I felt that there was about a 0% chance these stock options would be worth anything anyway, so choosing the highest salary was the safest move I could make.

Wait. What is a stock option anyway?

Finance is fraught with lingo. Yes, possibly even more than technology. So before diving into how the stock market affected my psyche, let's try to define a couple of terms.

A stock is a financial instrument representing the ownership of a fraction of a corporation. These shares are bought and sold on stock exchanges (e.g. Nasdaq, Euronext, etc). For example, should I want to, I could currently buy a share of Amazon.com Inc. (referenced as AMZN on the market) for 2,107.44 USD on the Nasdaq, which would make me a (tiny) shareholder of Amazon. The price of said share varies according to demand and offer, basically.

There are multiple reasons why one should want to hold stocks: either the stocks they own give them voting rights at the company annual meetings, allowing them to influence how the company is managed, or maybe they hope to make a profit by selling at a higher price than the one they bought the stock at.

Now, onto stock options. A stock option is the opportunity to acquire a stock at a guaranteed reduced price. That reduced price is called the strike price, and should be part of your employment contract. In my case, that strike price was $0.85. That meant that should I want to acquire one of these 36000 stocks, I needed to give Datadog 85 US cent.

The word option really means that you can decide to purchase these stocks coming with a discount, but you don't have to. You have the option to do it.

Obviously, companies don't grant all stock options to their employees immediately after hiring, because new employees could decide to stay for a couple of days, pocket all their stocks and then move on. What happened in my case (which I hear is pretty common, really) is that I unlocked (the real term is vest) stock options according to a vesting calendar. I didn't get anything for a whole year, and then I unlocked (vested) 25% of my stocks in one go. It's called a one year cliff. After that, I vested 6.25% at the end of every quarter for the remaining 3 years.

Once a stock option was vested, I could then wire money to Datadog and acquire the stock at the reduced price. This is called exercising the stock option. At that point, the stock options were really converted into a stock, of which I was the owner.

Phew. Let's recap.

By staying at Datadog, I had the opportunity to regularly wire my employer money in order to acquire stocks (i.e. to become a shareholder) at a reduced price, according to a 4 year calendar, in the hope of making a profit later.

strike price = f(risk)

The central notion here is risk. If you join a startup in its infancy, the probability of you turning a profit on your stock options is infinitesimal. To counteract the odds, you will probably get a very low strike price and many stocks, whereas if you join a company on the verge of going through its IPO, you probably will be given less stocks at a much higher strike price. The reason is simple: companies want to reward employees who took the risk of buying in early.

In my case, I joined Datadog when the probability of an IPO was still very low, which was reflected in my strike price.

What will buy you bread vs what might buy you a house

Fast forward 2 years. There are now more and more internal rumors about a potential upcoming IPO. These rumors culminate into the subject being publicly discussed in an all-hands. We are told that we are indeed going through the IPO filing process, which could take many more months before it comes through, if it does. One point is hammered in: nothing is sure at that point. Everything could still fail.

Immediately after the announcement, a seemingly never-ending stream of questions are being raised by employees. What strikes me is every question asked by one of our American colleagues seem well-informed. Many of them seem to have gone through an IPO before, and even those who have not seem to understand how these things work. The same cannot be said for my French colleagues and myself. We are collectively clueless. At that point, I hadn't even exercised a single stock option, as I was still fearful of committing thousands of euros in what could be a pipe dream.

I decide to ask one of my American teammates for advice. When I tell him that I still haven't exercised anything, he pauses for a second, and then proceeds to tell me the following.

Look. I'm not going to tell you what to do, but here's what I do. Every time I vest, I exercise immediately after. Every time. My salary is what buys me bread. My stocks are what might buy me a house.

After that conversation, I started to dig into the relationship between the exercise date and taxes, and proceeded to exercise everything that I had vested until then once things became clearer.

Hey Mr Taxman

Everything I say here applies to my understanding of the French tax code. I am not a lawyer. Do not take this as financial advice.

To understand why my colleague would always exercise right after his vesting date, you first need to understand how stocks are taxed. The way this works in France is pretty similar to the way the IRS does it in the US. If you live in Cyprus, Paraguay or any other tax haven, you don't pay any tax on stocks, which is good for you and sad for your hospitals and roads.

There are 2 things to consider:

if you exercise a stock option, you acquire a stock at a reduced price. You virtually made money there, because you should have paid more for the stock, meaning you will pay taxes on this virtual gain. This is call the acquisition tax ("gain d'acquisition" in French).
if you make a profit selling your stock, you will pay taxes on said profit. This is called profit tax ("gain de cession" in French).

To understand how this works, let's take an example. Say my strike price is set to $1. I exercise a stock option when the value of the associated stock is $10, and I then sell it later on the market for $40.

I will pay acquisition taxes on the $9 difference between the regular market price and the strike price
I will pay profit taxes on the $30 difference between the sell price and the regular market price at the time of the purchase

This means that the sooner I exercise, the smaller the difference between the exercise and strike price should be, meaning the smaller my acquisition tax will be in the end (following the hypothesis that the stock price does nothing but grow, which was true for us for a while).

In the case of a stock option related to a stock that is not publicly traded yet (pre-IPO), the "regular market price" considered when calculating the acquisition tax is set to the stock FMV. The FMV is a theoretical price the stock would have, according to some independent third party appraiser, that is regularly updated.

In our case, the FMV was updated every quarter and did nothing but go up until the actual IPO. The initial reasoning stood: the earlier my coworker exercised, the less acquisition tax he ended up paying.

At that point, the FMV was at about $9 and I decided to follow his advice.

Liftoff

This is the point in the article where I stop boring you with financial minutiae and start getting into how the IPO process affected me psychologically.

The IPO went really well. DDOG went from $27 to about $42 in a single day, and everyone celebrated. The trouble started the next day, when I configured my mac to display the stock current value in a sidebar widget. If you're at all familiar with addictology, this is where you start wincing hard.

I can't overstate how much of a bad idea this is. Having the feeling of "winning" or "losing" multiple times a day is addictive. The whole thing felt like a game, and I started to check my "Potential Benefit Value" in etrade several times a month. The numbers were in the 7 digits, and felt unreal.

Let's pause for a second, and imagine yourself sitting at a casino table. You're on a strong start, the odds are in your favor, and your chip pile grows pretty fast. Now, until you cash out, these chips are monkey money. They are worth nothing, and are only worth something if you take the decision to take them out of the table. You've won most of your games, so every time you lose one, you convince yourself to stay at the table to try and wait until to at least get back your losses. But then you lose some more, but hey, you should not back out now when you were winning so high not too long ago. On and on, in a loop. And so you stay at the table.

And that, dear reader, is why I think the casino pretty much always wins.

Here are a couple of things I learned in the last years, that were paramount in fighting off that psychological paralysis:

The money you have invested on the market is not real money. It's worth nothing until you sell.
Never put money in the market you can't afford to lose.
Know when to check out. This means knowing what you would like to use that money for, how much your plan would require and selling when you reach it.

The money that could buy bought me a house

In 2020, we were collectively struck by The Great Plague, and everybody was stuck inside. At that point, I realized that I had golden opportunity of being able to buy a house in the area that my fiancee and I dreamt of living in, instead of being boxed in a small flat.

And what happened then was... nothing. I was looking at other tech-company stocks that were benefiting from the lockdown, such as Zoom, Docusign, Shopify, etc, and they were miles ahead of where Datadog was. All I had to do was wait! (rubs hands).

This is when my fiancee kind lost it with my shenanigans, and told me that we could be living our dream today instead of waiting for.. what exactly? More money? To do what?

Can't enough be enough?

she told me.

At that point, I knew that however high the stock price was, I was going to be too paralyzed to do anything else than looking at monkey money numbers anyway. And so I estimated how much cash I'd need to cover the house as well as the acquisition and profit taxes, kept a healthy margin in case my tax estimates were wrong (remember the thing about me not being a fiscal advisor?) and for the unescapable renovation work that would need to be done, and sold about 60% of my total stocks at $66.6 (hell yeah).

And just like that, I had enough to afford our dream, pay the taxes on it, as well as supporting my close family.

"Now, what about the remaining 40%?" an astute reader might ask? Well, that I could afford to lose, and didn't have any specific plan in mind for. They are still in the market, and are worth a pretty hefty sum of money. I didn't feel like I needed to convert them into cash for anything. If their value rises, good, if not, it might rise again, who knows?

And with that, I was done. Or so I thought.

Just when I thought I was out, they pull me back in.

Do you know what happened at the end of my 4 year vesting period? Here's what I thought was going to happen: nothing. However, what really happened was an impromptu conversation with my Director, telling me that Datadog was giving me a refresher, in the form of a 4 year vesting with a one-year cliff calendar for about 2000 RSU.

Aaand, back to financial minutiae just for a bit. RSU are "free stocks" the company gives you. You don't have to buy them (contrary to stock options). So you get new stocks just by staying around and doing your job. As they are way less risky than stock options, because the company has already IPO-ed, you also get less of them.

Where things started to get really psychologically tricky for me, is when 2 events coincided:

I started to feel the symptoms of a burn-out, as I was working pretty hard, and had to deal with renovation work in the house, organizing our wedding (which was lovely, thank you very much), and various other fun things
For various reasons that I won't go into, I was given 2 more RSU grants, on overlapping vesting calendars

After 5 years, I was at a crossroads. The more I went on, the more I felt I needed to slow down. Years of exponential growth and on-call were taxing on my mental health. I was constantly stressed out and on edge. I did not sleep well, was taking on weight and was overall losing interest in my work.

As I saw it, my two options were:

I could stay and get more stocks, make more money, and continue working (with great and talented people!) in an ever-exponentially growing company that was promising me a promotion to Engineering Manager (which itself probably meant more stocks, less personal time and more stress), or
I could decide to quit, rest, slow down and do something else.

I just want to be clear there. There were other options, such as going back to an IC role, that I discussed with my manager. I don't want to come across as passively dissing him. He truly was an incredible manager. But in the end, these were the 2 extreme options.

As I was slowly coming to the realization that option 2 was the one for me, came an extremely toxic thought. Was there a point in the near future where I'd vest a substantial amount of RSUs, after which I could then quit? The answer was yes, about 10 months from then. And thus I tried to stick around, feeling more and more depressed and disengaged, all that in the prospect of vesting stocks amounting to about $150,000.

Don't get me wrong, this is a substantial amount of money, that most people aren't privileged enough to dream about. Except that I didn't need it really. I was already living where I wanted, with my wife that I loved with all my heart. This was the endgame. I quickly realized that I was putting my mental health in harm's way just because I didn't want to feel like I was checking out of the table and leaving money on it. Money that I didn't really need in the first place, thanks to my remaining 40%.

Realizing how unhealthy that line of thinking was, I settled on option #2, negotiated a 2-month leaving period (the legal one in France is 3 months), after which I said good-bye to all the wonderful people I had been lucky to work with for years.

On my last day, my "Potential Benefit Value" in etrade was at about $1.2M. I left it all on the table.

And you know what? I'm happy. Enough was enough.

Measuring the coverage of a rust program in Github Actions

2022-04-26T00:00:00+02:00

After having faced a couple of of regressions in bo (my personal text editor written in Rust) in the past couple of days, I have tried to increase the number of unit tests related to the codebase sections handling navigation. I already had some unit tests, but I needed to know what lines of code were not tested, to know what area of the codebase I needed to focus on.

To do this, I used Mozilla's excellent grcov project. I followed their instructions and ran the following commands locally, in my work directory.

$ export RUSTFLAGS="-Cinstrument-coverage"
$ cargo build
$ export LLVM_PROFILE_FILE="bo-%p-%m.profraw"
$ cargo test
$ grcov . -s . --binary-path ./target/debug/ -t html --branch --ignore-not-existing -o ./target/debug/coverage/
$ open ./target/debug/coverage/index.html

This way, I got a beautiful HTML report in which I could see my code coverage, either global, file by file,

or line by line.

grcov even generates nice SVG badges displaying the coverage score, that I could display on the project homepage!

What I ultimately wanted though, was to have every commit touching my main branch to trigger a new coverage generation report, that I could host somewhere public and read at leisure when I needed to.

To do so, I set-up a publicly accessible s3 bucket, configured to host a static website, which turns out to be remarkably easy to do in terraform:

resource "aws_s3_bucket" "github-brouberol-coverage" {
  bucket        = "my-bucket-name"
  provider      = aws.euwest
  acl           = "public-read"
  force_destroy = false
  versioning {
    enabled    = false
    mfa_delete = false
  }
  website {
    index_document = "index.html"
  }
}

There are other ways to host the HTML files than S3 (such as Github Pages), and you do not have you terraform to do it, but I so happen to have a terraform codebase for my personal infrastructure, which made it a no-brainer. If you decide do host the files another way, feel free to jump ahead.

I then created an AWS user, associated with an AWS access_key/secret_key pair and the following IAM policy, granting that user read/write permissions on that S3 bucket, and nothing else.

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Sid": "VisualEditor0",
            "Effect": "Allow",
            "Action": [
                "s3:PutObject",
                "s3:GetObjectAcl",
                "s3:GetObject",
                "s3:ListBucket",
                "s3:DeleteObject",
                "s3:PutObjectAcl"
            ],
            "Resource": [
                "arn:aws:s3:::<my-bucket-name>",
                "arn:aws:s3:::<my-bucket-name>/*"
            ]
        }
    ]
}

I then had to store the bucket name, keypair and AWS region name as encrypted secrets in the bo repository, by going to Settings > Secrets > Actions > New repository secret.

Once that was all set up, the project CI (Github Actions) needed to perform the following actions:

Checking out the project and setting up a nightly rust toolchain

- uses: actions/checkout@v2
- name: Setup toolchain
  uses: actions-rs/toolchain@v1
  with:
    toolchain: nightly
    override: true
    profile: minimal

running the unit tests with profiling and coverage collection enabled

- name: Run tests
  uses: actions-rs/cargo@v1
  with:
    command: test
    args: --all-features --no-fail-fast  # Customize args for your own needs
  env:
    CARGO_INCREMENTAL: '0'
    RUSTFLAGS: |
      -Zprofile -Ccodegen-units=1 -Cinline-threshold=0 -Clink-dead-code
      -Coverflow-checks=off -Cpanic=abort -Zpanic_abort_tests -Cinstrument-coverage
    RUSTDOCFLAGS: |
      -Zprofile -Ccodegen-units=1 -Cinline-threshold=0 -Clink-dead-code
      -Coverflow-checks=off -Cpanic=abort -Zpanic_abort_tests -Cinstrument-coverage'

generating the coverage report using grcov, using the actions-rs/grcov action.

- name: Gather coverage data
  id: coverage
  uses: actions-rs/grcov@v0.1

measuring the total coverage score, and report it in a check, if the job is associated to a pull request

- name: Report coverage in PR status for the current commit
  if: github.ref_name != 'main'
  run: |
    set -eu
    total=$(cat ${COV_REPORT_DIR}/badges/flat.svg | egrep '<title>coverage: ' | cut -d: -f 2 | cut -d% -f 1 | sed 's/ //g')
    curl -s "https://brouberol:${GITHUB_TOKEN}@api.github.com/repos/brouberol/bo/statuses/${COMMIT_SHA}" -d "{\"state\": \"success\",\"target_url\": \"https://github.com/brouberol/bo/pull/${PULL_NUMBER}/checks?check_run_id=${RUN_ID}\",\"description\": \"${total}%\",\"context\": \"Measured coverage\"}"
  env:
    GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
    COMMIT_SHA: ${{ github.event.pull_request.head.sha }}
    RUN_ID: ${{ github.run_id }}
    PULL_NUMBER: ${{ github.event.pull_request.number }}
    COV_REPORT_DIR: ${{ steps.coverage.outputs.report }}

uploading the whole HTML coverage report to S3, using the jakejarvis/s3-sync-action action. We only do this for commits belonging the main branch (i.e. direct pushes or after a pull request was merged).

- name: "Upload the HTML coverage report to S3"
  if: github.ref_name == 'main'
  uses: jakejarvis/s3-sync-action@master
  with:
    args: --acl public-read --follow-symlinks --delete
  env:
    AWS_S3_BUCKET: ${{ secrets.AWS_BUCKET }}
    AWS_ACCESS_KEY_ID: ${{ secrets.AWS_ACCESS_KEY_ID }}
    AWS_SECRET_ACCESS_KEY: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
    AWS_REGION: ${{ secrets.AWS_REGION }}
    SOURCE_DIR: ${{ steps.coverage.outputs.report }}
    DEST_DIR: 'bo'

With all of that set up, the coverage report is now publicly available, refreshed every time a new commit hits main, and I even get a coverage shield for free!

Tools I'm thankful for

2022-02-22T00:00:00+01:00

Software engineers sometimes have a reputation of being overly critical when it comes to tools and programming languages. The web is full of rants, heated debates and articles about what technology is "better" and which is "crap". It was thus refreshing to read an post titled Software I'm thankful for, that shone a light on some pieces of software in a positive light. In honor of this article, I've decided to go through the same exercise.

Python

Python was my gateway to becoming a software engineer. It was the first programming language I loved, and I still do to this day. I wrote Python code professionally for a an AI startup, an e-ticketing startup, the Scottish government, a global hosting provider, a huge observability SaaS. I've written large Python webapps and quick Python scripts. I've written large asynchronous task workflows processing payments, trained machine learning models, written self-documented REST APIs, found my house listing by scraping the web, monitor the level of the river close by, all of that in Python.

I also write Python code to maintain my own infrastructure, that I deploy via ansible, itself written in Python. This blog is generated via Pelican, which is written in Python. I've started to play with a Raspberry Pi Pico, that I program in ... CircuitPython. It's ubiquitous, and I've heard it be called "The second best tool for every job", meaning that it probably won't be the most performant tool for what you're working on, but you'll make progress really fast.

Learning and programming Python has taught me many programming concepts, such as object-oriented programming, functional programming, unit testing, dataclasses, metaprogramming, REST APIs, HTTP, JSON, etc.

I however now realize that it also allowed me to get introduced to lower-level concepts, such as ioctl, sockets, system calls, file descriptors, etc, through the reassuring lens of the Python standard library, instead of having to interact with these concepts in C, which was much more intimidating (and still is today).

Docker

The first time I was introduced to Docker was at a Python meetup in Lyon, circa 2013. After the 30 minute long presentation, I still had no clue as to what any of it meant and why I'd ever need it and pretty much shrugged it off. As the Docker ecosystem flourished and the dust settled, I started to understand the appeal.

Do you need to run redis to prototype against? Just run docker run redis and voila. Do you want to run calibre-web on your local VPS without having to install its dependencies in your system libraries? Sure.

Docker allowed me to self-host a collection of tools that I use every day, package and run applications in extremely large production environments, spin up development environments without having to pollute my system libraries. It boosted my productivity and became part of my day-to-day workflow. None of these are the real reason why I'm thankful for Docker.

I've seen many companies break down their monolith into dockerized microservices. The commonly invoked reasons are allowing teams to chose their own language for each project, and helping the horizontal scaling of some load-critical apps. As useful Docker was to start a single container, it didn't solve the issue of starting several containers that could communicate with each other on a single host. Enter docker-compose, which in turn didn't solve the issue of orchestrating containers on a fleet of nodes. Enter Mesos/Marathon, Docker Swarm, Kubernetes, Amazon ECS, etc.

The beef I have with Docker is that the hype around its ecosystem caused small companies to onboard immense amount of complexity from the absolute get-go, to help with recruiting. Because engineers want to build experience with Kubernetes, these companies find themselves dividing their attention between grappling with its inherent complexity, distributed tracing, image recycling policies, RBAC, etc, and building their actual core value.

This is why I'm thankful for Docker and its ecosystem. I believe I've seen situations in which it truly was critically useful, and I'll now be able to differentiate between situations in which we need it, and situations in which we only wished we did.

Raspberry Pi

Before I joined OVH, the only sysadmin experience I had was tinkering with my Raspberry Pi. Thanks to that 35$ matchbox-sized computer, I got to learn iptables and systemd, port forwarding, ssh hardening, file system checks and repairs. But really, the crucial point is that I was able to learn all that by making mistakes. I'd rather learn about why you need to be careful with iptables -j DROP in the comfort of my own home than in a production, high pressure, environment. I can't stress the impact that learning without the fear of public failure had on me.

I'm now getting into electronics through the Raspberry Pi Pico, which opens a whole new exploration and tinkering domain for me!

The terminal

The terminal is a truly important part of my day as a software engineer. It's really what allows me to feel in control. Like Python, it became a familiar tool in which I could discover entirely new domains, interact with new systems and concepts. I learned so much from it that I decided to help people out getting familiarized with the terminal and the shell.

Sending a webhook from Synology DSM to Discord

2022-01-17T00:00:00+01:00

Given the fact that running a Datadog agent on a Synology Play NAS is not obvious, I wanted to enable Discord webhooks push notifications (as this is where my Datadog alerts are already being sent). This way, I'd get plenty of alerts "for free" without having to configure new Datadog monitors.

While sending webhooks notifications from a Synology NAS to Discord is technically possible, the DSM UI somehow seems to prevent us from doing so, as documented in this forum thread. Somehow, we have to include a hello world message in the notification, as part of the message content, without which, the UI won't allow us to save the webhook configuration.

You can however circumvent the issue by ssh-ing into the NAS and edit the /usr/syno/etc/synowebhook.conf into this:

{
    "Discord": {
        "needssl": false,
        "port": 8090,
        "prefix": "A new system event occurred on your %HOSTNAME%",
        "req_header": "",
        "req_method": "post",
        "req_param": "{\"username\":\"Synology\", \"avatar_url\": \"https://play-lh.googleusercontent.com/HjbYWbXJ-6e6Cia-mBbHDSdontW1yE6MHMaXqlHW80CQegDOEPQ1HGACxvEpnqCUHgo\", \"embeds\": [{\"description\": \"@@TEXT@@\", \"title\": \"@@PREFIX@@\"}]}",
        "sepchar": " ",
        "template": "$webhook_url",
        "type": "custom",
        "url": "$webhook_url"
    }
}

Note: replace $webhook_url by your Discord webhook URL.

When this is done, you should see a Discord webhook in your Webhook Push Services, and you should now be able to send a test message to Discord!

Now, any warning or alert generated from DSM will automatically be sent to Discord as well!

River monitoring with Datadog

2021-11-02T00:00:00+01:00

water level (m) over time

Last month, Ardèche experienced very heavy precipitations in the span of couple of hours. As a result, the dam located upriver from me opened the floodgates (literally), which caused the Chassezac level to raise by about 6.5m in about 1.5h. My basement was completely flooded, and the water level stabilized just about a 1m from the house ground floor. We had just enough time to move our belongings to the first floor. The riverside was unrecognizable, to the point where we found fish in the trees.

3 weeks later, the same thing happened, but this time the dam managers did their job. They only let enough water to make the dam wasn't overrun, while keeping everyone safe downriver.

What really bothered me though, is that at no point were we alerted of anything by EDF (the company managing the grid). No text, to alert, nothing.

Datadog to the rescue.

Using custom scripts, I now measure the river level at the station before and after my house. I also keep tabs on the amount of rain measured at these stations, as well as the general alert level.

By "chance", the first flood stopped right before the house level, and the second one stopped right before the basement. By extrapolating just a bit, I'm now able to have a good idea of the impact of a flood by looking at the river level at the station upriver.

I thus created Datadog monitors over the river level and the alert level, and I hooked them to a personal Pagerduty account, using their free tier.

I made sure to enable Critical Alerts for High Urgency in the app settings, which enables Pagerduty to override my phone volume preference, to wake me up even if is is in silent mode.

Now, if the dam managers decide to open the gates during the night (it has happened), I'll know.

To the Underdark and back

2021-10-21T00:00:00+02:00

I've recently designed a 2 session long (6h) detour into the Underdark, that would feed into one of my player's character's backstory. The goal was to allow her to meet her long-disappeared father, while introducing both the players and the characters to the strange and dangerous land that is the Underdark.

The way I prepared these sessions was an interesting process. I wanted these sessions to be mostly focused on exploration and roleplay, with a single (intense) fight, as well as a puzzle. I tried to design a sandboxed environement with enough lore and backstory to make sure the players enjoy themselves and have a reason to interact with the NPCs. I wanted them to care and have the necessary space and freedom to express themselves.

Following are my session design notes, that lasted me 2 whole sessions. These were as much a way to create the world as reminders about key elements or creature capabilities that I should remember mid-fight. They ended up being quite short, because I tried really hard to paint a picture, and prepare some colorful moments, but not to anticipate my player's reactions. They mostly filled the gaps and brought life to that setting.

Click here to switch to the 🇫🇷 version.

Metaprocrastinating on writing a book by writing a text editor

2021-09-04T00:00:00+02:00

If you have been following my Essential Tools and Practices for the Aspiring Software Developer posts and were anxious to read more, you might have noticed that they stopped coming after a while. I have a draft for the last chapter, and I regularly think about getting back to it, at least to get some closure. Alas, procrastination being what it is, I never did.

My procrastination level became really interesting when I convinced myself that one of the reasons that I didn't want to write that final chapter was that my text editor was standing in the way. I was either using a full-fledged code editor (Sublime Text/VSCode) riddled with complex features I didn't need (autocompletion, linting, etc) or getting lost in configuring vim into the perfect markdown editor. Either way, these were the wrong tools for the job, and my only way to get back to writing was to.. write my own?

And thus, bo was born.

The idea was to create a simple text editor, with powerful vim-like navigation. It should allow me to write in a very simple interface, while being able to navigate through the text in a couple of keystrokes, leveraging the muscle memory I built over the years using vim (or the vim mode in various editors).

I wanted it to be written in Rust, as it would be a good opportunity for me to write non-trivial code in a safe language, and also because, well, it just sounded fun.

I've been working on it on and off in the last month, and I've implemented enough features so that it's starting to feel comfortable.

There's still a lot to do! I'd be delighted if you wanted to test it and give it a go!

Written with bo.

Cleaning up the Dungeondraft tag list

2021-08-31T00:00:00+02:00

I have spent quite a lot of time using Dungeondraft recently, as I've designed many homebrewed places and encounters. The more maps I created, the more assets pack I bought from CartographyAssets, to further enrich and improve them. I quickly started to realize that some of these asset packs caused the tag list to be filled with entries that weren't linked to any assets at all. This made the asset discovery process quite frustrating.

Luckily, I found out about Dungeondraft-GoPackager, a tool that allows anyone to unpack a Dungeondraft asset pack, and inspect its metadata. I discovered that some asset packs would ship with tag entries linked to an empty asset list:

$ dungeondraft-unpack 2M\ Forest\ Floor\ Assets.dungeondraft_pack .
$ cd 2M\ Forest\ Floor\ Assets
$ cat data/default.dungeondraft_tags | jq .
# ... snip
    "Magic": [],
    "Mattresses": [],
    "Mill": [],
    "Mine": [],
    "Mirror": [],
    "Molds and Stains": [],
    "Mushroom": [],
    "Obstacle": [
      "textures/objects/forestfloor_cliff_1.webp",
      "textures/objects/forestfloor_cliff_2.webp",
      "textures/objects/forestfloor_cliff_3.webp"
    ],
    "Ocean": [],
    "Ottomans": [],
    "Paddles": [],
    "Paper and Books": [],
    "Pillar": [],
    "Pillows": [],
    "Pine Trees": [],
    "Planks and Debris": [],
# ...

I suspect the author does that because they export the same list of tag for each asset pack they release. However, having only bought a couple, that caused my tag list to be pretty spotty.

I decided to create a script that would automate the process of unpacking asset packs, removing these empty metadata entries, and then repacking everything up. While the process is pretty simple conceptually, it can become tedious when the number of packs grows.

The result of that work is cleanup-dungeondraft-asset-packs, that you can install by running the following command:

$ pip3 install --user cleanup-dungeondraft-asset-packs
Collecting cleanup-dungeondraft-asset-packs
  Downloading cleanup_dungeondraft_asset_packs-0.1.0-py3-none-any.whl (4.2 kB)
Installing collected packages: cleanup-dungeondraft-asset-packs
Successfully installed cleanup-dungeondraft-asset-packs-0.1.0

Once installed, you just have to point it to your assets directory, and voilà:

$ cleanup-dungeondraft-asset-packs --assets-dir ~/Documents/DnD/DungeonDraft/Assets
INFO:root:Unpacking /Users/br/Documents/DnD/DungeonDraft/Assets/CH-Forest-Demo.dungeondraft_pack
INFO:root:Repacking tmp/CH-Forest-Demo
WARN[0000] overwriting file                              id=kt201FMq name="CH - Forest Demo" outPackagePath="/Users/br/Documents/DnD/DungeonDraft/Assets/cleaned/CH - Forest Demo.dungeondraft_pack" path=/Users/br/Documents/DnD/DungeonDraft/Assets/tmp/CH-Forest-Demo
INFO:root:Unpacking /Users/br/Documents/DnD/DungeonDraft/Assets/AS-Forest-apmh1i.dungeondraft_pack
INFO:root:Skipping, as no dungeondraft_tags file is found
INFO:root:Repacking tmp/AS-Forest-apmh1i
WARN[0000] overwriting file                              id=qxhzAkxg name="AS Forest" outPackagePath="/Users/br/Documents/DnD/DungeonDraft/Assets/cleaned/AS Forest.dungeondraft_pack" path=/Users/br/Documents/DnD/DungeonDraft/Assets/tmp/AS-Forest-apmh1i
INFO:root:Unpacking /Users/br/Documents/DnD/DungeonDraft/Assets/2M Forest Floor Assets.dungeondraft_pack
INFO:root:Skipping empty tag Administration
INFO:root:Skipping empty tag Animals
INFO:root:Skipping empty tag Armchairs
INFO:root:Skipping empty tag Armor
...

Now, re-open Dungeondraft, and point it to the cleaned directory that cleanup-dungeondraft-asset-packs created, in which it placed all cleaned assets.

At that point, your tag list should only contain entries linked to actual assets!

There you go, I hope that helps! Happy Dungeondrafting!

Running the Port Nyanzaru Dinosaur Race

2021-04-10T00:00:00+02:00

When I was preparing for Port Nyanzaru, in Tomb of Annihilation, I started reading what other Dungeons Masters had to say about the city. A lot of them would mention that the dinosaur race was a must-do, and that if done properly, it could really be a high point in the start of the adventure. The problem was, I felt that the official rules regarding this race were, well, underwhelming, to say the least. Each player rolls a dice, gets some points or not, repeatedly until the end of the race. If that race was going to be something to remember, I felt that I needed to spice it up a bit.

The way I designed the race was as a mix between the official rules, the Game of the Goose and Mario Kart. You win if you are the first to complete 2 full laps around the city. Each lap is made of 48 squares, and starts/finishes at the Coliseum, marked with an X.

The players roll initiative to determine the order in which they'll play. We however consider that they all move at the same time, meaning that if 2 dinosaurs cross the finish line during the same round, they'll be considered ex aequo.

A player's turn goes as follows:

the jockey rolls an animal handling check against the dinosaur's DC to see if they can control it
- if successful, the dinosaur moves using its high speed dice
- if unsuccessful, the dinosaur moves its low speed dice. The jockey could however choose to hit its mount with its whip to coerce it into running faster (using its high speed dice).
  - The dinosaur needs to make a successful CON DD10 check for that.
  - If unsuccessful, it moves at half speed for the rest of the turn.
  - For the more aggressive dinosaurs (the ones marked with an asterisk), if the CON check failed by more than 5 points, it stops moving for 2 rounds, in protest.
If a dinosaur moves through or stops on a red square (on a bridge), it could attempt to trip another dinosaur located on the same square.
- If the other dinosaur fails a DD 10 DEX check, it's considered prone for a turn.
If a dinosaur stops (or chooses to stop) on a blue square, the jockey can decide to pick up some loot box. These boxes can have positive or negative effects, either instantaneous or to possibly be used later, anytime during the player's turn (think Mario Kart loot boxes).

I've kept the same dinosaur stats as given in the book, and used the official stats block for the Velociraptor.

Dinosaur	Race	Jockey WIS	Low speed dice	High speed dice	Animal Handling DC	CON	DEX
Un Tej et l'Addition	Triceratops	14(+2)	1d6	1d4+6	14	15(+2)	9(-1)
Aubrion du Gers	Hadrosaurus	12(+1)	1d6	1d2+6	10	13(+1)	10(0)
Mambo Mambo King of Tango	Tyrannosaurus	17(+3)	1d6	1d6+6	18*	17(+3)	10(0)
Brigadier Gérard	Dimetrodon	13(+1)	1d4	1d4+4	8	15(+2)	12(+1)
Fanfreluche	Allosaurus	16(+2)	1d6	1d4+6	16*	15(+2)	13(+1)
Pourquoi il pleure?	Deinonychus	17(+3)	1d6	1d2+6	12*	14(+2)	15(+2)
Excelsior VII	Ankylosaurus	16(+2)	1d4	1d6+4	13	16(+2)	11(+0)
Irène	Velociraptor	15(+2)	1d8	1d2+8	12	13(+1)	14(+2)

Here are the loot boxes that I came up with.

Effect	Instantaneous?
An insect swarm scares off your mount. Your next move will use your low speed dice.	Yes
You find a juicy spider. When given to your mount, it will run using its high speed dice.	No
You injure yourself on an hallucinogenic vine. Your next animal handling check will be performed at a disadvantage.	Yes
A reflex potion, when consumed, will give you an advantage at the next DEX check.	No
This net will allow you to immobilize an adversary located on the same square than you during a whole turn if they fail a DEX check DC 12.	No
This blessing potion will allow you to add 1d4 to your next skill check or saving throw.	No
These beads allow you to trip all dinosaurs located on the same square than you or the square before you. A dinosaur trips if it fails a DEX check DC 13. In case of failure, its speed is divided by 2 during its next turn.	No
A blinding bomb explodes in your face. If you fail a WIS saving throw DC 13, your next 2 animal handling checks will be performed at a disadvantage.	Yes
An appetizing chicken heart will allow you to relaunch your speed dice, after consumption.	No
You get teleported on the same square than the penultimate dinosaur.	Yes

Each player had to pay 20 gold to enter the race. The first finisher gets 100 gold, the second one gets 50 gold and the third one gets 20 gold. The players are obviously free to bet on anything they like, and the DM is responsible for giving them appropriate odds.

I hope these rules will help you run a fun race, or at least give you ideas to create your own set of rules! Feel free to tell me what worked and what didn't if you ran with these!

For those of you using Foundry, Steve Vlaminck has created a plugin implementing those very rules!

Shell productivity tips and tricks

2020-04-24T00:00:00+02:00

This article is part of a self-published book project by Balthazar Rouberol and Etienne Brodu, ex-roommates, friends and colleagues, aiming at empowering the up and coming generation of developers. We currently are hard at work on it!

If you are interested in the project, we invite you to join the mailing list!

Tab completion
Keyboard shortcuts
Navigating through history
Shell expansions
Real-life examples
Summary
Going further

Shell productivity tips

I estimate that I spend around 50% of my day working in my text editor and my terminal. Any way I can get more productive in these environments has a direct and measurable impact on my daily productivity as a whole.

If you spend a good chunk of your day repeatedly hitting the left and right arrow keys to navigate in long commands or correct typos, or hitting the up or down arrow keys to navigate your command history, this chapter should help you get more done quicker. We will cover some shell features you can leverage to make your shell do more of the work for you.

On a personal level, I probably use some of these up to 30 times a day, sometimes even without thinking about it, and it gives me a real sense of ownership of my tool.

In the immortal words of Kimberly “Sweet Brown” Wilkins:

Ain't nobody got time for that.

Tab completion

When you are typing in your shell, I suggest you treat the Tab key as a superpower. Indeed, the same way your phone keyboard can autocomplete words for you, so can your shell. It can suggest completions of command names and even command arguments or options! This works by pressing Tab (twice for bash and once for zsh).

One of the reasons zsh might be favored over bash is its more powerful auto-completion system, giving more results out-of-the-box and allowing you to navigate through the auto-completion options.

Here is an example of bash auto-completing a command name:

$ mkd<Tab>
mkdep  mkdir

Here is an example of bash auto-completing a command argument:

$ man mkd<Tab>
mkdir         mkdirat       mkdtemp       mkdtempat_np

And finally, an example of bash auto-completing a command option:

$ python -<Tab>
-    -3   -B   -E   -O   -OO  -Q   -R   -S   -V   -W
-b   -c   -d   -h   -i   -m   -s   -t   -u   -v   -x

I suggest you get used to using auto-completion as much as possible. It can save you keystrokes, as well as make you discover command options you didn't know about.

Pro-tip: if you are using bash, you can get install the bash-completion¹ package (using your system package-manager) in order to enable auto-completion for a wide variety of commands that do not support it out-of-the-box.

Keyboard shortcuts

The shell uses a library called readline² to provide you with many keyboard shortcuts to navigate, edit, cut, paste, search, etc, in the command line. Mastering these will help to dramatically increase your efficiency, instead of copying and pasting with your mouse, and navigating the command with the ↑ and ↓ arrow keys.

The default shortcuts are inspired by the emacs³ terminal-based text editor. If you are already familiar with it, a lot of the default readline shortcuts might feel familiar. emacs isn't the only famous text editor in the history of computers though: another one, dating back from 1976, is vi.⁴ vi and emacs are designed in two very different ways, and have two very different logics. It is possible that one might “click” more than the other for you. If you happen to be familiar with the vi editor and are accustomed to its navigation system, you can replicate it in your shell as well by adding set -o vi in your shell configuration file. If you are using zsh with the Oh My Zsh framework that we introduced in the previous chapter, you can also use the vi-mode plugin to do this.

The advantage of using the same navigation logic and shortcuts in your text editor and your terminal is that is blurs the line between both, and brings consistency to your terminal environment. If you have no clue how emacs or vi work though, I would probably suggest you don't worry about all this for now and experiment with the default terminal shortcuts.

Navigating the current line

The following navigation shortcuts allow you to move quickly your cursor in the current command saving you from relying solely on the → and ← arrows.

Navigation	Shortcut
Go to beginning of line	`Ctrl` - `A`
Go to end of line	`Ctrl` - `E`
Go to next word	`Alt` - `F`
Go to previous word	`Alt` - `B`
Toggle your cursor between its current position and the beginning of line	`Ctrl` - `X` - `X`

If you however prefer using the vi navigation system, you will first need to type Esc to switch from the Insertion mode to an emulation of vi's normal mode, in which you can navigate in your text using the following shortcuts:

Navigation	Shortcut
Go to beginning of line	`^`
Go to end of line	`$`
Go to next word	`w`
Go to previous word	`b`
Move to the end of the previous word	`e`

You can go back to editing your command line by hitting the i key.

Deleting and editing text

These shortcuts allow you to quickly edit the current command more efficiently than by just using the Delete key.

Edition	Shortcut
Delete current character	`Ctrl` - `D`
Delete previous word	`Ctrl` - `W`
Delete next word	`Alt` - `D`
Edit the current command in your text editor	`Ctrl` - `X` `Ctrl` - `E`
Undo previous action(s)	`Ctrl` - `-`

The equivalent vi-style shortcuts are:

Edition	Shortcut
Replace current character by another (ex: e)	`r` - `e`
Delete current character	`x`
Delete previous word	`d` - `b`
Delete next word	`d` - `w`
Edit the current command in your text editor	`v`
Undo previous action(s)	`u`

Cutting and pasting

The shell provides you with shortcuts to cut and paste commands quickly without using your mouse.

Action	Shortcut
Cut current word before the cursor	`Ctrl` - `W`
Cut from cursor to end of line	`Ctrl` - `K`
Cut from cursor to start of line	`Ctrl` - `U`
Paste the cut buffer at current position	`Ctrl` - `Y`

The equivalent vi-style shortcuts are:

Action	Shortcut
Cut current word before the cursor	`d` - `w`
Cut from cursor to end of line	`d` - `$`
Cut from cursor to start of line	`d` - `^`
Paste the cut buffer at current position	`p`

Controlling the terminal

Finally, these shortcuts will let you interact with the terminal itself.

Action	Shortcut	Equivalent command
Clear the terminal screen	`Ctrl` - `L`	`clear`
Close the terminal screen	`Ctrl` - `D`	`exit`
Send current command to the background.	`Ctrl` - `Z`

Even mastering some of these shortcuts should make you immensely more productive at typing commands and navigating command-line interfaces. I suggest you take time to experiment until you feel more accustomed with them. I can guarantee that you will feel the productivity boost!

A unified command-line editing experience

These shortcuts do not just work in your shell, but in any application using the readline library to allow the user to type and edit commands. Learning these shortcuts will thus make you productive in all types of command lines that you might encounter in your career, such as python, irb, sqlite3, etc.

To make sure you get a smooth and homogeneous editing experience in all command lines you use in your system, you can set your preferred mode in the readline configuration file itself.

$ cat ~/.inputrc
set editing-mode vi  # or emacs

Navigating through history

If you find yourself typing a certain command times and times again, you should probably be aware of how to navigate and search your shell history, in order to save time and keystrokes.

While the obvious way to re-execute a previous command might seem to just bash on the ↑ key until you find the command you want, there are faster and smarter ways to accomplish this.

Searching the history

A very useful and time-saving trick is searching for a command into your shell history instead of re-typing it from scratch. You can search your command history by typing Ctrl - R which opens a reverse-i-search (backwards search) prompt, in which you can search for previously executed command containing a given search pattern.

Type Ctrl - R to navigate through the results, until you find the one you were looking for and type the Enter key to execute it.

$ <Ctrl-R>
(reverse-i-search): echo <Ctrl-R> <Enter>
$ echo "hello world"
hello world

If you want to stop the search, either hit Ctrl - C or Ctrl - G to be sent back into the regular shell prompt.

History search works by looking into the shell history file (~/.bash_history for bash and ~/.zsh_history for zsh by default). Every time you execute a command, it will be added to your shell history file (with a maximum number of retained commands defined by the HISTSIZE environment variable).

The location of your shell history file can be configured by setting the HISTFILE environment variable.

Rewriting history

If you want to remove a sensitive command from your history, you can simply edit your $HISTFILE history file and remove it.

$ secret-command --password 1234qwerty  # oh no! that should not be in my history!
$ grep secret-command $HISTFILE
secret-command --password 1234qwerty
$ sed -i '/secret-command/d' $HISTFILE  # deletion of history line containing 'secret-command'
$ grep secret-command $HISTFILE
$ # it's not in history anymore

You can also use the history built-in command to display your whole history

$ history | tail -n 5
  496  mkdir test
  497  secret-command --password 1234qwerty
  498  cd
  499  man history
  500  history | tail -n 5

Each history line is prefixed by its index in the history. You can then use history -d <index> to remove the associated line from history.

$ history -d 497
$ history | tail -n 7
  496  mkdir test
  497  cd
  498  man history
  499  history | tail -n 5
  500  history -d 497
  501  history | tail -n 7

This only works with bash, not zsh.

Avoiding history

There is a trick you can use if you want to fly under the radar and never have a command recorded in history in the first place. Simply prefix your command by a space.

If you are using zsh, you need to add setopt HIST_IGNORE_SPACE in your ~/.zshrc to make sure that behavior is enabled.

$  secret-command --password 1234qwerty  # notice the space at the start of the command!
$ history | tail -n 2
  502  history | tail -n 7
  503  history | tail -n 2

Shell expansions

The shell can perform expansions, meaning it can replace portions of the command before executing it. Relying on expansions allows you to type less and rely on the shell itself to do the heavy lifting. While there are multiple types of expansions, we will only cover 5:

history expansion: quickly access previous commands and arguments from history
tilde expansion: replace the ~ path prefix
pathname expansion: expand a path pattern into a list of files
braces expansion: expand a pattern between braces into a longer sequence
command expansion: replace a sub-command by its output

Expansions are extremely powerful. When used right, an expansion can literally save you from writing a script.

As we only over what we think are the most useful expansions and shortcuts, feel free to refer to the bash manual, section EXPANSION if you want to see the full list.

History expansion

Your shell has multiple tricks up its sleeve to allow you to quickly reference previous commands or arguments in history with a minimum of keystrokes. While this section only provides you with what we feel are the most useful of them, feel free to go to the HISTORY EXPANSION section of the bash manual.

Event designators

An Event designator is a reference to a command line entry in the history list. It allows you to quickly refer to a previous command without having to re-type it.

`!-n`

!-n refers to the nth latest command: !-1 refers to the latest command, !-2 to the command before that, etc.

$ echo "hello world!"
hello world!
$ cd
$ !-2  # !-1 is "cd" and !-2 is 'echo "hello world!"'
$ echo "hello world"
hello world

!! is a shortcut for !-1, aka the latest command.

$ echo "hello world!"
hello world!
$ !!
$ echo "hello world"
hello world

!! is oftentimes used in conjunction with sudo, to re-execute the previous command with superuser privileges when it failed, due to a lack of permission.

$ vim /etc/myfile
vim: /etc/myfile: Permission denied
$ sudo !!
$ sudo vim /etc/myfile

`^string1^string2`

^string1^string2 is used to repeat the previous command in which string1 is replaced by string2.

$ cat ./myfile
Just a file full of junk
$ ^cat^rm
$ rm ./myfile

I personally use and abuse of this technique when I'm about to irremediably delete some resources (files, folders, containers, etc), and I want to make sure I'm about to delete the right things by listing these resources first. If you are familiar with SQL queries, it is the equivalent of executing a SELECT query before changing the SELECT to DELETE to make sure you're not going to delete more than you wanted to.

Word designators

Word designators are used to select desired words from a previous command (by default, the latest). They can be very useful when you want to type a new command that uses arguments previously typed in a previous command.

`!^`

!^ maps to the first argument of your latest command.

$ touch first.txt second.txt last.txt
$ vim !^
$ vim first.txt

`!$`

!$ maps to the last argument of your latest command.

$ touch first.txt second.txt last.txt
$ vim !$
$ vim last.txt

Combining event and word designators

You can even combine event and word designators in more complex shapes by using the following syntax

[EVENT DESIGNATOR]:[WORD DESIGNATOR]

For example, you could use the !! event designator to select the last command, and the 2 word designator to select the second argument.

$ touch first.txt second.txt last.txt
$ vim !!:2
$ vim second.txt

Tilde expansion

For each unquoted word starting with ~ in the command, all characters preceding a forward slash (/) will be considered a tilde prefix. Depending on its actual value, the tilde prefix can be expanded several ways, although the simple ~ is probably its most common use.

Tilde prefix	Expansion
`~`	Your home directory
`~+`	Your current working directory
`~-`	Your previous working directory

Example

$ ls ~
Android                code       Downloads              Music
AndroidStudioProjects  Desktop    Dropbox                Pictures
bin                    Documents  Firefox_wallpaper.png  Videos

This lists the content of your home directory, and is the equivalent to ls $HOME. You can combine the tilde with a suffix to compose an absolute path to some file or folder in your home directory.

$ cd ~/code
$ pwd
/home/br/code

Pathname expansion

Pathname expansions allow you to write an short path pattern and have it expanded in a list of files and directories, saving you from tedious copy-pastes or a possibly long (and error-prone) command writing.

`*`

The glob, or wildcard * character matches any string. It allows you to give a pattern to the shell, that it will then expand to all files and directories matching the pattern. The wildcard can be prefixed or suffixed, which will further specify our pattern. For example, *.jpg matches all files ending with the .jpg extension, and README.* matches all files named README whatever their extension.

Let us consider the following file and directory structure.

$ tree
.
|-- pic1.jpg
|-- pic2.jpg
|-- pic3.jpg
|-- pic4.jpg
\__ pics
|   |-- pic5.jpg
|   |-- pic6.jpg
|   \__ pic7.jpg
\__ sounds
    \__sound1.mp3

2 directory, 8 files

We want to move all jpg files into our pics directory. Instead of running 4 different mv commands or manually typing a long mv command, we can run just one using a pathname expansion.

$ mv *.jpg pics
$ tree
.
\__ pics
|   |-- pic1.jpg
|   |-- pic2.jpg
|   |-- pic3.jpg
|   |-- pic4.jpg
|   |-- pic5.jpg
|   |-- pic6.jpg
|   \__ pic7.jpg
\__ sounds
    \__sound1.mp3

2 directory, 8 files

*.jpg was expanded to all files ending with .jpg, causing the shell to actually run mv pic1.jpg pic2.jpg pic3.jpg pic4.jpg pics, causing all 4 jpg files to be moved to the pics directory in a single command.

We could have executed the following command for the same result: mv pic*.jpg pics. This would have moved all files with name starting by pic and ending with .jpg to the pics directory

You can use * several times within the same pattern. For example ls */* will list all files and directories located in a subdirectory.

$ ls */*
sounds/sound1.mp3   pics/pic2.jpg       pics/pic4.jpg       pics/pic6.jpg
pics/pic1.jpg       pics/pic3.jpg       pics/pic5.jpg       pics/pic7.jpg

Like in our second example, we can also use */*.jpg to list all jpg files located in a subdirectory.

$ ls */*.jpg
pics/pic1.jpg   pics/pic3.jpg   pics/pic5.jpg  pics/pic7.jpg
pics/pic2.jpg   pics/pic4.jpg   pics/pic6.jpg

`**`

** is expanded to all files and directories in the children directories, with a depth limit of 1.

$ touch README.txt
$ mkdir sounds/lyrics
$ touch sounds/lyrics/sound1.txt
$ tree
.
|-- README.txt
\__ pics
|   |-- pic1.jpg
|   |-- pic2.jpg
|   |-- pic3.jpg
|   |-- pic4.jpg
|   |-- pic5.jpg
|   |-- pic6.jpg
|   \__ pic7.jpg
\__ sounds
    \__ lyrics
    |   \__sound1.txt
    \__sound1.mp3

3 directories, 10 files
$ ls **
README.txt

pics:
pic1.jpg pic2.jpg pic3.jpg pic4.jpg pic5.jpg pic6.jpg pic7.jpg

sounds:
lyrics     sounds.mp3

ls ** was expanded into ls README.txt pics/ sounds/, which does not include the content of sounds/lyrics because of the depth limit of 1.

`**/`

**/ is expanded into all directories and subdirectories with a depth limit of 1 starting from our first directory.

$ tree
.
|-- README.txt
\__ pics
|   |-- pic1.jpg
|   |-- pic2.jpg
|   |-- pic3.jpg
|   |-- pic4.jpg
|   |-- pic5.jpg
|   |-- pic6.jpg
|   \__ pic7.jpg
\__ sounds
    \__ lyrics
    |   \__sound1.txt
    \__sound1.mp3


3 directories, 10 files
$ ls **/
pics/:
pic1.jpg pic2.jpg pic3.jpg pic4.jpg pic5.jpg pic6.jpg pic7.jpg

sounds/:
lyrics     sounds.mp3

sounds/lyrics/:
sound1.txt

ls **/ was expanded into ls sounds/ sounds/lyrics pics/. It thus listed all files located in our subdirectories.

Brace expansion

A brace expansion is a mechanism by which the shell can generate multiple strings based on a sequence of tokens defined within curly braces. The brace expansion pattern can be preceded by an optional preamble and followed by an optional postscript.

$ mkdir ~/test/{pics,sounds,sprites}
$ ls ~/test
pics  sounds  sprites

~/test/{pics,sounds,sprites} was expanded into ~/test/pics ~/test/sounds ~/test/sprites causing the shell to execute mkdir ~/test/pics ~/test/sounds ~/test/sprites (which will be expanded further into mkdir /home/br/test/pics /home/br/test/sounds /home/br/test/sprites by a tilde expansion).

We could have done the same thing by factoring the final s of each token into a postscript.

$ mkdir ~/test/{pic,sound,sprite}s

A brace expansion can also have a sequence pattern {x..y[..incr]} where x and y are either an integer or a single character, and incr is an optional increment value.

$ touch ~/test/sounds/noise-{1..5}.mp3
$ ls ~/test/sounds
noise-1.mp3 noise-2.mp3 noise-3.mp3 noise-4.mp3 noise-5.mp3

The default increment is 1 if the sequence end is greater than its start, and -1 otherwise. However, we could specify a custom increment value if we want.

$ touch ~/test/pics/pic{1..10..2}.jpg
$ ls ~/test/pics
pic1.jpg pic3.jpg pic5.jpg pic7.jpg pic9.jpg

Command expansion

Your shell can replace a command surrounded by $() with its output.

I personally like use to commands expansions to iterate over a command's result, or by combining it with a heredoc redirection:

$ cat <<EOF > aboutme
My name is $(whoami)
and I live in $HOME
EOF
$ cat aboutme
My name is br
and I live in /home/br

Real-life examples

Moving a pattern of files contained in directories and subdirectories

What is really powerful with these expansions is that, like almost everything in the shell, they can be combined. The following example combines a pathname expansion, a brace expansion and a tilde expansion.

$ tree
.
|-- README.txt
\__ pics
|   |-- pic1.jpg
|   |-- pic2.jpg
|   |-- pic3.jpg
|   |-- pic4.jpg
|   |-- pic5.jpg
|   |-- pic6.jpg
|   \__ pic7.jpg
\__ sounds
    \__ lyrics
    |   \__sound1.txt
    \__sound1.mp3
$ mv **/*.{jpg,mp3} ~/assets/
$ tree
|-- README.txt
\__ pics
\__ sounds
    \__ lyrics
        \__sound1.txt
$ ls ~/assets
pic1.jpg   pic2.jpg   pic3.jpg   pic4.jpg   pic5.jpg   pic6.jpg   pic7.jpg   sound1.mp3

Using these expansions, we were able to move all jpg and mp3 files located in directories and subdirectories to the assets directory located in your home directory, in exactly 27 characters!

Renaming multiple directories

We could use a for loop, pathname expansion and a command expansion to rename all directories contained in the current directory to their uppercase equivalent.

$ for dir in */; do
    mv "$dir" "$(echo "$dir" | tr '[:lower:]' '[:upper:]')"
  done

Let's decompose that command into its different steps:

the */ glob pattern is expanded over the list of directories, on which we iterate via a for loop
we execute echo $dir | tr '[:lower:]' '[:upper:]', which will convert the current directory name to uppercase
the $(echo $dir | tr '[:lower:]' '[:upper:]') command is expanded into the uppercase directory name
the directory is renamed into an uppercase name
the for loop iterates over the next directory name
we move on to the next directory and repeat the previous steps for each of them

Iterating over paths with a for loop is brittle as it breaks if a path contains a space. We will later see how to properly do it using the find command.

Summary

Your shell has so many productivity tricks and shortcuts up its sleeve it can be a little bit daunting. I suggest you don't try to learn them all at once, but really just experiment with them and see what feels natural. Even mastering some of them will make you more productive!

What if there is an action you find useful but you just don't like the keyboard shortcut? Luckily for you, the next chapter will dive into how to personalize and customize your shell.

Going further

5.1: Create a directory. Use a bash expansion to move into that directory without typing its name a second time.

5.2: Print your 4th last command typed into your terminal without re-typing it.

5.3: Create the following empty files README.txt, requirements.txt and TODO.txt in a single command, without typing .txt more than once.

5.4: Delete all the files created in the last question without typing .txt more than once.

5.5: Create the following directory tree in a single command.

files
|-- 1
|   |-- 1a
|   |-- 1b
|   |-- 1c
|   |-- 2a
|   |-- 2b
|   |-- 2c
|   |-- 3a
|   |-- 3b
|   \-- 3c
|-- 2
|   |-- 1a
|   |-- 1b
|   |-- 1c
|   |-- 2a
|   |-- 2b
|   |-- 2c
|   |-- 3a
|   |-- 3b
|   \-- 3c
\-- 3
    |-- 1a
    |-- 1b
    |-- 1c
    |-- 2a
    |-- 2b
    |-- 2c
    |-- 3a
    |-- 3b
    \-- 3c

5.6: Remove all subdirectories starting with 3 created in the previous command, while keeping the top 3 directory.

5.7: Re-execute the command from exercise 5.3 by looking backwards into your shell history.

Customizing your shell

2020-04-17T00:00:00+02:00

If you are interested in the project, we invite you to join the mailing list!

Which terminal should I use?
What font should I use?
What shell should I use?
Configuring your shell
Configuring your prompt
Shell configuration frameworks
Summary
Going further

Customizing your shell

It is very common for programmers to tweak and customize their terminal and shell for hours, add or write new plug-ins, all in pursuit of the “perfect environment” and an increase of productivity. Others, on the contrary, avoid tweaking their shell altogether in order to always get the same experience on every machine.

On a personal note, I tend to favor having a personalized shell as much as possible. I feel that sharing files between different computers is now a solved issue, and the benefits I get from having personalized my work environments are so great that I gladly pay the small price of synchronizing that configuration between my computers.

In that chapter, we will learn more about the shell and how to configure your terminal environment to make it work for you. Please note that some of the recommendations come from personal taste, and might not work for you nor suit you. We encourage you to explore and find what feels right, but we hope to at least nudge you in the right direction.

Which terminal should I use?

First off, if you are new to using the terminal, you might not have realized that it exists multiple terminal applications. MacOS comes with Terminal pre-installed, and most Linux distributions come with either xterm, Gnome-terminal or Konsole pre-installed, and there is a vast number of available alternatives.

I don't think there is a good, absolute and definitive answer when it comes to picking the “right” terminal application. You might get various answers depending who you ask. That being said, I can at least mention my own personal recommendations and preferences.

Whatever terminal you end up using, I think that it is really important you configure it to your liking and preferences. As a programmer, you will probably spend a great deal of time in your terminal, and for you to feel productive and empowered, it needs to work for you.

Terminator

If you are running Linux, I personally favor Terminator¹ over the default choices. It has several features I find useful:

a tab system, allowing you to have multiple tab of terminal(s) within the same window
a grid system, allowing you to have multiple terminals in the same tab

I can work in multiple panes within the same tab, and have one tab per project

Here are the terminator keyboard shortcuts I find the most useful:

Shortcut	Action
`Ctrl` - `Shift` - `E`	split the screen vertically
`Ctrl` - `Shift` - `O`	split the screen horizontally
`Ctrl` - `Shift` - `T`	open a new tab
`Ctrl` - `PageUp`	switch to the next tab
`Ctrl` - `PageDown`	switch to the previous tab
`Ctrl` - `N`	open a new window
`Ctrl` - `Shift` - `+`	zoom in
`Ctrl` - `Shift` - `-`	zoom out
`Ctrl` - `D`	close the current terminal

iTerm2

As far as macOS is concerned, I find the default terminal (plainly named Terminal) to be hard to use. The terminal that seems to be widely accepted by the macOS programming community is iTerm2². It has all of the features cited above, and many (many) more!

iTerm2 looks similar to Terminator but can do much, much more

The iTerm2 keyboard shortcuts I find the most useful are:

Shortcut	Action
`Cmd` - `D`	split the screen vertically
`Cmd` - `Shift` - `D`	split the screen horizontally
`Cmd` - `T`	open a new tab
`Cmd` - `Shift` - `+`	zoom in
`Cmd` - `Shift` - `-`	zoom out
`Cmd` - `N`	open a new window
`Ctrl` - `D`	close the current terminal

The following sections go over some non-default iTerm2 settings that I find convenient. Again, these are my preference and are in no way prescriptive. Feel free to discard them if you want.

Open file shortcut

One of the iTerm2 features I enjoy is the ability of using Cmd + mouse click on a file path or an URL, to open the resource with the default associated program. For example, it will open an URL in your browser, a path to a local PDF file with Preview, a text file with your preferred text editor, etc.

By enabling this feature, you will be able to open a file using a graphical application from your terminal

Intuitive location for new terminals

Another tweak I've done to iTerm2 was changing the working directory new terminals will open into by default. What I wanted was

open a new terminal window in my home directory
open a new terminal tab in my home directory
open a new terminal split pane in the previous session's directory

I did this because I oftentimes found myself splitting the current tab when I want to run multiple commands within the same project, and I had to cd into the project directory every time I did a pane split.

I reduced the time I spent cd-ing into project directories with these settings. Preferences > Profiles > General > Working Directory > Advanced Configuration > Edit

What font should I use?

Using a font you enjoy is paramount. If you spend a lot of time reading and writing in your terminal, you might as well do it using a font that feels right to you.

I personally really enjoy the Fira Code³ font, both in my text editor and my terminal. Not only does it look really nice on the eye, but it also contains a set of ligatures for multi character combinations, such as ! and = rendered in a single character, allowing you to read code and decode symbols more easily.

Example of rendered character ligatures

Note that not all terminals support fonts with ligatures. For example, iTerm2 does but Terminator does not.

While Fira Code has my preference, there are other well-designed fonts including ligatures, such as JetBrains Mono.⁴

What shell should I use?

We have hinted at it until now: bash is not the only shell out there. You are free to use other shells if you want, such as zsh, fish, nushell, … As it was the case with terminals, the “good” terminal really depends on your definition of “good”. If you deeply care about using the same shell on every machine you work on, then bash is possibly for you. It has been around since 1989, is stable, mature and is the default shell on almost⁵ every UNIX system out there.

When researching this book, I was surprised to learn that zsh (or the Z-shell) wasn't really the last “kid on the block” either, as it was first released in 1990, just a year after the first stable bash release! You can expect the same level of stability, maturity and even syntax (to a large extent, except when it comes to configuration) than bash.

I personally think zsh really shines by providing a powerful default auto-completion experience, as well as more configuration options. As zsh is compatible with bash's own syntax, I encourage you to try them until you feel comfortable with one or the other.

The fish⁶ shell takes a radical turn from bash or zsh by providing an incompatible but “simple and clean” syntax, an extremely powerful command suggestion system, and an interactive configuration wizard.

If you are getting started with using the shell, my personal recommendation is to stick to bash or zsh and experiment with other shells to see what value they bring once you feel more confident.

Changing your default shell

The chsh (standing for change shell) command allows you to change your default shell.

Examples:

# Switching to bash by default
$ chsh -s /bin/bash

# Switching to zsh by default
$ chsh -s /bin/zsh

Once you have run chsh, any new terminal window you open will run your new default shell.

Configuring your shell

Up until now, every example we have seen have defined environment variables, aliases and functions directly in the shell. However, if we closed that shell, all of these changes would be undone and we would have to start again the next time we open a new one. Fortunately, all of these settings can be persisted in a configuration file. Adding aliases, environment variables and functions to that file will make sure they get imported every time you open a new shell.

These files usually reside in your home directory, and are named .bashrc for bash, and .zshrc for zsh.

$ cat ~/.zshrc
export EDITOR=vim
export PATH=$HOME/bin:$PATH

alias ls='ls -G'
alias ..='cd ..'
alias ...='cd ../..'

function mkcd {
    local target=$1
    mkdir -p "$target"
    cd $target
}

After adding anything to your shell configuration file, you need to run source ~/.zshrc (or source ~/.bashrc, depending on your shell). The source built-in command reads and executes commands from the argument file name in the current shell environment. Said in another way, running source ~/.<file> will cause the shell to reload its configuration.

rc stands for run commands. Indeed, when you source your configuration file, you will run the commands it contains. The subtlety with source is that it executes the argument script within your current shell, meaning any sourced commands will have a side-effect on your running shell.

If you can never remember a given command's options, or if you always find yourself typing a group of commands, I encourage you to define aliases and functions in your shell configuration file. They will allow you to feel more productive day after day, especially so if the alias and tools are abstracting complex commands.

The previous chapter ended with some real-life examples of alias and functions. Feel free to add them to your shell configuration file.

Configuring your prompt

Configuring your prompt is a very good way to make the shell work for you as much as possible, by providing you with useful context, such as the time of day, whether the last command was successful, your current working directory… While they can provide context and information to you, they will carry that context to anyone you copy and paste a command and associated output to.

Configuring your prompt is done by changing the value of the PS1 environment variable.

$ export PS1="MY COOL PROMPT $"
MY COOL PROMPT $

I think we can agree that MY COOL PROMPT is not as informative as it could, so let's change it to put our prompt to work. As the prompt configuration work slightly different between bash and zsh, we will address both cases in two different sections.

Configuring your bash prompt

The PS1 environment variable can be defined by using a mix and match of both regular and special characters. The regular characters are just displayed as-is, whereas the backslash-escaped special characters are interpreted by bash at the time PS1 is displayed and replaced by the associated value. The most useful special characters are defined as follows.

Character	Meaning
`\h`	The hostname up to the first dot
`\t`	The current time, in 24-hour HH:MM:SS format
`\u`	The current user's username
`\w`	The full current working directory (`$HOME` rendered as `~`)
`\W`	The basename of the current working directory (`$HOME` rendered as `~`)
`\n`	A new line

These special characters are evaluated every-time the prompt is displayed to make sure you always get the most up-to-date context.

The PROMPTING section of the bash manual contains the full list of backslash-escaped special characters.

Examples

$ export PS1='\u@\h \W $'
br@morenika ~ $

$ export PS1='[\t] \u@\h \W $'
[13:33:55] br@morenika ~ $

$ export PS1='[\t \u@\h:\w]\n>>> '
[13:57:55 br@morenika:~/code]
>>>

You can use online tools such as ezprompt⁷ to try different configurations until you find something you like.

Whatever PS1 value you settle with should be persisted and exported in your .bashrc configuration file.

Configuring your zsh prompt

zsh exposes a bit more options than bash when it comes to prompt configuration. Both PS1 and PROMPT environment variable can be set to the same effect, if you find PROMPT more explicit.

Instead of being backslash-escaped, zsh's special characters are prefixed by %, and are called prompt sequences. The most useful are detailed here.

Sequence	Meaning
`%m`	The hostname up to the first dot
`%*`	The current time, in 24-hour HH:MM:SS format
`%n`	The current user's username
`%~`	The full current working directory (`$HOME` rendered as `~`)
`%1~`	The basename of the current working directory (`$HOME` rendered as `~`)
`%?`	The exit status of the last command executed
`%%`	A %
`$'\n'`	A new line
`%B (%b)`	Start (stop) bold font mode
`%F (%f)`	Start (stop) using a given foreground color, if supported by the terminal

You will find the full list of prompt sequences in the zsh documentation⁸.

Examples

$ export PROMPT='%n@%m %~ $ '
br@morenika ~/code $

$ export PROMPT='[%*] %n@%m %~ $ '
[23:38] br@morenika ~/code $

$ export PROMPT="[%* %n@%m %~]"$'\n'">>> "
[23:41 br@morenika ~/code/izk]
>>>

$ export PROMPT="[%* %n@%m %1~]"$'\n'"%% "
[23:41 br@morenika izk]
%

zsh goes even further by letting you define the content of a right-sided prompt, through the RPROMPT environment variable, which uses the same syntax as PROMPT.

Example

$ export PROMPT='%~ $ '; export RPROMPT='%*'
~/code $                                              21:04:00

To make sure your changes are persisted, PROMPT and RPROMPT should be exported in your .zshrc configuration file.

Adding Colors

Adding color is a good way to spice up your prompt as well as providing some visual context. You can use color to indicate whether you are running with super-user privileges, if the last command succeeded or failed, or simply colorized each individual part of your prompt (username, hostname, etc) in a different way to make it even simpler to parse.

Adding color to your bash prompt

Bash allows you to style elements of your prompt by using 3-bit ANSI⁹ codes defining a zone associated with a potential effect, foreground color and background color.

Each effect, background or foreground color has an associated code, described in the following tables. The combination of these parameters is called Select Graphic Rendition, which is defined as a semicolon (;) separated list of codes.

Effect	ANSI Code
Normal	`0`
Bold	`1`
Faint	`2`
Italic	`3`
Underline	`4`
Strike through	`9`

Background Color	ANSI Code
Red	`41`
Green	`42`
Brown	`43`
Blue	`44`
Purple	`45`
Cyan	`46`
White	`47`
Bright black	`100`
Bright red	`101`
Bright green	`102`
Bright brown	`103`
Bright blue	`104`
Bright purple	`105`
Bright cyan	`106`
Bright white	`107`

Foreground Color	ANSI Code
Black	`30`
Red	`31`
Green	`32`
Brown	`33`
Blue	`34`
Purple	`35`
Cyan	`36`
White	`37`
Bright black	`90`
Bright red	`91`
Bright green	`92`
Bright brown	`93`
Bright blue	`94`
Bright purple	`95`
Bright cyan	`96`
Bright white	`97`

Examples of SGRs

blue text: 34
bold green text: 1;32
purple text on a white background: 35;47
bold red text on a bright cyan background: 1;31;106
bold and striked-through brown text on a green background 1;9;33;42

To define colorized zones in your bash prompt, use the following (granted, ugly) syntax:

\e[<SGR>mTEXT\e[m

Examples

$ export PS1='[\t] \u@\h \W \e[32m$\e[m '

The $ sign is now displayed in green

$ export PS1='\e[31m\u\e[m@\e[32m\h\e[m \e[36m\W\e[m $ '

The username is in red, the hostname in green and the path is in cyan.

Color palettes

Notice how an ANSI code only maps to a color name? That's because it is up to your terminal to interpret and render that color name into an actual color, meaning that the same prompt configuration could be rendered differently on two different terminals.

Mapping ANSI color names to actual RGB colors is done through what is called color palettes.

Following are two different color schemes, as well as the associated rendered prompt, both using the same PS1 value, used in the previous example.

The popular Solarized Dark color scheme

The Pastel (Dark Background) color scheme

As you can see, these both look quite different from the prompt displayed in the previous screenshot, even though the underlying prompt configuration is exactly the same. This means that, even if using 16 colors can feel limiting, you actually can map these colors to any color you like. The ANSI color system just prevents you from having more than 16 different colors in your prompt.

I recommend you to have a look at the mbadolato/iTerm2-Color-Schemes¹⁰ project, showcasing popular color palettes and providing you with the configuration files allowing you to used them in many terminal applications (and not just iTerm2 contrary to what its name suggests).

Up to 256 colors

As computers eventually started to have 256 colors graphics card, a 8 bit ANSI code scheme was introduced, allowing the user to render 256 colors in their terminal, instead of 16.

The 8-bit ANSI code syntax is \e[38;5;n where the colors associated with each value of n between 0 and 255 are represented in the following table¹¹.

The 8-bit ANSI code allows you to render more than the initial 16 available colors

Examples

# Using 256-bit ANSI codes
$ TIME="[\e[38;5;33m\t\e[m]"  # blue
$ USERNAME="\e[38;5;200m\u\e[m"  # pink
$ HOSTNAME="\e[38;5;139m\h\e[m"  # purple
$ WORKDIR="\W"  # no color
$ DOLLAR="\e[38;5;41m$\e[m"  # green
$ export PS1="${TIME} ${USERNAME}@${HOSTNAME} ${PTH} ${DOLLAR} "

These ANSI codes sure are awful to read but they make for pretty colors

Not all terminals support 256 colors, but most of the modern ones should. To this day, GNOME Terminal, Konsole, Terminator, XFCE4 Terminal, iTerm2, Terminal (macOS) and tmux all support 256 colors.

Contrary to the 3-bit ANSI codes, the 8-bit codes are insensitive to color schemes changes, as shown in the following examples, both re-using the same PS1 configuration than in the previous screenshot.

The colors remain unchanged

Adding color to your zsh prompt

Everything we've explained in the previous section is still valid for zsh: you can use 3 or 8 bit ANSI color codes just fine. However, zsh also provides you with a much easier and readable color system:

each color can be represented as either black, red, green, yellow, blue, magenta, cyan or white, or a number between 0 and 255
%F{color}Text%f: changes the Text foreground color to color
%K{color}Text%k: changes the Text background color to color
%BText%b: displays Text in boldface
%UText%u: underlines Text

Example

The current working directory in blue and the dollar sign in bold pink

Displaying dynamic data in the prompt

We can make our prompt display dynamic context to make it even more informative. To do this, we can execute a function as part of our PS1 environment variable. The shell will call that function every time it renders the prompt.

The idea is to be able to have as much information as possible in your prompt at the ready, but only when necessary.

Displaying dynamic data in bash

Let's say that we want to colorize the $ of our prompt in green if the last command was successful, and in red if it failed. We can wrap that logic into the following colorized_prompt bash function, and have it called every time PS1 is rendered by including $(colorized_prompt) in the environment variable.

The $(colorized_prompt) syntax means "call the colorize_prompt function", and will be expanded into the output of the function (what it prints), which will contain ASCII color codes colorizing the prompt.

function colorized_prompt {
    # Check if last command exit code equals 0
    if (($?)); then
        printf "\e[32m$\e[m"
    else
        printf "\e[31m$\e[m"
    fi

}
export PS1='[\t] \W $(colorized_prompt) '

$? is a special bash parameter that expands to the exit status of the previously executed command. The norm is to have an exit status of 0 if the command executed successfully, and any other exit status indicates an error.

$ pwd
/home/br
$ echo $?
0
$ cmdnotfound
bash: cmdnotfound: command not found
echo $?
127

The syntax if (($?)); then thus translates to “if the last command executed successfully, then…”.

The prompt is green after a successful command and red after a failed one

Displaying dynamic data in zsh

Dynamic data can be injected in your prompt the same way than in bash, by executing functions at rendering time. zsh however provides you with ternary conditionals, that is to say expressions that either evaluate to one value or the other depending on a condition, to reach the same goal. A ternary conditional has the following syntax

%(<condition>.<success value>.<failure value>)

If the condition is true, then the expression is evaluated to the success value. On the other hand, if the condition is false, the expression will be evaluated to the failure value.

You can read a ternary conditional as if condition, then, else. It's actually a common pattern called ternary expression you might encounter in many programming languages.

Here is a list of useful built-in conditions provided by zsh.

Condition	Meaning
`n?`	True if the previous command exited with the exit status n
`nd`	True if the day of the month is equal to n
`nw`	True if the day of the week is equal to n (Sunday = 0).
`!`	True if the shell is running with super-user privileges (as the `root` user)

Examples

$ export PROMPT='%F{%(0?.green.red)}$ %f'

Displays a dollar prompt in green if the last command was successful, or red if it failed

$ export PROMPT='%* %1~ %(!.#.$) '

Display a dollar sign if you run your regular user, and a hash if you are running in super-user mode

The full list of ternary conditionals is available in the zsh documentation¹².

Adding emoji to your prompt

Modern terminal support non-ASCII characters, such as emoji. Like colors, they can be convenient to convey information in a very space-efficient fashion.

For example, during the process of writing that book, I displayed the associated total word count in my prompt to keep me motivated. That word count would however only be displayed when I was located in the root directory of the project, in the spirit of only displaying context when necessary.

Shell configuration frameworks

Up until now, we have seen how to tailor your prompt by adding colors, context, dynamic information computed on-the-fly. While you can certainly spend hours customizing up to “perfection” (trust me, I have been there…), you can also take another route and benefit from other people's work, using a shell configuration framework.

These frameworks provide you with a large choice of prompt themes, helpers, options, additional command auto-completions, plug-ins, and are regularly updated by a community of developers around the world.

To this day, the most famous zsh configuration frameworks are Oh My Zsh¹³ and Prezto.¹⁴ While we can't fully attribute zsh's success to them (Oh My Zsh was first released around 2010, 20 years after zsh's first release), they certainly have helped in driving community adoption in the last couple of years¹⁵.

Comparison of Google Trends associated with zsh and Oh My Zsh

We will introduce you to the concepts behind Oh My Zsh, but it will then be up to you to explore, and select a theme as well as plug-ins you like (or even not use them at all!). After all, it is your development environment, and henceforth, your choice.

bash has a similar framework, inspired by Oh My Zsh, called bash-it.¹⁶ We won't cover it in details but we encourage you to look at it if don't feel like using zsh but still want to use a configuration framework.

Oh My Zsh

Quoting the official website,

Oh My Zsh is a delightful, open source, community-driven framework for managing your Zsh configuration. It comes bundled with thousands of helpful functions, helpers, plug-ins, themes, and a few things that make you shout…

To install it, run the following command in a shell, which will download an installation script, and run it on your computer.

$ sh -c "$(curl -fsSL https://raw.github.com/ohmyzsh/ohmyzsh/master/tools/install.sh)"

Once the script has finished running, you should see a message stating that Oh My Zsh has been installed, and that plug-ins, themes and options should be enabled by changing the configuration living under ~/.zshrc.

Before, we do, let's inspect our environment variables, to see how Oh My Zsh configures itself.

$ printenv | grep ZSH
ZSH=/home/br/.oh-my-zsh

That ZSH environment variable points to the Oh My Zsh installation directory. The framework also injected a couple of other variables defining specific configuration values.

$ set | grep ZSH
ZSH=/home/br/.oh-my-zsh
ZSH_ARGZERO=zsh
ZSH_CACHE_DIR=/home/br/.oh-my-zsh/cache
ZSH_COMPDUMP=/home/br/.zcompdump-morenika-5.7.1
ZSH_CUSTOM=/home/br/.oh-my-zsh/custom
ZSH_EVAL_CONTEXT=toplevel
ZSH_NAME=zsh
ZSH_PATCHLEVEL=zsh-5.7.1-0-g8b89d0d
ZSH_SPECTRUM_TEXT='Arma virumque cano Troiae qui primus ab oris'
ZSH_SUBSHELL=1
ZSH_THEME=robbyrussell
ZSH_VERSION=5.7.1

Picking a theme

We can see that the default theme is robbyrussell (Robby Russell¹⁷ is the creator of Oh My Zsh). The full list of available themes is available online¹⁸, along with screenshots.

You can also get the list by running the following command, as all themes are defined in $ZSH/themes.

$ ls -1 $ZSH/themes | sed 's/.zsh-theme//'
3den
adben
af-magic
afowler
...

I suggest you scroll through the themes wiki, or simply pick a theme at random from the previous command output, edit your ~/.zshrc configuration file by updating the value of the ZSH_THEME variable, and run source ~/.zshrc to reload it. That will get you a whole new shell theme!

Feel free to rinse and repeat until you find a theme that suits you. In the case where no built-in theme finds grace in your eyes, you can also explore the external theme wiki¹⁹. If you find an external theme you like, download its associated .zsh-theme file, and place it under $ZSH/themes, then edit ~/.zshrc, and update the ZSH_THEME accordingly.

If you want to further personalize a theme using some of the techniques we covered in that chapter, I'd advise you clone it and maintain a separate version, as your tweaks might get overridden at the next theme update.

Export the ZSH_CUSTOM environment variable to $ZSH/custom, then run the following commands.

$ mkdir -p $ZSH/custom/themes
$ cp $ZSH/themes/$ZSH_THEME.zsh-theme $ZSH/custom/themes/$ZSH_THEME-custom.zsh-theme

Then add ZSH_THEME=<old zsh theme>-custom to your ~/.zshrc.

Useful configuration options

Oh My Zsh has a couple of options you can enable or disable by editing ~/.zshrc. I suggest you take a look at them and choose what to activate. Here are some personal recommendations.

Automatic command correction

zsh can suggest a command correction if it detects a mistyped command. To enable the automatic command correction, add ENABLE_AUTO_CORRECTION='true' to ~/.zshrc.

$ sl
zsh: correct 'sl' to 'ls' [nyae]? y
Android                bin   Desktop    Downloads  Firefox_wallpaper.png  Pictures
AndroidStudioProjects  code  Documents  Dropbox    Music                  Videos

The 4 options are:

n (no): run the mistyped command
y (yes): run the suggested command
a (abort): stop and do nothing
e (edit): edit your command before re-running it

zsh's auto-correction feature can sometimes be over-zealous and is not to everyone's liking²⁰. If you end up repeatedly fighting it for a given command (e.g. git status wrongly autocorrected to git stats), you can define an alias for the command by prefixing it with nocorrect.

alias git status='nocorrect git status'

Automatic Oh My Zsh updates

To make sure you regularly get new plug-ins and bug fixes, Oh My Zsh can automatically and regularly update itself. To do so, set the following options in ~/.zshrc:

DISABLE_UPDATE_PROMPT=true: update Oh My Zsh without asking for confirmation
UPDATE_ZSH_DAYS=30: update Oh My Zsh every 30 days

Add plug-ins

Oh My Zsh comes with more than 250 plug-ins, each of them either defining aliases or improved auto-completion for a given set of commands. Refer to the Oh My Zsh wiki page²¹ to see the full list of available plug-ins. To enable a given plug-in, add its name to the plugins list in ~/.zshrc, then run source ~/.zshrc.

Example:

- plugins=(git)
+ plugins=(git python)

If you regularly use a command listed in the plug-in wiki page, you should probably try to enable the associated plug-in! I however suggest enabling the following general-purpose plug-ins.

common-aliases²²: Collection of useful aliases, not enabled by default since they may change some user defined aliases
colored-man-pages: colorize man pages

Colorized man pages are much easier to read!

extract: define an extract alias that can extract any type of archive (.zip, .tar.gz, .bzip, etc)²³

The following plug-ins are not provided by default, I find them so useful that I suggest you install them and give them a try.

zsh-autosuggestions²⁴: emulate the fish autosuggestion by suggesting commands as you type them, saving you from using Ctrl - R to look into your shell history. Any suggestion can be accepted by hitting → or ignored by just continuing typing.

I just typed ls and I immediately get a completion suggestion

Suggestion accepted!

zsh-syntax-highlighting²⁵: provide syntax highlighting within the zsh command line. It also colorizes the name of the command you type in green if it is found, and in red if not.

ls is a valid command

cmdnotfound is not

Uninstalling Oh My Zsh

If you find that Oh My Zsh isn't for you, you can uninstall it by running the uninstall_oh_my_zsh function. Your previous configuration will be restored.

Summary

I strongly believe that learning how to configure and personalize your own shell is an important part of becoming a developer. I'd even go as far as calling it a ritual. On a personal level, it helped me overcome the almost mystic reputation of the terminal by making it my own.

Configuring your shell might never really be fully completed. Do you find yourself executing a long command repeatedly? Make it an alias. If an alias does not cut it, or if it should take arguments, write a shell function instead. Are you oftentimes wondering on which branch, project or profile you are currently running? Add it to your prompt. If your prompt starts to feel a little crowded, you might be able to condense it by using colors and emoji.

Making your own tools and customizing your shell is an investment, but it is also an inherent part of being a software developer, which will allow you to do more, faster, and will help you feel more at home in your shell. It's also quite a bit of fun!

Going further

4.1: Look into your terminal's preferences and try to change the color scheme, or remap ANSI colors to different RGB colors.

4.2: Try different fonts, such as Source Code Pro, Fira Code Pro, Inconsolata or Jetbrains Mono and pick the one you like most

4.3: Explore your terminal preferences, and experiment with different settings.

4.4: Try to change the colors of the different sections of your prompt

The shell's building blocks

2020-04-04T00:00:00+02:00

If you are interested in the project, we invite you to join the mailing list!

Environment variables
Aliases
Functions
Real life examples
Summary
Going further

The shell's building blocks

As we have seen in the previous chapters, the shell is a program allowing you to run other programs. It is an invaluable tool in the life of a software engineer, as it provides you with a simple text-based interface to control your computer and any program you might install or write.

Something I still find striking after years of using a shell almost daily is how simple yet powerful its building blocks are.

Chapter 1 covered commands, I/O streams and pipes. This chapter will cover environment variables, aliases and functions.

Environment variables

Environment variables are key/value pairs that affect how running programs behave. Another way to say that would be that environment variables can allow you to tweak and personalize how certain programs, amongst which your shell, work. They can also define what programs will be called to perform a certain task.

Here are a few examples:

SHELL defines what shell your terminal runs (‘/bin/bash', /bin/zsh, /bin/fish, etc)
HOME defines where your home directory is located
EDITOR defines what text editor program should be used to edit text within your terminal (eg nano, vim, emacs, etc)

Displaying an environment variable's value

To display the value of given environment variable, you can use the echo command, followed by a dollar sign and the name of the variable:

$ echo $SHELL
/bin/zsh

You can use the printenv command to list all environment variables along with their value.

$ printenv
USER=br
HOME=/home/br
LC_TERMINAL=terminator
SHELL=/bin/zsh
EDITOR=vim
PWD=/home/br/
PAGER=less

For the sake of brevity, I've only displayed a subset of the environment variables defined on my computer. These variables tell the following story:

my username is br
all my personal data is stored in my home directory, located at /home/br
my default terminal is called terminator
and whenever I open terminator, it runs the commands via the zsh shell
my default text editor is vim
I am currently located in my home directory
my default pager program is less

Changing an environment variable

What is interesting about these environment variables is that they can be changed, and with them, the behavior of other programs.

For example, let's change the value of our HOME environment variable, defining where our home directory is.

$ HOME=/tmp
$ cd
$ pwd
/tmp

In the first line, I redefined the value of my HOME environment variable from /home/br to /tmp. Remember when you learned that running cd without arguments would take you back to your home directory? Well, it's actually using the HOME environment variable to figure out where your home directory is. Now that HOME has changed, so has cd's behavior.

Another example is PAGER. We saw that my environment had PAGER=less defined by default, which explains why you find yourself reading text within less when you open a man page. man fetches the actual documentation and displays it in a pager, which itself is specified by the PAGER environment variable. If you were to change that variable to something else, like more or bat,¹ it would then change man's behavior.

There is a difference between SHELL and $SHELL. The first one is the name of an environment variable, and the latter represents its value. Consequently, when we executed echo $SHELL, we told our shell to lookup what value was associated with the SHELL environment variable, and then display it to the screen via the echo command. $ is what we call a dereference operator in that context.

Defining new variables

Not only can you change an existing environment variable, but you can also define a new one. If a non-existing variable is echo-ed, it will simply be replaced by an empty string.

$ echo $NEW_VAR

$ NEW_VAR=my-new-env-var
$ echo $NEW_VAR
my-new-env-var

If you define an environment variable this way, it will only be visible by the shell itself, but not by any command executed by your shell (also called subprocesses). To make an environment variable visible by a subprocess, you need to define it after the export keyword.

To illustrate that, we will create our first shell script: a program executing shell commands one after the others.

$ cat <<EOF > echo_var.sh
echo $NEW_VAR
EOF
$ cat echo_var.sh
echo $NEW_VAR

As you can see, the echo_var.sh script only contains one shell command: echo $NEW_VAR.

To execute that bash script, we can run bash echo_var.sh, and all instructions within that script will be executed by bash. Let's have a look at what executing that script displays on the screen with and without export-ing that variable.

$ NEW_VAR=my-new-var
$ echo $NEW_VAR
my-new-var
$ bash echo_var.sh

$ export NEW_VAR=my-new-var
$ echo $NEW_VAR
my-new-var
$ bash echo_var.sh
my-new-var

As you can see, the echo_var.sh subprocess can see the NEW_VAR environment variable after it has been export-ed by its parent shell.

This can very useful if you write programs: some parameters can have a sane default value but can also be overridden by specifying an environment variable. grep does this for example: reading the grep man page, we see:

GREP_OPTIONS May be used to specify default options that will be placed at the beginning of the argument list.

Removing environment variables

You can remove an environment variable by using the unset keyword:

$ unset NEW_VAR
$ bash echo_var.sh

$ echo $NEW_VAR

$

The case of `PATH`

Until that point, we've executed commands in the shell, and things happened. It was a simple world and it was nice. You might wonder what would happen if I gave the shell a non-existent command though?. Well, I'm glad you asked. Ten points for Gryffindor.

$ cmdnotfound
zsh: command not found: cmdnotfound

The cmdnotfound command, like its name implies, is not found. But what makes a command be found then? What makes the shell happily comply when we type ls, and makes it complain when we type cmdnotfound? It turns out that this is due to an environment variable called PATH, listing all directories in which executable programs can be found.

$ echo $PATH
/home/br/bin:/home/br/.local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games

This means that for any command passed to the shell, it will look into these directories (separated by a colon) in search for the program I'm trying to run.

For example, if I type

$ ls

into my shell, it will look into /home/br/bin, /home/br/.local/bin, /usr/local/sbin, etc, until it finds it in /bin.

$ ls /bin
...
chmod      dd         ed        ls         ps         sh         tcsh       zsh

If the command is not found in any of the directories listed in PATH, then it is not found.

This means that you can also redefine PATH to force your shell to look into new directories. In fact, this is exactly what I've done to make it look into /home/br/bin, where I store tools of my making.

You can mess with up your shell by running unset PATH.

$ unset PATH
$ ls
zsh: command not found: ls

However, calling a command by using its absolute or relative path still works, as PATH is only used to look for commands that only have been invoked by name.

$ unset PATH
$ /bin/ls
bin    code    Documents  ..
...

There is a useful command you can use to know what a program is and where it is found: which.

$ which less
/usr/bin/less
$ which bash
/usr/local/bin/bash
$ which ls
ls: aliased to ls -G

Wait. What? What's an alias?

Aliases

An alias allows you to define custom commands. In the previous example, running ls would actually run ls -G, which enables colorized output.

You can define an alias by using the alias keyword.

$ alias ls='ls -G'

There are a couple of reasons you might want to define aliases:

redefining a command's behavior (ex: always using ls with the -G option)
shortening a command's name to make it quicker to type (ex: alias ..='cd ..')
creating new commands altogether (ex: alias filesize='ls --size --human-readable -1')

To see the underlying command that will be executed by an alias, you can type alias <name>.

$ alias filesize='ls --size --human-readable -1'
$ alias filesize
alias filesize='ls --size --human-readable -1'

Aliases are very simple yet powerful. They allow you to customize your shell to your liking, create new commands without having to remember a lot of options, and decrease the time you spend typing, all of which should make you feel more productive.

Aliases can be “nested”. If you define ls as an alias of ls -G and filesize as an alias of ls --size --human-readable -1, your shell will unwrap both aliases and execute ls -G --size --human-readable -1 when you type filesize.

When we're executing filesize bin, the shell will see that filesize is an alias for ls --size --human-readable -1 and will actually execute the command ls --size --human-readable -1 bin behind the scenes. This simply is done by replacing the alias by its definition in the command itself. Aliases can however fall short if we want to do something a more complex than this.

For example, one of my favorite productivity tools is mkcd, which creates a directory and steps into it right after. It saves you from typing

$ mkdir new-dir
$ cd new-dir

where you can just type

$ mkcd new-dir

An alias can't really help here, because we are talking about aliasing two commands with a single alias, which does not work. Enter functions.

Functions

According to the bash man page:

A shell function is an object that is called like a simple command and executes a compound command with a new set of positional parameters.

Let's see what that looks like in practice. A function is declared this way.

function name {
    # ...
}

If your function is expecting arguments, these can be accessed by using $n where n is a number. For example, $1 is the first function argument, $2 its second argument, etc. With that in mind, we can now declare our mkcd function.

function mkcd {
    mkdir -p $1
    cd $1
}

Let's now see mkcd in action!

$ function mkcd {
    local target=$1
    mkdir -p $target
    cd $target
}
$ pwd
/home/br
$ mkcd test
$ pwd
/home/br/test

You can use the typeset -f command to see how a function was defined (or which <function-name>, although that only works in zsh).

$ typeset -f mkcd
mkcd () {
    mkdir -p $1
    cd $1
}

Real life examples

These are some of the environment variables, aliases and functions I have defined for myself.

alias ..='cd ..'
alias ...='cd ../..'

Colorize commands output

alias ls='ls --color=auto'
alias grep='grep --color=auto'
alias ip='ip --color'

Alias commands I never remember

# https://xkcd.com/1168/
alias untar='tar -zxvf'

Have `$HOME/bin` be part of `PATH`

export PATH=$PATH:$HOME/bin

By extending my PATH this way, I can then put every single tool I create into $HOME/bin and have it be usable right-away.

A backup function

function bak {
    cp -r $1 $1.bak
}

This function can be used to backup a file or directory. I regularly use this when I'm about to edit a critical file and I want to make sure I can revert my changes if needed.

Password generation function

This function generate a password composed of alphanumeric characters, of default length 32.

$ function genpass {
    local passlen=${1:-32}
    # Note: LC_ALL=C is needed for macos compatibility
    LC_ALL=C tr -cd '[:alnum:]' < /dev/urandom | fold -w $passlen | head -n1
}
$ genpass
GQROc0tnABqfYH0qpMMwSPYFgcY7OANB
$ genpass 50
WkeQ14E8FIQZN7XlN7yPkYK4yhMOvpAuNzZivKwODNkskh0uq0

The weather in your terminal

function weather {
    curl "wttr.in/${1:-lyon}?m"
}

This function uses curl to send an HTTP request to the http://wttr.in website, that displays weather forecasts in a terminal-friendly way. So I can just type weather mycity and voila:

$ weather lyon
Weather report: lyon

     \   /     Sunny
      .-.      17 °C
   ― (   ) ―   ↖ 6 km/h
      `-'      10 km
     /   \     0.0 mm
                                                       ┌─────────────┐
┌──────────────────────────────┬───────────────────────┤  Sat 04 Apr ├───────────────────────┬──────────────────────────────┐
│            Morning           │             Noon      └──────┬──────┘     Evening           │             Night            │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│    \  /       Partly cloudy  │    \  /       Partly cloudy  │    \  /       Partly cloudy  │    \  /       Partly cloudy  │
│  _ /"".-.     7..8 °C        │  _ /"".-.     13 °C          │  _ /"".-.     13 °C          │  _ /"".-.     10..11 °C      │
│    \_(   ).   ← 5-6 km/h     │    \_(   ).   ↙ 5 km/h       │    \_(   ).   ← 5-10 km/h    │    \_(   ).   ↖ 8-17 km/h    │
│    /(___(__)  10 km          │    /(___(__)  10 km          │    /(___(__)  10 km          │    /(___(__)  10 km          │
│               0.0 mm | 0%    │               0.0 mm | 0%    │               0.0 mm | 0%    │               0.0 mm | 0%    │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
                                                       ┌─────────────┐
┌──────────────────────────────┬───────────────────────┤  Sun 05 Apr ├───────────────────────┬──────────────────────────────┐
│            Morning           │             Noon      └──────┬──────┘     Evening           │             Night            │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│     \   /     Sunny          │     \   /     Sunny          │     \   /     Sunny          │    \  /       Partly cloudy  │
│      .-.      10..12 °C      │      .-.      16 °C          │      .-.      14..15 °C      │  _ /"".-.     10..12 °C      │
│   ― (   ) ―   ↖ 14-18 km/h   │   ― (   ) ―   ↑ 23-27 km/h   │   ― (   ) ―   ↑ 15-25 km/h   │    \_(   ).   ↑ 13-26 km/h   │
│      `-'      10 km          │      `-'      10 km          │      `-'      10 km          │    /(___(__)  10 km          │
│     /   \     0.0 mm | 0%    │     /   \     0.0 mm | 0%    │     /   \     0.0 mm | 0%    │               0.0 mm | 0%    │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
                                                       ┌─────────────┐
┌──────────────────────────────┬───────────────────────┤  Mon 06 Apr ├───────────────────────┬──────────────────────────────┐
│            Morning           │             Noon      └──────┬──────┘     Evening           │             Night            │
├──────────────────────────────┼──────────────────────────────┼──────────────────────────────┼──────────────────────────────┤
│     \   /     Sunny          │     \   /     Sunny          │     \   /     Sunny          │     \   /     Clear          │
│      .-.      12..13 °C      │      .-.      16 °C          │      .-.      14..15 °C      │      .-.      11 °C          │
│   ― (   ) ―   ↖ 18-22 km/h   │   ― (   ) ―   ↑ 22-28 km/h   │   ― (   ) ―   ↑ 14-24 km/h   │   ― (   ) ―   ↑ 8-16 km/h    │
│      `-'      10 km          │      `-'      10 km          │      `-'      10 km          │      `-'      10 km          │
│     /   \     0.0 mm | 0%    │     /   \     0.0 mm | 0%    │     /   \     0.0 mm | 0%    │     /   \     0.0 mm | 0%    │
└──────────────────────────────┴──────────────────────────────┴──────────────────────────────┴──────────────────────────────┘
Location: Lyon, Métropole de Lyon, Circonscription départementale du Rhône, Auvergne-Rhône-Alpes, France [45.7578137,4.8320114]

Summary

Environment variables, aliases and functions are simple yet powerful to change the shell's behavior into something that feels more intuitive. You feel like nano is not shiny enough and prefer using vim instead? Sure. Define EDITOR=vim. Any command interacting with an editor would then use vim instead of nano.

Aliases are a great way to reduce mental friction in the shell by hiding away complex commands, or just reducing the amount of typing you have to do. When aliases start being not powerful enough because you want to execute multiple commands, you can then have a look at functions instead.

Everything we have seen so far however had an ephemeral effect, as changes you made would disappear when you close your shell session. In the next chapter, we will go dive into how to persistently configure your shell to improve your day-to-day experience and productivity.

Going further

3.1: Write a cat alias that displays meow on screen.

3.2: Write a restorebak function that takes a filename as only argument and renames $1.bak into $1.

3.3: Unset the PATH environment variable and then export it back so that you can use ls again.

https://github.com/sharkdp/bat ↩

My pizza recipe

2020-03-18T00:00:00+01:00

If you're reading this, do yourself a favor. Stop right there, and go to this post instead. I was young and foolish, and didn't know any better.

I've always enjoyed a good looking Neapolitan pizza. You know, the ones with the puffy, slightly burned crust. I have probably baked dozens of them during the last couple of years, but only recently did I become satisfied enough with my recipe to feel comfortable sharing it.

What you'll need

I don't think you need a professional oven, a dough mixer or a pizza stone to get nice results. Hear me out: I'm not saying you don't need any of these, but you certainly can do without and still enjoy a delicious slice.

This is what I use for this recipe:

a pizza tray
a tupperware
a scraper
a bowl
a tall glass

Ingredients

For 2 pizzas, you'll need:

400g of flour
6g of salt (I personally use kosher salt, although more out of habit than anything else)
220mL of water
6g of fresh baker's yeast (dried yeast works as well. I prefer it fresh.)
time

When it comes to the flour, the traditional flour is called Double zero (00) flour. This means that it's ground extremely fine. It's usually found in specialized Italian shops.

What's important to note here is that a flour is characterized by several things:

how fine it was ground
its protein content

Pizza dough needs a high amount of gluten in the flour (14% is recommended) to develop its elasticity, and you can have 00 flour with a very low amount of protein (think cake flour), which will probably give you disappointing results.

I've never managed to find 00 pizza flour close to my place, but I found a good organic T65 flour (French T65 flours are commonly used in bread making) with 12% of protein which gives good results.

Recipe

Petit levain

The first thing we're going to make is a petit levain: a mix of water, flour and yeast aiming at gently activating the yeast coming out of your fridge of freezer.

Mix 50g of flour, 100mL and the yeast. Stir with a wooden spoon, as the yeast isn't too fond of metal.

Autolysis

Mix the rest of the flour and water with the salt, and let it rest for half and hour, for an autolysis phase. During that phase, the water will get fully absorbed by the flour, which will make it easier to work with. The water will also start to degrade the flour into simpler sugars that will be more easily digested by the yeast when we add it, increasing the yeast activity.

In theory, we should add the salt after the autolysis process, but as I use kosher salt, I find it easier to add it at this step rather than working it into the dough afterwards.

Kneading

After having waited a good half-hour, incorporate the petit levain with the mix of flour, salt and water, by using your dough scraper. It's ok if there is a bit of flour left in the bowl, it will get absorbed during the kneading. Wait for 10, 15 minutes.

Pop the dough on your table (add a little bit of flour on the table if there isn't any left in the bowl), and knead it for about 10 minutes. Feel free to stop when the dough is pretty smooth and supple.

Put the dough in a closed container, and put it to rest in the fridge for 48 to 72h. Yep, you read that right. If you're thinking "I am my own person, I do what I want, and I want to eat that pizza tonight", sure, let the dough rest for about 2h on your kitchen counter instead, under a damp towel.

Letting your dough rest for a long time will give it time to ferment and develop flavors and aromas. Trust me, it will smell fantastic when you open the container.

Shaping

Take the container of the fridge about 6h (if you can) before cooking time, and divide it into 2 even parts.

Letting the dough out early will give it time to "relax" and will make it easier to stretch.

Shape each half into a nice stretched ball, and put them in a closed container until you start cooking (or put the second one in the fridge if you're only planning to make one).

Stretching

When stretching your dough, you want to make sure to never use a rolling pin, and never touch the outside rim, otherwise you'll chase the air out and the crust won't expand as much as we'd want. I learned a ton from that super short video, which really helped me getting that Neapolitan look I wanted.

Sprinkle flour on your pizza tray, and pop the dough on it.

Toppings

First off, add some tomato sauce. One important thing is to avoid putting too much, which would dampen the dough and make the pizza quite watery. I usually use 2 big spoons of sauce, and spread it in circles, starting from the center. Make sure to avoid touching the outside rim.

Pre-heat your oven at 250°C (480°F) with convection, if possible.

At that point, you're going to have to make choices. I won't tell you what you should put on your pizza, but I can describe my favorite one though, which has (put on the pizza in that order)

small onions slices, previously cooked in olive oil
shreds of mozzarella di bufala
some genovese pesto here and there
black olives
oregano

Drizzle the rim with olive oil, which will give it a nice golden color at cooking time.

Cooking

When your oven is hot, cook your pizza for about 8 minutes.

The rest is on you.

Text processing in the shell

2020-03-14T00:00:00+01:00

If you are interested in the project, we invite you to join the mailing list!

cat
head
tail
wc
grep
cut
paste
sort
uniq
awk
tr
fold
sed
Real-life examples
Going further: for loops and xargs
Summary
Going further

Text processing in the shell

One of the things that makes the shell an invaluable tool is the amount of available text processing commands, and the ability to easily pipe them into each other to build complex text processing workflows. These commands can make it trivial to perform text and data analysis, convert data between different formats, filter lines, etc.

When working with text data, the philosophy is to break any complex problem you have into a set of smaller ones, and to solve each of them with a specialized tool.

Make each program do one thing well.¹

The examples in that chapter might seem a little contrived at first, but this is also by design. Each one of these tools was designed to solve one small problem. They however become extremely powerful when combined.

We will go over some of the most common and useful text processing commands the shell has to offer, and will demonstrate real-life workflows piping them together. I suggest you take a look at the man of these commands to see the full breadth of options at your disposal.

The example CSV (comma-separated values) file is available online.² Feel free to download it yourself to test these commands.

`cat`

As seen in the previous chapter, cat is used to concatenate a list of one or more files and displays their content on screen.

$ cat Documents/readme
Thanks again for reading this book!
I hope you're following so far!

$ cat Documents/computers
Computers are not intelligent
They're just fast at making dumb things.
$ cat Documents/readme Documents/computers
Thanks again for reading this book!
I hope you are following so far!

Computers are not intelligent
They're just fast at making dumb things.

`head`

head prints the first n lines in a file. It can be very useful to peek into a file of unknown structure and format without burying your shell under a wall of text.

$ head -n 2 metadata.csv
metric_name,metric_type,interval,unit_name,per_unit_name,description,orientation,integration,short_name
mysql.galera.wsrep_cluster_size,gauge,,node,,The current number of nodes in the Galera cluster.,0,mysql,galera cluster size

If -n is unspecified, head will print the first 10 lines in its argument file or input stream.

`tail`

tail is head’s counterpart. It prints the last n lines in a file.

$ tail -n 1 metadata.csv
mysql.performance.queries,gauge,,query,second,The rate of queries.,0,mysql,queries

If you want to print all lines in a file located after the nth line (included), you can use the -n +n argument.

$ tail -n +42 metadata.csv
mysql.replication.slaves_connected,gauge,,,,Number of slaves connected to a replication master.,0,mysql,slaves connected
mysql.performance.queries,gauge,,query,second,The rate of queries.,0,mysql,queries

Our file has 43 lines, so tail -n +42 only prints the 42nd and 43rd line in our file.

If -n is unspecified, tail will print the last 10 lines in its argument file or input stream.

tail -f or tail --follow displays the last lines in a file and displays each new line as the file is being written to. It is very useful to see real time activity that is written to a log file, for example a web server log file, etc.

`wc`

wc (for word count) prints either the number of characters (when using -c), words (when using -w) or lines (when using -l) in its argument files or input stream.

$ wc -l metadata.csv
43  metadata.csv
$ wc -w metadata.csv
405 metadata.csv
$ wc -c metadata.csv
5094 metadata.csv

By default, wc prints all of the above.

$ wc metadata.csv
43     405    5094 metadata.csv

Only the count will be printed out if the text data is piped in or redirected into stdin.

$ cat metadata.csv | wc
43     405    5094
$ cat metadata.csv | wc -l
43
$ wc -w < metadata.csv
405

`grep`

grep is the Swiss Army knife of line filtering. It allows you to filter lines matching a given pattern.

For example, we can use grep to find all occurrences of the word mutex in our metadata.csv file.

$ grep mutex metadata.csv
mysql.innodb.mutex_os_waits,gauge,,event,second,The rate of mutex OS waits.,0,mysql,mutex os waits
mysql.innodb.mutex_spin_rounds,gauge,,event,second,The rate of mutex spin rounds.,0,mysql,mutex spin rounds
mysql.innodb.mutex_spin_waits,gauge,,event,second,The rate of mutex spin waits.,0,mysql,mutex spin waits

grep can either filter files passed as arguments, or a stream of text passed to its stdin. We can thus chain multiple grep commands to further filter our text. In the next example, we filter lines in our metadata.csv file that contain both the mutex and OS words.

$ grep mutex metadata.csv | grep OS
mysql.innodb.mutex_os_waits,gauge,,event,second,The rate of mutex OS waits.,0,mysql,mutex os waits

Let’s go over some of the options you can pass to grep and their associated behavior.

grep -v performs an invert matching: it filters the lines that do not match the argument pattern.

$ grep -v gauge metadata.csv
metric_name,metric_type,interval,unit_name,per_unit_name,description,orientation,integration,short_name

grep -i performs a case-insensitive matching. In the next example grep -i os matches both OS and os.

$ grep -i os metadata.csv
mysql.innodb.mutex_os_waits,gauge,,event,second,The rate of mutex OS waits.,0,mysql,mutex os waits
mysql.innodb.os_log_fsyncs,gauge,,write,second,The rate of fsync writes to the log file.,0,mysql,log fsyncs

grep -l only lists files containing a match.

$ grep -l mysql metadata.csv
metadata.csv

grep -c counts the number of times a pattern was found.

$ grep -c select metadata.csv
3

grep -r recursively searches files in the current working directory and all subdirectories below it.

$ grep -r are ~/Documents
/home/br/Documents/computers:Computers are not intelligent
/home/br/Documents/readme:I hope you are following so far!

grep -w only matches whole words.

$ grep follow ~/Documents/readme
I hope you are following so far!
$ grep -w follow ~/Documents/readme
$

`cut`

cut cuts out a portion of a file (or, as always, its input stream). cut works by defining a field delimited (what separates two columns) with the -d option, and what column(s) should be extracted, with the -f option.

For example, the following command extracts the first column of the last 5 lines our CSV file.

$ tail -n 5 metadata.csv | cut -d , -f 1
mysql.performance.user_time
mysql.replication.seconds_behind_master
mysql.replication.slave_running
mysql.replication.slaves_connected
mysql.performance.queries

As we are dealing with a CSV file, we can extract each column by cutting over the , character, and extract the first column with -f 1.

We could also select both the first and second columns by using the -f 1,2 option.

$ tail -n 5 metadata.csv | cut -d , -f 1,2
mysql.performance.user_time,gauge
mysql.replication.seconds_behind_master,gauge
mysql.replication.slave_running,gauge
mysql.replication.slaves_connected,gauge
mysql.performance.queries,gauge

`paste`

paste can merge together two different files into one multi-column file.

$ cat ingredients
eggs
milk
butter
tomatoes
$ cat prices
1$
1.99$
1.50$
2$/kg
$ paste ingredients prices
eggs    1$
milk    1.99$
butter  1.50$
tomatoes    2$/kg

By default, paste uses a tab delimiter, but you can change that using the -d option.

$ paste ingredients prices -d:
eggs:1$
milk:1.99$
butter:1.50$
tomatoes:2$/kg

Another common use of paste it to join all lines within a stream or a file using a given delimiter, using a combination of the -s and -d argument.

$ paste -s -d, ingredients
eggs,milk,butter,tomatoes

If - is specified as an input file, stdin will be read instead.

$ cat ingredients | paste -s -d, -
eggs,milk,butter,tomatoes

`sort`

sort, well, sorts argument files or input.

$ cat ingredients
eggs
milk
butter
tomatoes
salt
$ sort ingredients
butter
eggs
milk
salt
tomatoes

sort -r performs a reverse sort.

$ sort -r ingredients
tomatoes
salt
milk
eggs
butter

sort -n performs a numerical sort, by sorting fields by their arithmetic value.

$ cat numbers
0
2
1
10
3
$ sort numbers
0
1
10
2
3
$ sort -n numbers
0
1
2
3
10

`uniq`

uniq detects or filters out adjacent identical lines in its argument file or input stream.

$ cat duplicates
and one
and one
and two
and one
and two
and one, two, three
$ uniq duplicates
and one
and two
and one
and two
and one, two, three

As uniq only filters out adjacent identical lines, we can still see more than one unique lines in its output. To filter out all identical lines from our duplicates file, we need to sort its content first.

$ sort duplicates | uniq
and one
and one, two, three
and two

uniq -c prepends all lines with its number of occurrences.

$ sort duplicates | uniq -c
   3 and one
   1 and one, two, three
   2 and two

uniq -u only displays the unique lines within its input.

$ sort duplicates | uniq -u
and one, two, three

uniq is particularly useful used in conjunction with sort, as | sort | uniq allows you to remove any duplicate line in a file or a stream.

`awk`

awk is a little more than a text processing tool: it’s actually a whole programming language of its own³. One thing awk is really good at is splitting files into columns, and it especially shines when these files contain a mix and match of spaces and tabs.

$ cat -t multi-columns
John Smith    Doctor^ITardis
Sarah-James Smith^I    Companion^ILondon
Rose Tyler   Companion^ILondon

cat -t displays tabs as ^I.

We can see that these columns are either separated by spaces or tabs, and that they are not always separated by the same number of spaces. cut would be of no use there, because it only works on a single character delimiter. awk however, can easily make sense of that file.

awk '{ print $n }' prints the nth column in the text.

$ cat multi-columns | awk '{ print $1 }'
John
Sarah-James
Rose
$ cat multi-columns | awk '{ print $3 }'
Doctor
Companion
Companion
$ cat multi-columns | awk '{ print $1,$2 }'
John Smith
Sarah-James Smith
Rose Tyler

There is so much more we can do with awk, however, printing columns probably accounts for 99% of my personal usage.

{ print $NF } prints the last column in the line.

`tr`

tr stands for translate, and it replaces characters into others. It either works on characters or character classes, such as lowercase, printable, spaces, alphanumeric, etc.

tr <char1> <char2> translates all occurrences of <char1> from its standard input into <char2>.

$ echo "Computers are fast" | tr a A
computers Are fAst

tr can also translate character classes by using the [:class:] notation. The full list of available classes is described in the tr man page, but we’ll demonstrate some of them here.

[:space:] represent all types of spaces, from a simple space, to a tab or a newline.

$ echo "computers are fast" | tr '[:space:]' ','
computers,are,fast,%

All spaces-like characters were translated into a comma. Note that the % character at the end of the output represents the lack of a trailing newline. Indeed, that newline was translated to a comma as well.

[:lower:] represents all lowercase characters, and [:upper:] represents all uppercase characters. Converting between cases is thus made very easy.

$ echo "computers are fast" | tr '[:lower:]' '[:upper:]'
COMPUTERS ARE FAST
$ echo "COMPUTERS ARE FAST" | tr '[:upper:]' '[:lower:]'
computers are fast

tr SET1 SET2 will transform any character in SET1 into the characters in SET2. The following example replaces all vowels by spaces.

$ echo "computers are fast" | tr '[aeiouy]' ' '
c mp t rs  r  f st

tr -c SET1 SET2 does the opposite: it transforms any character not in SET1 into the characters in SET2. The following example replaces all non vowels by spaces.

$ echo "computers are fast" | tr -c '[aeiouy]' ' '
 o  u e   a e  a

tr -d deletes the matched characters, instead of replacing them. It’s the equivalent of tr <char> ''.

$ echo "Computers Are Fast" | tr -d '[:lower:]'
C A F

tr can also replace character ranges, for example all letters between a and e, or all numbers between 1 and 8, by using the notation s-e, where s is the start character and e is the end one.

$ echo "computers are fast" | tr 'a-e' 'x'
xomputxrs xrx fxst
$ echo "5uch l337 5p34k" | tr '1-4' 'x'
5uch lxx7 5pxxk

tr -s string1 compresses any multiple occurrences of the characters in string1 into a single one. One of the most useful uses of tr -s is to replace multiple consecutive spaces by a single one.

$ echo "Computers         are       fast" | tr -s ' '
Computers are fast

`fold`

fold wraps each input line to fit in a specified width. It can be useful to make sure an argument text fits in a small display size for example. fold -w n folds the lines at n characters.

$ cat ~/Documents/readme | fold -w 16
Thanks again for
 reading this bo
ok!
I hope you're fo
llowing so far!

fold -s will only break lines on a space character, and can be combined with -w to fold up to a given number of characters.

Thanks again
for reading
this book!
I hope you're
following so
far!

`sed`

sed is a non-interactive stream editor, used to perform text transformation on its input stream, on a line-per-line basis. It can take its output from a file our its stdin and will output its result either in a file or its stdout.

It works by taking one or many optional addresses, a function and parameters. A sed command thus looks like this:

[address[,address]]function[arguments]

While sed can perform many functions, we will cover only substitution, as it is probably sed’s most common use.

Substituting text

A sed substitution command looks like this:

s/PATTERN/REPLACEMENT/[options]

Example: replacing the first instance of a word for each line in a file

$ cat hello
hello hello
hello world!
hi
$ cat hello | sed 's/hello/Hey I just met you/'
Hey I just met you hello
Hey I just met you world
hi

We can see that only the first occurrence of hello was replaced in the first line. To replace all occurrences of hello in each line, we can use the g (for global) option.

$ cat hello | sed 's/hello/Hey I just met you/g'
Hey I just met you Hey I just met you
Hey I just met you world
hi

sed allows you to specify any other separator than /, which is especially useful to keep the command readable if the search of replacement pattern contains forward slashes.

$ cat hello | sed 's@hello@Hey I just met you@g'
Hey I just met you Hey I just met you
Hey I just met you world
hi

By specifying an address, we can tell sed on which line or line-range to actually perform the substitution.

$ cat hello | sed '1s/hello/Hey I just met you/g'
Hey I just met you hello
hello world
hi
$ cat hello | sed '2s/hello/Hey I just met you/g'
hello hello
Hey I just met you  world
hi

The address 1 tells sed to only replace hello by Hey I just met you on line 1. We can specify an address range with the notation <start>,<end> where <end> can either be a line number or $, meaning the last line in the file.

$ cat hello | sed '1,2s/hello/Hey I just met you/g'
Hey I just met you Hey I just met you
Hey I just met you world
hi
$ cat hello | sed '2,3s/hello/Hey I just met you/g'
hello hello
Hey I just met you world
hi
$ cat hello | sed '2,$s/hello/Hey I just met you/g'
hello hello
Hey I just met you world
hi

By default, sed displays its result in its stdout, but it can also edit the initial file in-place, with the use of the -i option.

$ sed -i '' 's/hello/Bonjour/' sed-data
$ cat sed-data
Bonjour hello
Bonjour world
hi

On Linux, only -i needs to be specified. However, due to the fact that sed’s behavior on macOS is slightly different, the '' needs to be added right after -i.

Real-life examples

Filtering a CSV using `grep` and `awk`

$ grep -w gauge metadata.csv | awk -F, '{ if ($4 == "query") { print $1, "per", $5 } }'
mysql.performance.com_delete per second
mysql.performance.com_delete_multi per second
mysql.performance.com_insert per second
mysql.performance.com_insert_select per second
mysql.performance.com_replace_select per second
mysql.performance.com_select per second
mysql.performance.com_update per second
mysql.performance.com_update_multi per second
mysql.performance.questions per second
mysql.performance.slow_queries per second
mysql.performance.queries per second

This example filters the lines containing the word gauge in our metadata.csv file using grep, then the filters the lines with the string query as their 4th column, and displays the metric name (1st column) with its associated per_unit_name value (5th column).

Printing the IPv4 address associated with a network interface

$ ifconfig en0 | grep inet | grep -v inet6 | awk '{ print $2 }'
192.168.0.38

ifconfig <interface name> prints details associated with the argument network interface name. For example:

en0: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 1500
    ether 19:64:92:de:20:ba
    inet6 fe80::8a3:a1cb:56ae:7c7c%en0 prefixlen 64 secured scopeid 0x7
    inet 192.168.0.38 netmask 0xffffff00 broadcast 192.168.0.255
    nd6 options=201<PERFORMNUD,DAD>
    media: autoselect
    status: active

We then grep for inet, which will match 2 lines.

$ ifconfig en0 | grep inet
    inet6 fe80::8a3:a1cb:56ae:7c7c%en0 prefixlen 64 secured scopeid 0x7
    inet 192.168.0.38 netmask 0xffffff00 broadcast 192.168.0.255

We then exclude the line with ipv6 by using a grep -v.

$ ifconfig en0 | grep inet | grep -v inet6
inet 192.168.0.38 netmask 0xffffff00 broadcast 192.168.0.255

We finally use awk to get the 2nd column in that line: the IPv4 address associated with our en0 network interface.

$ ifconfig en0 | grep inet | grep -v inet6 | awk '{ print $2 }'
192.168.0.38

It has been suggested to me that grep inet | grep -v inet6 could be replaced by the following future-proof awk command:

$ ifconfig en0 | awk ' $1 == "inet" { print $2 }'
192.168.0.38

It is shorter and specifically targets IPv4 using the $1 == "inet" condition.

Extracting a value from a config file

$ grep 'editor =' ~/.gitconfig  | cut -d = -f2 | sed 's/ //g'
/usr/bin/vim

We look for the editor = value in the current user’s git configuration file, then cut over the = sign, get the second column and remove any space around that column.

$ grep 'editor =' ~/.gitconfig
     editor = /usr/bin/vim
$ grep 'editor =' ~/.gitconfig  | cut -d'=' -f2
 /usr/bin/vim
$ grep 'editor =' ~/.gitconfig  | cut -d'=' -f2 | sed 's/ //'
/usr/bin/vim

Extracting IP addresses from a log file

The following real life example looks for the message Too many connections from in a database log file (which is followed by an IP address) and displays the 10 biggest offenders.

$ grep 'Too many connections from' db.log | \
  awk '{ print $12 }' | \
  sed 's@/@@' | \
  sort | \
  uniq -c | \
  sort -rn | \
  head -n 10 | \
  awk '{ print $2 }'
   10.11.112.108
   10.11.111.70
   10.11.97.57
   10.11.109.72
   10.11.116.156
   10.11.100.221
   10.11.96.242
   10.11.81.68
   10.11.99.112
   10.11.107.120

Let’s break down what this pipeline of command does. First, let’s look at what a log line looks like.

$ grep "Too many connections from" db.log | head -n 1
2020-01-01 08:02:37,617 [myid:1] - WARN  [NIOServerCxn.Factory:1.2.3.4/1.2.3.4:2181:NIOServerCnxnFactory@193] - Too many connections from /10.11.112.108 - max is 60

awk '{ print $12 }' then extracts the IP from the line.

$ grep "Too many connections from" db.log | awk '{ print $12 }'
/10.11.112.108
...

sed 's@/@@' removes the trailing slash from the IPs.

$ grep "Too many connections from" db.log | awk '{ print $12 }' | sed 's@/@@'
10.11.112.108
...

As we have previously seen, we can use whatever separator we want for sed. While / is commonly used as a separator, we are currently replacing that very character, which would make the substitution expression sightly less readable.

sed 's/\///'

sort | uniq -c sorts the IPs lexicographically, and then removed duplicates while prefixing IPs by their associated number of occurrences.

$ grep 'Too many connections from' db.log | \
  awk '{ print $12 }' | \
  sed 's@/@@' | \
  sort | \
  uniq -c
   1379 10.11.100.221
   1213 10.11.103.168
   1138 10.11.105.177
    946 10.11.106.213
   1211 10.11.106.4
   1326 10.11.107.120
   ...

sort -rn | head -n 10 sorts the lines by the number of occurrences, numerically and in the reversed order, which displays the biggest offenders first, 10 of which are displayed. The final awk { print $2 } extracts the IPs themselves.

$ grep 'Too many connections from' db.log | \
  awk '{ print $12 }' | \
  sed 's@/@@' | \
  sort | \
  uniq -c | \
  sort -rn | \
  head -n 10 | \
  awk '{ print $2 }'
  10.11.112.108
  10.11.111.70
  10.11.97.57
  10.11.109.72
  10.11.116.156
  10.11.100.221
  10.11.96.242
  10.11.81.68
  10.11.99.112
  10.11.107.120

Renaming a function in a source file

Let’s imagine that we are working a code project, and we would like to rename rename a poorly named function (or class, variable, etc) in a code file. We can do this by using sed -i, which performs an in-place replacement in a file.

$ cat izk/utils.py
def bool_from_str(s):
    if s.isdigit():
        return int(s) == 1
    return s.lower() in ['yes', 'true', 'y']

$ sed -i 's/def bool_from_str/def is_affirmative/' izk/utils.py
$ cat izk/utils.py
def is_affirmative(s):
    if s.isdigit():
        return int(s) == 1
    return s.lower() in ['yes', 'true', 'y']

Use sed -i '' instead of sed -i on macOs, as the sed version behaves slightly differently.

We’ve however only renamed this function in the file it was defined in. Any other file we import bool_from_str will now be broken, as this function is not defined anymore. We’d need a way to rename bool_from_str everywhere it is found in our project. We can achieve just that by using grep, sed, and either for loops or xargs.

Going further: `for` loops and `xargs`

To replace all occurrences of bool_from_str in our project, we first need to recursively find them using grep -r.

$ grep -r bool_from_str .
./tests/test_utils.py:from izk.utils import bool_from_str
./tests/test_utils.py:def test_bool_from_str(s, expected):
./tests/test_utils.py:    assert bool_from_str(s) == expected
./izk/utils.py:def bool_from_str(s):
./izk/prompt.py:from .utils import bool_from_str
./izk/prompt.py:                    default = bool_from_str(os.environ[envvar])

As we are only interested in the matching files, we also need to use the -l/--files-with-matches option:

-l, --files-with-matches
        Only the names of files containing selected lines are written to standard out-
        put.  grep will only search a file until a match has been found, making
        searches potentially less expensive.  Pathnames are listed once per file
        searched.  If the standard input is searched, the string ``(standard input)''
        is written.

$ grep -r --files-with-matches bool_from_str .
./tests/test_utils.py
./izk/utils.py
./izk/prompt.py

We can then use the xargs command to perform an action on each line in the output (each file containing the bool_from_str string).

$ grep -r --files-with-matches bool_from_str . | \
  xargs -n 1 sed -i 's/bool_from_str/is_affirmative/'

-n 1 tells xargs that each line in the output should cause a separate sed command to be executed.

The following commands are then executed:

$ sed -i 's/bool_from_str/is_affirmative/' ./tests/test_utils.py
$ sed -i 's/bool_from_str/is_affirmative/' ./izk/utils.py
$ sed -i 's/bool_from_str/is_affirmative/' ./izk/prompt.py

If the command you call with xargs (sed, in our case) support multiple arguments, you can (and shoud, as a single command will execute faster) drop the -n 1 argument and run

$ grep -r --files-with-matches bool_from_str . | xargs sed -i 's/bool_from_str/is_affirmative/'

which will then execute

$ sed -i 's/bool_from_str/is_affirmative/' ./tests/test_utils.py ./izk/utils.py ./izk/prompt.py

We can see that sed can take multiple arguments by looking at its synopsis, in its man page.

SYNOPSIS
     sed [-Ealn] command [file ...]
     sed [-Ealn] [-e command] [-f command_file] [-i extension] [file ...]

Indeed, as we’ve seen in the previous chapter, file ... means that multiple arguments representing file names are accepted.

We can see that all bool_from_str occurrences have been replaced.

$ grep -r is_affirmative .
./tests/test_utils.py:from izk.utils import is_affirmative
./tests/test_utils.py:def test_is_affirmative(s, expected):
./tests/test_utils.py:    assert is_affirmative(s) == expected
./izk/utils.py:def is_affirmative(s):
./izk/prompt.py:from .utils import is_affirmative
./izk/prompt.py:                    default = is_affirmative(os.environ[envvar])

As it is often the case, there are multiple ways of achieving the same result. Instead of using xargs, we could have used for lops, which allow you to iterate over a list of lines and perform an action on each element. These for loops have the following syntax:

for item in list; do
    command $item
done

By wrapping our grep command by $(), it will cause the shell to execute the it in a subshell, which result will then be iterated on by the for loop.

$ for file in $(grep -r --files-with-matches bool_from_str .); do
  sed -i 's/bool_from_str/is_affirmative/' $file
done

which will execute

$ sed -i 's/bool_from_str/is_affirmative/' ./tests/test_utils.py
$ sed -i 's/bool_from_str/is_affirmative/' ./izk/utils.py
$ sed -i 's/bool_from_str/is_affirmative/' ./izk/prompt.py

I tend to find the for loop syntax clearer than xargs’s. xargs can however execute the commands in parallel using its -P n options, where n is the maximum number of parallel commands to be executed at a time, which can be a performance win if your command takes time to run.

Summary

All these tools open up a world of possibilities, as they allow you to extract data and transform its format, to make it possible to build entire workflows of commands that were possibly never intended to work together. Each of these commands accomplishes has a relatively small function (sort sorts, cat concatenates, grep filters, sed edits, cut cuts, etc).

Any given task involving text, can then be reduced to a pipeline of smaller tasks, each of them performing a simple action and piping its output into the next task.

For example, if we wanted to know how many unique IPs could be found in a log file, and that these IPs always appeared at the same column, we could:

grep lines on a pattern specific to lines containing an IP address
locate the column the IPs appear, and extract them with awk
sort the list of IPs with sort
compute the list of unique IPs with uniq
count the number of lines (aka, of unique IPs) with wc -l

As there is a plethora of text processing tools, either available by default or installable, there is bound to be many ways to solve any given task.

The examples in this article were contrived, but I suggest you read the amazing article “Command-line Tools can be 235x Faster than your Hadoop Cluster”⁴ to get a sense of how useful and powerful these text processing commands really are, and what real-life problems they can solve.

Going further

2.1: Count the number of files and directories located in your home directory.

2.2: Display the content of a file in all caps.

2.3: Count how many times each word was found in a file.

2.4: Count the number of vowels present in a file. Display the result from the most common to the least.

Discovering the terminal

2020-03-05T00:00:00+01:00

If you are interested in following the project, we invite you to join the mailing list!

What is a terminal?
Your first steps
Managing files
Learning new options
Command Input/Output streams
Composing commands
Escaping from bad situations
Summary
Going further

Discovering the terminal

When people picture a programmer, it’s not uncommon for them to imagine someone sitting in front of a computer screen displaying undecipherable streams of text going by really fast, like in The Matrix. Let’s set the record straight. This is not true, at least for the most part. The Matrix however got some things right. A programmer works with code, which, as its name indicates, has to be learned before it can be understood. Anyone not versed in the trade of reading and writing code would only see gibberish. Another thing these movies usually get right is the fact that a programmer types commands in a terminal.

What is a terminal?

Most of the applications people use everyday have a Graphical User Interface (GUI). Think about Photoshop, Firefox, or your smartphone apps. These application have immense capabilities, but the user is mostly bound by the features implemented in them in the first place. What if you suddenly wanted to have a new feature in Photoshop that just wasn’t available? You would possibly end up either waiting for the newest version to be released, or have to install another application altogether.

One of the most important tools in a programmer toolbox is of a different kind though. It’s called the terminal, which is a command-line application. That is to say that you enter a command, your computer executes that command, and displays the output in the terminal.

In other words, this is an applications in which you give your computer orders. If you know how to ask, your computer will be happy to comply. However, if you order it to do something stupid, it will obey.

— You: “Computer, create that folder.”

— Computer: “Sure.”

— You: “Now put all the files on my Desktop in that new folder.”

— Computer: “No problem.”

— You: “Now delete that folder forever with everything inside.”

— Computer: “Done.”

— You: “Wait, no, my mistake, I want it back.”

— Computer: “Sorry, it’s all gone, as you requested.”

— You: “…”

— Computer: “I’m not even sorry.”

Never has this famous quote been more true:

With great power come great responsibility

Learning your way around a terminal really is a fundamental shift in how you usually interact with computers. Instead of working inside the boundaries of an application, a terminal gives you free and unlimited access to every part of the computer. The littles wheels are off, and you are only limited by the number of commands you know. Consequently, learning how to use the terminal will give you insights about how your computer works. Let’s see what we can do. We’ll start small, but trust me, it gets better.

Your first steps

First off, let’s define a couple of words.

A terminal is an application you can open on your computer, in which you’ll be able to type commands in a command line interface (CLI). When you hit the Enter key, the command will be executed by a program called a shell, and the result is displayed back in the terminal.

In the early days of computing, video terminals were actual physical devices, used to execute commands onto a remote computer that could take a whole room.

The DEC VT100, a physical video terminal dating back 1978

Nowadays, terminals are programs run into a graphical window, emulating the behavior of the video terminals of old.

This is what a terminal looks like nowadays.

Different operating systems come with different terminals and different shells pre-installed, but the most common shell out there is certainly bash.

Before we go any deeper, let’s open a terminal! The way you do this however depends on your operating system.

Opening a terminal

On MacOS

Open the Finder app, click on Applications on the left pane, then enter the Utilities directory, then execute the Terminal app. You can also use the Spotlight search by clicking on the magnifying glass icon on the top right corner of your screen (or use the Cmd Space keyboard shortcut), and type Terminal.

On Linux

Depending on the Linux distribution you use, it might come with XTerm, Gnome-Terminal or Konsole pre-installed. Look for any of these in your applications menu. A lot of Linux installation use the Ctrl - Alt - T keyboard shortcut to open a terminal window.

On Windows

Windows is a special case: Linux and MacOS come with bash pre-installed, whereas Windows does not. It comes with 2 built-in shells: cmd and Powershell. The rest of this tutorial and its following chapters however assume you are running bash. The reason for that is that bash is pretty much ubiquitous, whether it's on a personal workstations or on servers. On top of that, bash comes with a myriad of tools and commands that will be detailed in the next chapter.

Fortunately, Windows 10 can now natively run bash since 2019 by using the Windows Subsystem for Linux (WSL). We suggest you follow the instructions from this tutorial.

Running bash on Windows is now possible

Running our first command

When you open your terminal, the first thing you will see is a prompt. It is what is displayed every time the shell is ready for its next order. It is common for the prompt to display information useful for the user. In my case, br is my username, and morenika is my computer’s name (its hostname).

br@morenika:~$ is my prompt

The black rectangle is called a cursor. It represents your current typing position.

What your prompt actually looks like depends on your operating system and your shell. Don’t worry if it does not look exactly the same as the one in the following examples.

The first command we will run is ls (which stands for list directory). By default, that command lists all directories and files present in the directory we currently are located into. To run that command, we need to type ls after the prompt, and then hit Enter

The text that is displayed after our command and before the next prompt is the command’s output.

br@morenika:~$ ls
Android                code       Downloads              Music
AndroidStudioProjects  Desktop    Dropbox                Pictures
bin                    Documents  Firefox_wallpaper.png  Videos

These are all the files and directories located in my personal directory (also called home directory). Let’s open a graphical file explorer and check, just to be sure.

As expected, we weren’t lied to

The shell is sensitive to casing: a lower-case command is not the same thing as it’s upper case equivalent.

br@morenika:~$ LS
bash: LS: command not found

As of now, we will ignore the br@morenika:~$ prompt prefix and will only use $, to keep our examples short.

Commands arguments

In our last example, we listed all files and directories located in my home directory. What if I wanted to list all files located in the bin directory that we can see in the output? In that case, I could pass bin as an argument to the ls command.

$ ls bin
bat            fix-vlc-size  lf          terraform  vpnconnect
clean-desktop  itresize      nightlight  tv-mode

By passing the bin argument to the ls command, we told it where to look, and we thus changed its behavior. Note that it is possible to pass more than one argument to a command.

$ ls Android bin
Android:
Sdk

bin:
bat  clean-desktop  fix-vlc-size  itresize  lf  nightlight  terraform  tv-mode  vpnconnect

In that example, we passed two arguments to ls: bin and Android. ls then proceeded to list the content of each these 2 directories.

Think about how you would have done that in a File explorer GUI. You probably would have gone into the first directory, then gone back to the parent directory and finally proceeded with the next directory. The terminal allows you to be more efficient.

Command options

Now, let’s say I’d also like to see how big files located under bin are. No problem! The ls command has options we can use to adjust its behavior. The -s option causes ls to display each file size, in kilobytes.

$ ls -s bin
total 52336
 4772 bat                4 itresize    44296 terraform
    4 clean-desktop   3244 lf              4 tv-mode
    4 fix-vlc-size       4 nightlight      4 vpnconnect

While this is nice, I’d prefer to see the file size in a human-readable unit. I can add the -h option to further specify what ls has to do.

$ ls -s -h bin
total 52M
4.7M bat            4.0K itresize     44M terraform
4.0K clean-desktop  3.2M lf          4.0K tv-mode
4.0K fix-vlc-size   4.0K nightlight  4.0K vpnconnect

I can separate both options with a space, or also group them as one option.

$ ls -sh bin
total 52M
4.7M bat            4.0K itresize     44M terraform
4.0K clean-desktop  3.2M lf          4.0K tv-mode
4.0K fix-vlc-size   4.0K nightlight  4.0K vpnconnect

I’d finally like each file and its associated size to be displayed on its own line. Enter the -1 option!

 $ ls -s -h -1 bin
total 52M
4.7M bat
4.0K clean-desktop
4.0K fix-vlc-size
4.0K itresize
3.2M lf
4.0K nightlight
 44M terraform
4.0K tv-mode
4.0K vpnconnect

Short options make it easy to type a command quickly, but the result can be hard to decipher after a certain amount of options, and you might find yourself wondering what the command is doing in the first place. Luckily, options can have a long form and a short form. For example, -s can be replaced by its long form --size, and -h by --human-readable.

$ ls --size --human-readable -1 bin
total 52M
4.7M bat
4.0K clean-desktop
4.0K fix-vlc-size
4.0K itresize
3.2M lf
4.0K nightlight
 44M terraform
4.0K tv-mode
4.0K vpnconnect

The command feels way more self-explanatory this way! You’ll notice that we still used the short form for the -1 option. The reason for that is that this option simply does not have a long form.

Takeaways

A terminal is an application through which you interact with a shell
You can execute commands by typing them in the shell’s command-line and hitting Enter
A command can take 0, 1 or more arguments
A command’s behavior can be changed by passing options
By convention, options can have have multiple forms: a short and/or a long one.

Here is a summary of the different parts of a command

Managing files

So far, we’ve seen how to run a command, changing its behavior by passing command-line arguments and options, and that ls is used to list the content of a directory. It’s now time to learn about how to managing your files, by creating files and directories, copying and moving them around, creating links, etc. The goal of this section is to teach you how to do everything you usually do in your file explorer, but in your terminal.

`pwd`, `cd`: navigating between directories

Up to now, every command we’ve run were executed from our home directory (the directory in which you have all your documents, downloads, etc). The same way you can navigate directories in a graphical file editor, you can do it in a terminal as well.

Before going anywhere, we first need to figure out where we are. Enters pwd, standing for print working directory. This command displays your current working directory (a.k.a where you are).

$ pwd
/home/br

Now that we found our bearings, we can finally move around. We can do that with the cd command, standing for (you might have guessed it) change directory.

$ cd Documents
$ pwd
/home/br/Documents
$ cd ./invoices
$ pwd
/home/br/Documents/invoices
$ cd 2020
$ pwd
/home/br/Documents/invoices/2020

As 2020 is empty, we can’t go any further. However, we can also go back to the parent directory (the directory containing the one we are currently into) using cd ...

$ pwd
/home/br/Documents/invoices/2020
$ cd ..
$ pwd
/home/br/Documents/invoices

We don’t have to always change directory one level at the time. We can go up multiples directories at a time.

$ pwd
/home/br/Documents/invoices
$ cd ../..
$ pwd
/home/br

We can also go several directories down at the same time

$ pwd
/home/br
$ cd Documents/invoices/2020

Running cd without arguments takes you back to your home directory.

$ pwd
/home/br/Documents/invoices/2020
$ cd
$ pwd
/home/br

Running cd - takes you back to your previous location.

$ pwd
/home/br/Documents/invoices/2020
$ cd /home/br
$ cd -
$ pwd
/home/br/Documents/invoices/2020

You might wonder why cd .. takes you back to the parent directory? What does .. mean? To understand this, we need to explore how paths work.

Paths: root, absolute and relative

If you have never used a terminal before, and have only navigated between directories using a graphical file explorer, the notion of path might be a bit foreign. A path is a unique location to a file or a folder on your file system. The easiest way to explain it is by describing how files and directories are organized on your disk.

The base directory (also called root directory, and referred as /) is the highest directory in the hierarchy: it contains every single file and directory in your system, each of these directories possibly containing others, to form a structure looking like a tree.

Your disk is organized like a tree

Let’s look at what that / root directory contains.

$ ls /
bin  boot  dev  etc  home  lib  lib64  lost+found  media
mnt opt  proc  root  run  sbin  srv  sys  tmp  usr  var

Ok so, there are a couple of things in there. We have talked about home directories before, remember? It turns out that all the users’ home directories are located under the home directory. As home is located under /, we can refer it via its absolute path, that is to say the full path to a given directory, starting from the root directory. In the case of home, its absolute path is /home, as it is directly located under /.

Any path starting with / is an absolute path.

We can then use that path to inspect the content of the home directory with the ls command.

$ ls /home
br

The absolute path of br is /home/br. Each directory is separated from its parent by a /. This is why the root directory is called /: it is the only directory without a parent.

Any path that does not start with / will be a relative path, meaning that it will be relative to the current directory. When we executed the ls bin command, bin was actually a relative path. Indeed, we executed that command while we were located in /home/br, meaning that the absolute path of bin was /home/br/bin.

Each folder on disk has a link to itself called ., and and link to its parent folder called ...

The . link points to the folder itself and the .. link points to the folder’s parent.

We can use these . and .. links when constructing relative paths. For example, if you were located in /home/br, you could refer to the Android folder as ./Android, meaning “the Android folder located under . (the current directory)”.

$ ls ./Android
Sdk

Were you located under /home/br/Android, you could also refer to /home/br/Downloads as ../Downloads.

Following Android’s .. link takes you back to the home director

ls -a allows you to see hidden files, a.k.a all files starting with a dot. We can indeed see the . and .. links!

$ ls -a
.  ..  Sdk

`mkdir`: creating directories

In order to make sure that we don’t mess with your personal files when testing out the commands from this chapter, we will start by creating a new directory to experiment in, called experiments.

You can create a new directory using the mkdir command, which stands for make directories. By executing the command mkdir experiments, you will create the experiments directory in your current directory. Let’s test this out.

$ ls
Android                code       Downloads              Music
AndroidStudioProjects  Desktop    Dropbox                Pictures
bin                    Documents  Firefox_wallpaper.png  Videos

$ mkdir experiments
$

Notice that the mkdir command did not display anything. It might feel unusual at first, but this is the philosophy of these commands: only display something if something went wrong. In other terms, no news if good news.

We can now check that the directory has been created.

$ ls
Android          bin   Desktop  Downloads  experiments      Music     Videos
AndroidStudioProjects  code  Documents  Dropbox    Firefox_wallpaper.png  Pictures

We can also see that directory by opening our file explorer.

The directory we have just created in the terminal can be seen in our file explorer. The terminal displays the information as text, and the file explorer displays it in a graphical form.

Running mkdir on a pre-existing command causes it to fail and display an error message.

$ mkdir experiments
mkdir: experiments: File exists

What if we wanted to create a directory in experiments called art, and another directory called paintings itself located into art?

$ mkdir experiments/art/paintings
mkdir: experiments/art: No such file or directory

Something clearly went wrong here. mkdir is complaining that it cannot create paintings within experiments/art as it does not exist. We could create art and then paintings, in two separate commands, but fortunately, mkdir provides us with a -p option that causes mkdir to succeed even if directories already exist, and that will create each parent directory.

-p, --parents: no error if existing, make parent directories as needed

This looks like exactly what we need in that case! Let’s see if it works as expected.

$ mkdir -p experiments/art/paintings
$ ls experiments
art
$ ls experiments/art
paintings
$ ls experiments/art/paintings
$

`cp`, `mv`: moving files around

cp (standing for copy) allows you to copy a file or a directory to another location.

$ cp Documents/readme experiments/art
$ ls experiments/art
paintings   readme
$ ls Documents
readme

You can also move the file from a location to another by using mv.

$ mv experiments/art/readme experiments
$ ls experiments
art   readme
$ ls experiments/art
paintings

That does not seem to work on directories however.

$ cp experiments/art experiments/art-copy
cp: experiments/art is a directory (not copied).

By default, cp only works on files, and not on directories. We need to use the -r option to tell cp to recursively copy experiments/art to experiments/art-copy, meaning cp will copy the directory and every file and directories it contains.

$ cp -r experiments/art experiments/art-copy
$ ls experiments
art-copy art  readme
$ ls experiments/art
paintings
$ ls experiments/art-copy
paintings

Finally, you can use mv to rename a file or a directory. It might sound surprising that there is not rn or rename command, but renaming a file is actually just moving it to another location in the same directory.

$ mv experiments/readme experiments/README
$ ls experiments
README    art-copy art

`rm`: removing files and directories

The rm copy allows you to delete files and directories.

Be careful with rm, when a file is deleted, it is not moved to the trash, it is gone.

$ rm experiments/README
$ ls experiments
art-copy art

rm behaves like cp: it only allows you to remove directories by using the -r option.

$ rm experiments/art
rm: experiments/art: is a directory
$ rm -r experiments/art
$ ls experiments
art-copy
$ rm -r experiments/art-copy
$ ls experiments
$

`ln`: creating links

Have you ever created a shortcut to a file on your desktop? Behind the scenes, this works using a symbolic link. A link points to the original file, and allows you to access that file from multiple places, without actually having to store multiple copies on disk.

We can create such a link by using the ln -s command (-s stands for symbolic).

$ pwd
/home/br
$ ln -s Documents/readme Desktop/my-readme

Using the -l option of ls, we can see where a link points to.

$ ls -l Desktop
total 0
lrwxr-xr-x  1 br  br  21 Jan 17 16:48 my-readme -> /home/br/Documents/readme

My personal mnemonic to remember the order of arguments is by remembering s for source: the source file goes after the -s option. ln -s <source> <destination>

`tree`: visualizing files and subfolders

tree displays the content of the current directory (or argument directory) and its subfolders in a tree-like representation. It is very useful to have a quick look at the current content of a directory,

$ tree experiments
experiments
|__ art
    |__ paintings

2 directories, 0 files

tree might not be installed by default, depending on your system. We mention it here as we will re-use it throughout the chapters.

Learning new options

Getting help

If you are wondering how you will be able to remember all these options, don’t worry. Nobody expects you to know all of the options of all the commands by heart. You can rely on the commands’ documentation instead of having to memorize them all.

Most of the commands out there take a -h (or --help) option that will display the list of options the command itself can take, and what they do.

$ ls --help
Usage: ls [OPTION]... [FILE]...
List information about the FILEs (the current directory by default).
Sort entries alphabetically if none of -cftuvSUX nor --sort is specified.

Mandatory arguments to long options are mandatory for short options too.
  -a, --all                  do not ignore entries starting with .
  -A, --almost-all           do not list implied . and ..
      --author               with -l, print the author of each file
  -b, --escape               print C-style escapes for nongraphic characters
      --block-size=SIZE      with -l, scale sizes by SIZE when printing them;
                               e.g., '--block-size=M'; see SIZE format below
  -B, --ignore-backups       do not list implied entries ending with ~
  -c                         with -lt: sort by, and show, ctime (time of last
                               modification of file status information);
                               with -l: show ctime and sort by name;
                               otherwise: sort by ctime, newest first
[cut for brevity]

It’s interesting to note that some options accept both short and long forms, like -a/--all, while some others only accept a short form (-c) or a long form (--author). There’s no real rule there, only conventions. A command might not even accept a --help option, but most if not all the common ones do.

-h is not always the short option for --help. Indeed, we’ve seen that ls --help prints an overview of all available commands, whereas ls -h displays units in a human-readable format!

Reading the manual

Sometimes, there’s no --help option available, or its output isn’t clear or verbose enough for your taste, or the output is too long to navigate easily. It’s often a good idea to read the command’s man page (man stands for manual).

Let’s give it a go, by typing the following command.

$ man ls

man ls displays the manual of the ls command: everything you need to know about what ls can be used for.

Reading the synopsis

man provides you with a synopsis, describing a specific usage of the command on each line, along with the associated options and arguments.

The ls synopsis is

SYNOPSIS
       ls [OPTION]... [FILE]...

The square brackets around [OPTION] and [FILE] mean that both options and files are optional. As we’ve seen at the beginning of this chapter, just running ls on its own prints the content of the current working directory.

The ... following [OPTION] and [FILE] means that several options and several files arguments can be passed as arguments to ls, as illustrated by the following example.

$ ls -sh Android bin
Android:
total 4.0K
4.0K Sdk

bin:
total 52M
4.7M bat            4.0K fix-vlc-size  3.2M lf           44M terraform  4.0K vpnconnect
4.0K clean-desktop  4.0K itresize      4.0K nightlight  4.0K tv-mode

If we look at the mkdir synopsis, we see that options are, well, optional, but we must provide it with one or more directories to create, because DIRECTORY is not between square brackets.

SYNOPSIS
       mkdir [OPTION]... DIRECTORY...

The DESCRIPTION section will list all possible options (short and long forms), along with their effect.

Navigating the manual

When you run man, the manual of the command will be displayed in a pager, a piece of software that helps the user get the output one page at a time. One of the most common pager commands is less (which is incidentally the more featureful successor of more, because less is more). Being dropped into a pager for the first time is confusing, as you might not know how to to navigate.

The most useful commands you can type within less are:

h: display the less help
q: exit less
/pattern: look for the input text located after the cursor’s current position
n: go to next pattern occurrence
?pattern: look for the input text located before the cursor’s current position
N go to the pattern previous occurrence
up or down arrow to navigate up or down a line
PageUp and PageDown keys to navigate up or down a page
g go to the beginning of the file
G go to the end of the file

For example, if you’re not sure what the -s ls option is doing, you can type man ls and then /-s when you are in less. Type n until you find the documentation for -s, --size (or N to go back if you went too far). Once you’re done, you can exit less by typing q.

While man uses less under the hood to help you read documentation, you can simply use less to page through any file your disk. For example, I can use this command on my computer.

$ less Documents/readme

You can look into the less help itself, by typing h when reading a man page, by typing less --help in a terminal, or even man less!

Exactly like ls, man itself is a command, and as most of the commands, it has a manual! You can read more about man itself by typing

$ man man

Low and behold, the manual’s manual.

Command Input/Output streams

Before we can fully explain what makes the shell so powerful, we need to explain what is an Input Output stream. Every time we run a command, the shell executes a process, which will then be in charge of running the command, and communicating its output back to the terminal. Input/Output streams are the way the shell sends input to a process and dispatches output from it.

Each process has 3 streams by default:

stdin (or standard input): provides input to the command
stdout (or standard output): displays the command’s output
stderr (or standard error): displays the command’s error

Each one of these streams has an associated file descriptor, a number used by the shell to reference that stream. stdin has the file descriptor 0, stdout has 1, and stderr has 2.

stdin (file descriptor 0) is the process input stream, stdout (file descriptor 1) is the process output stream and stderr (file descriptor 2) is the process error stream.

Redirecting output to a file

It can be convenient to “save” the output of a command to a file, to further process it at a later time, or to send it to someone else. You can use the > operator to redirect the stdout of a command to a file.

$ ls /home/br > ls-home.txt

We can then display the content of the ls-home.txt file using the cat command.

$ cat ls-home.txt
Android                code       Downloads              Music
AndroidStudioProjects  Desktop    Dropbox                Pictures
bin                    Documents  Firefox_wallpaper.png  Videos

If the file doesn’t already exist, it will be created by the shell at the moment of the redirection. If the file however does exist at redirection time, it will be overwritten, meaning that anything that file used to contain will be replaced by the output of the redirected command.

In that example, we use the echo command, that simply sends the argument text to its stdout.

$ cat ls-home.txt
Android                code       Downloads              Music
AndroidStudioProjects  Desktop    Dropbox                Pictures
bin
$ echo "Hello world!" > ls-home.txt
$ cat ls-home.txt
Hello world!

If you want to append the output of a command to a file without overwriting its content, you can use the >> operator instead of >.

$ cat echoes
cat: echoes: No such file or directory
$ echo "Hey, I just met you, and this is crazy" >> echoes
$ echo "so here's my echo, so cat it maybe" >> echoes
$ cat echoes
Hey, I just met you, and this is crazy
so here's my echo, so cat it maybe

Redirecting a file to a command’s input

The same way you can redirect a command’s stdout to a file, you can redirect a file to a command’s sdtin.

In that example, we’ll redirect the content of the echoes file to the input of the wc -l command, counting the number of lines of its input stream or the file(s) passed by argument.

$ cat echoes
Hey, I just met you, and this is crazy
so here's my echo, so cat it maybe
$ wc -l < echoes
2

You can of course combined the <, > and >> operators in a single command. In the following example, we will redirect the content of the echoes file to the wc -l command, and redirect the output of that command to the echoes-lines files.

$ wc -l < echoes > echoes-lines
$ cat echoes-lines
2
$ cat echoes
Hey, I just met you, and this is crazy
so here's my echo, so cat it maybe

Redirecting multiple lines to a command’s input

You might find yourself in a situation where you want to pass multiple lines of input to a command, and the < operator fails you in that case, as it only deals with files. Luckily, your shell provides you with the heredoc (here document) << operator to accomplish this.

A heredoc redirection has the following syntax:

command <<DELIMITER
a multi-line
string
DELIMITER

The DELIMITER can be any string of your choosing, although EOF (“end of file”) is pretty commonly used.

Let’s consider the following example:

$ cat <<EOF
My username is br
I'm living at /home/br
EOF

This command will output the following block of text:

My username is br
I'm living at /home/br

You can redirect that block into a file by combining both the << and > operators.

$ cat <<EOF > aboutme
My username is br
I'm living at /home/br
EOF
$ cat aboutme
My username is br
I'm living at /home/br

Redirecting `stderr`

Let’s consider the following example.

$ cat -n notthere > notthere-with-line-numbers
cat: notthere: No such file or directory
$ cat notthere-with-line-numbers

How come the notthere-with-line-numbers file is empty even after we redirected the cat -n notthere command’s output to it? The reason for that is, we didn’t really redirect the command’s output to that file, we redirected the command’s stdout. As the file notthere does not exist, the cat command fails, and displays an error message on it’s stderr stream, which wasn’t redirected.

You can redirect a process stream by using its file descriptor. Remember? 0 for stdin, 1 for stdout and 2 for stderr.

$ cat -n notthere  2>errors.txt
$ cat errors.txt
cat: notthere: No such file or directory

This stderr redirection can be illustrated by the following diagram.

Any errors displayed by cat will be redirected into the errors.txt file

You can also redirect the command’s stdout to a file, and its stderr to another file.

$ cat -n notthere >output.txt 2>errors.txt
$ cat output.txt
$ cat errors.txt
cat: notthere: No such file or directory

Normal output will be redirected into output.txt whereas errors are redirected to into errors.txt

It is also possible to redirect the command’s stderr into its stdout using 2>&1. This will effectively merge both streams into a single one.

$ cat notthere > output.txt 2>&1
$ cat output.txt
cat: notthere: No such file or directory

cat’s stdout and stderr are merged together into a single stream

The order of redirections has always felt a little bit weird to me. You’d expect the following syntax to work, as it feels (at least to me) more logical, by saying “redirect all errors to stdout, and redirect the whole thing to a file”. It does not work though.

$ cat notthere 2>&1 > output.txt
cat: notthere: No such file or directory
$ cat output.txt
$

Composing commands

Being able to use a myriad of commands, each one with its own purpose, is powerful. However, the true power of the shell comes from the fact that these commands can be combined. This is where the terminal takes a radical shift from the philosophy of graphical applications. Where a GUI allows you to use a set of predefined tools, the shell allows you to assemble commands into your own specialized tools.

This is done via the pipe: |, allowing the redirection of a command’s output stream to another command’s input stream.

$ command1 | command2

A pipe simply works by connecting the stdout stream of a command to the stdin stream of the next command. Simply said, the output of a command becomes the input of the next.

ls is piped into wc by redirecting its output into wc’s input. A pipe allows to compose and assemble commands into pipelines, which makes the terminal so powerful.

You can of course chain as many commands as possible and create command pipelines.

$ command1 | command2 | command3 | ... | commandN

When you execute command1 | command2, your shell starts all commands at the same time, and a command’s output is streamed into the next one as the commands run.

For example, let’s imagine I’d like to count the number of files in my Downloads folder. To that effect, I can combine ls and the wc (for word count) commands. wc, when used with the -l options, allows to count the number of lines in its input.

$ ls -1 ~/Downloads | wc -l
34

Now, let’s say I only want to count the number of pdf files in my Downloads folder, not just all of them. No problem, grep to the rescue! grep allows to filer its input on a given pattern (more on grep in the next chapter). By using grep pdf, we filter the output of ls -1 to only the filenames containing “pdf”, and then count how many filenames were filtered using wc -l.

$ ls -1 ~/Downloads | grep pdf | wc -l
22

Going further: redirecting output to both the console and a file

The tee command allows you to write a command’s stdout to a file while still displaying it into the console. This can be very useful if you want to store the output of a command in a file, but still be able to see what it’s doing in real-time.

$ ls -1 | tee output.txt
Android
code
...
$ cat output.txt
Android
code
...

tee is named after the T-splitter used in plumbing.

Escaping from bad situations

Mistyped command, missing arguments

If you mistype a command, or forget to add arguments, you can find yourself in a situation where your shell hangs, and nothing happens. For example, type any of the following commands.

$ cat

$ echo 'hello world

The first command hangs because it is waiting for input on its stdin stream, as no argument file was provided. In the case of the second command, it is missing a matching single quote. In both cases, you get can out of this situation by hitting Ctrl - C which kills the command by sending it a interruption signal.

If your shell is stuck on receiving input (like in the cat example), you can also cleanly exit it by hitting Ctrl - D which will send a special EOF (“end of file”) character, indicating to the command that its input is now closed.

$ cat
hello
hello
world
world
# Ctrl-D
$

Escaping characters

Imagine for a second that you had a file on disk named my file, and you wanted to display its content using cat.

$ cat my file
cat: my: No such file or directory
cat: file: No such file or directory

In the previous example, the cat command was given 2 arguments my and file, none of which corresponded to any existing file. We have 2 solutions to make this work: quoting the file name, or using an escape character.

$ cat 'my file'
That file has spaces in it...
$ cat "my file"
That file has spaces in it...

By putting quotes around the file name, you are telling your shell that whatever is between the quotes is a single argument.

Like previously mentioned, we could also use the backslash escape character, which indicates that the following character doesn’t have any special meaning.

$ cat my\ file
That file has spaces in it...

By using \ (a backslash character followed by a space), we indicate to the shell that the space is simply a space, and should not be interpreted as a separator between 2 arguments.

Summary

In that chapter, we’ve discovered what a terminal is: an application in which you can type text commands to have them executed by a program called a shell.

Facing the terminal can be intimidating at first because you might not always know what command to type. Learning your way around the terminal is however part of the journey of becoming a software engineer. Like any other powerful tool, it can be hard to learn but will also make you immensely more productive once you get more accustomed to it.

The fundamental philosophy of working in a terminal is being free to compose different tools in a way that might not have been initially foreseen by the tools’ developers, by using pipes and stream redirections. Instead of using a single tool that was only designed to perform a finite set of tasks, you are free to assemble a patchwork of unrelated commands, that can all work together by joining their input and output streams.

In the next chapter, we will dig into text processing commands, which can be immensely powerful when chained together with pipes.

Going further

1.1: Look into the ls manual and research what the -a option is doing. Run ls -a ~/. What are the . and .. directories? What are the files starting with a . ?

1.2: Run a command and redirect its output into a file, but display any errors in the terminal.

1.3: Run a command and redirect its output into a file, and any errors into a different file.

1.4: Run a command and redirect both its output and errors into the same file, while also displaying them all on screen at the same time.

1.5: Use a heredoc redirection to create a new file with text in it.

1.6: Given an echoes file, what is the difference between wc -l echoes, cat echoes | wc -l and wc -l < echoes ?

How to setup a personal wireguard VPN

2019-12-11T00:00:00+01:00

My work takes me to the United-States multiple times a year, and I've never been comfortable using the hotel Wi-Fi, or even my company VPN for that matter, when I'm there. I want to be assured that what I do online is my business and my business alone.

I had heard about Wireguard multiple times, how performant and simple it was compared to OpenVPN (I'd like to have a talk with whomever came up with the OpenVPN config file...). I decided to jump in and give it a try. The idea was to setup a VPN access point on my VPS, hosted in Paris, to which I could connect when I travel.

Installing wireguard

I followed Wireguard's official install instructions. However, I also needed to install the headers files for the kernel I was running so that dkms could compile the wiregard kernel module.

% apt-get install linux-headers-$(uname -r)
% add-apt-repository ppa:wireguard/wireguard
% apt-get update
% apt-get install wireguard

If everything is going according to plan, you should see the wireguard kernel module being compiled by dkms at install time:

...
DKMS: build completed.wireguard.ko:
Running module version sanity check.
 - Original module
   - No original module exists within this kernel
 - Installation
   - Installing to /lib/modules/X.Y.Z-ABC-generic/updates/dkms/
...

At that point, you should be able to see the module in the lsmod output and load it.

% lsmod | grep wireguard
wireguard             204800  0
ip6_udp_tunnel         16384  1 wireguard
udp_tunnel             16384  1 wireguard
% modprobe wireguard

Configuring the server peer

First off, we create the server wireguard peer's public and private keys.

% cd /etc/wireguard
% umask 077  # disable public access
% wg genkey | tee privatekey | wg pubkey > publickey

We now configure the server peer, assuming that the VPS public network interface is ens2. We'll use the 192.168.2.0/24 subnet for all wireguard-related addresses, and assign 192.168.2.1 IP to the server peer.

% cat <<EOF > /etc/wireguard/wg0.conf
[Interface]
# The IP assigned to the wg0 interface
Address = 192.168.2.1/24

# The port wireguard will listen on
ListenPort = <public port>

# The private key used by the local peer
PrivateKey = $(cat /etc/wireguard/privatekey)

# Accept traffic to the wg0 interface and allow NATing traffic from ens2 to wg0
PostUp = iptables -A FORWARD -i %i -j ACCEPT; iptables -A FORWARD -o %i -j ACCEPT; iptables -t nat -A POSTROUTING -o ens2 -j MASQUERADE
PostDown = iptables -D FORWARD -i %i -j ACCEPT; iptables -D FORWARD -o %i -j ACCEPT; iptables -t nat -D POSTROUTING -o ens2 -j MASQUERADE

EOF
% rm /etc/wireguard/privatekey

We also need to authorize UDP traffic on the <public port> port.

% iptables -i ens2 -p udp --dport <public port> -j ACCEPT

Once that's done, we're now able to use wg-quick to setup the wg0 network interface, as well as the MASQUERADE iptables rules that will NAT the traffic between the public ens2 interface to wg0. We can actually use systemd for that, as we're assured that the wg0 interface is re-created in case of a reboot.

% systemctl start wg-quick@wg0
[#] ip link add wg0 type wireguard
[#] wg setconf wg0 /dev/fd/63
[#] ip -4 address add 192.168.2.1/24 dev wg0
[#] ip link set mtu 1420 up dev wg0
[#] iptables -A FORWARD -i wg0 -j ACCEPT; iptables -A FORWARD -o wg0 -j ACCEPT; iptables -t nat -A POSTROUTING -o ens2 -j MASQUERADE

% systemctl enable wg-quick@wg0
Created symlink from /etc/systemd/system/multi-user.target.wants/wg-quick@wg0.service to /lib/systemd/system/wg-quick@.service.

Configuring the phone peer

I use the Wireguard Android app, and assign the 192.168.2.2/32 address to my phone, as well as add the server peer details (as Wireguard is a point-to-point VPN without a client/server architecture).

The server peer public key is set to the content of the remote /etc/wireguard/publickey file, on my VPS. As I want to route all my phone traffic through wireguard, I set the Allowed IPs field to 0.0.0.0/0, and the peer endpoint to <server public ens2 IP>:<public port>.

Authorizing the phone peer

After having generated a public key for the phone peer, we also need to authorize it on the server peer and restart wireguard.

% cat <<EOF >> /etc/wireguard/wg0.conf

[Peer]
# Phone peer
PublicKey = <phone peer public key generated in app>
AllowedIPs = 192.168.2.2/32
EOF
% systemctl restart wg-quick@wg0

Testing the whole thing

My phone disconnected from the server wireguard peer, I'm now able to inspect the state of the wg0 server network interface:

% ifconfig wg0
wg0       Link encap:UNSPEC  HWaddr 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
          inet addr:192.168.2.1  P-t-P:192.168.2.1  Mask:255.255.255.0
          UP POINTOPOINT RUNNING NOARP  MTU:1420  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)

I then connect my phone to the server peer, open a random webpage, and voila, we can see traffic going through the server wg0 interface.

$ ifconfig wg0
wg0       Link encap:UNSPEC  HWaddr 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
          inet addr:192.168.2.1  P-t-P:192.168.2.1  Mask:255.255.255.0
          UP POINTOPOINT RUNNING NOARP  MTU:1420  Metric:1
          RX packets:4084 errors:0 dropped:132 overruns:0 frame:0
          TX packets:4895 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1
          RX bytes:452436 (452.4 KB)  TX bytes:2954188 (2.9 MB)

A quick tcpdump shows that the data flowing to wg0 is indeed encrypted.

$ tcpdump -i wg0 -vv -c 100 -X
tcpdump: listening on wg0, link-type RAW (Raw IP), capture size 262144 bytes
15:14:05.096356 IP (tos 0x0, ttl 105, id 47301, offset 0, flags [none], proto TCP (6), length 332)
    wq-in-f188.1e100.net.5228 > 192.168.2.2.46641: Flags [P.], cksum 0x8308 (correct), seq 1867855144:1867855424, ack 229885280, win 253, options [nop,nop,TS val 426814177 ecr 2017832], length 280
    0x0000:  4500 014c b8c5 0000 6906 fe02 4a7d 8cbc  E..L....i...J}..
    0x0010:  c0a8 0202 146c b631 6f55 3528 0db3 c560  .....l.1oU5(...`
    0x0020:  8018 00fd 8308 0000 0101 080a 1970 aae1  .............p..
    0x0030:  001e ca28 1703 0301 13e7 c1f4 5089 ed04  ...(........P...
    0x0040:  aba6 ef67 2cbe a7b3 f0cc 02d0 caaa d675  ...g,..........u
...

I now have have a personal VPN I can use whenever I travel abroad.

Thanks to Thomas for being patient with me while answering networking questions at 11pm, and for proof-reading this article. Any remaining mistake is my own.

My DIY proposal

2019-11-30T00:00:00+01:00

You know how everyone wants their proposal to be special, thoughtful, original, and above all, wants to avoid being cliché? Well, I wanted all that. I also wanted it to be DIY and geeky. With all that in mind, and because my SO is such a Zelda fan, I decided to propose to her by having her open an Ocarina of time themed treasure chest, which would light up from the inside and play the famous music when it opens.

I started googling for instructions for a DIY Zelda treasure chest and found this perfect tutorial. Now, being perfectly honest, I have to admit that I'm not a very gifted craftsman. The idea of making a chest myself from wood got me a little worried, as I felt that I really needed to be precise and prepared, whereas I tend to lean to the more yolo side of things.

To that end, I started by creating a 3D model of the chest itself, using FreeCAD, which I had to learn from scratch, by using the dimensions (in imperial freedom units) from the instructables tutorial.

I finally ended up with something looking pretty good, that I could export to a plan with precise dimensions reported in mm.

The easy part was now done, and I needed to get to the actual building phase. I found a fablab in my area that looked pretty nice, but I couldn't find enough time to sneak around there and actually get to it. Time passed, and I one day noticed a big stack of cardboard lying around in our flat. I finally decided to make the chest out of cardboard instead, as it'd be easier to work and iterate with.

I followed the plan as best as I could (and improvised a fair amount), and ended up with something looking quite good!

At that point, I had a nice looking chest, but I also wanted music and light to beam out of it when it was being opened. I investigated a setup based on an Arduino with external speakers, a lever switch and an external LED for a while, but I soon realised that my phone was all I needed to make it work! All I needed to do was write an app that would play an mp3 and light up the phone's LED when it detected an ambient brightness increase. As the brightness sensor needed to face upwards, that'd mean that the LED would face downwards and beam the light towards the bottom of the chest. I decided to put a small mirror in the chest, and go with that.

I taught myself Kotlin and Android development on Udemy (thanks to Datadog for providing employees with an account!), by following Kotlin for Android: Beginner to Advanced, by Devslopes, which I can't recommend enough. I ended up with that small application installed on my phone.

One issue that I encountered was that Android does not give you any API to ask the brightness sensor for the current brightness value. All you can do is be notified when that value changes.

// Event handler executed when the light sensor detects a change
override fun onSensorChanged(event: SensorEvent) {
    val lux = event.values[0]
    println("Lux: ${lux}")
    if (lux >= activationLux && !mPlayer.isPlaying) {
        turnLightsOn()
        playSound()
    }
}

This is unwieldy as all the app can do is detect if the ambient brightness crosses an absolute threshold, which itself depends on the time of the day, the ambient light of the room, etc. I investigate whether I could light the LED up for a couple of seconds, then turn it off, to force the sensor to pick up some changes, to approximate the current brightness inside the chest. In the end, it was easier to just cover the inside with black foam to make sure the phone was in pitch black darkness.

I was finally all set.

The ring was a ruby, obviously.

Oh and she said yes!

On letting go

2019-11-11T00:00:00+01:00

I have been feeling more and more burdened in the recent months. It first wasn't clear to me why I was feeling that way: I'm in the most happy and fulfilling relationship I've ever been in, we just moved to a beautiful apartment we both fell in love with, I'm honoured and lucky to have great friends, and a fantastic job I'm enjoying myself in (which also turns out to pay greatly), alongside some of the greatest engineers out there. Why on Earth was I therefore sometimes feeling like all I felt like doing was sit on the couch, turn on Netflix, and wait for the day to end?

I started to get a better understanding of what was happening when my girlfriend confronted me on the subject of money, which turned out to be my greatest insecurity. At the time, we were planning for a trip to Canada mid-October, during which we were going camping in one of the national parks. Having almost no camping gear, we borrowed some and went out to buy the rest. What should have been an uneventful event turned into a fight, due to the anxiety that increasing pile of items in our cart was somehow causing me. Almost all I could think about was how much that gear was going to cost, and the space they would take at home when we got back. When we finally got to the point of choosing sleeping bags (something that you don't want to be cheap about when sleeping in the cold Canadian wilderness), I'd become so anxious that I was almost incapable of adding them to the cart and snapped back when my partner asked me why I was hesitating, given that I could afford them.

That whole conversation felt like dejà vu, as we went through the exact same process when planning for the renovation of our kitchen and bathroom, in the new apartment. During the process of choosing what we liked and imagining what we'd want, I couldn't mute that little voice in my head keeping track of what it would cost.

Why was I anxious at the idea of buying something that I needed or wanted, that would bring me comfort, when I could afford it in the first place?

As a side note, I don't think that I'm a cheap person. I have no problem buying someone else a present, lending or giving money, but when it comes to me, I can behave like Balthazar Scrooge (how appropriate).

It finally clicked as I was looking for old pictures and un-used apps to delete from my phone. I was doing that to maintain it as clutter-free as possible, to free myself from the weight of the things I wasn't using. It felt good to get rid of that dead digital weight, to make space for myself. And as I was doing so, I finally realised that the anguish didn't come from spending the money, but from the fact that buying new things for home would thus decrease my living space. These objects would then add to the clutter I was trying to clear from my phone, and weight me down.

I needed to apply to the physical life what my digital self instinctively knew: that letting things go is okay.

By choosing to only surround myself with what I love and find beautiful, I can create an environment that lifts me up and "recharges my batteries", so to speak. Anything not actively contributing to a shared sense of aesthetics or not making me feel good can be given away (as long as my partner and I agree). I now realise that less is more is key to my happiness and peace of mind. Keep what you love, enjoy what you have, give away the rest.

I should probably avoid starting a new project and practice what I already know I love but haven't worked on for a while.

In Digital Minimalism, Cal Newport points out that we oftentimes find ourselves installing a new digital tool (whether it's a new app, a new social media account, ...) without actively considering the benefit we're getting out of it. By doing so, we're inviting new notifications and distractions into our lives, which will eat away at our personal time and attention. As time and space are related under the Einsteinien theory of relativity, maybe our personal time and space are related as well? They both should be protected as much as possible from whatever is unaesthetic, distracting and that which weighs us down.

I've thus cleared out my Reddit and Twitter accounts from everything work-related (and almost abandoned Twitter altogether), have deactivated almost all phone notifications (except for my partner's messages), and am in the process of applying the same principles in the physical world.

Keep what you love, enjoy what you have, give away the rest.

It feels fantastic.

Managing my infra like it's 2019

2019-07-22T00:00:00+02:00

I recently realized that I was routinely managing thousands of servers and petabytes of data in my daily job, but was still managing my own personal infrastructure like I was living in 1999.

With the advent of configuration management tools such as Ansible, Chef, and the like, it became easier to configure instances in a reproducible manner by defining said configuration as code. Terraform made it easier to codify and provision cloud resources: instances, but also security groups, permissions, storage, load balancers, etc.

It's easy to simply think of a cloud infrastructure as a pool of compute resource. It is however often so much more than that. When executed right, The Cloud is a set of meshed services, interacting and communicating with each other (possibly with compute resources sitting in the middle). That applies for vast and complex infrastructures such as the one I work on at Datadog, but it also applies to my ridiculously tiny personal one. Realizing this got me thinking. Why wasn't I using the same tools and techniques to manage my small infrastructure than the ones I'm using daily?

My infrastructure

My personal infrastructure consists of (drumrolls...) 3 servers:

a VPS running in Scaleway, hosting my personal services (personal website, blog, git repositories, CalDAV server, traffic analytics, IRC client, Read-it-later service, etc)
a VPS running in OVH, hosting my mother's website
a Raspberry Pi, running in my living room, hosting private services (Kresus)

Until now, each of these servers were managed in an ad-hoc fashion, sometimes with scripts, sometimes without. All the cloud resources on which my services (S3 buckets, DNS zones, etc) were managed manually, using the cloud provider web console.

I manage my DNS zones with OVH, I use the AWS S3 bucket free tier for the blog images, and Datadog for monitoring.

Improving the setup

I had several objectives in mind to improve the current setup:

define all instances configuration and state in ansible playbooks
re-use and share instances configuration by leveraging ansible roles
define and manage all cloud resources using terraform to never have to log into a cloud web console again
secure all web-services with an automatically renewed SSL certificate provided by Let's Encrypt
run all services behind a reverse-proxy, using a docker container or a userland systemd service with minimal permissions and privileges
monitor the hosts and services using Datadog (free for 5 hosts or less) , with monitors define in terraform
secure the SSH connections of the internet-facing hosts via Duo (free for 10 users or less)
be able to SSH into all hosts from my personal and work laptop, as well as from my phone
monitor my daily backups

Show me the code

You can have a look at the code here. I've purposefully omitted the terraform/global_vars/main.tf file, credentials are obviously encrypted, API keys are defined in my home directory, but everything else is readable openly. My hope is that that readers might either learn something or point out where I'm doing something silly or insecure.

What now?

I'm now confident that I can open some of these services to friends, if they want to. I measure and monitor my own SLIs, the expiry of the SSL certificates, and can intervene from anywhere if something breaks.

My infrastructure is now more secure, and has been audited by fellow peers ¹. I'm now confident I can restore the services in the face of an instance loss (which is very important for my mother, as her website has a fair amount of traffic and brings her regular new customers).

I'm also dogfooding Datadog features, which got to me suggest a couple of improvements to the Datadog terraform provider which will be worked on next quarter.

Thanks to Mehdi and Thomas for the thorough playbook review. Any remaining mistake or silliness is my own. ↩

My no-knead bread recipe

2019-05-18T00:00:00+02:00

I've started baking again, and I think I'm really getting nice results. On top of it, they are reproducible! In that post, I'll walk you through my favorite recipe step-by-step.

I'm baking a 65% hydrated loaf using a no-knead technique, cooked in a cooking pot.

Ingredients

As the FWSY book said, a regular contains 4 ingredients: flour, water, salt and yeast.

My recipe contains:

500g T65 flour
325mL of lukewarm water (the hydration percentage is calculated based on the flour mass, so 500 * 0.65 = 325g)
9g of coarse sea salt (the french yeast syndicate has settled on 18g of salt per kg of flour, so who am I so say otherwise?)
8g of fresh baker's yeast

Initial mix

First, mix the flour, the salt, and half of the water in a bowl, and the fresh baker's yeast with the other half of the water. Wait for a good half-hour.

Once the yeast is active and is well mixed into the water, incorporate it into the bowl and mix.

Stretch and folds

As this is a no-knead recipe, we'll use the stretch and fold technique to stretch and reinforce the gluten strands, which will then make sure the dough is elastic and can expand and rise nicely at cooking time.

Note: the dough will stick to the fingers during the first 2 stretch-and-folds. It's ok. Try to refrain from adding too much flour.

First, flour the table a little, and get the dough out of the bowl (preferably using a scraper, to avoid tearing the dough while you get it out). Stretch it slowly and as much as possible. The first times, it's possible some holes will form. If so, try to patch them and don't overstretch.

Then, fold the dough 4 times on itself and shape the dough into a boule. Let it rest in a bowl, under a slightly damp towel for between a half-hour an an hour. In my experience, the more you wait, the more the yeast will activate and the more bubbles you'll get in the end.

After that waiting period, the dough should have expanded a bit, and feel more elastic, as well as less sticky.

Repeat these steps 4 to 5 times until the dough passes the finger dent test. The more you wait, the better.

The dough after 3 stretch and folds

The dough after 4 stretch and folds. Look at that puffy boi!

The dough after 5 stretch and folds

Proofing

Shape the dough into a boule, and let it proof for 2 hours under a slightly damp towel. Go watch Netflix or something. After the 2 hours, shape it into a boule again, to re-tighten the dough.

The dough after 2 hours of proofing and a tightening

Pre-heating and scoring

Pre-heat your oven at 250°C (482°F) with the cooking pot (lid included) inside. Once the oven is hot enough, place the proofed dough on cooking paper, flour it, then score it.

The floured dough

A deep scoring pattern allows the gas to dissipate. Shallow ones are just for show

Cooking

Get the cooking pot from the oven, and place the loaf inside, still on the cooking paper. Let it cook lid closed for 30 minutes, to make sure the water contained in the bread evaporates in the pot, which will help the crust develop. Remove the lid and lower the oven temperature to 235°C (455°F). Get the bread out of the oven after 15 to 20 minutes, when you feel it's cooked enough and you like the color.

Resting

Place your hot loaf on a grille, and let it rest and cool down for a couple of hours. Enjoy the cracking sounds.

All done!

Eating

You know what to do.

Final crumb shot

Allocating unbounded resources to a kubernetes pod

2018-09-29T00:00:00+02:00

Note: this article assumes that the reader is familiar with Kubernetes and Linux cgroups.

When deploying a pod in a Kubernetes cluster, you normally have 2 choices when it comes to resources allotment:

defining CPU/memory resource requests and limits at the pod level
defining default CPU/memory requests and limits at the namespace level using a LimitRange

However, what if circumstances allowed you to allocate unbounded resources to your pod? While that would go against the idea of bin-packing pods by using resource bounded cgroups, it could still useful if you ran no other pods that the unbounded one on your node. In that case, wouldn't be interested in protecting your pod against any noisy neighbour, and you'd want it to be able to use all the available node resources.

This (while not strictly documented) can be accomplished by using the following resource limits and requests:

resources:
  limits:
    cpu: 0
    memory: 0
  requests:
    cpu: 0
    memory: 0

In our case, we also have a defined LimitRange in our namespace, so we want to make sure that our request for unbounded resources does not get overridden by the default values.

$ kubectl describe limitrange my-limit-range
Name:       my-limit-range
Namespace:  default
Type        Resource  Min  Max  Default Request  Default Limit
----        --------  ---  ---  ---------------  -------------
Container   memory    -    -    512Mi            1Gi
Container   cpu       -    -    500m             1

$ kubectl get pod my-pod -o jsonpath='{.spec.containers[0].resources}'
map[limits:map[cpu:0 memory:0] requests:map[cpu:0 memory:0]]

It seems that the LimitRange has not overridden our request. However, we see a different picture when we inspect the node running our pod:

$ kubectl get pod my-pod -o jsonpath='{.spec.nodeName}'
my-node
$ kubectl describe node my-node
...
Non-terminated Pods:         (6 in total)
  Namespace    Name     CPU Requests  CPU Limits  Memory Requests  Memory Limits
  ---------    ----     ------------  ----------  ---------------  -------------
  datadog      my-pod   500m (13%)    1 (26%)     512Mi (2%)       1Gi (4%)
...

Who should we believe? When different parts of the control plane disagree on the resource allotment, there's really one place to get the truth from: the container cgroup itself.

To do so, we need to exec into the pod, and inspect the CPU quota and memory limit values.

$ kubectl exec -it my-pod
user@my-pod:/$ cat /sys/fs/cgroup/cpu/cpu.cfs_quota_us
-1

As detailed on the Linux kernel documentation, or the Red Hat documentation portal

A value of -1 for cpu.cfs_quota_us indicates that the group does not have any bandwidth restriction in place, such a group is described as an unconstrained bandwidth group. This represents the traditional work-conserving behavior for CFS.

Now, the memory.

user@my-pod:/$ cat /sys/fs/cgroup/memory/memory.limit_in_bytes
9223372036854771712

That looks odd. This would indicate that the process has a limit of ... 8191TB of memory!

Digging a bit further, we learn that 9223372036854771712 is a kind of "magic" number in the memory management layer of the kernel, meaning that the process gets unbounded memory.

Conclusion

Looking at the cgroup itself showed that a value of 0 for cpu/memory requests/limits is not intercepted by the LimitRange in place, and is translated to an unbounded cgroup in the end. It also showed that the pod resource requests and limits reported at the node level are inaccurate.

On meritocracy, identity and context

2018-09-21T00:00:00+02:00

Before reading

This is a deeply personal article, that hasn't been easy to write, especially with all the tension currently occurring in the tech industry, around inclusiveness, gender, code of conducts, etc. I've done my best to explain my thoughts on the matter, while being as respectful as possible. If you feel that you disagree with me, I'd be happy to debate with you, as long as the discussion stays civil and respectful.

The lay of the land

I have been working as a professional engineer in the tech industry for the last 7 years or so. My first contact with the subject of under-representation of minorities in the industry came during EuroPython 2012, when a tasteless tweet was posted the very night after which Lynn Root talked about "Increasing women engagement in the Python community" (these events are best summarized by Lynn herself). And these events kept happening, and happening, and happening. My personal view has since then been that each of these highly publicized events were caused by an appalling lack of tact, thoughtfulness, empathy and respect, and that conference attendees should make sure to behave appropriately or should suffer the consequences.

Code of Conducts started to be defined for conferences and online projects, thanks to the initiative of groups and individuals pushing for more respectful and inclusive communities. I have always thought that these were essential and useful, because they seemed to be making some conference attendees or project members feel safer (and not making myself feel less so), and would probably help keeping jerk-like behavior at bay.

Safe spaces were organized in conferences (I even helped on a few myself, as a tutor), and I thought it was a wonderful idea. I did not take anything away from the most represented types of conference attendees, and allowed less represented people to be able to take their marks in a safe environment.

However, I feel something changed for me when I read the proposal to replace the master/slave terminology by leader/follower in the Django framework. The PR starts with the following stance:

The docs and some tests contain references to a master/slave db configuration. While this terminology has been used for a long time, those terms may carry racially charged meanings to users.

My view at the time was "I mean, it does not really change anything for me, and if it can help people feel better...". Looking back, I'm pretty sure I felt a bit of unease reading the PR, but I (subconsciously or not) shrugged it off.

That debate recently resurfaced when the same thing happened in both the redis and the CPython codebases. Reading what antirez (the redis creator) had to say on the subject was a real moment of clarity for me.

Today it happened again. A developer, that we’ll call Mark to avoid exposing his real name, read the Redis 5.0 RC5 change log, and was disappointed to see that Redis still uses the “master” and “slave” terminology in order to identify different roles in Redis replication.

I said that I was sorry he was disappointed about that, but at the same time, I don’t believe that terminology out of context is offensive, so if I use master-slave in the context of databases, and I’m not referring in any way to slavery. I originally copied the terms from MySQL, and now they are the way we call things in Redis, and since I do not believe in this battle (I’ll tell you later why), to change the documentation, deprecate the API and add a new one, change the INFO fields, just to make a subset of people that care about those things more happy, do not make sense to me.

After it was clear that I was not interested in his argument, Mark accused me of being fascist.

At this point, I realized the landscape had dramatically changed, and that the inclusiveness debate had morphed into a more politicized and (according to me) confused and sterile version of itself.

Case in point, someone suggested the Zen of Python should be modified because the sentence Beautiful is better than ugly could be interpreted as a support for body-shaming behaviors. Words cannot express how wrong this feels to me. That suggestion shows both a profound lack of contextual thinking, and a will to advance a pro political correctness agenda.

People have been talking about Beautiful Code and Ugly Code for a long time. Long enough to write books about it. Long enough so that I could have late night discussions about it with my father (who's also a computer scientist). To me, suggesting that Beautiful is better than ugly encourages body shaming feels alien, because it's completely out of context. Words have certain meanings in certain contexts. That's how we get away with synonyms. In the context of the Zen of Python, the word Beautiful clearly characterizes code, not people. The Dwarf Star term defines a certain type of star, with given astrophysical properties. Should the entire astrophysics community rename it just because some people feel it's an offensive way of calling Peter Dinklage? Similar humorous (or not?) counter-arguments were offered during the CPython master/slave debate.

It seems all we read about now (especially after Linus Torvalds' temporary stepdown) is either written by strong meritocracy partisans, conspiracy theorists or by strong inclusiveness defenders (I've decided not to use the term SJW, as I understand it's a mocking and pejorative term).

It was even suggested and debated whether that this suggestion was made by a troll. The fact that, troll or not, that discussion lingered for several days is a very serious issue to me. It shows how polarized the debate now is, and how easily a strong community can be derailed.

About inclusivity, diversity and context

The core of the debate is focused on inclusivity and diversity (see this example), which got me thinking. It's clear to me why we want to push for diversity:

a body of similar minds will likely producer similar solutions to a problem, causing the final adopted solution to be more narrowed
a person could (subconsciously or not) avoid a given career path / community because she/he might not feel represented enough, and thus feel excluded or as though he/she does not belong

I want to focus on the second point, because I'm of the opinion that this is where the heated debates stem from.

If you read the Code of Conduct Covenant, which is a code of conduct most of the current conferences and community use or are based on, the text starts with:

In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to making participation in our project and our community a harassment-free experience for everyone, regardless of age, body size, disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.

I naturally tend to agree with this. We should all strive for inclusiveness and diversity, and should make sure everyone is treated gently and is given a friendly, open hand, whomever they are. However, if I were fostering malicious intent, I could point out that this list does not cover diets. I myself am a flexitarian (I've cut out all fish and meat from my daily diet, but will eat some without issue if there's no other option). I could somehow feel unrepresented or even excluded from a given community if its CoC does not state that my personal diet should be respected.

Although that example could seem frivolous or ridiculous, it points out something I feel is interesting. That whole paragraph attempts at listing all the way people could differ, to make sure everyone is explicitly included. I would personally have phrased it a more open-ended way:

In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to making participation in our project and our community a harassment-free experience for everyone, regardless of who they are and how they identify.

as I think some issues stem from the fact we have attempted to list what constitutes "diversity". If a tech conference decides to impose quotas on speakers, these quotas will focus on certain attributes (eg sex and skin color) while missing others (eg age, education), which might help some people to better identify to the speakers, but might not help others. This inventaire à la Prévert certainly looks like inclusiveness, but I think it misses the point.

How we identify is both subjective and subject to context. I might identify as an SRE, an engineer or a Python developer in the context of work or a tech-related event, a social extrovert in the context of a party, an leftist heterosexual male in the context of my personal and private life, etc.

How we identify depends on context, and yet, we seem intent on mixing personal identities and non-personal contexts, the same way accusing Beautiful is better than ugly to promote body shaming mixes human and technological contexts. I recognize that some situations are trickier than others (eg conferences, workplaces), because they can mix personal and professional contexts, thus blurring the lines.

If diversity is defined as having multiple identities present, then diversity must be subjective and subject to context too. To follow in that tech conference example, I feel diversity in the technological content should reside in education background, level of experience and field of interest of the speaker, while diversity in the social events tied to the conference could have a totally different definition.

These criterion are my pick, but I suggest you clearly and openly define which ones matter to you if you're ever in the position of selecting speakers or employees.

Closing words

In my view, the tech industry as a whole has been guilty of resistance to change by kicking around the old meritocracy horse for too long. We need to talk about the lack of women, the rampant misogynist attitudes, the male/women pay gap. We need to fix these issues by acknowledging them first, and debating them transparently, in a less polarized way. Not just as an industry, but as a society.

However, as I don't buy in the "show me the code or GTFO" attitude, I don't believe in politically correctness before everything else. If some people lack the ability to recognize that Beautiful is better than ugly in the Zen of Python does not body shame people, then maybe we shouldn't let them define what our core values are.

Solution to Advent of Code "Day 3: Spiral Memory"

2017-12-31T00:00:00+01:00

After an unsuccessful attempt at learning Rust earlier this year (I mainly read through the documentation without applying it in any project), I recently started to tackle the 2017 edition of Advent of Code, in order to practice Rust for real.

The 3rd challenge, Spiral Memory is interesting because you can bruteforce it, or solve it with math. I ended up doing the latter, even though math is really not my strong suit.

We're asked to calculate the Manhattan distance between a given point and the center, in a spiral reference. The problem amounts to finding the coordinates of any point $P$ in this spiral reference, as once we have the point coordinates, calculating the Manhattan distance is easy:

\begin{align} D_P &= |X_P - X_0| + |Y_P - Y_0| \ &= |X_P| + |Y_P| \end{align}

Nested shells

My approach was the following: a spiral has nested "shells", all centered around the center. In this image, the first shell is outlined in grey, and the second one in purple. Each of these spirals has a first value, called $S_i$, where $i$ is the index of the spiral.

For any point $(X_P, Y_P)$ of value $V$, we know that it is located somewhere on the shell located right before the first shell with start value $S$ such as $S > V$. For example, if the input value was 23, we know that it's located on the second shell as $S_2 ≤ 23 < S_3$.

We need to know the number of elements a shell of index $i$ is composed of, noted $Δ_i$ On this representation, the first shell is a square of side of length 3, the second shell is a square of side of length 5. We can generalize this to $L = 2i + 1$, where $i$ is the index of the shell. For any index $i$, the shell is composed of the following number of elements

\begin{align} Δ_i &= (2i + 1)^2 - (2(i -1) + 1)^2 \ &= 4i^2 + 4i + 1 - 4i^2 +4i - 1 \ &= 8i \end{align}

Coordinates of the first element of a shell

Once we know on which shell a given point $P$ is located, we need to know the coordinates of the first point $S_i$ of this shell, so we can infer $P$'s coordinates. This first point will always be located after the center point, and all points composing the previous shells. We can thus infer

\begin{equation} V_{S_{{}i}} = 2 + \sum^{i-1}Δ_i \end{equation}

We now need to get the coordinates of any given first shell point. By simply looking at the spiral itself, we can deduce that

\begin{equation} (X_{S_{{}i}}, Y_i}}) = (n, -n + 1) \end{equation}

Navigating the spiral

The final piece of the puzzle is to infer the coordinates of the point $P$ given the coordinates of the start point $S_i$ of the shell it belongs to. To do that, we need to look at how the coordinates evolve along a shell.

We can see that:

on the first quarter of the shell, $Y$ coordinates increase by 1 for each increasing value
on the second quarter of the shell, $X$ coordinates decrease by 1 for each increasing value
on the third quarter of the shell, $Y$ coordinates decrease by 1 for each increasing value
on the fourth quarter of the shell, $X$ coordinates increase by 1 for each increasing value

To calculate the coordinates of the point $P$, we just need to locate it on the shell, start from $(X_{S_{{}i}}, Y_i}})$ and increase/decrease the $X$ and $Y$ coordinates until we reach the target value.

The implementation

The strategy is:

calculate the values of the first shell points until we find a value greater than our target point
backtrack to the previous shell
compute the coordinates of the first point of the shell we backtracked to
increase/decrease the $X$ and $Y$ coordinates until we reach the target value
calculate the Manhattan distance using these coordinates

// advent_day03.rs

fn nb_elements_in_outer_level(level: i32) -> i32{
    8 * level
}

fn start_element(level: i32) -> i32 {
    if level == 0 {
        1
    } else {
        let mut out = 0;
        for i in 1..level {
            out += nb_elements_in_outer_level(i);
        }
        out + 2
    }
}

fn first_element_coordinates(level: i32) -> (i32, i32) {
    (level, -level + 1)
}


fn number_coordinates(number: i32) -> (i32, i32) {
    let mut level = 0;
    let mut start: i32;

    // Increase level until we found a starting value greater than
    // input value. When such a value is found, backtrack a step.
    loop {
        start = start_element(level);
        if start >= number {
            level -= 1;
            println!("{:?} is found on level {:?} of the spiral", number, level);
            start = start_element(level);
            break
        } else {
            level += 1;
        }
    }

    // At this point, we've found the starting point of the spiral
    // level we number belongs to.
    let delta = number - start;
    let (mut x, mut y) = first_element_coordinates(level);

    if delta > 2 * level {
        y += 2 * level;
    } else {
        y += delta;
        return (x, y)
    }

    if delta > 4 * level {
        x -= 2 * level;
    } else {
        x -= delta - (2 * level);
        return (x, y)
    }

    if delta > 6 * level {
        y -= 2 * level;
    } else {
        y -= delta - (4 * level);
        return (x, y)
    }

    x += delta - (6 * level);
    (x, y)
}

fn manhattan_distance(x: i32, y: i32) -> i32{
    x.abs() + y.abs()
}

fn main() {
    let number = 312051;
    let (x, y) = number_coordinates(number);
    println!("{:?} has coordinates {:?}", number, (x, y));
    let distance = manhattan_distance(x, y);
    println!("{:?} is at a distance of {:?} from the center", number, distance);
}

The solution

312051 is found on level 279 of the spiral
312051 has coordinates (-152, -278)
312051 is at a distance of 430 from the center

On working from home while remaining sane

2017-10-29T00:00:00+02:00

Since I started working at Datadog, I've had the opportunity of working from home full-time (for the second time in my career). Although I consider this to be a real privilege, it comes with its own set of challenges that I'd like to pinpoint and address in light of my personal experiences.

I hope this article will be useful for anyone willing to try out (or struggling with) remote work.

Productivity VS isolation

First, why would you even want to work from home in the first place? To me, it's both about flexibility and productivity. I can focus on complex tasks for long periods of time without being interrupted, while still being able to keep a flexible timetable. I can also work from anywhere, as long as I can have a good enough internet connection.

However, this flexibility and freedom is paid with isolation, which can then lead to demotivation or burn-out down the road. Remote work is, by definition, solitary, which can quickly become an issue, because humans are social animals and (for most of us) crave for social and physical interaction. This make be believe that remote workers are more exposed to burn-out.

The burn-out cycle

In my experience, the easiest path to demotivation or burn-out (whether you're working remotely or not) is being over-enthusiast and working long hours. When doing so, it's easy to develop some kind of hero complex, a belief that you're indispensable and that things will break down if you take a break, or leave on holidays. The more hours you pull, the less sleep you get, the more stressed and tired you become. Because you're stressed, you then feel you need to work harder, until you just can't take it anymore, and you burn-out.

Ideally, this cycle can be prevented or broken with proper management and supervision. If your manager realises you've started to walk this slippery slope, she/he should take action, and incite/force you to take a break. This can be enforced by regular 1-1 meetings, to keep track on how remote workers are doing.

This brings me to an important point: remote work can dangerous if it's not in the company culture, and you should keep away.

Remote as a culture

To enable sane remote work, a company does must include remote workers in all events, when physically possible. All brown-bags, talks, all-hands, etc, should be streamed live, or at least recorded. If being out of the office means you have access to less information, it means that remote workers are seen as second-rate employees.

All communication must be asynchronous, to include remote workers, especially if teams are working across timezones. Wether it's slack, email, Google Docs or something else, anyone should be able to catch up with any conversation or topic. Any significant direct discussion should be made available one way or another to remote workers.

Finally, it should be easy to go meet your team in person. I'd go even further and recommend you do it on a regular basis. I personally chose to go to our Paris office a week every month.

Work hygiene

Now, if your company has remote in its blood and culture, good! Now, all is left to figure out is your organisation and work hygiene. The following advice come from my personal experience and should not be considered as absolute truths backed by science. Take them if they make sense to you.

Containerisation of private and personal life

The first thing I find absolutely essential is containerisation (no, not Docker) of your private and professional life. You need to have a dedicated office room, with a door, that is not your living room. The idea is that, when you open that door, you're at work, and when you close it, you're out. I find it to be especially important during the first weeks of remote work. I now find myself work more and more from my living room, but I know that if I need isolation for some reason, I still have this room I can go to.

For the same reason (along with a bazillion security reasons), never work from your personal machine. You want to make it a conscious effort to switch from watching Netflix to reading your work email.

Routine

To me, routine is key to avoid getting tired. Try to wake up, start working, eat, stop working and go to sleep at regular hours. Ban any night work, especially when you're not on-call.

Exercise is also very important. It's easy to keep extremely static during your remote work-day, which can take a toll on your health. Also, one of the things I miss the most is my daily bike commute. I replaced it with 45 minutes of gym in the morning, 3 times a week. This has the nice advantage of making me feel like I accomplished something very early in the day, and gives me energy to keep it going.

Also, take regular breaks and take a 15 minute walk at some point in the day.

Talk with other remote workers about how they make it. Share tips, stories, do-s and don't-s, to build collective wisdom.

Conclusion

Working from home can be liberating and an amazing productivity booster, but you need to stay alert and conscious of the challenges and constraints it entails. I'd urge you to show a fair amount of self-discipline and organisation in order to avoid falling into the burn-out spiral.

Have fun!

Edit

I found this very interesting resource from Trello, called How to embrace remote work.

The main takeways I get from it are:

pace yourself: work isn't going anyhwere. Do not forget to take breaks.
use the right tool to convey the right information (do not rely on instant messaging for crucial information!)
don't forget to use passive communication (eg: status messages)
intent can be lost over text communication. Assume positive intent.

The story of the 20°C cronjob

2017-05-25T00:00:00+02:00

For the last month or so, the lifespan of my beloved Thinkpad X1 Carbon battery had been getting down the drains, from 5-6 hours to less than 3. Following @padenot's advice, I installed powertop and started investigating what was draining this good'ol battery of mine.

Looking at the powertop output, I immediatly realized that something fishy was happening on this laptop:

The battery reports a discharge rate of 4.95 W
The estimated remaining time is 2 hours, 6 minutes

Summary: 1111.7 wakeups/second,  7.9 GPU ops/seconds, 0.0 VFS ops/sec and 23.0% CPU use

                Usage       Events/s    Category       Description
            264.4 ms/s     3656.7       Process        /bin/bash /usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t -f br
            114.3 ms/s     626.2        Process        /usr/lib64/firefox/firefox
             20.7 ms/s      95.5        Process        /opt/sublime_text_3/plugin_host 3272
             ...

Why was sendmail so busy, and why in the hell was it running anyway? strace showed me that the process was indeed very busy, and mailq showed that I had more than 15000 outgoing emails in the system mail queue!

$ mailq
...
mail in dir /home/br/.esmtp_queue/TSRueRJD:
    From: "(Cron Daemon)" <br>  To: br
mail in dir /home/br/.esmtp_queue/ZI1LtzhT:
    From: "(Cron Daemon)" <br>  To: br
15653 mails to deliver

Ok, so all these mails were being sent by cron. My user crontab only had one job, and it was * * * * * rm $HOME/crash_dump.erl. Indeed, I had been experimenting with Elixir recently, and when I crashed the Erlang VM, this file would pop-up in my home directory. At some point, I added this cronjob to make it go away and forgot about it. As the job's stdout was not redirected to /dev/null, each time the file was not found, the cron job would fail and a mail would be added to the queue.

After removing this job, purging the mail queue, and adding MAILTO="" at the beginning of my crontab (to avoid repeating this investigation down the road), sendmail went quiet, my battery life went back to ~6 hours, and the laptop average temperature went down 20°C.

Preparing the SRE interview

2017-04-20T00:00:00+02:00

I recently interviewed for an SRE position. I spent a full week learning (or refreshing my memory) on the subjects and topics that could be covered in such an interview. I'll try and lay down the list of topics I covered and resources I used.

What is an SRE?

Having spent the last 2 years employed as a DevOps, I've often felt that DevOps and SRE were two slightly differing implementations of the same ideas. The first one felt like a set of general principles, when the second one is a clear and detailed model (pre-dating DevOps), with a set of rules and guidelines. Google developed the SRE model and explained it in the SRE book. The underlying ideas are simple, but powerful:

Develop tools and systems reducing toil and repetitive work from engineers
Automate everything, or as much as possible (deployments, maintenances, tests, scaling, mitigation)
Monitor everything
Think scalable from the start
Build resilient-enough architectures
Handle change and risk through SLAs, SLOs and SLIs
Learn from outages

If you haven't yet read the SRE book, I strongly urge you to do so. There's even a free online version available. If you do not have the time, then maybe have a look at this Ben Treynor (Google VP Engineering) What is 'Site Reliability Engineering'? interview, for a general introduction.

According to the SRE book, an SRE should spend half of its time on "ops" work, and the other half doing development.

Google places a 50% cap on the aggregate "ops" work for all SREs—tickets, on-call, manual tasks, etc. [...] An SRE team must spend the remaining 50% of its time actually doing development. Source

Some skills are thus paramount to an SRE:

coding / software development
system administration and automation
scalable system design
system troubleshooting

Consequently, each of these areas of expertise can be (and often are) the subject of an interview.

Coding / Software development interview

I've found that the reference resource to prepare a coding interview, especially when targeting companies like Amazon, Google, Microsoft, Yahoo, etc, is Cracking the Coding Interview, by Gayle Laakmann McDowell. This book is a real trove of advice (technical or not) and example exercises (with the associated solutions).

Even though it is targeted to software developer interviews, I still covered the following topics listed in the Must Know section of the book:

Data structures:

Linked list
Stack
Queue
Heap
Hash table
Binary tree
associated Big-O time and memory complexity for common operations (Search, insert, delete, etc).

I found Data structures and Algorithms using Python and C++ to be useful (albeit a bit lengthy) when dealing with these data structures for the first time. This presentation gives a short but to-the-point, no-nonsense introduction of these data structures.

Algorithms

Mergesort
Quicksort
Binary search

I also had a look at https://github.com/adicu/interview_help to practice on some real-life interview questions, and at https://github.com/nryoung/algorithms to read Python implementations of common data structures and algorithms.

Scalable system design interview

This was my favorite subject to work on, as an apparently simple question such as "Design the bit.ly service" hides unexpected depths of complexity. Being able to design a scalable system implies knowing about:

DNS
load balancing
micro-service architecture
CAP theorem
consistency patterns
availability patterns
databases
caching
asynchronism patterns
etc

The main idea is to be able to identify the architecture bottlenecks, and to dimension the architecture with an appropriate number of machines, with some "back-of-the-envelope" calculations, whilst being robust and failure tolerant.

The most useful resources I found to prepare were:

Scalability lecture given at Harvard
Latency Numbers Every Programmer Should Know
The System Design Primer (I suggest you follow the links after each section for an in-depth follow-up)
this great step-by-step walkthrough on design questions, by HiredInTech
Scaling up to your first 10 million users, talk given by Joel Williams of AWS
Crack the design interview
When to use NoSQL vs SQL

System troubleshooting interview

To be able to automate the administration of a system, one should first know the said system in depth, which, in a lot of cases, will be GNU/Linux. If you have time, I strongly suggest reading The Linux Programming Interface. Note that this is a large book (my version has 1556 pages) focusing on an old version of the Linux kernel (2.6.x). Fear not! You'll still gain a vast knowledge about how a GNU/Linux system operates. For a quicker tour, you could have a look at the Linux Kernel Internals blog. You'll also find interesting SRE interview questions/answers in this SRE interview questions blogpost.

Julia Evans, also known as b0rk has written some absolutely fantastic beginner-friendly resources about troubleshooting and networking. I strongly recommend having a look at:

Mastering the mentioned tools (strace, tcpdump, netstat, lsof, ngrep, etc) gave me some good debugging chops I have applied in production many times.

Netflix has also written a very nice and thorough blogpost on performance troubleshooting: Linux Performance Analysis in 60,000 Milliseconds, detailing what to check in case of a performance issue.

Wait, there's more

Technical knowledge is one thing, but SRE being a relatively new activity, I also wanted to get real-life feedbacks from real-life SREs. To that end, I watched the following (great) talks:

Case Study: Adopting SRE Principles at StackOverflow, by Tom Limoncelli of Stack Exchange
Love DevOps? Wait until you meet SRE, by Nick Wright, from Atlassian
Panel: training new SREs, with Katie Ballinger (CircleCI), Saravanan Loganathan (Yahoo), Rita Lu (Google), Craig Sebenik (Matterport), Andrew Widdowson (Google)

Oh and one last thing...

I'm super excited to announce I'm joining @datadoghq as an SRE ! pic.twitter.com/Ji1JJQLJ4x
— Balthazar Rouberol (@brouberol) 19 avril 2017

Découverte de la command line Docker

2016-10-13T00:00:00+02:00

Connectez vous sur la machine virtuelle installée en première partie de la journée.

1 - Installez docker

$ sudo su
$ apt-get install docker.io

2 - Assurez vous que docker est correctement installé. Si c'est le cas, vous devriez avoir une sortie similaire à celle-ci:

$ docker info
Containers: 20
 Running: 2
 Paused: 0
 Stopped: 18
Images: 70
Server Version: 1.12.1
Storage Driver: aufs
 Root Dir: /var/lib/docker/aufs
 Backing Filesystem: extfs
 Dirs: 153
 Dirperm1 Supported: true
Logging Driver: json-file
Cgroup Driver: cgroupfs
Plugins:
 Volume: local
 Network: bridge host null overlay
Swarm: inactive
Runtimes: runc
Default Runtime: runc
Security Options:
Kernel Version: 3.16.0-4-amd64
Operating System: Debian GNU/Linux 8 (jessie)
OSType: linux
Architecture: x86_64
CPUs: 1
Total Memory: 3.779 GiB
Name: gallifrey
ID: RYUC:5OT6:3JFQ:APQG:QEW7:V7KP:ZDYY:WBBF:LZL5:PSFY:WFEH:MF6V
Docker Root Dir: /var/lib/docker
Debug Mode (client): false
Debug Mode (server): false
Registry: https://index.docker.io/v1/
WARNING: No memory limit support
WARNING: No swap limit support
WARNING: No kernel memory limit support
WARNING: No oom kill disable support
WARNING: No cpu cfs quota support
WARNING: No cpu cfs period support
Insecure Registries:
 127.0.0.0/8

3 - Listez les commandes disponibles:

$ docker

4 - Listez les images disponibles localement

$ docker images

5 - Pullez l'image ubuntu

$ docker pull ubuntu

6 - Listez les images disponibles localement

$ docker images

7 - Lancez un echo "hello world!" dans votre image ubuntu

$ docker run ubuntu echo "hello world"

8 - Listez tous les conteneurs. Pourquoi le conteneur est-il en status Exited?

$ docker ps -a

9 - Pullez l'image brouberol/nginx

$ docker pull brouberol/nginx

10 - Lancez l'image brouberol/nginx avec la commande suivante

$ docker run -it --rm --name=nginx-1 -p 80:80 brouberol/nginx

11 - Lancez un docker run --help et tentez de comprendre les options passées dans la commande précédente

12 - Ouvrez un second terminal et ouvrez une session ssh dans votre VM

13 - Inspectez les conteneurs:

$ docker ps

Vous devriez avoir une sortie telle que

$ docker ps
CONTAINER ID    IMAGE             COMMAND              CREATED         STATUS          PORTS                NAMES
76ee1a3f4ce2    brouberol/nginx   "./entrypoint.sh"    7 seconds ago   Up 6 seconds    0.0.0.0:80->80/tcp   nginx-1

À quoi servait l'option -p 80:80 de la commande docker run?

14 - Listez les ports hosts/conteneurs liés a votre conteneur via docker port <id> (dans l'exemple précédent, <id> vallait 76ee1a3f4ce2).

15 - Interrogez votre conteneur nginx via curl localhost

16 - Recuperez l'IP publique de votre VM via ip -o -4 addr show eth0 | awk '{print $4}' | cut -d'/' -f 1

17 - Ouvrez votre navigateur et visitez http://<public_ip>

18 - Inspectez votre conteneur via la commande docker inspect.

19 - Lancez la commande docker history --no-trunc brouberol/nginx et tentez de comprendre la différence entre l'image nginx de base et l'image brouberol/nginx.

20 - Inspectez le Dockerfile et l'entrypoint utilisés via les commandes suivantes:

docker exec nginx-1 cat Dockerfile
docker exec nginx-1 cat entrypoint.sh

21 - À quoi sert docker exec?

22 - Selon vous, pourquoi utilisons nous un exec dans l'entrypoint? Indice: la commande docker stop envoie un SIGTERM au process PID 1 du conteneur.

23 - Visitez https://hub.docker.com/explore/, et tentez de deployer une autre application officiellement supportée par Docker!

Celery best practices

2015-12-29T00:00:00+01:00

I've been programming with celery for the last three years, and Deni Bertović's article about Celery best practices has truly been invaluable to me. In time, I've also come up with my set of best practices, and I guess this blog is as good a place as any to write them down.

Write short tasks

I think that a task should be as concise as possible, in order to be able to understand what it does and how it handles corner cases as quickly as possible. I personally try to follow these rules:

wrap the main task logic in an object method or a function
make this method/function raise identified exceptions for identified corner cases and decide what is the logic for each of them
implement a retry mechanism only where appropriate

Let's illustrate these rules with a simple example: sending an email using a 3rd party API (eg: Mailgun, Mailjet, etc). Anyone having spent enough time using third party infrastructure and systems knows they should never totally rely on them: the network can fail, they can be unavailable, etc. We thus need to handle some expectable error cases and have a fallback strategy in case of an unexpected error.

Let's say that we have a function api_send_mail that does the actual API call, raising a myapp.exceptions.InvalidUserInput exception, in case of an HTTP client error. This exception constitutes our set of expectable exceptions, that we need to plan for. Any other exception (connection error, server HTTP error, etc) will be sent to some crash report backend, like Sentry and trigger a retry.

My task implementation would look something like this:

import requests

from myproject.tasks import app  # app is your celery application
from myproject.exceptions import InvalidUserInput

from utils.mail import api_send_mail

@app.task(bind=True, max_retries=3)
def send_mail(self, recipients, sender_email, subject, body):
    """Send a plaintext email with argument subject, sender and body to a list of recipients."""
    try:
        data = api_send_mail(recipients, sender_email, subject, body)
    except InvalidUserInput:
        # No need to retry as the user provided an invalid input
        raise
    except Exception as exc:
        # Any other exception. Log the exception to sentry and retry in 10s.
        sentrycli.captureException()
        self.retry(countdown=10, exc=exc)
    return data

What the task actually does is abstracted one layer down, and almost all the rest of the task body is handling errors. I feel that it's easier to grasp the bigger picture, and that the task is easier to maintain this way.

Retry gracefully

Setting fixed countdowns for retries may not be what you want. I tend to prefer using a backoff increasing with the number of retries. This means the more a task fails, the more we have to wait until the next retry. I think this has a couple of interesting consequences:

we don't hammer the external service in case of an outage,
it gives more time to the service to go back to normal,
and thus increases our overall chance of success

A simple (but effective anyhow) implementation could look something like this:

def backoff(attempts):
    """Return a backoff delay, in seconds, given a number of attempts.

    The delay increases very rapidly with the number of attemps:
    1, 2, 4, 8, 16, 32, ...

    """
    return 2 ** attempts

@app.task(bind=True, max_retries=3)
def send_mail(self, recipients, sender_email, subject, body):
    """Send a plaintext email with argument subject, sender and body to a list of recipients."""
    try:
        data = api_send_mail(recipients, sender_email, subject, body)
    except InvalidUserInput:
        raise
    except Exception as exc:
        sentrycli.captureException()
        self.retry(countdown=backoff(self.request.retries), exc=exc)
        ...

Fail fast and don't block forever

One thing to remember is to always specify a timeout on I/O operations, or at least on the celery task itself. If you don't, it's possible all your tasks could block indefinitely, which would then prevent any additional task to start. In the context of the send_mail task, I could probably do something like this, as an API call should probably not take more than 5 seconds:

@app.task(
    bind=True,
    max_retries=3,
    soft_time_limit=5 # time limit is in seconds.
)
def send_mail(self, recipients, sender_email, subject, body):
    ...

If the task takes more than 5 seconds to complete, the celery.exceptions.SoftTimeLimitExceeded exception would get raised and logged to Sentry.

I also tend to set the CELERYD_TASK_SOFT_TIME_LIMIT configuration option with a default value of 300 (5 minutes). This will act as a failsafe if I forget to set an appropriate soft_time_limit option on a task.

All that is pretty dandy, but I don't want to re-implement the exception catching for every task. I should be able to specify a basic behavior shared between all my tasks. Turns out you can, using an abstract task class.

from myproject.tasks import app

class BaseTask(app.Task):
    """Abstract base class for all tasks in my app."""

    abstract = True

    def on_retry(self, exc, task_id, args, kwargs, einfo):
        """Log the exceptions to sentry at retry."""
        sentrycli.captureException(exc)
        super(BaseTask, self).on_retry(exc, task_id, args, kwargs, einfo)

    def on_failure(self, exc, task_id, args, kwargs, einfo):
        """Log the exceptions to sentry."""
        sentrycli.captureException(exc)
        super(BaseTask, self).on_failure(exc, task_id, args, kwargs, einfo)


@app.task(
    bind=True,
    max_retries=3,
    soft_time_limit=5,
    base=BaseTask)
def send_mail(self, recipients, sender_email, subject, body):
    """Send a plaintext email with argument subject, sender and body to a list of recipients."""
    try:
        data = api_send_mail(recipients, sender_email, subject, body)
    except InvalidUserInput:
        raise
    except Exception as exc:
        self.retry(countdown=backoff(self.request.retries), exc=exc)
    return data

You can see that the send_mail task implementation only deals with email sending and expected error handling. Everything else is handled by the abstract base class. If the common behavior is more complex, this trick can drastically reduce the size of each task body and the amount of duplicated code in your tasks.

Note: this example is only here to demonstrate how to share behavior between tasks. To properly integrate Sentry with Celery, have a look at this page.

Tip: have a look at the list of available handlers, to get an idea of what behavior can be shared between tasks.

Write large tasks as classes

So far, I've only implemented tasks as functions. However, it's also possible to define class tasks.

I think one of the scenarii where class tasks really shine are when you'd like to split a large task function into several well-defined and testable methods. As you can see here, the celery.task decorator will generate a task class and inject the decorated function as the class run method. Defining a class task amounts to defining a class inheriting from app.Task with a run method.

class handle_event(BaseTask):   # BaseTask inherits from app.Task

    def validate_input(self, event):
        ...

    def get_or_create_model(self, event):
        ...

    def stream_event(self, event):
        ...

    def run(self, event):
        if not self.validate_intput(event):
            raise InvalidInput(event)
        try:
            model = self.get_or_create_model(event)
            self.call_hooks(event)
            self.persist_model(event)
        except Exception as exc:
            self.retry(countdown=backoff(self.request.retries), exc=exc)
        else:
            self.stream_event(event)

By doing this, the task logic is clear and easy to follow (the run method stays concise even if the methods body are large), and each of these method can then be unit-tested independently.

Another advantage of using class tasks is using multiple inheritance to specialize a task with multiple abstract base classes. For example, I'd like to use the celery_once QueueOnce abstract class to introduce some locking mechanism, while still using the BaseTask for sentry logging. This way, each abstract task class is used as a mixin, adding some behaviour to the task.

Unit-test your tasks

Unit testing a project involving celery has always been a pickle for me. I tried to deploy a broker and a test celery worker in the CI environment, but it felt like killing a fly with a bazooka. The answer turns out to be quite simple, thanks to Nicolas Le Manchet for figuring this one out! When the CELERY_ALWAYS_EAGER option is activated, all tasks called using their apply_async or delay method are called directly, without requiring any broker or celery worker. Easy as pie.

Installing Guitar Pro 6 on Fedora 22+

2015-12-06T00:00:00+01:00

I've been playing guitar for the last 10 years now, but I spent the last 4 years only playing and singing alone. I decided to improve my technique, and to treat me with Guitar Pro 6. I was happy to see they even supported Linux natively! Sadly, they only provide a deb file, and no rpm. I'll thus describe here how I managed to install it on Fedora 22, ♬ with a little help from my friends ♫.

Installing the dependencies

First, download the Guitar Pro deb file. Mine was called gp6-full-linux-r11686.deb Extract the archive called data.tar.gz from the deb, and then de-archive it:

$ cd /tmp
$ mv ~/Downloads/gp6-full-linux-r11686.deb .
$ ar vx gp6-full-linux-r11686.deb
$ tar -xvf data.tar.gz

Create the installation directory for Guitar Pro.

$ sudo mkdir -p /opt/GuitarPro6

Move the GuitarPro files to the installation directory.

$ sudo mv ./opt/GuitarPro6/ /opt/GuitarPro6/

We now need to install Guitar Pro's dependencies, and of course, they're 32 bit...

$ sudo dnf install libICE.i686 \
    libSM.i686 \
    libssh.i686 \
    libxml2.i686 \
    libxslt.i686 \
    libpng12.i686 \
    libvorbis.i686 \
    alsa-lib.i686 \
    portaudio.i686 \
    pulseaudio-libs.i686 \
    qt-x11.i686 \
    qtwebkit.i668

You might have to download other packages as well, as Guitar Pro was not my first 32 bit program I had to install.

Note: The required packages will be listed when you execute the /opt/GuitarPro6/launcher.sh script, and you can use the dnf whatprovides command to find the package that provides each required library.

Sadly, that's not it yet. GuitarPro also depends on both libcrypto and libssl 0.9.8, and they're not packaged anymore in Fedora 22. We'll use a very cool trick: by a great stroke of luck, the libraries contained in the openssl Ubuntu deb package are usable as a drop-in replacement!

$ wget -q http://security.ubuntu.com/ubuntu/pool/universe/o/openssl098/libssl0.9.8_0.9.8o-7ubuntu3.2.14.04.1_i386.deb 1>/dev/null
$ ar x libssl0.9.8_0.9.8o-7ubuntu3.2.14.04.1_i386.deb data.tar.xz
$ tar --strip-components 3 \
    -xf data.tar.xz \
    ./lib/i386-linux-gnu/libcrypto.so.0.9.8 \
    ./lib/i386-linux-gnu/libssl.so.0.9.8
$ chmod 755 libssl.so.0.9.8 libcrypto.so.0.9.8
$ mv libssl.so.0.9.8 libcrypto.so.0.9.8 /opt/GuitarPro6

Seriously, how cool is that ¹?

Note: I moved both shared libraries to the /opt/GuitarPro6 directory, because it also contained numerous other ones. My guess, which turned out to be correct, was that the executable uses will look for shared objects in its directory. This way, I didn't have to fiddle with LD_LIBRARY_PATH and ldconfig, or install these libraries into my /usr/lib folder.

Installing the sound banks

We now need to install the sound banks. First, download them from the official website. Then, install them via the /opt/GuitarPro6/GPBankInstaller script:

$ sudo mv Banks-r370.gpbank /opt/GuitarPro6
$ sudo /opt/GuitarPro6/GPBankInstaller /opt/GuitarPro6/Soundbanks.gpbank /opt/GuitarPro6/Data/Soundbanks/

Make Guitar Pro the default program for tab files

We then install the desktop and icon file that were packaged in the Guitar Pro deb, so that it can be executed from the app launcher.

$ sudo cp ./usr/share/applications/GuitarPro6.desktop /usr/share/applications/GuitarPro6.desktop
$ sudo cp ./usr/share/pixmaps/guitarpro6.png /usr/share/pixmaps/

The last and final step is the cherry on the cake: we're going to make Guitar Pro open automatically when opening the tab files. We first define the application/x-guitar-pro MIME-type and then associate it with the gp3, gp4, gp5 and gp6 extensions.

$ cat <<EOF > guitarpro-mime.xml
<?xml version="1.0"?>
    <mime-info xmlns="http://www.freedesktop.org/standards/shared-mime-info">
    <mime-type type="application/x-guitar-pro">
        <glob pattern="*.gp3"/>
        <glob pattern="*.gp4"/>
        <glob pattern="*.gp5"/>
        <glob pattern="*.gp6"/>
    </mime-type>
</mime-info>
EOF
$ sudo xdg-mime install guitarpro-mime.xml
$ xdg-mime default GuitarPro6.desktop application/x-guitar-pro

Done. You should now be able to click on a tab file, and enjoy!

Conclusion

I managed to make everything work, with both some help and luck. I would however have prefered if the Guitar Pro binary had been compiled statically, to ease the installation process.

Also, when you advertise Linux compatibility, please, PLEASE, at least mention the package format (deb, rpm, other), and also mention the distributions you support natively.

Finally, when you want to support Linux, do not ever redirect Linux users to the Windows installation guide, by stating that both processes are "substantially similar". They are not.

On that note (get it?), keep on rocking in a free world!

Sources

We can of course try to include them from other rpm-based distributions, but I have to admit I found it cooler this way. ↩

Pain au levain, deuxième essai

2015-11-14T00:00:00+01:00

Nouvel essai, nouvelle réussite! Cette fois, j'ai testé deux nouvelles techniques (l'autolyse et le pétrissage stretch and fold) et j'ai augmenté la proportion de farine complète.

Ingrédients

200g de levain
360ml d'eau
600g de farines (430g de T55, 50g de T65 et 120g de T110)
10g de sel

Rien de révolutionnaire ici. J'ai légèrement baissé le volume d'eau par rapport au premier essai et augmenté le poids en farine semi-complète.

Recette

Autolyse

C'est ici que j'ai innové. Au lieu de simplement mélanger les ingrédients et de pétrir, je suis passé par deux phases d'autolyse. J'ai d'abord mélangé l'eau et la farine, et attendu 20 minutes. J'ai ensuite rajouté le levain, et attendu 20 minutes. Cette technique favorise la formation de chaînes de gluten (qui donnent à la pâte son élasticité) sans trop oxyder la pâte, ce qui évite de se retrouver avec un pain fade en sortie. Les partisans de l'autolyse insistent sur le fait que ça a une énorme influence sur le goût final.

Stretch and fold

Depuis que j'ai commencé à m'amuser à faire du pain, ma technique de pétrissage a toujours consisté à travailler la pâte avec la paume de la main en la pliant sur elle même. J'ai souvent trouvé que mes pains n'étaient pas assez aérés. J'ai récemment entendu parler d'une technique radicalement différente, mise au point par Richard Bertinet, permettant de travailler des pâtes à fort taux d'hydratation (plus de 80%!) tout en conservant son aération.

L'idée est simple (comme montré dans cette vidéo): on étire la pâte, plus la replie en 3, encore en deux. On forme ensuite une boule, et on attend pendant 50 minutes, avant de recommencer. Pendant ces 50 minutes, la pâte va travailler pour nous en formant les fameuses chaînes de gluten. Le résultat est assez miraculeux: la pâte est bien plus ferme après la première opération, et parfaitement travaillable après la deuxième, sans aucune phase de pétrissage!

L'opération est à répéter 2 ou 3 fois selon les sources. J'ai tenté 2 fois (stretch and fold, 50 minutes d'attente, stretch and fold, 50 minutes d'attente).

Pointage

Donner à la pâte la forme voulue, et laisser reposer 2 heures. Une fois ce temps écoulé, fariner le pâton et entailler la pâte avec un couteau tranchant, ce qui aidera la pâte à gonfler à la cuisson.

Cuisson

Même technique que la dernière fois: préchauffer le four à 240°C, enfourner quand le four est bien chaud pendant 22 minutes, et ensuite 25 minutes à 220°C. Régulièrement verser de l'eau au fond du four, afin de créer de la vapeur, qui favorisera la formation de croûte.

Repos

Laisser le pain refroidir sur une grille pour que l'humidité interne disparaisse.

Résultat

Le résultat n'est pas aussi aéré que ce que j'espérais, mais c'est déjà beaucoup mieux! Et c'est délicieux!

Pain au levain, premier essai

2015-11-06T00:00:00+01:00

Mon premier essai de pain au levain étant, en toute modestie, particulièrement réussi, je consigne ici la recette.

Le levain

Tout d'abord, le levain. J'ai suivi cette recette, ainsi que celle-ci. Au bout de presque 2 semaines, mon levain (Gérard) est prêt à être intégré dans du pain: il est vif, crisse sous le doigt et a une bonne (et douce) odeur de fermentation. Notez que j'ai fait mon levain à base de farine T85 (de la bonne grosse farine dégrossie), avec de temps en temps une goûte de miel bio.

Le pain

J'ai suivi ici les conseils de Clément (déjà pas le dernier dans le domaine): une mesure de levain, deux mesures d'eau et trois mesures de farines. Notez le "s" à "farines": j'ai mis 5/6e de farine T55 et le 1/6e restant de farine semi-complète T110.

Recette et ingrédients

200g de levain
400g d'eau (compter 350-360g d'eau si on pétrit à la main. Mes voisins m'ont entendu gueuler parce que la pâte était trop collante, ce qui m'a forcé à rajouter beaucoup de farine au pétrissage).
600g de farines (500g de T55 et 100g de T110)

Pétrir pendant une bonne quinzaine de minutes. Afin d'incorporer le maximum d'air possible, plier la pâte sur elle même, et éviter de trop la faire dégazer en appuyant dessus comme une brute. Laisser reposer la pâte pendant une heure ou deux. Une fois reposée, la pâte aura gonflé. Certaines recettes vous diront que le pâton aura double voire triplé de taille, mais ça va dépendre de la quantité de farine complète incorporée. En effet, elle est beaucoup plus lourde (parce que comportant plus de son, l'écorce du blé) que la farine blanche, donc lève moins.

La plupart des recettes vous conseillent d'attendre entre 6 et 8h à ce stade là. Pour mon premier essai, vu qu'il était déjà 22h, j'ai décidé de m'en balancer et d'y aller.

Je fait préchauffer mon four à 240°C, avec une tasse d'eau au fond du four. Une fois la pâte levée, je la façonne, afin de lui donner la forme finale générale. Je farine d'abord ma plaque de cuisson. J'aime beaucoup les boules de campagne, du coup je ramène un peu la pâte sous elle même, afin d'avoir une belle boule sur mon plaque. Je farine le dessus de la boule, et incise la pâte au couteau d'une croix, d'environ un cm de profondeur. Au four!

Au bout de 10-15 minutes, je verse le contenu de la tasse au fond du four, afin de libérer de la vapeur d'eau, qui va aider à la formation de la croûte. Au bout de 20-25 minutes de cuisson à 240°C, je baisse le four à 220°C pour encore 20 minutes de cuisson. Une fois le pain sorti du four, je le laisse reposer plusieurs heures sur une grille, afin que l'humidité s'échappe bien. Je suis un grand adepte de manger le pain à la sortie du four, mais avec de telles quantités, c'est l'effet caoutchouteux garanti...

Le résultat le lendemain

Le pain est très bon, avec un léger goût acide donné par le levain. J'aimerais avoir des bulles un peu plus grosses la prochaine fois, mais je crois qu'il faut que je retravaille ma technique de pétrissage, qui n'incorpore pas assez d'air, et surtout que je laisse la pâte gonfler bien plus longtemps (voire toute la nuit).

Peut-être aussi mettre un quart ou un tiers de farine semi-complète, pour lui donner un goût plus brut, auquel cas il faudra faire gaffe à ce que la pâte réussisse quand même à gonfler (augmenter l'hydratation?).

My n-step plan to become a better programmer

2015-05-24T00:00:00+02:00

One of the main selling points of Python are its multi-paradigm philosophy. You can code in imperative, object-oriented or aspect-oriented style, use meta-programming techniques, etc. It also has an immense amount of libraries available. Finally, it's both a simple language to pick up for beginners, and a powerful language for more experienced programmers.

I've been programming for the last six or seven years, and I feel that my main strength is also my main weakness: I've been mainly coding in Python since the beginning. It means that I can now use Python's features and standard library pretty well, but it also means that I tend to think of every problem in terms of Python features and libraries (standard or not).

A proverb programmers are taught quite early is

If all you have is a hammer, everything looks like a nail. (Source)

It means that if you're only comfortable with a single tool, then you'll try to use it in every situation, even in one where it's not appropriate. I strongly feel that to become a better programmer, I now need to learn other programming languages and even other paradigms. I was initially thinking of functional languages, like Haskell or oCaml, but then I remembered something Fredrik told me a while ago, at a EuroPython after-party: reading "Structure and Interpretation of Computer Programs" immediately made him a better programmer. I remember being curious as to why.

It so happens that the books is written under a Creative Common license, and can be downloaded here, AND uses Scheme as a teaching language. It thus combines three things I strive for: a new language, a new programming paradigm and more insight into the art of programming itself.

I'm thus laying out my n-step plan to become a better programmer:

Read the book thoroughly
Solve the exercises
Stop conceiving every solution in Python

Behold, one of my first Scheme programs, a pavement in the road of my improvement.

; Implementation of cubic root Newton approximation technique in Scheme

(define (square x) (* x x))

(define (cubic-root x)
    (define (improve guess)
        (/ (+ (/ x (square guess)) (* guess 2)) 3))

    (define (good-enough? new-guess old-guess)
        (< (abs (/ (- new-guess old-guess) old-guess)) 0.001))

    (define (try new-guess old-guess)
        (if (good-enough? new-guess old-guess)
            new-guess
            (try (improve new-guess) new-guess)))

    (try 1.0 x)
)

(cubic-root 9)
; => 2.0800838232385224

Note: If you want to experiment with various languages (Scheme included) without having to install them on your machine, have a look at repl.it.

So long and thank you for all the (deep fried) fish!

2015-01-24T00:00:00+01:00

J'ai délaissé ce blog depuis longtemps. Je pourrais vous dire que j'avais plein d'autres trucs à faire, mais en vérité, j'avais bien le temps d'écrire une ligne ou deux. Mon séjour se termine, plus tôt que prévu, pour deux raisons majeures: mon boulot est devenu chiant à mourir, et ma vie lyonnaise me manque.

Felix et Thais sont venus me voir la semaine dernière, et j'ai passé le week-end dans les rues, à leur faire découvrir la ville, et tous les recoins que les touristes ne voient jamais. Je me suis rendu compte que malgré mon départ précipité, je commence à vraiment bien là connaître, cette ville! Si je devais vous la décrire en une ligne, je pense que je dirais que c'est une ville avec des chauffeurs de bus, un château et des gens qui baladent leur chien.

Par beaucoup d'aspects, c'est un vrai dépaysement: les gens se déplacent des quatre coins de la ville pour aller faire des courses dans le seul (minuscule) marché de la ville, alors qu'à Lyon, on peut trouver des immenses marchés dans chaque quartier. Avouer ne pas supporter les chiens est passible de la peine capitale, et une grande majorité des locaux disent ne pas avoir goûté le haggis (qui les dégoûte), et ne pas oser prendre un deep fried mars bar. Ils ont vraiment tort à propos du haggis, mais pas pour le coup du mars. C'est un coup à ce que votre infarctus se tape un arrêt cardiaque! Sachez que le modèle deep fried se décline aussi en demi-pizza, en pain à l'ail et autres douceurs et légèretés. Ce qui est assez incroyable, c'est que le nombre d'obèses soit environ égal à 0. Ça doit être le coup du froid, c'est pas possible...

Je confirme aussi que l'Écosse est toujours un grand pays pour le whisky. J'ai appris à vraiment aimer cette boisson, qui est vraiment incroyable en terme de variété des saveurs, des odeurs et des couleurs. Bien plus que le vin je trouve. Je comprends enfin le plaisir qu'on les gens à parler pendant des heures de ce qu'ils ont (ou ont eu) dans leur verre. C'est juste que j'étais ignare en fait: pour parler, il faut déjà avoir du vocabulaire. J'ai vraiment découvert ça avec le whisky.

Je me suis vraiment senti chez moi partout, sauf chez moi. C'est principalement dû au fait que mon appart est un couloir, et ma voisine du dessus (ma propriétaire) se croit tout permis. Du coup, j'ai passé mon temps à être le plus silencieux possible, histoire qu'elle me lâche la grappe, ou alors à sortir (histoire qu'elle me lâche la grappe). Je peux vous dire qu'entre elle, et l'équipe que j'ai du diriger (composée d'une folle clinique, d'un superactif probablement drogué et d'un inutile amorphe), je reviens avec un cuir bien épais. Annonce générale à tous ceux qui auraient dans leurs plans de m'emmerder: "don't".

Je vais rajouter des photos dans la galerie, histoire que vous puissiez vous mettre un peu à ma place. Attention, pas trop longtemps, vous risqueriez de chopper froid.

Je vous embrasse, et on se reverra sur le sol gaulois.

Jour J et pantalon moule fesses

2014-12-08T00:00:00+01:00

Grande excitation au boulot: le jour J est finalement arrivé. Personne n'est mort, et on a eu environ 500% de trafic en plus que prévu. Du coup, on est plutôt contents. Ça veut aussi dire que d'ici les vacances de Noël, on met un dernier coup de collier, et ensuite, Fredrik et moi sommes de support pendant 3 mois. Autant dire que ça va pas être la même ambiance: DONDE ESTA LA PLAYA? Au bureau.

J'ai souvent regardé de haut ceux qui avaient du matos super pointu pour faire un truc de tous les jours. Comme courir par exemple. La réflexion était la suivante: "Pff, ça fait 20000 ans qu'on court dans tous les sens, mais soudainement, il faut absolument un pantalon qui moule les fesses et un T-shirt bleu électrique à manche longues". Autant le dédain peut être justifiable sous nos latitudes clémentes, mais allez essayer de courir 45 minutes à 9h du soir en décembre en Écosse, et vous m'en direz des nouvelles. Tout d'un coup, les gens équipés me paraissent moins ridicules. Sensés même! Alors voilà, comme quoi on change en voyage. On s'achète des pantalons moule fesses, des T-shirts bleu électrique à manches longues, des chaussures avec des orteils, et on va courir la nuit en forêt en espérant ne pas tomber sur un sanglier.

Avis à tous ceux qui pensent que je passe mes journées à boire des coups, je vous embrasse!

Julie

2014-11-30T00:00:00+01:00

Julie est venue me voir ce weekend. Elle.

Sur ce chantage affectif quasi assumé, je vous embrasse, et je vous vois à Noël!

Rock, Roll et thé à la menthe

2014-11-23T00:00:00+01:00

Désolé pour le long silence, j'ai été pas mal occupé ces temps-ci. J'ai bien essayé d'embaucher des plumes, comme Alexandre Dumas, mais figurez vous qu'ils ne parlent pas la langue ces cons là. Ça en deviendrait limite frustrant à la fin.

Premièrement, j'ai mis quelques photos du coin ici. C'est un petit vrac, mais c'est trié du plus ancien au plus récent. Je confirme mes premières impressions: la ville est à la fois accueillante et et très vivante. Et j'ai eu la bonne idée de ne pas y habiter. Au lieu de ça, j'ai trouvé un studio perdu près d'une forêt, de champs et d'une rivière. Et vous savez quoi? C'est vraiment chouette. J'ai pu me remettre à la course, de jour comme de nuit (lampe torche oblige) et je suis relativement tranquille (sauf quand ma logeuse essaye de me soutirer du pognon). En fait, je me suis organisé une petite vie tranquillou. Quand je ne bosse pas, on sort manger un bout, on se balade, on regarde des Doctor Who en buvant du thé ou alors je vais courir. C'est pas aussi rock and roll que j'avais imaginé au début, mais c'est finalement très chouette.

Bon, je ne vais pas vous mentir, je ne suis pas devenu moine pour autant: les bières sont très bonnes, et le haggis aussi hein! Faudrait pas non plus déconner.

Je rentrerais à Noël finalement, du coup et d'ici là, je vous embrasse.

Ovfbhf Cncn

2014-11-03T00:00:00+01:00

Je pense qu'une des plus belles surprises de ce voyage, en dehors du hamburger camembert/oignons confits, c'est de m'être rendu compte que je suis bon dans ce que je fais. Loin de moi l'idée de me faire mousser en famille, mais c'est quand même une belle prise de conscience! Ça m'a fait me demander comment j'en suis arrivé à faire de l'informatique mon métier.

Petit retour en arrière à travers l'espace-temps.

L'informatique, c'est d'abord un truc de famille. Des bouquins techniques qui traînent sur le piano de la maison de Béthisy, des carcasses de PC entassées dans le grenier, mon premier PC que j'ai monté avec papa, des jeux que j'ai cracké avec Félix... L'informatique, c'est une curiosité de gosse qui ne m'a jamais vraiment quitté en fait. J'ai eu beau jurer, à l'apothéose de ma délicieuse adolescence, que je ne ferais jamais comme papa, ni comme maman, je me retrouve avec le métier de l'un, à faire du Yoga et du Tai-Chi, tout en ayant un lombricomposteur à la cave. Ironie du sort, quand tu nous tiens.

Bref, l'informatique, je crois que ça me vient tout d'abord de la volonté de comprendre des trucs obscurs, des trucs de grands. Déjà, maîtriser un jargon, à défaut d'aider avec les filles, ça donne l'impression d'en savoir plus que les autres. Et puis, ça donne aussi l'impression d'en savoir plus que la veille. Si on n'a pas cette envie de constamment se rendre compte qu'on ne sait pas grand chose, et d'y remédier, je crois que ça n'est pas la peine de se lancer dans cette voie.

Un jour ou l'autre, on se rend compte aussi qu'on peut rendre service, à soi comme aux autres. On peut installer un système pour l'un, conseiller l'autre dans l'achat d'un PC, développer un programme qui s'occupe de me chercher un appart, auto-héberger son blog, organiser une conférence, parler à une autre, et plein d'autres trucs rigolos. J'insiste sur le mot rigolo. Mes journées passent généralement assez vite, parce que je m'amuse. Tous les passionnés d'informatique que j'ai pu rencontrer s'amusent, à un moment de leur journée. Pas plus tard que dimanche soir, j'ai décidé de tester une nouvelle techno, pour me rendre compte tout d'un coup qu'il était 4h du matin, et qu'il serait sans doute bon de fermer l'oeil. Sans l'amusement, on serait juste des gens pâles devant un écran. Mais comme on trouve que tout ça c'est quand même chouette, parce que visiblement, ça rend service à plein de gens, on développe des trucs comme Youtube, OpenStreetMap (et Mapado :), et on file des coups de main, pour le simple plaisir d'aider. Un peu comme dans le vrai monde en fait.

Ça faisait plusieurs années que je m'amusais dans mon coin ou avec des amis, et puis tout d'un coup, on m'a proposé de me donner de l'argent pour continuer. ALLO. Encore quelques années après, on m'a dit que comme j'étais expert dans tel truc, ça serait choupi de déménager dans la semaine et dans un autre pays, pour que le pays en question puisse réclamer ses taxes foncières sans trop de problème. Alors bon, être expert, c'est qu'une question de perspective, mais quand je repense au Balthazar d'il y a 15 ans, je me dis que le chemin parcouru ne l'a sans doute pas été pour rien.

Alors voilà, je suis parti sur un coup de tête, pour aller faire un truc qui m'amuse, pour des gens qui ont l'air de penser que ça leur rend service, et qui sont même prêts à m'aider à rembourser mon appart pour ça. Je sais pas vous, mais je trouve qu'il y a pire comme passion.

Allez, il est presque minuit, je me souhaite un bon anniversaire un peu en avance, et j'éteins mon PC. Il parait que ça empêche de dormir. Des conneries tout ça.

Camembert et oignons confits

2014-11-01T00:00:00+01:00

Ça fait un petit moment que je vous ai négligé. Sachez que l'alcool n'a rien a voir là dedans. On pourrait même dire que le travail et le changement d'heure en sont la cause première. Voyez vous, je travaille. Beaucoup. Le weekend aussi, de temps en temps. Du coup, quand je sors du travail, il fait nuit, et les photos que je pourrais prendre d'Edimbourg seraient assez monochromes.

Sachez aussi que tout va vraiment bien. C'est fou comme je me sens bien dans ce pays et dans cette ville. Depuis mon dernier article, j'ai déménagé ici, dans un appartement ma foi bien sympathique, un peu loin de tout. Le calme est assez royal. J'ai une petite rivière qui passe en face de chez moi, que je peux suivre jusqu'à la mer, en passant à travers un parc et un terrain de golf. On atteint pas le charme de Villeurbanne, mais c'est somme toute très correct, je vous rassure. En longeant la plage, on tombe même sur un pub qui sert des burgers au camembert et oignons confits, absolument affolants!

Côté boulot, je ne peux pas vous dire grand chose, simplement parce que je n'ai pas le droit. Mais tout va très bien aussi. C'est un projet pour le gouvernement écossais, d'une complexité assez folle et dans laquelle on essaie de mettre un peu d'ordre. On a une date butoir en décembre, qui met bien la pression à tout le monde! C'est assez fréquent de croiser des collègues le dimanche au bureau.

Je vous embrasse mates.

Pizzas et compte bancaire

2014-10-23T00:00:00+02:00

Si on y réfléchit bien, l'Angleterre est un autre pays. C'est à dire que ce n'est pas la France en fait. Du coup, il y a pas mal de choses qui changent, voire même qui dépayseraient le premier gaulois descendant du bateau.

Quand j'ai voulu ouvrir un compte bancaire local, on m'a demandé mon passport, et c'est tout. Mon charmant banquier français nous demande 3 fiches de salaire, une attestation de domicile (et limite un test urinaire) pour qu'on puisse ouvrir un compte commun avec Julie. Le compte ne coute rien (du tout), la carte non plus, et seulement les agios font vaguement mal. C'est comme si les banques anglaises étaient reconnaissantes qu'on mette de l'argent chez elles. C'est pas facile à imaginer hein?

Autre exemple. Les anglais adorent les journaux de caniveau. On en trouve partout (sauf dans les caniveaux d'ailleurs, vu que tout est très propre). Les gros titres d'ajourd'hui: "Mon infection des reins était en fait un bébé", "Le footballeur violeur demande une seconde chance".

Les policiers sourient et disent bonjour. On est poli dans le bus et on dit bonjour et au revoir au chauffeur.

Alors evidemment, tout n'est pas rose (c'est même assez gris au niveau du ciel), mais vraiment, c'est dépaysant.

Au passage.

C'est tout.

Deep fried mars bars

2014-10-22T00:00:00+02:00

Je n'avais pas dans l'idée de faire un blog en partant en Écosse. Disons que j'envisageais plus de boire des pintes à la place. Cela dit, comme ma maman m'a demandé d'en faire un, je m'exécute. Je suis très bien élevé voyez vous.

Du coup, résumons un peu. J'ai été contacté par Fredrik, un ami suédois, qui m'a proposé de venir travailler à Édimbourg pendant quelques mois. J'avais dans l'idée de changer de travail, et le plan consistant à partir à 7h de Lyon, prendre deux avions, un bus, un taxi pour aller directement au bureau ne manquant pas de rock'n roll, j'ai décidé d'accepter. J'y serais jusqu'à mars 2015, en théorie.

Je suis parti avec quelques à priori. Par exemple, j'avais dans l'idée que tous les écossais ressemblaient à ça. Et bien figurez vous que c'est faux. On m'avait aussi dit que j'allais me peler les marrons. Et bien figurez vous que c'est en bonne voie. On m'avait aussi dit que l'accent écossais est complètement incompréhensible. Je vous propose de vérifier par vous même avec une petite vidéo facile.

Comme j'ai déjà pu le dire à certains d'entre vous, la ville est magnifique. Vous allez devoir me croire sur parole, parce que j'ai pas (encore) de photos sous la main, mais entre les châteaux, les tours médiévales, les donjons et les parcs, c'est vraiment un plaisir de s'y balader. Je n'ai toujours pas eu un seul jour de pluie depuis mon arrivée, ce qui doit bien relever du miracle. En ce qui concerne la nourriture locale, je n'ai toujours pas essayé les deep fried mars bars (parce que ça va arrêter mon coeur), mais contre toute attente, j'ai découvert qu'on mange très bien. Le haggis est bien évidemment excellent, mais on peut trouver de la bonne nourriture de pub un peu partout, pour pas grand chose.

Je vous embrasse lads.

Crawl a website with scrapy

2012-04-23T00:00:00+02:00

In this article, we are going to see how to scrape information from a website, in particular, from all pages with a common URL pattern. We will see how to do that with Scrapy, a very powerful, and yet simple, scraping and web-crawling framework.

For example, you might be interested in scraping information about each article of a blog, and store it information in a database. To achieve such a thing, we will see how to implement a simple spider using Scrapy, which will crawl the blog and store the extracted data into a MongoDB database.

We will consider that you have a working MongoDB server, and that you have installed the pymongo and scrapy python packages, both installable with pip.

If you have never toyed around with Scrapy, you should first read this short tutorial.

First step, identify the URL pattern(s)

In this example, we’ll see how to extract the following information from each isbullsh.it blogpost :

title
author
tag
release date
url

We’re lucky, all posts have the same URL pattern: http://isbullsh.it/YYYY/MM/title. These links can be found in the different pages of the site homepage.

What we need is a spider which will follow all links following this pattern, scrape the required information from the target webpage, validate the data integrity, and populate a MongoDB collection.

Building the spider

We create a Scrapy project, following the instructions from their tutorial. We obtain the following project structure:

isbullshit_scraping/
├── isbullshit
│   ├── __init__.py
│   ├── items.py
│   ├── pipelines.py
│   ├── settings.py
│   └── spiders
│       ├── __init__.py
│       ├── isbullshit_spiders.py
└── scrapy.cfg

We begin by defining, in items.py, the item structure which will contain the extracted information:

from scrapy.item import Item, Field

class IsBullshitItem(Item):
    title = Field()
    author = Field()
    tag = Field()
    date = Field()
    link = Field()

Now, let’s implement our spider, in isbullshit_spiders.py:

from scrapy.contrib.spiders import CrawlSpider, Rule
from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor
from scrapy.selector import HtmlXPathSelector
from isbullshit.items import IsBullshitItem

class IsBullshitSpider(CrawlSpider):
    name = 'isbullshit'
    start_urls = ['http://isbullsh.it'] # urls from which the spider will start crawling
    rules = [Rule(SgmlLinkExtractor(allow=[r'page/\d+']), follow=True),
        # r'page/\d+' : regular expression for http://isbullsh.it/page/X URLs
        Rule(SgmlLinkExtractor(allow=[r'\d{4}/\d{2}/\w+']), callback='parse_blogpost')]
        # r'\d{4}/\d{2}/\w+' : regular expression for http://isbullsh.it/YYYY/MM/title URLs

    def parse_blogpost(self, response):
        ...

Our spider inherits from CrawlSpider, which “provides a convenient mechanism for following links by defining a set of rules”. More info here.

We then define two simple rules:

Follow links pointing to http://isbullsh.it/page/X
Extract information from pages defined by a URL of pattern http://isbullsh.it/YYYY/MM/title, using the callback method parse_blogpost.

Extracting the data

To extract the title, author, etc, from the HTML code, we’ll use the scrapy.selector.HtmlXPathSelector object, which uses the libxml2 HTML parser. If you’re not familiar with this object, you should read the XPathSelector documentation.

We’ll now define the extraction logic in the parse_blogpost method (I’ll only define it for the title and tag(s), it’s pretty much always the same logic):

def parse_blogpost(self, response):
    hxs = HtmlXPathSelector(response)
    item = IsBullshitItem()
    # Extract title
    item['title'] = hxs.select('//header/h1/text()').extract() # XPath selector for title
    # Extract author
    item['tag'] = hxs.select("//header/div[@class='post-data']/p/a/text()").extract() # Xpath selector for tag(s)
    ...
    return item

Note: To be sure of the XPath selectors you define, I’d advise you to use Firebug, Firefox Inspect, or equivalent, to inspect the HTML code of a page, and then test the selector in a Scrapy shell. That only works if the data position is coherent throughout all the pages you crawl.

Store the results in MongoDB

Each time that the parse_blogspot method returns an item, we want it to be sent to a pipeline which will validate the data, and store everything in our Mongo collection.

First, we need to add a couple of things to settings.py:

ITEM_PIPELINES = ['isbullshit.pipelines.MongoDBPipeline',]

MONGODB_SERVER = "localhost"
MONGODB_PORT = 27017
MONGODB_DB = "isbullshit"
MONGODB_COLLECTION = "blogposts"

Now that we’ve defined our pipeline, our MongoDB database and collection, we’re just left with the pipeline implementation. We just want to be sure that we do not have any missing data (ex: a blogpost without a title, author, etc).

Here is our pipelines.py file :

import pymongo

from scrapy.exceptions import DropItem
from scrapy.conf import settings
from scrapy import log


class MongoDBPipeline(object):
    def __init__(self):
        connection = pymongo.Connection(settings['MONGODB_SERVER'], settings['MONGODB_PORT'])
        db = connection[settings['MONGODB_DB']]
        self.collection = db[settings['MONGODB_COLLECTION']]

    def process_item(self, item, spider):
        valid = True
        for data in item:
          # here we only check if the data is not null
          # but we could do any crazy validation we want
          if not data:
            valid = False
            raise DropItem("Missing %s of blogpost from %s" %(data, item['url']))
        if valid:
          self.collection.insert(dict(item))
          log.msg("Item wrote to MongoDB database %s/%s" %
                  (settings['MONGODB_DB'], settings['MONGODB_COLLECTION']),
                  level=log.DEBUG, spider=spider)
        return item

Release the spider!

Now, all we have to do is change directory to the root of our project and execute

$ scrapy crawl isbullshit

The spider will then follow all links pointing to a blogpost, retrieve the post title, author name, date, etc, validate the extracted data, and store all that in a MongoDB collection if validation went well.

Pretty neat, hm?

Conclusion

This case is pretty simplistic: all URLs have a similar pattern and all links are hard written in the HTML code: there is no JS involved. In the case were the links you want to reach are generated by JS, you’d probably want to check out Selenium. You could complexify the spider by adding new rules, or more complicated regular expressions, but I just wanted to demo how Scrapy worked, not getting into crazy regex explanations.

Also, be aware that sometimes, there’s a thin line bewteen playing with web-scraping and getting into trouble.

Finally, when toying with web-crawling, keep in mind that you might just flood the server with requests, which can sometimes get you IP-blocked :)

The entire code of this project is hosted on Github. Help yourselves!

Create a webcam manager using pyGTK and Gstreamer

2012-02-29T00:00:00+01:00

Introduction

I recently joined the Strongsteam project for a 6 month internship. Our main goal is to provide some "artificial intelligence and data mining APIs to let you pull interesting information out of images, video and audio." We will be doing a presentation at Pycon 2012, the 9th of March, during the Startup Row weekend. On this occasion, I had to implement a desktop GUI allowing to display a webcam video stream and to capture snapshots, with the following constraints:

GUI written with wxPython or pyGTK
the webcam stream must be integrated in the wxPython/pyGTK window
the webcam must not be handled with the OpenCV python module (the installation can be painful on Mac OS X)
the snapshots default format and resolution must be JPG and 640x480px

How to handle the webcam ?

My initial research led me to consider two different solutions:

using PyGame, a set of python modules adding functionality on top of the SDL library
using Gstreamer, a pipeline-based multimedia framework allowing "to create a variety of media-handling components, including simple audio playback, audio and video playback, recording, streaming and editing" (quote: wikipedia article). Gstreamer is used by a bunch of multimedia applications, like Cheese, Amarok, Pitivi, ...

I quickly turned to PyGame, because of the simplicify of the snapshot operation : all we have to do is to use the pygame.camera.Camera.get_image() function. However, the integration of the PyGame surface into a pyGTK interface turned out to be pretty complicated. I found a couple of StackOverflow posts stating that even though this integration was possible, it was not advised. Indeed, some erratic behaviours seem to be observed when using different OS.

I thus considered Gstreamer, and quicky found this encouraging project. This code allowed to start and stop a webcam video stream embedded in a pyGTK interface : I was definitlely in the right place !

Why doesn't it work with my webcam ?

If you experience some problems testing the project introduced into the previous part (black screen, first run successful and following run leading to black screen, ...) check if your webcam is UVC (USB Video Class) Linux compliant. To do that, type in

$ lsusb

in a terminal and locate the line describing your webcam.

My laptop integrated webcam was described as Bus 001 Device 003: ID 05ca:1814 Ricoh Co., Ltd HD Webcam. The reference 05ca:1814 doesn't appear on the UVC website. That could explain why I experienced so many problems with it (it appears that Ricoh webcams are poorly UVC compliant).

I hence bought a Logitech QuickCam Pro 9000, known for being well supported. Everything ran smoothly with this one.

How to use Gstreamer ?

If you don't know how to use Gstreamer, I'd advise you to have a look these pages :

The main idea is to construct a pipeline, by connecting various data sources, sinks and processing blocks (bins) in a data flow graph.

In our case, we are going to use the following pipeline to display the webcam stream:

v4l2src ! video/x-raw-yuv,width=640,height=480,framerate=30/1 ! xvimagesink

v4l2src : Video for Linux input : your webcam (the default device is /dev/video0, but if you are using an external webcam, use v4l2src device=/dev/video1)
video/x-raw-yuv : video colorspace specific to webcam
width=640,height=480 : your webcam resolution (check that it is compatible with your webcam)
framerate=30/1 : number of frames per second
xvimagesink : video sink

Let's see how to do that in Python:

def create_video_pipeline(self):
    """Set up the video pipeline and the communication bus bewteen the video stream and gtk DrawingArea """
    video_pipeline = 'v4l2src device=/dev/video1 ! video/x-raw-yuv,width=640,height=480,framerate=30/1 ! xvimagesink'
    self.video_player = gst.parse_launch(video_pipeline) # create pipeline
    self.video_player.set_state(gst.STATE_PLAYING)       # start video stream

    bus = self.video_player.get_bus()
    bus.add_signal_watch()
    bus.connect("message", self.on_message)
    bus.enable_sync_message_emission()
    bus.connect("sync-message::element", self.on_sync_message)

def on_message(self, bus, message):
    """ Gst message bus. Closes the pipeline in case of error or end of stream message """
    t = message.type
    if t == gst.MESSAGE_EOS:
        print "MESSAGE EOS"
        self.video_player.set_state(gst.STATE_NULL)
    elif t == gst.MESSAGE_ERROR:
        print "MESSAGE ERROR"
        err, debug = message.parse_error()
        print "Error: %s" % err, debug
        self.video_player.set_state(gst.STATE_NULL)

def on_sync_message(self, bus, message):
    """ Set up the Webcam <--> GUI messages bus """
    if message.structure is None:
        return
    message_name = message.structure.get_name()
    if message_name == "prepare-xwindow-id":
        # Assign the viewport
        imagesink = message.src
        imagesink.set_property("force-aspect-ratio", True)
        # Sending video stream to gtk DrawingArea
        imagesink.set_xwindow_id(self.movie_window.window.xid)

Now, we have a live video stream displayed into a pyGTK interface, but still no way of capturing a snapshot.

How do we capture a snapshot ?

I encountered many StackOverflow open questions about this part, but no satisfactory answer...

At first, I wanted to use Gstreamer for that too, but I couldn't find any way to dynamically modify the pipeline to add a frame extraction, jpg encoding and a filesink (to save the snapshot). I thus tried this ugly hack : when the 'take snapshot' button is clicked

stop the video stream
start the following pipeline: v4l2src device=/dev/video1 ! video/x-raw-yuv,width=640,height=480,framerate=30/1 ! ffmpegcolorspace ! video/x-raw-rgb,framerate=1/1 ! ffmpegcolorspace ! jpegenc snapshot=true ! filesink location=snap.jpeg, which will extract a single frame, encode it to jpg and save it to a file.
stop this image pipeline
re-start the video stream

That was of course ugly, and resulted into a ~2s flicker when taking the snapshot... Back to square one.

I'll save you the suspens, the right solution is to use the gtk.DrawingArea.window.get_colormap() method, as shown here:

def take_snapshot(self):
    """ Capture a snapshot from DrawingArea and save it into a image file """
    drawable = self.movie_window.window
    # self.movie_window is of type gtk.DrawingArea()
    colormap = drawable.get_colormap()
    pixbuf = gtk.gdk.Pixbuf(gtk.gdk.COLORSPACE_RGB, 0, 8, *drawable.get_size())
    pixbuf = pixbuf.get_from_drawable(drawable, colormap, 0,0,0,0, *drawable.get_size())
    pixbuf = pixbuf.scale_simple(self.W, self.H, gtk.gdk.INTERP_HYPER) # resize
    # We resize from actual window size to wanted resolution
    #  gtk.gdk.INTER_HYPER is the slowest and highest quality reconstruction function
    # More info here : http://developer.gnome.org/pygtk/stable/class-gdkpixbuf.html#method-gdkpixbuf--scale-simple
    filename = 'snap.jpg'
    filepath = relpath(filename)
    pixbuf.save(filename, self.snap_format)

This snippet does the following operations:

extract the last frame from the gtk.DrawingArea
encode it to RGB
resize it to 640x480px
save it to snap.jpg

And that's done, without even a teeny-tiny flicker! Yay! We now have a perfecly functional snapshot operation.

Project source code & Git repository

All the code can be encountered on my GitHub.

How to randomly generate a Monty Python parody

2011-11-16T00:00:00+01:00

If you always wanted to write texts in the way of Monty Python, I have what you need ! In this post, I am going to show you mathematical techniques to analyse a text, in order to randomly generate look-alike texts.

Introduction to basic concepts

First essential question: what is a text?

From a mathematical point of view, a text of length n simply is the concatenation of n symbols, all taken from a finite alphabet A. In our context, the alphabet is generally composed of all lowercase and uppercase letters, punctuation signs, etc.

In a real-life situation, the symbols sucession is not random, but depends of the previous symbols. Indeed, if the 3 last symbols are " ", "t" and "h", it is highly probable that the next one will be "e", because the world "the" is fairly common.

The whole problem can thus be resumed to obtaining a transition probability matrix between strings of fixed length and all smbols of the alphabet.

Example : Let's assume that the three last symbols are " ", "t", and "h", and that the probability of the next symbol being "e" (written $p("e" / " th")$ ) is 0.6, an "a" is 0.3 and "u" is 0.1. We would then obtain a line of the matrix of transition probability between " th" and all alphabet symbols:

" th" —> a: 0.3, b: 0, c: 0, ..., e: 0.6, ..., u: 0.1, ...

The probability $p("e" / " th")$ is called a conditional probability.

Markov chain of order $k$

We are going to model our data text (here, the "Monthy Python and the Holy Grail" script) with a Markov chain of order $k$. This barbarian name refers to :

"a mathematical system that undergoes transitions from one state to another (from a finite or countable number of possible states) in a chain-like manner -- Source : Wikipedia"

That means that the following state is conditioned by the $k$ previous ones.

If we deal with a Markov chain of order 3, the probability of occurence of the next symbol will only depends on the 3 previous symbols. From previous tests, I can say that $k=10$ is a good place to start. (More on that later)

Text Alphabet

We've just fixed the value of k, which was the first step of the process. Now, we need to to create a list of all encountered symbols (ie: the alphabet).

First, we read the data file, and join all the lines in a single string.

f = open('../data/monty.txt')
f_lines = ' '.join(f.readlines())

Then, we create the alphabet list:

def alphabet(datafile_lines):
    """
    Returns all used characters in a given text
    """
    alph = []
    for letter in datafile_lines:
        if letter not in alph:
            alph.append(letter)
    return sorted(alph)

Finding all exiting K-tuples in the source text

Now, we need to identify all distinct strings of length $k=10$ in the text.

This can seem a bit tedious, but list comprehensions and sets will do a lovely work.

# -- split text in ak chunks of length k
ak_chunks = [datafile_lines[i:i+k] for i in xrange(len(datafile_lines))]

# -- remove final chunk if not of size k
if len(ak_chunks[-1]) != k:
    ak_chunks.remove(ak_chunks[-1])

# -- Extract unique values from list
ak_chunks = list(set(ak_chunks)) #set: reduce to unique values

Empirical probabilities of transition

Now comes the hard work. So far, we have

a text,
its alphabet,
a HUGE list of all distincts strings of length $k=10$ contained in the text

What we then need is a way to calculate the empirical probability of transition between each string of length 10 and symbols of the alphabet ("empirical" in the way that these probabilities will only apply to the text we study).

Let's formalize a bit the problem:

$a^k$ : string of length $k$ (here, 10)
$b$ : symbol located after $a^k$
$n_(a^k)$ : number of times that the string $a^k$ is encountered in the text
$n_(b/a^k)$ = number of times that the string $a^k$ is followed by the symbol $b$

We can now express the empirical probability $p(b/a^k) = n_(b/a^k) / n_(a^k)$ (number of times that the string $a^k$ is followed by the symbol $b$ / number of times that the string $a^k$ is encountered in the text)

Example : if our text is ABCABDABC, $a^k = AB$ and $b = C$:

$n_(AB) = 3$
$n_(C/AB) = 2$
$p(C/AB) = 2/3 = 0.667$

Let's write all that in Python:

def conditional_empirical_proba(chain, ak, symbol, n_ak): # p(b/a^k)
    """
    Returns the proportion of symbols after the ak string (contained
    in chain string and of length k) which are equal to the value
    of given parameter 'symbol'
    Ex:conditional_empirical_proba('ABCABD', 2, 'AB', 'C', n_ak)-> 0.5
    """
    nb_ak = n_b_ak(chain, ak, symbol)
    if n_ak != 0:
        return float(nb_ak)/n_ak
    else:
        return 0

def n_b_ak(chain, ak, symbol): # n_(b/a^k)
    """
    Given a string chain, returns the number of
    times that a given symbol is found
    right after a string ak inside the chain
    """
    return chain.count(ak+symbol)

def n_ak(chain, ak): # n_(a^k)
    """
    Given a string chain and a string ak, returns
    the number of times ak is found in chain
    """
    return chain.count(ak)

Now, the only remaning thing to do is to calculate the empirical conditional probability for each k-tuple and for each symbol.

A few remarks are necessary:

We will only store empirical conditional probabilities > 0 (more on that later)
We will store accumulative empirical conditional probabilities (more on that later)
The matrix will be created with a dictionnary of dictionnaries

# Initialization of matrix
prob = {}
for ak in ak_chunks:
    # New matrix line
    prob[ak] = {}

    # -- calculate p(b/a^k) for each symbol of alphabet
    pbak_cumul = 0
    for symb in alpha:
        pbak = conditional_empirical_proba(datafile_lines, ak, symb, nak)

        # cumulative probabilities
        pbak_cumul += pbak

        # if sucession ak+symb is encountered in text, add probability to matrix
        if pbak != 0.0: # Very important, if pbak = 0.0, the combination ak+symb will not be randomly generated
            prob[ak][symb] = pbak_cumul

with open('../results/distribs/distrib_k%d.txt' % (k), 'w') as proba_file
    pickle.dump(prob, proba_file)

Random text generation

Close your eyes for a second, and think about what we just did. We calculated empirical transition probabilities between all existing strings of length 10 and all symbols of the alphabet, and stored the non nil acumulative probabilities in a matrix. (The non-nil part has two main advatages : it implies less storage cost, and we only store combinations that occured in the text. This way, random generation becomes really easy !)

It is now extremely easy to generate a text using these accumulative probabilities! Let's consider a quick example.

Example : $a^k = AB$, $p(A/AB)=0.2$, $p(B/AB)=0.5$, $p(C/AB)=0.5$. We then store these acumulative values in the matrix:

$p(A/AB)=0.2$
$p(B/AB)=0.7$
$p(C/AB)=1$

That way, we only have to pick a random float between 0 and 1 using a uniform distribution to match this float with a symbol. random(0,1) = 0.678 --> symbol = B

For this technique to work, the first $k=10$ symbols of the generated text must directly come from the original text (and hence will be contained in the matrix). This will give us a valid initial condition.

Let's now generate the text :

def random_text(size, k):
    """
    Given a result size and an integer k,
    returns a randomly generated text using
    probability distributions of markov chains
    of order k dumped in ../results/distribs/distrib_kX.txt
    files
    """
    # -- Initial string
    with open('../data/monty.txt','r') as f
        initial_string = ' '.join(f.readlines())[:k]
        out = initial_string

    # -- Import probability distribution
    try:
        p = open('../results/distribs/distrib_k%d.txt'%(k),'r')
    except IOError as err:
        print err
        exit(2)

    distrib_matrix = pickle.load(p)
    p.close()

    # -- Generate text following probability distribution
    kuple = initial_string
    for x in xrange(size):
        p = random.uniform(0,1)
        i = 0
        char = ''

        # read distribution specific to k-tuple string
        dist = distrib_matrix[kuple]

        for symbol in dist:
            char = symbol
            i = dist[symbol]
            if i > p:
                break

        out += symbol
        kuple = kuple[1:]+symbol # update k-tuple

    return out

Done ! Now, you only have to call the function random_text(len_text, 10) and BOOM !

Example of generated text with $k = 10$

"KING ARTHUR: Will you ask your master that we have been charged by God with a sacred quest. If he will give us food and shelter for the week.
ARTHUR: Will you ask your master if he wants to join my court at Camelot?!
SOLDIER #1: You're using coconuts!
ARTHUR: Ohh.
BEDEVERE: Uh, but you are wounded!
GALAHAD: What are you doing in England?
FRENCH GUARDS: [whispering] Forgive me that' and 'I'm not worth"

What if we change $k$ ?

k can be interpreted as the quantity of context you take into account to calculate a symbol occurence probability. We chose $k = 10$, because a context of 10 symbols allows the program to generate a text with apparent sense (limited by the randomness of the process, and by the fact that THIS IS MONTY FREAKING PYTHON).

The more context you add, the more alike the generated and original texts will be, up to a point where they will be identical.

If you decrease k, you can find a interesting case where you generate words, but where the context is senseless.

Example, for $k=5$:

"KING ARTHUR: Yes!
VILLAGER #3: A bit.
VILLAGER #1: You saw saw saw it, did you could
separate, and master that!
ARTHUR: Will you on Thursday.
CUSTOMER: What do you can you think kill your every
good people. It's one.)
OTHER FRENCH GUARDS: [whispering]"

If you decrease $k$ even more, you will only generate rubbish.

Conclusion

We have seen a pretty simple text analysis technique which allows us to randomly generate a text, based on statistical analysis of the data text. This technique is based on the fact that the probability of occurence of a letter depends on its local "past".

Playing with the value of the "past length", you can generate text more or less alike to the original, and with more or less "sense".

This simple technique does not use the nltk python module, or a set of texts to generate "theoretical" rules on a language. Its is purely empirical.

All source code available on GitHub.

EDIT : A nice comment from reddit:

"This approach was first proposed by Claude Shannon in his landmark paper "A Mathematical Theory of Communication"… in 1948. Gotta love how people keep reinventing the same things over and over again. But this time, in Python!"

Balthazar

Just enough Makefile to be dangerous

Getting started with make

The step structure

Phony targets

Default target

My best practices

Makefile auto-documentation as the default step

Tell what's happening, not how

Define commonalities in variables

Keep all paths in the Makefile

Generate a visual representation of the Makefile

Keep things readable

Pinning your SQLite version across environments

Inspecting the sqlite version on linux

Inspecting the sqlite version on macOS

Pinning the sqlite version by vendoring the compiled library

Compiling libsqlite3 for linux

Compiling libsqlite3 for macOS

Compiling the right version on-demand

Compiling libsqlite3 in docker

Unit testing the SQLite version and feature set

Sources

How to profile a FastAPI asynchronous request

The limitations of cProfile when profiling asynchronous code

Enter pyinstrument!

Integrating pyinstrument with FastAPI

Let's see the results

Sources

Preventing a pull request from being merged until it's safe

Neapolitan pizza dough recipe

Merging multiple mp3 files into an audiobook with chapters

Generating pretty maps ready to be gift-wrapped

Monitoring my solar panel power production

Speeding up a 21h job to 8 minutes: a story of SQLAlchemy optimization

My DIY Dungeons and Dragons ambiance mixer

Getting started

Reacting to key presses

Sending structured data from the keypad

Playing sounds after a keypress

Let's rub some web on it

The finishing touch

Demo time

I have the hardware! How can I run it?

Closing words

Can't enough be enough?

How it started

Wait. What is a stock option anyway?

strike price = f(risk)

What will buy you bread vs what might buy you a house

Hey Mr Taxman

Liftoff

The money that could buy bought me a house

Just when I thought I was out, they pull me back in.

Measuring the coverage of a rust program in Github Actions

Tools I'm thankful for

Python

Docker

Raspberry Pi

The terminal

Sending a webhook from Synology DSM to Discord

River monitoring with Datadog

To the Underdark and back

Metaprocrastinating on writing a book by writing a text editor

Cleaning up the Dungeondraft tag list

Running the Port Nyanzaru Dinosaur Race

Shell productivity tips and tricks

Table of Contents

Shell productivity tips

Tab completion

Keyboard shortcuts

Navigating the current line

Deleting and editing text

Cutting and pasting

Controlling the terminal

A unified command-line editing experience

Navigating through history

Searching the history

Rewriting history

Avoiding history

Getting started with `make`

Compiling `libsqlite3` for linux

Compiling `libsqlite3` for macOS

Compiling `libsqlite3` in docker

The limitations of `cProfile` when profiling asynchronous code

`!-n`

`^string1^string2`

`!^`

`!$`

`*`

`**`

`**/`

The case of `PATH`