Skip to the content.

Docker Images and Volumes

Great! So you have now looked at docker container run, played with Docker containers, and ran your first web application inside a container. In this section, you will learn about Docker Images and Volumes.

Tasks:

Task 1: Docker Images

In this section, we dive into Docker images. You will build your own image, use that image to run an application locally, and finally, push the newly create images to Docker Cloud.

The Docker documentation gives a great explanation on how storage works with Docker images and containers, but here’s the highlights.

A Docker image is built up from a series of layers. Each layer represents an instruction in the image’s Dockerfile. Each layer except the very last one is read-only. Consider the following Dockerfile:

    FROM ubuntu:15.04
    COPY . /app
    RUN make /app
    CMD python /app/app.py

This Dockerfile contains four commands, each of which creates a layer. The FROM statement starts out by creating a layer from the ubuntu:15.04 image. The COPY command adds some files from your Docker client’s current directory. The RUN command builds your application using the make command. Finally, the last layer specifies what command to run within the container.

Multiple Containers can use the same Image. Each container has its own writable container layer, and all changes are stored in this container layer, multiple containers can share access to the same underlying image and yet have their own data state. The diagram below shows multiple containers sharing the same Ubuntu 15.04 image.

The following exercises will help to illustrate those concepts in practice.

Let’s start by looking at layers and how files written to a container are managed by something called copy on write.

Docker images are the basis of containers. In the previous example, you pulled the dockersamples/static-site image from the registry and asked the Docker client to run a container based on that image. To see the list of images that are available locally on your system, run the docker images command.

$ docker images
REPOSITORY                  TAG                 IMAGE ID            CREATED             SIZE
dockersamples/static-site   latest              92a386b6e686        2 hours ago        190.5 MB
nginx                       latest              af4b3d7d5401        3 hours ago        190.5 MB
python                      2.7                 1c32174fd534        14 hours ago        676.8 MB
postgres                    9.4                 88d845ac7a88        14 hours ago        263.6 MB
containous/traefik          latest              27b4e0c6b2fd        4 days ago          20.75 MB
node                        0.10                42426a5cba5f        6 days ago          633.7 MB
redis                       latest              4f5f397d4b7c        7 days ago          177.5 MB
mongo                       latest              467eb21035a8        7 days ago          309.7 MB
alpine                      3.3                 70c557e50ed6        8 days ago          4.794 MB
java                        7                   21f6ce84e43c        8 days ago          587.7 MB

Above is a list of images that we’ve pulled from the Docker registry and images I created myself (we’ll shortly see how). You will have a different list of images on your machine. The TAG refers to a particular snapshot of the image and the ID is the corresponding unique identifier or hash for that image.

For simplicity, you can think of an image functions similarly to a git repository - images can be committed with changes and have multiple versions. When you do not provide a specific version number, the client defaults to latest.

  1. Pull a specific version of ubuntu image as follows:

    $ docker image pull ubuntu:12.04
    

    Note If you do not specify the version number of the image then, as mentioned, the Docker client will default to a version named latest.

  2. So for example, the docker image pull command given below will always pull the latest tag of an image. The example below pulls ubuntu:latest by default.

    $ docker image pull ubuntu
    

To get a new Docker image you can either get it from a registry (such as the Docker Hub) or create your own. There are hundreds of thousands of images available on Docker Hub. You can also search for images directly from the command line using docker search.

An important distinction with regard to images is between base images and child images.

Another key concept is the idea of official images and user images. (Both of which can be base images or child images.)

Task 2: Layers and Copy on Write

  1. Pull the Debian:Buster image

    $ docker image pull ubuntu:jammy
    jammy: Pulling from library/ubuntu
    952b15bbc7fb: Pull complete
    Digest: sha256:ac58ff7fe25edc58bdf0067ca99df00014dbd032e2246d30a722fa348fd799a5
    Status: Downloaded newer image for ubuntu:jammy
    docker.io/library/ubuntu:jammy
    
  2. Pull a MariaDB image

    $ docker image pull mariadb:11
    11: Pulling from library/mariadb
    6c7698a779f6: Already exists
    c3beef926275: Pull complete
    dd40ffbb6cb3: Pull complete
    31691bc52e3b: Pull complete
    0b4de91620aa: Pull complete
    1ecbfd4a00bd: Pull complete
    91656c5c74a8: Pull complete
    fbc99aa6f426: Pull complete
    Digest: sha256:b85481f8f2a65c10dec198e562a751676e926da83018e5590d00be86e5c9f635
    Status: Downloaded newer image for mariadb:11
    docker.io/library/mariadb:11
    

    What do you notice about the output from the Docker pull request for MySQL?

    The first layer pulled says:

    6c7698a779f6: Already exists

    Notice that the layer id (6c7698a779f6) is the same for the first layer of the MySQl image and the only layer in the Debian:Jessie image. And because we already had pulled that layer when we pulled the Debian image, we didn’t have to pull it again.

    So, what does that tell us about the MySQL image? Since each layer is created by a line in the image’s Dockerfile, we know that the MySQL image is based on the Debian:Jessie base image. We can confirm this by looking at the Dockerfile for the Docker Hub.

    The first line in the the Dockerfile is: FROM ubuntu:jammy This will import that layer into the MySQL image.

    So layers are created by the Dockerfile and are shared between images. When you start a container, a writeable layer is added to the base image.

    Next you will create a file in our container, and see how that’s represented on the host file system.

  3. Start a Debian container, shell into it.

    $ docker run --tty --interactive --name debian debian:stretch-slim bash
    root@e09203d84deb:/#
    
  4. Create a file and then list out the directory to make sure it’s there:

    root@e09203d84deb:/# touch test-file
    root@e09203d84deb:/# ls
    bin  boot  dev	etc  home  lib	lib64  media  mnt  opt	proc  root  run  sbin  srv  sys  test-file  tmp  usr  var
    

    We can see test-file exists in the root of the containers file system.

    What has happened is that when a new file was written to the disk, the Docker storage driver placed that file in it’s own layer. This is called copy on write - as soon as a change is detected the change is copied into the writeable layer. That layers is represented by a directory on the host file system. All of this is managed by the Docker storage driver.

  5. Exit the container but leave it running by pressing ctrl-p and then ctrl-q

    Our Docker host utilizes OverlayFS with the overlay2 storage driver.

    OverlayFS layers two directories on a single Linux host and presents them as a single directory. These directories are called layers and the unification process is referred to as a union mount. OverlayFS refers to the lower directory as lowerdir and the upper directory a upperdir. “Upper” and “Lower” refer to when the layer was added to the image. In our example the writeable layer is the most “upper” layer. The unified view is exposed through its own directory called merged.

  6. Stop the container

    $ docker container stop debian
    
  7. Ensure that your debian container still exists

    $ docker container ls --all
    CONTAINER ID        IMAGE               COMMAND             CREATED             STATUS           PORTS               NAMES
    674d7abf10c6        debian:stretch-slim "bash"              36 minutes ago      Exited (0) 2 minutes ago                       debian
    
  8. Start the Debian container again

    $ docker container start debian
    
  9. Attach to the Debian container hit enter twice after completing the command

    $ docker container attach debian
    

    Because the container still exists, the files are still available on your file system. At this point the file we created previously still exists.

    However, if we remove the container, the directories on the host file system will be removed, and your changes will be gone.

  10. Remove the container and list the directory contents

    $ docker container rm debian
    debian
    

    The files that were created are now gone and the container now reverts back to the base image which it was created from if we start it again.

Task 3: Understanding Docker Volumes

Docker volumes are directories on the host file system that are not managed by the storage driver. Since they are not managed by the storage drive they offer a couple of important benefits.

Volumes can be anonymous or named. Anonymous volumes have no way to be explicitly referenced. They are almost exclusively used for performance reasons as you cannot persist data effectively with anonymous volumes. Named volumes can be explicitly referenced so they can be used to persist data and increase performance.

The next sections will cover both anonymous and named volumes.

Special Note: These next sections were adapted from Arun Gupta’s excellent tutorial(Outdated) on persisting data with MySQL.

Anonymous Volumes

Take a look at the MySQL Dockerfile you will find the following line:

VOLUME /var/lib/mysql

This line sets up an anonymous volume in order to increase database performance by avoiding sending a bunch of writes through the Docker storage driver.

Note: An anonymous volume is a volume that hasn’t been explicitly named. This means that it’s extremely difficult to use the volume later with a new container. Named volumes solve that problem, and will be covered later in this section.

  1. Start a MySQL container

    $ docker run --name mysqldb -e MYSQL_USER=mysql -e MYSQL_PASSWORD=mysql -e MYSQL_DATABASE=sample -e MYSQL_ROOT_PASSWORD=supersecret -d mysql
    acf185dc16e274b2f332266a1bfc6d1df7d7b4f780e6a7ec6716b40cafa5b3c3
    

    When we start the container the anonymous volume is created:

  2. Use Docker inspect to view the details of the anonymous volume

    $ docker inspect -f "in the  container  is mapped to " mysqldb
    

    This command will return: in the /mysqldb container /var/lib/mysql is mapped to /var/lib/docker/volumes/cd79b3301df29d13a068d624467d6080354b81e34d794b615e6e93dd61f89628/_data

    As mentioned anonymous volumes will not persist data between containers, they are almost always used to increase performance.

  3. Shell into your running MySQL container and log into MySQL

    $ docker exec --tty --interactive mysqldb bash
    
    root@132f4b3ec0dc:/# mysql --user=mysql --password=mysql
    mysql: [Warning] Using a password on the command line interface can be insecure.
    Welcome to the MySQL monitor.  Commands end with ; or \g.
    Your MySQL connection id is 3
    Server version: 5.7.19 MySQL Community Server (GPL)
    
    Copyright (c) 2000, 2017, Oracle and/or its affiliates. All rights reserved.
    
    Oracle is a registered trademark of Oracle Corporation and/or its
    affiliates. Other names may be trademarks of their respective
    owners.
    
    Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
    
  4. Create a new table

    mysql> show databases;
    +--------------------+
    | Database           |
    +--------------------+
    | information_schema |
    | sample             |
    +--------------------+
    2 rows in set (0.00 sec)
    
    mysql> connect sample;
    Connection id:    4
    Current database: sample
    
    mysql> show tables;
    Empty set (0.00 sec)
    
    mysql> create table user(name varchar(50));
    Query OK, 0 rows affected (0.01 sec)
    
    mysql> show tables;
    +------------------+
    | Tables_in_sample |
    +------------------+
    | user             |
    +------------------+
    1 row in set (0.00 sec)
    
  5. Exit MySQL and the MySQL container.

    mysql> exit
    Bye
    
    root@132f4b3ec0dc:/# exit
    exit
    
  6. Stop the container and restart it

    $ docker stop mysqldb
    mysqldb
    
    $ docker start mysqldb
    mysqldb
    
  7. Shell back into the running container and log into MySQL

    $ docker exec --interactive --tty mysqldb bash
    
    root@132f4b3ec0dc:/# mysql --user=mysql --password=mysql
    mysql: [Warning] Using a password on the command line interface can be insecure.
    Welcome to the MySQL monitor.  Commands end with ; or \g.
    Your MySQL connection id is 3
    Server version: 5.7.19 MySQL Community Server (GPL)
    
    Copyright (c) 2000, 2017, Oracle and/or its affiliates. All rights reserved.
    
    Oracle is a registered trademark of Oracle Corporation and/or its
    affiliates. Other names may be trademarks of their respective
    owners.
    
    Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
    
  8. Ensure the table created previously table still exists

    mysql> connect sample;
    Reading table information for completion of table and column names
    You can turn off this feature to get a quicker startup with -A
    
    Connection id:    4
    Current database: sample
    
    myslq> show tables;
    +------------------+
    | Tables_in_sample |
    +------------------+
    | user             |
    +------------------+
    1 row in set (0.00 sec)
    
  9. Exit MySQL and the MySQL container.

    mysql> exit
    Bye
    
    root@132f4b3ec0dc:/# exit
    exit
    

    The table persisted across container restarts, which is to be expected. In fact, it would have done this whether or not we had actually used a volume as shown in the previous section.

  10. Let’s look at the volume again

    $ docker inspect -f "in the  container  is mapped to " mysqldb
    in the /mysqldb container /var/lib/mysql is mapped to /var/lib/docker/volumes/cd79b3301df29d13a068d624467d6080354b81e34d794b615e6e93dd61f89628/_data
    

    We do see the volume was not affected by the container restart either.

    Where people often get confused is in expecting that the anonymous volume can be used to persist data BETWEEN containers.

    To examine that delete the old container, create a new one with the same command, and check to see if the table exists.

  11. Remove the current MySQL container

    $ docker container rm --force mysqldb
    mysqldb
    
  12. Start a new container with the same command that was used before

    $ docker run --name mysqldb -e MYSQL_USER=mysql -e MYSQL_PASSWORD=mysql -e MYSQL_DATABASE=sample -e MYSQL_ROOT_PASSWORD=supersecret -d mysql
    eb15eb4ecd26d7814a8da3bb27cee1a23304fab1961358dd904db37c061d3798
    
  13. List out the volume details for the new container

    $ docker inspect -f "in the  container  is mapped to " mysqldb
    in the /mysqldb container /var/lib/mysql is mapped to /var/lib/docker/volumes/e0ffdc6b4e0cfc6e795b83cece06b5b807e6af1b52c9d0b787e38a48e159404a/_data
    

    Notice this directory is different than before.

  14. Shell back into the running container and log into MySQL

    $ docker exec --interactive --tty mysqldb bash
    
    root@132f4b3ec0dc:/# mysql --user=mysql --password=mysql
    mysql: [Warning] Using a password on the command line interface can be insecure.
    Welcome to the MySQL monitor.  Commands end with ; or \g.
    Your MySQL connection id is 3
    Server version: 5.7.19 MySQL Community Server (GPL)
    
    Copyright (c) 2000, 2017, Oracle and/or its affiliates. All rights reserved.
    
    Oracle is a registered trademark of Oracle Corporation and/or its
    affiliates. Other names may be trademarks of their respective
    owners.
    
    Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
    
  15. Check to see if table created previously table still exists

    mysql> connect sample;
    Connection id:    4
    Current database: sample
    
    mysql> show tables;
    Empty set (0.00 sec)
    
  16. Exit MySQL and the MySQL container.

    mysql> exit
    Bye
    
    root@132f4b3ec0dc:/# exit
    exit
    
  17. Remove the container

    docker container rm --force mysqldb
    mysqldb
    

So while a volume was used to store the new table in the original container, because it wasn’t a named volume the data could not be persisted between containers.

To achieve persistence a named volume should be used.

Named Volumes

A named volume (as the name implies) is a volume that’s been explicitly named and can easily be referenced.

A named volume can be created on the command line, in a docker-compose file, and when you start a new container. They CANNOT be created as part of the image’s dockerfile.

  1. Start a MySQL container with a named volume (mydbdata)

    $ docker run --name mysqldb \
    -e MYSQL_USER=mysql \
    -e MYSQL_PASSWORD=mysql \
    -e MYSQL_DATABASE=sample \
    -e MYSQL_ROOT_PASSWORD=supersecret \
    --detach \
    --mount type=volume,source=mydbdata,target=/var/lib/mysql \
    mysql
    

    Because the newly created volume is empty, Docker will copy over whatever existed in the container at /var/lib/mysql when the container starts.

    Docker volumes are primitives just like images and containers. As such, they can be listed and removed in the same way.

  2. List the volumes on the Docker host

    $ docker volume ls
    DRIVER              VOLUME NAME
    local               55c322b9c4a644a5284ccb5e4d7b6b466a0534e26d57c9ef4221637d39cf9a88
    local               cc44059d23e0a914d4390ea860fd35b2acdaa480e83c025fb381da187b652a66
    local               e0ffdc6b4e0cfc6e795b83cece06b5b807e6af1b52c9d0b787e38a48e159404a
    local               mydbdata
    
  3. Inspect the volume

    $ docker inspect mydbdata
    [
        {
            "CreatedAt": "2017-10-13T19:55:10Z",
            "Driver": "local",
            "Labels": null,
            "Mountpoint": "/var/lib/docker/volumes/mydbdata/_data",
            "Name": "mydbdata",
            "Options": {},
            "Scope": "local"
        }
    ]
    

    Any data written to /var/lib/mysql in the container will be rerouted to /var/lib/docker/volumes/mydbdata/_data instead.

  4. Shell into your running MySQL container and log into MySQL

    $ docker exec --tty --interactive mysqldb bash
    
    root@132f4b3ec0dc:/# mysql --user=mysql --password=mysql
    mysql: [Warning] Using a password on the command line interface can be insecure.
    Welcome to the MySQL monitor.  Commands end with ; or \g.
    Your MySQL connection id is 3
    Server version: 5.7.19 MySQL Community Server (GPL)
    
    Copyright (c) 2000, 2017, Oracle and/or its affiliates. All rights reserved.
    
    Oracle is a registered trademark of Oracle Corporation and/or its
    affiliates. Other names may be trademarks of their respective
    owners.
    
    Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
    
  5. Create a new table

    mysql> connect sample;
    Connection id:    4
    Current database: sample
    
    mysql> show tables;
    Empty set (0.00 sec)
    
    mysql> create table user(name varchar(50));
    Query OK, 0 rows affected (0.01 sec)
    
    mysql> show tables;
    +------------------+
    | Tables_in_sample |
    +------------------+
    | user             |
    +------------------+
    1 row in set (0.00 sec)
    
  6. Exit MySQL and the MySQL container.

    mysql> exit
    Bye
    
    root@132f4b3ec0dc:/# exit
    exit
    
  7. Remove the MySQL container

    $ docker container rm --force mysqldb
    

    Because the MySQL was writing out to a named volume, we can start a new container with the same data.

    When the container starts it will not overwrite existing data in a volume. So the data created in the previous steps will be left intact and mounted into the new container.

  8. Start a new MySQL container

    $ docker run --name new_mysqldb \
    -e MYSQL_USER=mysql \
    -e MYSQL_PASSWORD=mysql \
    -e MYSQL_DATABASE=sample \
    -e MYSQL_ROOT_PASSWORD=supersecret \
    --detach \
    --mount type=volume,source=mydbdata,target=/var/lib/mysql \
    mysql
    
  9. Shell into your running MySQL container and log into MySQL

    $ docker exec --tty --interactive new_mysqldb bash
    
    root@132f4b3ec0dc:/# mysql --user=mysql --password=mysql
    mysql: [Warning] Using a password on the command line interface can be insecure.
    Welcome to the MySQL monitor.  Commands end with ; or \g.
    Your MySQL connection id is 3
    Server version: 5.7.19 MySQL Community Server (GPL)
    
    Copyright (c) 2000, 2017, Oracle and/or its affiliates. All rights reserved.
    
    Oracle is a registered trademark of Oracle Corporation and/or its
    affiliates. Other names may be trademarks of their respective
    owners.
    
    Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.
    
  10. Check to see if the previously created table exists in your new container.

    mysql> connect sample;
    Reading table information for completion of table and column names
    You can turn off this feature to get a quicker startup with -A
    
    Connection id:    4
    Current database: sample
    
    mysql> show tables;
    +------------------+
    | Tables_in_sample |
    +------------------+
    | user             |
    +------------------+
    1 row in set (0.00 sec)
    

    The data will exist until the volume is explicitly deleted.

  11. Exit MySQL and the MySQL container.

    mysql> exit
    Bye
    
    root@132f4b3ec0dc:/# exit
    exit
    
  12. Remove the new MySQL container and volume

    $ docker container rm --force new_mysqldb
    new_mysqldb
    
    $ docker volume rm mydbdata
    mydbdata
    

    If a new container was started with the previous command, it would create a new empty volume.

Let’s Take a Break then continue to WebApps Part 2

For the next step in the tutorial, head over to Webapps with Docker - Part Two