Article Categories

Selected Reading

Best practices for writing a Dockerfile

Docker Operating System Open Source

A Dockerfile is a text document containing instructions to build a container image. It allows developers to create reproducible execution environments and automate the deployment process. Writing an efficient Dockerfile is crucial for project performance, especially when deploying at scale.

Docker images follow a layered architecture where each instruction creates a new layer. This design enables image reuse, efficient disk storage utilization, and caching during the build process. Understanding this layered structure is key to optimizing your Dockerfile.

Order Instructions by Change Frequency

Place least frequently changing instructions first in your Dockerfile. When a line changes and its cache becomes invalid, all subsequent layers must be rebuilt. This principle significantly impacts build performance.

Poor Example

FROM python:3

WORKDIR /usr/src/myapp

# Application files change frequently
COPY . .

# System packages change rarely
RUN apt-get -y update
RUN apt-get -y install vim

EXPOSE 8887
CMD ["python3", "./file.py"]

Optimized Example

FROM python:3

WORKDIR /usr/src/myapp

# Install system dependencies first (rarely change)
RUN apt-get -y update
RUN apt-get -y install vim

# Copy application files last (frequently change)
COPY . .

EXPOSE 8887
CMD ["python3", "./file.py"]

Leverage Build Cache Effectively

Docker checks for existing image layers in cache before creating new ones. Each instruction execution looks for cached layers first. Using the --no-cache flag forces Docker to rebuild all layers, which should be avoided unless necessary.

Install Only Required Packages

Unnecessary packages increase build time and image size. Use a requirements.txt file to track dependencies precisely:

COPY requirements.txt .
RUN pip3 install -r requirements.txt

Use .dockerignore Files

Similar to .gitignore, a .dockerignore file excludes unnecessary files and directories from the build context, improving build performance and reducing image size.

Write Specific COPY Commands

Copy only required files to avoid cache invalidation when unrelated files change:

# Instead of copying everything
COPY . .

# Copy specific files
COPY requirements.txt .
COPY src/ ./src/
COPY config/ ./config/

Chain Related RUN Commands

Each RUN instruction creates a new layer. Chain related commands to reduce layers and improve caching:

Before

RUN apt-get -y update
RUN apt-get -y install vim

After

RUN apt-get update \
    && apt-get -y install vim

Minimize Package Dependencies

Remove unnecessary package dependencies using the --no-install-recommends flag:

RUN apt-get update \
    && apt-get install -y --no-install-recommends \
        python3 \
        python3-pip \
    && apt-get clean \
    && rm -rf /var/lib/apt/lists/*

Best Practices Summary

Practice	Benefit	Impact
Order by change frequency	Better cache utilization	Faster builds
Chain RUN commands	Fewer layers	Smaller images
Use .dockerignore	Smaller build context	Faster transfers
Specific COPY statements	Avoid cache busting	Better performance
Minimal dependencies	Smaller attack surface	Security & size

Conclusion

Efficient Dockerfile writing focuses on leveraging Docker's layered caching system and minimizing unnecessary components. By ordering instructions properly, chaining commands, and copying only required files, you can significantly improve build performance and create optimized container images for production deployment.

Raunak Jain

Updated on: 2026-03-17T09:01:38+05:30

481 Views

Previous Next