KB at your finger tips

Book a Meeting to Avail the Services of Docker

This is one stop global knowledge base where you can learn about all the products, solutions and support features.

Categories

All

CRM

Salesforce

ERP

SAP

Storage and Backups

Purestorage Netapp Nutanix

Cloud

AWS

Computing

IBM

Database

MongoDB MySQL PostgreSQL

DevOps

Docker Kubernetes Jenkins GitLab

Mobile

React Native Android

Web

React Angular PHP

Programming

Python

Products

CNC Software (Mastercam) 3D perception 7shifts ABB EV Charging Infrastructure Accelo Aconex (by Oracle) Act-On Actenum Scheduler Actico Compliance Suite ActiveCampaign Adapt Addepar Adobe Captivate Adyen AG Leader Agrivi AgriWebb Agworld Ahrefs Aidoc AIMS360 Akamai Kona Site Defender Algolia Alis by TalentGuard ALLDATA Repair Allscripts Alpaca Trading Altair HyperWorks Altium Designer AMOS Amplitude Anaplan Anessa Biofuel Optimization AppDynamics Applied Epic Applied Materials E3 Aptean ERP Aptean Food & Beverage ERP Aptima Aquatic Informatics’ WaterTrax Arable Arena Solutions PLM Articulate 360 Asana ASML Twinscan Aspen Plus Biochemical Process Modeling Aurora Solar AspenTech APC Athenahealth Atomwise Audiam Audiomovers AspenTech for Refining Autodesk AutoCAD Autosoft AspenTech Pharma Suite Asset Panda Aveva Marine AssetMark Aveva Pipeline Management AVEVA Process Engineering Suite AVEVA Refining Solutions AVEVA Unified Operations Center Avid Media Composer AVL eSuite AWS Ground Station AWS IoT Greengrass Azure IoT Central Azure Space Backbase BatchMaster ERP Benchling Bentley OpenFlows Bentley OpenRoads Bentley OpenUtilities Bentley Systems AssetWise Bentley Systems – OpenRail beonic BevSpot Bigbelly Smart Waste & Recycling System BigCommerce Automotive Bitly Bitmovin Blackbaud BlackBerry AtHoc Blackboard Learn BlackLynx Blend Blue Yonder WMS Bluebeam Revu Bohemia Interactive Simulations (BISim) Bosch Esitronic Diagnostic Bosch IoT Suite Braintree Bright Biomethane Brightpearl Bringg Browzwear Builder Prime Buildertrend BuildTools BWX Technologies' Nuclear Security Cabinet Vision Cadence OrCAD Cadence Virtuoso CAE Canvas LMS Cargowise One Carrier i-Vu CaterZen CattleMax CDK Global Celtx Cerner Pharmacy ChargePoint Cloud Services Charles River Development Check Point CloudGuard Checkout.com Cin7 Circularise Cisco Umbrella Classcraft ClearDATA ClearMetal ClickFunnels ClickUp Climate FieldView Clir Renewables Cloud Lending (Q2 Software) Cobb Accessport ComplianceQuest ComplyAdvantage Conducttr Conservis Continental Connected Vehicle Copper Coupa by LLamasoft CQG Integrated Client Creatio Creatio marketing CropNutrition CrowdStrike Falcon CyberSource Damarel Darktrace Dassault Systèmes CATIA Dassault Systèmes ENOVIA Dataminr DaVinci Resolve DealerSocket DealHub Decisive Farming Deliverect Deltek Costpoint Demandbase Denso EV Thermal Systems DeSL (Digital Enterprise Software) DigitalOcean DJI Ground Station Pro Docebo DrillPlan by Schlumberger DroneDeploy e-Builder Ecochain EcoVadis Edmentum Edsby EFI Live Tuning EFI Optitex EFI PrintSmith Vision Ellie Mae Encompass Ellucian Banner Elucidat Emerson Ovation Emerson Plantweb eMoney Advisor Enact Systems Enfusion Envestnet | Tamarac Epicor ERP Epicor Parts Network Esko ArtiosCAD Esko WebCenter ETQ Reliance QMS Everbridge eVinci™ Micro Reactor by Westinghouse Exostar SCM Fanuc Automation Fanuc Robotics FarmERP FarmLogs FieldAware Finacle Finastra Fusion FinQuery Flatiron Health FoodLogiQ Fortinet FortiGate FourKites Freshworks GDC Technology GE Digital APM GE Digital Plant Applications GE Digital Predix GE Healthcare FlexFactory GE Vernova Genscape Pipeline Solutions Geofeedia GeoSoftware Geotab GEP SMART Gerber Technology GetAccept Gorgias Greenbyte Energy Cloud Greenlight Guru GreenPrinter GreenSlate Guidewire Software Handshake HawkEye 360 Health Catalyst Herdwatch Hexagon Intergraph CAD Hexagon Manufacturing Intelligence Hexagon MSC Software Hexagon PPM Hexagon Safety & Infrastructure Hexcel HexPly Honeywell Enraf Tank Gauging Honeywell Forge Honeywell Forge for Chemicals Honeywell UOP Ecofining Houzz Pro HP Indigo Digital Press HP Tuners VCM Suite HubSpot CRM IBM i2 IBM Watson Health IFS Applications Illuminate Education Imprivata Indra Air Automation InEight Infor CloudSuite Food & Beverage Infor EAM Infor LN Infor WMS Inmarsat Fleet Data Instapage Insurity Intellect QMS Intergraph SmartMarine 3D Intex ERP Intuitive Surgical da Vinci IONOS Ironclad iSpring Suite John Deere Operations Center Johnson Controls Metasys Juro JustFood ERP Kahoot Kahua Kareo KBC Petro-SIM by Yokogawa Keap Kiinaxis RapidResponse Kinaxis RapidResponse Kisters WISKI KLA YieldStar Klaviyo Knewton Alta Kratos OpenSpace KUKA Robotics KUKA Smart Production Kustomer Landr LeadAngel LeanData Lectra Fashion PLM Leidos Integrated ISR Solutions Linode Lithium Balance BMS Lockheed Martin DiamondShield LogicGate Logility LogiNext Mile LuciadLightspeed (Hexagon) Luma Health Mahle Thermal Management Majesco Mallcomm MAM Software Autopart Manhattan TMS Manhattan WMS Mark43 MASS (Cohort plc) MasterControl Manufacturing Excellence MasterControl Quality Excellence Material Exchange McKesson Pharmacy Management MedeAnalytics Medidata Solutions Medtronic CareLink Medtronic Hugo RAS Mentor Graphics (Siemens) MetricStream Microsoft D365 SCM Microsoft Project MilliporeSigma BioContinuum MindTickle Mirakl Mitchell1 ProDemand MODS (Military Operational Decision Support) Moodle Moody’s Analytics Morningstar Direct MRI Facilities Management MuleSoft MyPath Nautilus Labs Navitaire nCino Nearpod Netafim NetApp Netwrix Auditor NextGen Healthcare NICE Actimize NICE Public Safety Nimble Nutshell Nuvation Energy BMS Nuvolo Office Furniture by Haworth Omnisend OneTrust OneWeb Satellite Services Onfleet Optibus Optimizely ONYX InSight Oracle E-Business Suite Oracle FLEXCUBE Oracle TMS Orion Advisor Tech OSISoft PI System Outgrow OVHcloud PagerDuty Palo Alto Networks Cortex XSOAR Palo Alto Networks Prisma Cloud PandaDoc Panorama Education Parrot Analytics Petrel by Schlumberger Philips HealthSuite PipelineManager by Atmos International Pipeliner CRM PitchBook Pix4Dfields Planet Labs Plex Systems PowerSchool PredictHQ Printavo Procore Project44 Proposify Prospera PTC Windchill Quantifi QuarkXPress Quipli Quorum Distribution Software R1 RCM Rackspace Rapyd ReCharge Reynolds and Reynolds Rightsline Riskalyze Roostify Runway ML S&P Capital IQ Sabre AirCentre Safefood 360 Saipher ATC SAP Defense & Security Sapiens CoreSuite Oracle Cloud SCM Oracle NetSuite Plex Manufacturing Cloud QbDVision Ramco Aviation MRO Suite RedTeam SAP EWM SAP Supply Chain Management SAS Analytics Schoology ScreenCloud Sensia Oilfield Monitoring and Control SentinelOne Serpstat Shift Technology Shift72 ShipBob ShipServ Shopify Plus Shopify Plus for Automotive Shotgun (by Autodesk) Siemens Simcenter Signal Sciences SimCentric Technologies SimCorp Dimension SimScale Simutech Multimedia Skedulo SkillMill by Interplay Learning Skyfii SkyWatch EarthCache Slingshot Orbital Smartsheet for Compliance Snapsheet Solutionreach Soyang Europe Green Materials Sparta Systems TrackWise Sphera Sustainability Management Spire Space Services Splunk Enterprise Security Splunk Industrial IoT Square for Retail StormGeo Stratasys 3D Printers Stripe STRIVR StudioBinder SUSE Swissport Symantec Endpoint Security (Broadcom) Synopsys TCAD Syntasa TalentLMS TankVision by Endress+Hauser Teladoc Health Tenable.io Terminus TexGen Thales CipherTrust Thales Manpack EW Suite Thales TopSky-ATC Thinkorswim Thryv Toast POS TotalEnergies Bio Lubricants TraceGains TraceLink Trimble Construction Trove TurnKey Lender Udemy for Business Unbounce Updox Validic Valmet DNA Automation System Varsity Tutors Veeva CRM Veeva MedTech Veeva Quality Veeva Vault Vendasta Veson Nautical IMOS Platform Viewpoint (by Trimble) Vimeo OTT Virbela Vizient VTS VWO Waste Management Open API by AMCS Waystar Wealthfront Wherefour WindRiver Helix Virtualization Platform WoodCAD|CAM Workday Student Worldpay Wrike Xactly Xerox Paper Recycling XOLV Xtime YCharts Yopto Zendesk Marketplace Zoho Zoho Inventory Zonos Zscaler Web-to-Print by Printful Zuppler ABB Robotics Solutions Alibaba Cloud ANSYS Composite PrepPost Apollo Autodesk BIM 360 Aventri BigCommerce Bizzabo Bloomberg Terminal BookBrush Brella Canva Pro Chartmetric Contentful edX EnergyHub EventPro Granular Helix Core iVvy Venue Management Joomag ManageArtworks MathWorks MATLAB Mobalytics Moocall On24 Pluralsight PressPad Prismm (All Seated) Reedsy RegFox SAP S/4HANA Amwell ANSYS Engineering Simulation Software Autodesk CFD AutoGrid Flex BetterUp Dassault Systèmes SIMULIA Evisort FlippingBook LabWare LIMS SAP TMS Amazon Augmented AI Aquarium V7 Dataloop Encord Encord Galileo Labelbox Voxel51 iMerit Ango Hub Multimodal AI Platform Canva Scalenut Hypotenuse AI Writer Copy.ai RELAYTO Amplience M1-project Imagine.io Aida Social Craftly.AI Typetone Describely Adobe GenStudio for Performance Marketing Astral Contents DashoContent Eden AI hair-style.ai PlanPost AI ZiNK AI Kore.AI Microsoft Copilot Genesys Cloud CX Synthflow Botpress Twixor Guru Qualified LivePerson Leena AI Autonomous Agent Voiceflow Replicant Conversica Rezolve.ai Dixa Assembled Zowie Level AI Moveworks Espressive, Inc. Haptik Rulai Cognigy.AI MeBeBot Intelligent Assistant Supervity Vozy HCL BigFix AEX ViaSay Kenyt Puzzel yellow.ai EVA.ai Verint Messaging Cresta Netomi Omind AiSDR Engagely.ai Floatbot Tiledesk FPT.AI Relevance AI Kindly Yuma AI CoSupport AI Joule AtomChat.io Enjo IntelePeer PolyAI Salesforge Vodex.ai Born Digital Hoory AI Nurix Overchat AI Spotlight.ai UneeQ Accenture Conversational AI Benedic chatbot IA Comm100 Chatbot eGain Virtual Assistant for Agents Fulkrum GPTBots.ai KNO Plus Moveo.AI Neople Qlary AI TailorTask Verint Intelligent Virtual Assistant Agent4 AIgent Atomicwork Clay Research Copilot.Live Cust Debales AI Conversion Agent FlashAI Remplo SeamlessGPT SearchUnify Virtual Assistant (SUVA) SideKick AI Taalk AI Talkdesk Turbo AI Agent Village AI Vinsi AI Wethos AI Wizr AI Studio You.com Deepgram Whisper Hour One Gladia Google Cloud Speech-to-Text Google Cloud Speech-to-Text Microsoft Custom Recognition Intelligent Service (CRIS) Mihup.ai BigHand Speech Recognition Amazon Transcribe Microsoft Speaker Recognition API Krisp ai voice detector Microsoft Bing Speech API Kaldi Resemble AI ArtPro Rev Sayint Speech Recognition API Synth Threads Voicify Azure Custom Speech Service Museum Space ResourceMate Speech to text Symbl.ai The Digital Ark Voicein Yugo (Speech-to-Text) Acapela VaaS GlobalLink Neon AI OneVoiceData Snips Spantel Hospital Direct Speechmorphing Telisma Voxpow Winscribe Speech Recognition ZipScribe AI Squared eMuseum Online Collections Software HeardThat Listener – Reliable Automatic Speech Recognition (ASR) MediaServicesIQ Neuro.net HeardThat

Docker

Inline cache

The inline cache storage backend is the simplest way to get an external cache and is easy to get started using if youâre already building and pushing an image. However, it doesnât scale as well to multi-stage builds as well as the other drivers do. It also doesnât offer separation between your output artifacts and your cache output. This means that if youâre using a particularly complex build flow, or not exporting your images directly to a registry, then you may want to consider the registry cache.

Synopsis

   $ docker buildx build --push -t <registry>/<image> \
  --cache-to type=inline \
  --cache-from type=registry,ref=<registry>/image .

  

No additional parameters are supported for the inline cache.

To export cache using inline storage, pass type=inline to the --cache-to option:

   $ docker buildx build --push -t <registry>/<image> \
  --cache-to type=inline .

Alternatively, you can also export inline cache by setting the build argument BUILDKIT_INLINE_CACHE=1 , instead of using the --cache-to flag:

   $ docker buildx build --push -t <registry>/<image> \
  --arg BUILDKIT_INLINE_CACHE=1 .

To import the resulting cache on a future build, pass type=registry to --cache-from which lets you extract the cache from inside a Docker image in the specified registry:

   $ docker buildx build --push -t <registry>/<image> \
  --cache-from type=registry,ref=<registry>/<image> .

Synopsis

   $ docker buildx build --push -t <registry>/<image> \
  --cache-to type=local,dest=path/to/local/dir[,parameters...] \
  --cache-from type=local,src=path/to/local/dir .

  

The following table describes the available CSV parameters that you can pass to --cache-to and --cache-from .

Name	Option	Type	Default	Description
`src`	`cache-from`	String	Â	Path of the local directory where cache gets imported from.
`digest`	`cache-from`	String	Â	Digest of manifest to import, see cache versioning.
`dest`	`cache-to`	String	Â	Path of the local directory where cache gets exported to.
`mode`	`cache-to`	`min` , `max`	`min`	Cache layers to export, see cache mode.
`oci-mediatypes`	`cache-to`	`true` , `false`	`true`	Use OCI media types in exported manifests, see OCI media types.
`compression`	`cache-to`	`gzip` , `estargz` , `zstd`	`gzip`	Compression type, see cache compression.
`compression-level`	`cache-to`	`0..22`	Â	Compression level, see cache compression.
`force-compression`	`cache-to`	`true` , `false`	`false`	Forcibly apply compression, see cache compression.

If the src cache doesnât exist, then the cache import step will fail, but the build will continue.

Cache versioning

This section describes how versioning works for caches on a local filesystem, and how you can use the digest parameter to use older versions of cache.

If you inspect the cache directory manually, you can see the resulting OCI image layout:

   $ ls cache
blobs  index.json  ingest
$ cat cache/index.json | jq
{
  "schemaVersion": 2,
  "manifests": [
    {
      "mediaType": "application/vnd.oci.image.index.v1+json",
      "digest": "sha256:6982c70595cb91769f61cd1e064cf5f41d5357387bab6b18c0164c5f98c1f707",
      "size": 1560,
      "annotations": {
        "org.opencontainers.image.ref.name": "latest"
      }
    }
  ]
}

  

Like other cache types, local cache gets replaced on export, by replacing the contents of the index.json file. However, previous caches will still be available in the blobs directory. These old caches are addressable by digest, and kept indefinitely. Therefore, the size of the local cache will continue to grow (see moby/buildkit#1896 for more information).

When importing cache using --cache-to , you can specify the digest parameter to force loading an older version of the cache, for example:

   $ docker buildx build --push -t <registry>/<image> \
  --cache-to type=local,dest=path/to/local/dir \
  --cache-from type=local,ref=path/to/local/dir,digest=sha256:6982c70595cb91769f61cd1e064cf5f41d5357387bab6b18c0164c5f98c1f707 .

  

Registry cache

The registry cache storage can be thought of as an extension to the inline cache. Unlike the inline cache, the registry cache is entirely separate from the image, which allows for more flexible usage - registry -backed cache can do everything that the inline cache can do, and more:

Allows for separating the cache and resulting image artifacts so that you can distribute your final image without the cache inside.
It can efficiently cache multi-stage builds in max mode, instead of only the final stage.
It works with other exporters for more flexibility, instead of only the image exporter.

Note

This cache storage backend requires using a different driver than the default docker driver - see more information on selecting a driver here. To create a new driver (which can act as a simple drop-in replacement):
$ docker buildx create --use --driver=docker-container

Synopsis

Unlike the simpler inline cache, the registry cache supports several configuration parameters:

   $ docker buildx build --push -t <registry>/<image> \
  --cache-to type=registry,ref=<registry>/<cache-image>[,parameters...] \
  --cache-from type=registry,ref=<registry>/<cache-image> .

  

The following table describes the available CSV parameters that you can pass to --cache-to and --cache-from .

Name	Option	Type	Default	Description
`ref`	`cache-to` , `cache-from`	String	Â	Full name of the cache image to import.
`dest`	`cache-to`	String	Â	Path of the local directory where cache gets exported to.
`mode`	`cache-to`	`min` , `max`	`min`	Cache layers to export, see cache mode.
`oci-mediatypes`	`cache-to`	`true` , `false`	`true`	Use OCI media types in exported manifests, see OCI media types.
`compression`	`cache-to`	`gzip` , `estargz` , `zstd`	`gzip`	Compression type, see cache compression.
`compression-level`	`cache-to`	`0..22`	Â	Compression level, see cache compression.
`force-compression`	`cache-to`	`true` , `false`	`false`	Forcibly apply compression, see cache compression.

You can choose any valid value for ref , as long as itâs not the same as the target location that you push your image to. You might choose different tags (e.g. foo/bar:latest and foo/bar:build-cache ), separate image names (e.g. foo/bar and foo/bar-cache ), or even different repositories (e.g. docker.io/foo/bar and ghcr.io/foo/bar ). Itâs up to you to decide the strategy that you want to use for separating your image from your cache images.

If the --cache-from target doesnât exist, then the cache import step will fail, but the build will continue.

Amazon S3 cache

Warning

This cache backend is unreleased. You can use it today, by using the moby/buildkit:master image in your Buildx driver.

The s3 cache storage uploads your resulting build cache to Amazon S3 file storage service, into a specified bucket.

Note

This cache storage backend requires using a different driver than the default docker driver - see more information on selecting a driver here. To create a new driver (which can act as a simple drop-in replacement):
$ docker buildx create --use --driver=docker-container

Synopsis

   $ docker buildx build --push -t <user>/<image> \
  --cache-to type=s3,region=<region>,bucket=<bucket>,name=<cache-image>[,parameters...] \
  --cache-from type=s3,region=<region>,bucket=<bucket>,name=<cache-image> .

  

The following table describes the available CSV parameters that you can pass to --cache-to and --cache-from .

Name	Option	Type	Default	Description
`region`	`cache-to` , `cache-from`	String	Â	Geographic location.
`bucket`	`cache-to` , `cache-from`	String	Â	Name of the S3 bucket used for caching
`name`	`cache-to` , `cache-from`	String	Â	Name of the cache image
`access_key_id`	`cache-to` , `cache-from`	String	Â	See authentication
`secret_access_key`	`cache-to` , `cache-from`	String	Â	See authentication
`session_token`	`cache-to` , `cache-from`	String	Â	See authentication
`mode`	`cache-to`	`min` , `max`	`min`	Cache layers to export, see cache mode.

Authentication

access_key_id , secret_access_key , and session_token , if left unspecified, are read from environment variables on the BuildKit server following the scheme for the AWS Go SDK. The environment variables are read from the server, not the Buildx client.

Garbage collection

While docker builder prune or docker buildx prune commands run at once, garbage collection runs periodically and follows an ordered list of prune policies.

Garbage collection runs in the BuildKit daemon. The daemon clears the build cache when the cache size becomes too big, or when the cache age expires. The following sections describe how you can configure both the size and age parameters by defining garbage collection policies.

Configuration

Depending on the driver used by your builder instance, the garbage collection will use a different configuration file.

If youâre using the docker driver, garbage collection can be configured in the Docker Daemon configuration. file:

   {
  "builder": {
    "gc": {
      "enabled": true,
      "defaultKeepStorage": "10GB",
      "policy": [
          {"keepStorage": "10GB", "filter": ["unused-for=2200h"]},
          {"keepStorage": "50GB", "filter": ["unused-for=3300h"]},
          {"keepStorage": "100GB", "all": true}
      ]
    }
  }
}

  

For other drivers, garbage collection can be configured using the BuildKit configuration file:

   [worker.oci]
  gc = true
  gckeepstorage = 10000
  [[worker.oci.gcpolicy]]
    keepBytes = 512000000
    keepDuration = 172800
    filters = [ "type==source.local", "type==exec.cachemount", "type==source.git.checkout"]
  [[worker.oci.gcpolicy]]
    all = true
    keepBytes = 1024000000

  

Default policies

Default garbage collection policies are applied to all builders if not already set:

   GC Policy rule#0:
        All:            false
        Filters:        type==source.local,type==exec.cachemount,type==source.git.checkout
        Keep Duration:  48h0m0s
        Keep Bytes:     512MB
GC Policy rule#1:
        All:            false
        Keep Duration:  1440h0m0s
        Keep Bytes:     26GB
GC Policy rule#2:
        All:            false
        Keep Bytes:     26GB
GC Policy rule#3:
        All:            true
        Keep Bytes:     26GB

  

rule#0 : if build cache uses more than 512MB delete the most easily reproducible data after it has not been used for 2 days.
rule#1 : remove any data not used for 60 days.
rule#2 : keep the unshared build cache under cap.
rule#3 : if previous policies were insufficient start deleting internal data to keep build cache under cap.

Note

âKeep bytesâ defaults to 10% of the size of the disk. If the disk size cannot be determined, it defaults to 2GB.

Read article

Optimizing builds with cache management

You will likely find yourself rebuilding the same Docker image over and over again. Whether itâs for the next release of your software, or locally during development. Because building images is a common task, Docker provides several tools that speed up builds.

The most important feature for improving build speeds is Dockerâs build cache.

How does the build cache work?

Understanding Dockerâs build cache helps you write better Dockerfiles that result in faster builds.

Have a look at the following example, which shows a simple Dockerfile for a program written in C.

   # syntax=docker/dockerfile:1
FROM ubuntu:latest

RUN apt-get update && apt-get install -y build-essentials
COPY main.c Makefile /src/
WORKDIR /src/
RUN make build

  

Each instruction in this Dockerfile translates (roughly) to a layer in your final image. You can think of image layers as a stack, with each layer adding more content on top of the layers that came before it:

Image layer diagram showing the above commands chained together one after the other

Whenever a layer changes, that layer will need to be re-built. For example, suppose you make a change to your program in the main.c file. After this change, the COPY command will have to run again in order for those changes to appear in the image. In other words, Docker will invalidate the cache for this layer.

Image layer diagram, but now with the link between COPY and WORKDIR marked as invalid

If a layer changes, all other layers that come after it are also affected. When the layer with the COPY command gets invalidated, all layers that follow will need to run again, too:

Image layer diagram, but now with all links after COPY marked as invalid

And thatâs the Docker build cache in a nutshell. Once a layer changes, then all downstream layers need to be rebuilt as well. Even if they wouldnât build anything differently, they still need to re-run.

Note

Suppose you have a RUN apt-get update && apt-get upgrade -y step in your Dockerfile to upgrade all the software packages in your Debian-based image to the latest version.

This doesnât mean that the images you build are always up to date. Rebuilding the image on the same host one week later will still get you the same packages as before. The only way to force a rebuild is by making sure that a layer before it has changed, or by clearing the build cache using docker builder prune .

How can I use the cache efficiently?

Now that you understand how the cache works, you can begin to use the cache to your advantage. While the cache will automatically work on any docker build that you run, you can often refactor your Dockerfile to get even better performance. These optimizations can save precious seconds (or even minutes) off of your builds.

Order your layers

Putting the commands in your Dockerfile into a logical order is a great place to start. Because a change causes a rebuild for steps that follow, try to make expensive steps appear near the beginning of the Dockerfile. Steps that change often should appear near the end of the Dockerfile, to avoid triggering rebuilds of layers that havenât changed.

Consider the following example. A Dockerfile snippet that runs a JavaScript build from the source files in the current directory:

   # syntax=docker/dockerfile:1
FROM node
WORKDIR /app
COPY . .          # Copy over all files in the current directory
RUN npm install   # Install dependencies
RUN npm build     # Run build

  

This Dockerfile is rather inefficient. Updating any file causes a reinstall of all dependencies every time you build the Docker image &emdash; even if the dependencies didnât change since last time!

Instead, the COPY command can be split in two. First, copy over the package management files (in this case, package.json and yarn.lock ). Then, install the dependencies. Finally, copy over the project source code, which is subject to frequent change.

   # syntax=docker/dockerfile:1
FROM node
WORKDIR /app
COPY package.json yarn.lock .    # Copy package management files
RUN npm install                  # Install dependencies
COPY . .                         # Copy over project files
RUN npm build                    # Run build

  

By installing dependencies in earlier layers of the Dockerfile, there is no need to rebuild those layers when a project file has changed.

Keep layers small

One of the best things you can do to speed up image building is to just put less stuff into your build. Fewer parts means the cache stay smaller, but also that there should be fewer things that could be out-of-date and need rebuilding.

To get started, here are a few tips and tricks:

Donât include unnecessary files

Be considerate of what files you add to the image.

Running a command like COPY . /src will COPY your entire build context into the image. If youâve got logs, package manager artifacts, or even previous build results in your current directory, those will also be copied over. This could make your image larger than it needs to be, especially as those files are usually not useful.

Avoid adding unnecessary files to your builds by explicitly stating the files or directories you intend to copy over. For example, you might only want to add a Makefile and your src directory to the image filesystem. In that case, consider adding this to your Dockerfile:

COPY ./src ./Makefile /src

As opposed to this:

COPY . /src

You can also create a .dockerignore file, and use that to specify which files and directories to exclude from the build context.

Use your package manager wisely

Most Docker image builds involve using a package manager to help install software into the image. Debian has apt , Alpine has apk , Python has pip , NodeJS has npm , and so on.

When installing packages, be considerate. Make sure to only install the packages that you need. If youâre not going to use them, donât install them. Remember that this might be a different list for your local development environment and your production environment. You can use multi-stage builds to split these up efficiently.

Use the dedicated `RUN` cache

The RUN command supports a specialized cache, which you can use when you need a more fine-grained cache between runs. For example, when installing packages, you donât always need to fetch all of your packages from the internet each time. You only need the ones that have changed.

To solve this problem, you can use RUN --mount type=cache . For example, for your Debian-based image you might use the following:

   RUN \
    --mount=type=cache,target=/var/cache/apt \
    apt-get update && apt-get install -y git

  

Using the explicit cache with the --mount flag keeps the contents of the target directory preserved between builds. When this layer needs to be rebuilt, then itâll use the apt cache in /var/cache/apt .

Minimize the number of layers

Keeping your layers small is a good first step, and the logical next step is to reduce the number of layers that you have. Fewer layers mean that you have less to rebuild, when something in your Dockerfile changes, so your build will complete faster.

The following sections outline some tips you can use to keep the number of layers to a minimum.

Use an appropriate base image

Docker provides over 170 pre-built official images for almost every common development scenario. For example, if youâre building a Java web server, use a dedicated image such as openjdk . Even when thereâs not an official image for what you might want, Docker provides images from verified publishers and open source partners that can help you on your way. The Docker community often produces third-party images to use as well.

Using official images saves you time and ensures you stay up to date and secure by default.

Use multi-stage builds

Multi-stage builds let you split up your Dockerfile into multiple distinct stages. Each stage completes a step in the build process, and you can bridge the different stages to create your final image at the end. The Docker builder will work out dependencies between the stages and run them using the most efficient strategy. This even allows you to run multiple builds concurrently.

Multi-stage builds use two or more FROM commands. The following example illustrates building a simple web server that serves HTML from your docs directory in Git:

   # syntax=docker/dockerfile:1

# stage 1
FROM alpine as git
RUN apk add git

# stage 2
FROM git as fetch
WORKDIR /repo
RUN git clone https://github.com/your/repository.git .

# stage 3
FROM nginx as site
COPY --from=fetch /repo/docs/ /usr/share/nginx/html

  

This build has 3 stages: git , fetch and site . In this example, git is the base for the fetch stage. It uses the COPY --from flag to copy the data from the docs/ directory into the Nginx server directory.

Each stage has only a few instructions, and when possible, Docker will run these stages in parallel. Only the instructions in the site stage will end up as layers in the final image. The entire git history doesnât get embedded into the final result, which helps keep the image small and secure.

Combine commands together wherever possible.

Most Dockerfile commands, and RUN commands in particular, can often be joined together. For example, instead of using RUN like this:

   RUN echo "the first command"
RUN echo "the second command"

Itâs possible to run both of these commands inside a single RUN , which means that they will share the same cache! This can is achievable using the && shell operator to run one command after another:

   RUN echo "the first command" && echo "the second command"
# or to split to multiple lines
RUN echo "the first command" && \
    echo "the second command"

  

Another shell feature that allows you to simplify and concatenate commands in a neat way are heredocs . It enables you to create multi-line scripts with good readability:

   RUN <<EOF
set -e
echo "the first command"
echo "the second command"
EOF

  

(Note the set -e command to exit immediately after any command fails, instead of continuing.)

Other resources

For more information on using cache to do efficient builds, see:

Garbage collection
Cache storage backends

Read article

Welcome to Knowledge Base!

KB at your finger tips

Docker

Inline cache

Inline cache

Synopsis

Further reading

Local cache

Local cache

Synopsis

Cache versioning

Further reading

Registry cache

Registry cache

Synopsis

Further reading

Amazon S3 cache

Amazon S3 cache

Synopsis

Authentication

Further reading

Garbage collection

Garbage collection

Configuration

Default policies

Optimizing builds with cache management

Optimizing builds with cache management

How does the build cache work?

How can I use the cache efficiently?

Order your layers

Keep layers small

Donât include unnecessary files

Use your package manager wisely

Use the dedicated `RUN` cache

Minimize the number of layers

Use an appropriate base image

Use multi-stage builds

Combine commands together wherever possible.

Other resources

Thank you for subscribing to AiDOOS! Stay connected for updates!

Welcome to Knowledge Base!

KB at your finger tips

Docker

Inline cache

Inline cache

Synopsis

Further reading

Local cache

Local cache

Synopsis

Cache versioning

Further reading

Registry cache

Registry cache

Synopsis

Further reading

Amazon S3 cache

Amazon S3 cache

Synopsis

Authentication

Further reading

Garbage collection

Garbage collection

Configuration

Default policies

Optimizing builds with cache management

Optimizing builds with cache management

How does the build cache work?

How can I use the cache efficiently?

Order your layers

Keep layers small

Donât include unnecessary files

Use your package manager wisely

Use the dedicated RUN cache

Minimize the number of layers

Use an appropriate base image

Use multi-stage builds

Combine commands together wherever possible.

Other resources

Donât include unnecessary files

Use the dedicated `RUN` cache