<h2 id="content"><a href="#content" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Content</h2>
<ul>
<li><a href="#introduction">Introduction</a></li>
<li><a href="#breaking-down-the-problems">Breaking down the problems</a>
<ul>
<li><a href="#s3-the-default-root-object">S3 the default root object</a></li>
<li><a href="#nested-folders">Nested folders</a></li>
</ul>
</li>
<li><a href="#s3-folders">S3 Folders</a></li>
<li><a href="#remapping-the-s3-object-path">Remapping the S3 object path</a></li>
<li><a href="#conclusion">Conclusion</a></li>
</ul>
<h2 id="introduction"><a href="#introduction" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Introduction</h2>
<p>When I first tried this out... I was surprised that this does not work out of the box.</p>
<p>After some research, I found out that this is another one of those AWS gotchas.</p>
<p>Well, not really a gotcha but a small misunderstanding of how S3 actually works.</p>
<p>When working with Amazon S3 and Cloudfront, you can specify a root object.</p>
<p>For example, for a static site, you may choose to use <code class="language-">index.html
</code> at the root folder.</p>
<p>However, what happens when you have a multi-page site — such as when you are using a framework like Astro ?</p>
<div style="margin: 2em auto;">
  <div style="display:flex;justify-content:center;">
    <img src="/images/multi-page-astro-asset-outputs.png" alt="Illustrations of the Astro build outputs" style="width:80%" />
  </div>
  <div style="display:flex;justify-content:center;">
    <blockquote>Illustrations of the Astro build outputs</blockquote>
  </div>
</div>
<p>When I was working on the Astro technical series, I came across this problem.</p>
<p>So, this article will highlight the problem, the why and also propose a solution to fix this!</p>
<h2 id="breaking-down-the-problems"><a href="#breaking-down-the-problems" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Breaking down the problems</h2>
<p>Let’s first break down the problem.</p>
<h3 id="s3-the-default-root-object"><a href="#s3-the-default-root-object" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>S3 the default root object</h3>
<p>When you setup S3 with Cloudfront, there is an option to provide a default root object.</p>
<p>My understanding is that this option basically will default to fetching a particular S3 object when you request for the root path (<code class="language-">/
</code>).</p>
<p>Let’s say for example, we set the default root object as <code class="language-">index.html
</code>.</p>
<p>In our example of a site built using Astro, Cloudfront will automatically map the root path to the default S3 object.</p>
<p>It will notice the root path (<code class="language-">/
</code>) and direct that request to fetch the <code class="language-">index.html
</code> object in the S3 bucket.</p>
<div style="margin: 2em auto;">
  <div style="display:flex;justify-content:center;">
    <img src="/images/multi-page-default-index-s3-bucket.png" alt="Illustration of fetching the default root object" style="width:80%" />
  </div>
  <div style="display:flex;justify-content:center;">
    <blockquote>Illustration of fetching the default root object</blockquote>
  </div>
</div>
<p>Well, what happens when we have multiple pages ?</p>
<h3 id="nested-folders"><a href="#nested-folders" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Nested folders</h3>
<p>In multi-page sites, it’s common to have nested folder structures.</p>
<p>However, the problem with the S3 and Cloudfront setup, is that, when you try to request a folder, it doesn’t know what to do.</p>
<div style="margin: 2em auto;">
  <div style="display:flex;justify-content:center;">
    <img src="/images/multi-page-request-about-failure.png" alt="Illustration of failure when requesting a folder in S3" style="width:80%" />
  </div>
  <div style="display:flex;justify-content:center;">
    <blockquote>Illustration of failure when requesting a folder in S3</blockquote>
  </div>
</div>
<p>This is actually not S3’s fault. It was designed that way.</p>
<p>To better understand why this fails, we need to revisit the following: <strong>What does a folder in S3 really mean ?</strong></p>
<h2 id="s3-folders"><a href="#s3-folders" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>S3 Folders</h2>
<p>Amazon S3 as a blob storage may give the impression that it has a hierarchical structure but it technically does not.</p>
<p>In reality, it has a flat structure, similar to a Key-Value store.</p>
<p>What it does behind the scenes is that it’ll organize your files in a way that it creates this hierarchy like a filesystem.</p>
<p>Amazon S3 stores these blobs or objects based on “prefixes”, and these keys will act like namespaces used to group objects together.</p>
<p><strong>For example:</strong></p>
<ul>
<li>
<p>"assets/image1.jpeg"</p>
</li>
<li>
<p>"assets/image2.jpeg"</p>
</li>
</ul>
<p>In our example, both of these files share the prefix of "assets", the folder is used as a key prefix to group these files together.</p>
<div style="margin: 2em auto;">
  <div style="display:flex;justify-content:center;">
    <img src="/images/multi-page-s3-folder-prefix.png" alt="Illustration of how S3 uses folder prefixes" style="width:80%" />
  </div>
  <div style="display:flex;justify-content:center;">
    <blockquote>Illustration of how S3 uses folder prefixes</blockquote>
  </div>
</div>
<blockquote class="common">
<p><b>⚠️  Important:</b></p>
<p>You only need to perform this “remapping” if you are using Cloudfront with Origin Access Identity (OAI) to access S3.</p>
<p>This is because when Cloudfront + OAI tries to access S3, it is using the S3 REST API endpoint (<code class="language-">s3:getObject
</code>) to access the resources.</p>
<p>This approach does not support redirection to a default index, hence, it would return with an error.</p>
<p>On the other hand, if you are just using S3 with the website hosting (ie <code class="language-">&lt;bucket-name&gt;.s3-website-&lt;AWS-region&gt;.amazonaws.com
</code>), this redirection will happen by default.</p>
<p>So, requesting for <code class="language-">/about
</code> will redirect paths to the <code class="language-">/about/index.html
</code>.</p>
<p>A subtle distinction between the two, nevertheless, it is important to point out.</p>
</blockquote>
<p><strong>How does this relate to multi-page support ?</strong></p>
<p>This means that when we request for a folder, we are essentially making a get request (<code class="language-">s3 get-bucket
</code>) for a prefix.</p>
<p>This leads to an error because S3 doesn’t know what to do in this case because technically this is not a S3 object.</p>
<p>Knowing all of this, what can we do about it in our multi-page site ?</p>
<h2 id="remapping-the-s3-object-path"><a href="#remapping-the-s3-object-path" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Remapping the S3 object path</h2>
<p>One way to solve this problem is by remapping the S3 object path.</p>
<p>We can leverage the power of edge functions on Cloudfront to do this.</p>
<p>This can done through the use of the viewer request function on cloudfront, which is a type function that will run before a request comes into cloudfront.</p>
<p>That way, if any request does not have an extension or a filename, we can direct it to a S3 object of our choice.</p>
<p><strong>In our case, we can do the following:</strong></p>
<ul>
<li>
<p>Remap all request uri with no filename to default to <code class="language-">index.html
</code> object (<code class="language-">/*
</code> -> <code class="language-">/*/index.html
</code>)</p>
</li>
<li>
<p>Remap all request uri with no extensions to default to <code class="language-">index.html
</code> object (<code class="language-">/*
</code> -> <code class="language-">/*/index.html
</code>)</p>
</li>
</ul>
<p>This should fulfill all our needs for the multi-page site!</p>
<p>After adding this change, when we make a request for <code class="language-">/about
</code> it would be remap the uri to <code class="language-">/about/index.html
</code>.</p>
<div style="margin: 2em auto;">
  <div style="display:flex;justify-content:center;">
    <img src="/images/multi-page-cloudfront-function-url-remapping.png" alt="Illustration of remapping the url" style="width:80%" />
  </div>
  <div style="display:flex;justify-content:center;">
    <blockquote>Illustration of remapping the url</blockquote>
  </div>
</div>
<h2 id="conclusion"><a href="#conclusion" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Conclusion</h2>
<p>We covered quite a bit in this article, let’s do a quick recap.</p>
<p><strong>Takeaways:</strong></p>
<ul>
<li>
<p>Think of Amazon S3’s in a way you’d think about a Key-Value stores</p>
</li>
<li>
<p>Use folders in Amazon S3 as a way to group related objects together to create an hierarchy (like a filesystem)</p>
</li>
<li>
<p>A solution to get around requesting a S3 folder is to remap the paths to a default object (ie <code class="language-">index.html
</code>)</p>
</li>
</ul>
<p>Coming up... we’ll cover how to set this up in our infrastructure using terraform!</p>
<p>And... that’s all for now, stay tuned for more!</p>
<p>I hope you enjoyed this guide and you learned something new!</p>
<p>If you did, please do share this article with a friend or co-worker 🙏❤️ (Thanks!)</p>
<p>Want to get hands on ?</p>
<p>Check out my tutorial on this here 👉 <a href="https://www.jerrychang.ca/writing/amazon-s3-cloudfront-multi-page-support-tutorial" target="_blank" rel="nofollow noopener noreferrer">Astro: Adding multi page support with Amazon S3 + Cloudfront (Tutorial)</a></p>
<h3 id="helpful-references"><a href="#helpful-references" aria-hidden="true" tabindex="-1"><span class="icon icon-link"></span></a>Helpful References</h3>
<ul>
<li><a href="https://docs.astro.build/en/guides/deploy/aws/" target="_blank" rel="nofollow noopener noreferrer">Astro: Deploy to AWS</a></li>
<li><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html" target="_blank" rel="nofollow noopener noreferrer">Amazon S3 Developer Docs: Using folders</a></li>
<li><a href="https://aws.amazon.com/blogs/networking-and-content-delivery/implementing-default-directory-indexes-in-amazon-s3-backed-amazon-cloudfront-origins-using-cloudfront-functions/" target="_blank" rel="nofollow noopener noreferrer">AWS: Implementing Default Directory Indexes in Amazon S3-backed Amazon CloudFront Origins Using CloudFront Functions</a></li>
<li><a href="https://aws.amazon.com/blogs/networking-and-content-delivery/implementing-default-directory-indexes-in-amazon-s3-backed-amazon-cloudfront-origins-using-lambdaedge/" target="_blank" rel="nofollow noopener noreferrer">AWS: Implementing Default Directory Indexes in Amazon S3-backed Amazon CloudFront Origins Using Lambda@Edge</a></li>
<li><a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/IndexDocumentSupport.html" target="_blank" rel="nofollow noopener noreferrer">Amazon S3: Hosting a static website, Configuring an index document</a></li>
</ul>


Surprisingly this does not work by default

Jerry Chang

Go Back

Amazon S3 + Cloudfront: Multi-page suport

Enjoy the content ?