Browser Rendering

Control and interact with a headless browser instance in your Nuxt application using Puppeteer.

Getting Started

Enable browser rendering in your Nuxt project by enabling the hub.browser option:

nuxt.config.ts
export default defineNuxtConfig({
  hub: {
    browser: true
  },
})

Lastly, install the required dependencies by running the following command:

Terminal
npx ni @cloudflare/puppeteer puppeteer
ni will automatically detect the package manager you are using and install the dependencies.

Usage

In your server API routes, you can use the hubBrowser function to get a Puppeteer browser instance:

const { page, browser } = await hubBrowser()

In production, the instance will be from @cloudflare/puppeteer which is a fork of Puppeteer with version specialized for working within Cloudflare workers.

NuxtHub will automatically close the page instance when the response is sent as well as closing or disconnecting the browser instance when needed.

Use Cases

Here are some use cases for using a headless browser like Puppeteer in your Nuxt application:

  • Web scraping: Extract data from websites, especially those with dynamic content that requires JavaScript execution.
  • Generating PDFs or screenshots: Create snapshots or PDF versions of web pages.
  • Performance monitoring: Measure load times, resource usage, and other performance metrics of web applications.
  • Automating interactions or testing: Simulating user actions on websites for tasks like form filling, clicking buttons, or navigating through multi-step processes.

Limits

Browser rendering is only available on the Workers Paid plan for now.

To improve the performance in production, NuxtHub will reuse browser sessions. This means that the browser will stay open after each request (for 60 seconds), a new request will reuse the same browser session if available or open a new one.

The Cloudflare limits are:

  • 2 new browsers per minute per Cloudflare account
  • 2 concurrent browser sessions per account
  • a browser instance gets killed if no activity is detected for 60 seconds (idle timeout)

You can extend the idle timeout by giving the keepAlive option when creating the browser instance:

// keep the browser instance alive for 120 seconds
const { page, browser } = await hubBrowser({ keepAlive: 120 })

The maximum idle timeout is 600 seconds (10 minutes).

Once NuxtHub supports Durable Objects, you will be able to create a single browser instance that will stay open for a long time, and you will be able to reuse it across requests.

Screenshot Capture

Taking a screenshot of a website is a common use case for a headless browser. Let's create an API route to capture a screenshot of a website:

server/api/screenshot.ts
import { z } from 'zod'

export default eventHandler(async (event) => {
  // Get the URL and theme from the query parameters
  const { url, theme } = await getValidatedQuery(event, z.object({
    url: z.string().url(),
    theme: z.enum(['light', 'dark']).optional().default('light')
  }).parse)

  // Get a browser session and open a new page
  const { page } = await hubBrowser()

  // Set the viewport to full HD & set the color-scheme
  await page.setViewport({ width: 1920, height: 1080 })
  await page.emulateMediaFeatures([{
    name: 'prefers-color-scheme',
    value: theme
  }])

  // Go to the URL and wait for the page to load
  await page.goto(url, { waitUntil: 'domcontentloaded' })

  // Return the screenshot as response
  setHeader(event, 'content-type', 'image/jpeg')
  return page.screenshot()
})

On the application side, we can create a simple form to call our API endpoint:

pages/capture.vue
<script setup>
const url = ref('https://hub.nuxt.com')
const image = ref('')
const theme = ref('light')
const loading = ref(false)

async function capture {
  if (loading.value) return
  loading.value = true
  const blob = await $fetch('/api/browser/capture', {
    query: {
      url: url.value,
      theme: theme.value
    }
  })
  image.value = URL.createObjectURL(blob)
  loading.value = false
}
</script>

<template>
  <form @submit.prevent="capture">
    <input v-model="url" type="url" />
    <select v-model="theme">
      <option value="light">Light</option>
      <option value="dark">Dark</option>
    </select>
    <button type="submit" :disabled="loading">
      {{ loading ? 'Capturing...' : 'Capture' }}
    </button>
    <img v-if="image && !loading" :src="image" style="aspect-ratio: 16/9;" />
  </form>
</template>

That's it! You can now capture screenshots of websites using Puppeteer in your Nuxt application.

Storing the screenshots

You can store the screenshots in the Blob storage:

const screenshot = await page.screenshot()

// Upload the screenshot to the Blob storage
const filename = `screenshots/${url.value.replace(/[^a-zA-Z0-9]/g, '-')}.jpg`
const blob = await hubBlob().put(filename, screenshot)
Learn more about the Blob storage.

Metadata Extraction

Another common use case is to extract metadata from a website.

server/api/metadata.ts
import { z } from 'zod'

export default eventHandler(async (event) => {
  // Get the URL from the query parameters
  const { url } = await getValidatedQuery(event, z.object({
    url: z.string().url()
  }).parse)

  // Get a browser instance and navigate to the url
  const { page } = await hubBrowser()
  await page.goto(url, { waitUntil: 'networkidle0' })

  // Extract metadata from the page
  const metadata = await page.evaluate(() => {
    const getMetaContent = (name) => {
      const element = document.querySelector(`meta[name="${name}"], meta[property="${name}"]`)
      return element ? element.getAttribute('content') : null
    }

    return {
      title: document.title,
      description: getMetaContent('description') || getMetaContent('og:description'),
      favicon: document.querySelector('link[rel="shortcut icon"]')?.href
      || document.querySelector('link[rel="icon"]')?.href,
      ogImage: getMetaContent('og:image'),
      origin: document.location.origin
    }
  })

  return metadata
})

Visiting /api/metadata?url=https://cloudflare.com will return the metadata of the website:

{
  "title": "Connect, Protect and Build Everywhere | Cloudflare",
  "description": "Make employees, applications and networks faster and more secure everywhere, while reducing complexity and cost.",
  "favicon": "https://www.cloudflare.com/favicon.ico",
  "ogImage": "https://cf-assets.www.cloudflare.com/slt3lc6tev37/2FNnxFZOBEha1W2MhF44EN/e9438de558c983ccce8129ddc20e1b8b/CF_MetaImage_1200x628.png",
  "origin": "https://www.cloudflare.com"
}

To store the metadata of a website, you can use the Key Value Storage.

Or directly leverage Caching on this API route:

server/api/metadata.ts
export default cachedEventHandler(async (event) => {
  // ...
}, {
  maxAge: 60 * 60 * 24 * 7, // 1 week
  swr: true,
  // Use the URL as key to invalidate the cache when the URL changes
  // We use btoa to transform the URL to a base64 string
  getKey: (event) => btoa(getQuery(event).url),
})