Skip to main content

Google PageSpeed Insights

This page guides you through the process of setting up the Google PageSpeed Insights source connector.

Sync overview

Prerequisites

Set up the Google PageSpeed Insights source connector

  1. Log into your Airbyte Cloud or Airbyte Open Source account.
  2. Click Sources and then click + New source.
  3. On the Set up the source page, select Google PageSpeed Insights from the Source type dropdown.
  4. Enter a name for your source.
  5. For API Key, enter your Google PageSpeed API Key.
  6. For URLs to analyse, enter one or many URLs you want to create PageSpeed Insights for. Example: https://www.google.com.
  7. For Analyses Strategies, enter either "desktop", "mobile" or both to define which Analyses strategy to use.
  8. For Lighthouse Categories, select one or many of the provided options. Categories are also called "audits" in some of the Google Lighthouse documentation.
  9. Click Set up source.

IMPORTANT: As of 2022-12-13, the PageSpeed Insights API - as well as this Airbyte Connector - allow to specify a URL with prefix "origin:" - like origin:https://www.google.com. This results in condensed, aggregated reports about the specified origin - see this FAQ. However: This option is not specified in any official documentation anymore, therefore it might be deprecated anytime soon!

Supported sync modes

The Google PageSpeed Insights source connector supports the following sync modes:

  • Full Refresh

Supported Streams

The Google PageSpeed Insights source connector supports the following stream:

  • pagespeed: Full pagespeed report of the selected URLs, lighthouse categories and analyses strategies.

Features

FeatureSupported?(Yes/No)Notes
Full Refresh SyncYes
Incremental SyncNo

Performance considerations

When using the connector without an API key, Google utilizes an undocumented, but strict rate limit - which also depends on how many global requests are currently sent to the PageSpeed API. The connector will retry, using an exponential backoff interval.

If the connector is used with an API key, Google allows for 25.000 queries per day and 240 queries per minute. Therefore, under normal usage, the connector should not trigger any rate limits. Create an issue if you see any rate limit issues that are not automatically retried successfully.

Reference

Config fields reference

Field
Type
Property name
array<string>
categories
array<string>
strategies
array<string>
urls
string
api_key

Changelog

Expand to review
VersionDatePull RequestSubject
0.1.72024-06-0639272[autopull] Upgrade base image to v1.2.2
0.1.62024-05-2138147Make compatable with builder
0.1.52024-04-1937171Updating to 0.80.0 CDK
0.1.42024-04-1837171Manage dependencies with Poetry.
0.1.32024-04-1537171Base image migration: remove Dockerfile and use the python-connector-base image
0.1.22024-04-1237171schema descriptions
0.1.12023-05-25#22287🐛 Fix URL pattern regex
0.1.02022-11-26#19813🎉 New Source: Google PageSpeed Insights [low-code CDK]