Google Sheets

Version 0.0.2

📘

Please read the Google Sheet API Docs for more information.

Set up


Source

To get set up with the Google Sheets connector, you will only need your Google login.

Extract

You will need your spreadsheet ID and to supply a range of data that you want to ingest. You can get the spreadsheet ID from the URL of the spreadsheet you are looking to ingest. See the example below:

https://docs.google.com/spreadsheets/d/<spreadsheet_id>/edit#gid=0

URL - https://docs.google.com/spreadsheets/d/1VNdo9JyNFj5Bdg4v5AlutIEzzW3_0OYShRjhktFOvdQ/edit#gid=0

Spreadsheet ID - 1VNdo9JyNFj5Bdg4v5AlutIEzzW3_0OYShRjhktFOvdQ

By default we will ingest the data from the entire sheet, however, you can limit your range using the range text field.

Ensure that the Google Sheet is structured with header titles in the first row, followed by the corresponding data in the rows below. Deviating from this format may result in errors during the data extraction process.

'Sheet' Report:

How to select data from a different tab/sheet within the spreadsheet - To do this you will need to pass the tab/sheet name before the range and followed by an !.

Here are two examples:

  • Data from a tab/sheet called 'Sheet1' and range 'A:D' - Sheet1!A:D
  • Data from a tab/sheet called 'My New Sheet' and range 'A4:D16' - My New Sheet!A4:D16

'All Sheet Tabs' Report:

This report automatically combines data from all tabs within a Google Sheet into a single table. The tab name is included as a column labeled 'SHEET_TAB_TITLE'. To avoid errors during data extraction, ensure that the header titles are the same across all tabs.

Features


FeatureNotes
API reliabilityReliable & Fast

Report details


⬇️ Reports
Sheet
All Sheet Tabs

Limitations


  • As mentioned before, ensure that the Google Sheet is structured with header titles in the first row, followed by the corresponding data in the rows below.
  • If any of your column headers surpasses the maximum character threshold for column headers in your warehouse the column headers will be truncated. This threshold is 255 and 157 for Snowflake and Redshift respectively.
  • Duplicate column names should also be avoided as these are not effectively handled by SQL queries in the warehouse. If there are duplicate columns in the source table, it is recommended to update them e.g. 'columnName' and 'columnName' (add an '' at the end).
  • The order of columns from the Google Sheet is not persisted in the warehouse.