By Taylor Mcgrath

Github to Snowflake

Integrates with:

The Github kit uses the Github API to connect and load the data into your data warehouse.

This Kit includes…

Please note that the incremental load is only available for Repository Issues and Repository Commits, as only these 2 endpoints support a start date and end date in the query. For these 2 entities, the S2T river loads the data using Upsert-Merge method. For other entities, tables are overwritten.

Configuring this Kit for use

Before using this kit, you need to get your Github access token: https://developer.Github.com/how-to-authenticate-with-personal-access-tokens

Environment Variables

In this Kit, there are 4 environment variables that are used to make for dynamic use of target configuration. 

  • {db_github} is used in all Target configurations and queries as the target database name.

  • {schema_github} is used in all Target configurations and queries as the target schema name.

  • {github_owner} is the github organization or owner of the repository, and this value is used to configure the target table name configuration

  • {github_repo} is used to configure the target table name configuration

  • {Github_Alert_Group} is used as the alert email address(es) for when the orchestration and source to target rivers fail (optional).

Go to the Variables menu on the left side navigation bar and create variables for db_github, schema_github, github_owner and github_repo. Fill in the values for the Snowflake Database/Schema you'd like the data to land.

Connections

Kits are imported with ‘blank’ source and target connections. In order to use the template, you have two options:

  • Swap out blank connections with existing connections

  • Add credentials to the blank connections that come with the imported template.

This kit requires a connection with the key "token" and its value

Have questions about this Kit or want more endpoints?

Send an email to helpme@rivery.io with the title “Help – Github to Snowflake Kit”

Minimize the firefighting.
Maximize ROI on pipelines.

icon icon