How to mine Google BigQuery

Have more questions? Submit a request

Once you have crawled assets from Google BigQuery you can mine its query history to construct lineage.

To mine lineage from Google BigQuery, complete the following steps.

Select the miner

To select the Google BigQuery miner:

  1. In the top right of any screen, navigate to New and then click New Workflow.
  2. From the filters along the top, click Miner.
  3. From the list of packages, select BigQuery Miner and click on Setup Workflow.

Configure the miner

To configure the Google BigQuery miner:

  1. For Connection, select the connection to mine. (To select a connection, the crawler must have already run.)
  2. For Miner extraction method select Query History.
  3. For Start time choose the earliest date from which to mine query history.
    πŸ’ͺ Did you know? The miner restricts you to only querying the past two weeks of query history. If you need to query more history, for example in an initial load, consider using the S3 miner first. After the initial load you can modify the miner's configuration to use query history extraction.
  4. (Optional) By default, the miner runs in the US region. To run in another region, for Region, select Custom and then enter the region under Custom BigQuery Region.

Run the miner

To run the Google BigQuery miner, after completing the steps above:

  • To run the miner once, immediately, at the bottom of the screen click the Run button.
  • To schedule the miner to run hourly, daily, weekly or monthly, at the bottom of the screen click the Schedule & Run button.

Once the miner has completed running, you will see lineage for Google BigQuery assets that were created in Google BigQuery between the start time and when the miner ran! πŸŽ‰

Related articles

Was this article helpful?
1 out of 1 found this helpful