I have a use case in which we have a few tables in BigQuery. Now I want to implement an index on one of the columns in the BigQuery table. But I am not finding enough documentation to do that. I found a few blogs and posts mentioning BigQuery doesn't support indexes. Please help me find a blog or post which can help me in implementing index on BigQuery. Thanks in advance.
相关问题
- How to export crash-free users from firebase?
- What are the advantages ManageIQ has over OpenStac
- How can my C# program behave differently depending
- Updating a value in an ARRAY in a BigQuery table
- Import json data with null values
相关文章
- BigQuery - Concatenate multiple rows into a single
- Where do you get Google Bigquery usage info (mainl
- How do I configure Google BigQuery command line to
- How can I make integration tests with google cloud
- How to get gcloud auth activate-service-account pe
-
Error in Google BigQuery
- CLI “bq load” - how to use non-printable character
- How to capitalize a string?
You can create indexes in bigquery table using Clustering order parameter available in advanced options while creating table.This clustering option is only available for Partitioned tables. Follow the below link for additional details: link to google documentation
The close you can get for "index" in BigQuery is Partitioned Tables. Currently it only supports partition by date though.
Besides partitioning one could as well use multiple tables, eg each with a day's amount of data. BigQuery can query a maximum of 1000 tables at once, so that should cover most cases and let's you keep costs constant.
2019 update: Check out how clusters improve your querying times and data scanned:
As stated in the comments this question is associated with "how would BigQuery deal with my data if it was a 100 times larger". When dealing with traditional databases an index is the right solution, but BigQuery is different: As data size grows, BigQuery adds more servers to the mix - keeping performance almost constant.
In other words, as your data grows you should expect costs to increase linearly, with performance staying almost constant. No indexes needed. And this is one of the big reasons why people choose BigQuery for their analytical workloads.
(It all depends on your specific use case of course, please test these assertions and report back!)