Fully managed, Serverless (no-ops), powers fast SQL queries. Essential for analytics, cloud-based data warehouse.
BigQuery is a fully managed, serverless (no-ops) data warehouse that powers fast SQL queries. It is essential for analytics and serving as a cloud-based data warehouse.
Load, copy, export, re-cluster, delete/metadata operations are Free.
Views let you share query results without giving access to underlying source data.
BigQuery doesn't enforce primary and foreign key constraints in the same way as RDBMS, but they are highly useful for query optimization.
Divides large tables into smaller, more manageable segments.
_PARTITIONTIME.table_name$partition_id).Sort data within partitions based on one or more columns.
An in-memory analysis service that accelerates BigQuery queries for business intelligence (BI) and data visualization tools like Looker Studio and Looker.
SELECT * EXCEPT.APPROX_COUNT_DISTINCT for acceptable accuracy results faster.Built-in data exchange platform for secure & scalable data sharing across organizations. Supports a Data Mesh architecture.
Analyzing sensitive data with multiple parties while ensuring privacy and security without directly sharing the underlying data itself.
Query structured & unstructured data across analytics & AI engines with built-in governance.
Granting an IAM role to an entity at different levels: Organization, Folder, Project, Dataset, Table, View.
Tags: Key-value pairs to conditionally apply IAM to tables & datasets.
INFORMATION_SCHEMA.JOBS_*, Operational health dashboard, or Audit logs (Jobs API).Security feature that defines a security perimeter around resources to mitigate the risk of data exfiltration.
CDC tracks changes from transactional databases to BigQuery in near-real time.
EXPLAIN statements to identify bottlenecks in Query Plans.AS OF SYSTEM TIME, going back 7 days.ORDER BY clause.Q1. What is the main difference between Partitioning and Clustering in BigQuery?
Q2. How long are cached query results maintained in BigQuery?
Q3. How do you flatten nested or repeated fields in a BigQuery table for querying?
Q4. What is the best approach to handle data skew when querying in BigQuery?
Q5. Can you query data residing on AWS S3 or Azure directly using BigQuery?