Google BigQuery

Google is the flagship subsidiary and company that led to the creation of Alphabet, which is the third largest tech giant globally, with a market cap of $1.4 trillion and $40 billion net income in 2020. Google generates most of its revenues from its context-aware advertisement business through Google search, Google Ads and YouTube. As Google is all about data, the company has unmatched expertise and success in the management and monetization of data, and also in research and development of all the methodology and technology that is required to manage data at truly enormous scale. Google is constantly expanding its global footprint. It currently has over 140,000 employees and more than 150 offices and data centers around the world.

Google Cloud Platform is Google’s cloud offering for enterprises of all sizes. It is built on the same strong technical foundation and suite of tools Google uses internally for its core services such as Google Search, Google Ads, Google Mail and YouTube. Google Cloud Platform was introduced in 2008 and is a constantly growing suite of IaaS, PaaS and SaaS solutions, which are suitable for any kind or size of use case.

As data is very much the DNA of Google, all its data, data management, analytics, machine learning and AI solutions have a very high reputation in the developer and data engineering community, as well as with many companies focused on delivering modern, cloud-native, data-driven business solutions and services.

Google Cloud Platform is said to be the third biggest player in the cloud services business, behind Amazon AWS and Microsoft Azure. It does not yet offer such a broad and deep portfolio of cloud services as Amazon AWS and Microsoft Azure, but what it does offer is recognized as best-in-class technology, open, very developer-friendly and highly scalable out of the box. Google BigQuery is no exception here.

Google BigQuery was first introduced in 2011. It is a serverless data warehouse PaaS solution offering data analysis and machine learning capabilities up to petabyte scale, but also perfectly complements small data analytics use cases. Technically, BigQuery is built on Dremel, a Google-developed distributed column-based data management system, both for batch and interactive querying of very large datasets. The fact that Google uses BigQuery extensively throughout its own solutions and service portfolio can be taken as proof that it works well, and at any scale. A good example is Google Ads, Google’s online advertising platform and main source of revenue ($146 billion in 2020). Google Ads users can easily access all their data through Google BigQuery in real time. Every click made by customers is instantly (with very low latency) available for analysis in Google BigQuery. As such capabilities are essential for digital business and have even become business-critical when used for fully automated real-time campaign management, Google BigQuery and the entire Google Data Platform ecosystem is designed to cope with very large and demanding use cases (e.g., Spotify). This also makes Google BigQuery a contender in the cloud-based enterprise data warehouse market and especially the analytics market. Direct integrations to end-user tools such as Tableau, Power BI, Qlik and many others (even Excel) are available. In addition, most modern data processing, analytics and AI tools support direct access to data from Google BigQuery, and APIs are provided for all major programming languages to build data-driven solutions.

User & Use Cases

As already indicated, Google BigQuery is what it is: a raw but powerful cloud-based data management and analytics platform. Therefore, the top three tasks listed in the chart above are very much in line with its intended use. 90 percent of survey respondents use it for data warehousing/BI, 60 percent deploy it as a query engine and 45 percent use it for data lake purposes. It should be noted that Google BigQuery itself is not really a data lake solution as its focus is on structured data in columnar storage formats for fast access and querying. However, in the overall Google Smart Analytics ecosystem, there are other components such as DataProc (Hadoop, Spark-based data processing) and Data Flow that do the job.

Interestingly, Google BigQuery usage is evenly spread across small, medium and large companies. The reason for this is quite simple in that its use cases vary from straightforward Google Ads scenarios to complex processing of petabytes of data as Spotify does.

In addition to its core capabilities, Google BigQuery has also become increasingly valuable as a back end for machine learning and AI use cases, as implied by the 23 percent of users who already use Google BigQuery for these purposes. This is also the direction in which Google is pushing its platform, making Google Smart Analytics a universal data-driven platform for any analytical use case.

Current use

n=31

Total number of users per company

n=31

Total number of administrators per company

n=28

Company size (number of employees)

n=31

Want to see the whole picture?

BARC’s Vendor Performance Summary contains an overview of The Data Management Survey results based on feedback from Google BigQuery users, accompanied by expert analyst commentary.

Contact us to purchase the Vendor Performance Summary

Google BigQuery

Peer Groups Business Software Generalists (data management), Data Warehouse Technologies
VendorGoogle
Number of responses31
ProductGoogle BigQuery
Offices150+
Employees140,000
CustomersNot disclosed
Revenues (2020)$75 billion
Websitewww.cloud.google.com/bigquery