Home / Networking & Cloud / Build AI Agents with the BigQuery MCP Server

Build AI Agents with the BigQuery MCP Server

Jan 21, 2026

Benjamin DaigleSoftware Development Expert

The process of securely connecting sophisticated AI agents to vast enterprise data warehouses has traditionally been a significant hurdle, often demanding complex custom integrations and prolonged development cycles that hinder rapid innovation. A newly released, fully managed remote Model Context Protocol (MCP) server for Google services now addresses this challenge directly by providing AI agents with a secure and direct pathway to analyze data within BigQuery. This managed service architecture fundamentally removes the operational overhead associated with server maintenance and configuration, empowering development teams to channel their efforts exclusively toward creating intelligent, data-driven agents. This approach not only accelerates the AI agent building process but also standardizes communication through a defined protocol. This ensures that LLM-powered applications can seamlessly access analytics data via a structured set of tools. Furthermore, for organizations requiring greater flexibility and control, MCP server support for BigQuery is also accessible through the open-source MCP Toolbox for Databases, offering a more customizable deployment option. As of January 2026, the fully managed remote BigQuery Server is available in preview, signaling a major step forward in simplifying generative AI application development.

1. Initial Setup and Preparation

The foundation of a successful AI agent deployment begins with meticulous project setup and permission configuration within the cloud environment. Before any development can commence, it is essential to establish or select an existing Google Cloud Project with active billing. This project serves as the central container for all resources, from datasets to authentication credentials. Following project setup, the next critical step involves assigning precise user roles to ensure the principle of least privilege is maintained. The user account must be granted permissions such as roles/bigquery.user for executing queries, roles/bigquery.dataViewer for accessing data, roles/mcp.toolUser for utilizing MCP tools, and roles/serviceusage.serviceUsageAdmin for enabling necessary APIs. These roles collectively ensure that the agent operates within a secure and well-defined scope. Additional OAuth-related permissions, including roles/iam.oauthClientViewer and roles/oauthconfig.editor, are also required to manage the authentication flow, which is crucial for securely connecting the agent to the data source without exposing sensitive credentials directly within the application code.

With the project and permissions established, the focus shifts to preparing the local development environment and activating the necessary service APIs. This process typically involves a macOS or Linux terminal equipped with the gcloud Command Line Interface (CLI) installed and configured. To authorize the Agent Development Kit (ADK) to access BigQuery resources on behalf of the user, a crucial authentication command must be executed within the shell. This command initiates a flow that securely links the local development environment to the authenticated Google Cloud account. Once authenticated, the next step is to enable the BigQuery and MCP APIs for the designated project. This is accomplished by running a specific gcloud services enable command, which effectively activates the endpoints required for data querying and agent-tool communication. This activation is a one-time setup step per project that formally grants the project the ability to process BigQuery requests and communicate through the Model Context Protocol, paving the way for the agent’s core functionalities to be implemented and tested against live services.

2. Data Loading and Authentication Configuration

To provide the AI agent with data to analyze, a sample dataset must first be loaded into BigQuery. For demonstration purposes, the cymbal_pets dataset serves as an excellent example, containing representative tables and schemas for the agent to interact with. Loading this dataset is a straightforward process executed via a command that pulls the database from a public storage bucket directly into the designated project. This step is vital as it populates the environment with tangible data, allowing for realistic testing of the agent’s ability to discover schemas, query tables, and synthesize answers. Following data loading, the next critical task is to configure the authentication mechanism. Google OAuth is the chosen method for establishing a secure connection to the BigQuery MCP server. This involves creating an OAuth Client ID within the Google Cloud console’s API & Services section. For local development, selecting the “Desktop app” application type is appropriate, which generates a unique Client ID and Client Secret that must be copied and stored securely for later use in the application’s environment configuration.

The configuration of the OAuth client requires careful attention to the specific hosting environment to ensure authorized and secure communication. While a “Desktop app” type is suitable for local machine development, a different approach is necessary when using a hosted environment like Google Cloud Shell. In such cases, a “Web application” OAuth Client ID must be created instead. This type requires specifying authorized origins and redirect URIs to prevent unauthorized access. For a Cloud Shell session, these URIs are dynamically generated and must be correctly configured in the OAuth client settings to match the temporary domain of the shell instance. This distinction is crucial for maintaining security and ensuring the OAuth flow completes successfully. Once the client is created, a final API key must be generated for Gemini. This key, created from the API Keys page in the console, grants the ADK access to the Gemini model, which powers the agent’s natural language understanding and generation capabilities. This key acts as the bridge between the structured data tools of BigQuery and the advanced reasoning abilities of the large language model.

3. Agent Development and Deployment

With all prerequisites in place, the process of building and running the AI agent application can begin. The first step involves installing the Agent Development Kit (ADK) and initializing a new agent project by following the official Python Quickstart documentation. Once the ADK is installed, a new agent specifically for the BigQuery integration, named cymbal_pets_analyst for this example, is created using the ADK’s project initialization command. This command scaffolds a new directory with the necessary files and structure for an agent application. The next crucial step is to configure the environment variables by editing the .env file within the newly created project folder. This file must be populated with the specific values obtained earlier, including the Project ID, the OAuth Client ID and Secret, and the Gemini API Key. Properly setting these variables is essential for the agent to authenticate correctly and connect to the right cloud resources and services. This separation of configuration from code is a security best practice that prevents sensitive credentials from being hardcoded into the application’s source.

After configuring the environment, the core logic of the agent is implemented by modifying the agent.py file. The default content of this file is replaced with Python code designed to integrate with the remote BigQuery MCP server. This code defines the agent’s tools, handles the OAuth authentication flow, and manages the communication between the user’s prompts, the language model, and the BigQuery data backend. Once the code is in place, the ADK application is ready to be launched. This is done by running a command from the parent directory containing the agent’s project folder. The ADK will start a local web server, typically accessible at https://127.0.0.1:8000/. Navigating to this address in a web browser reveals a user interface where the newly created agent can be selected. Upon the first interaction that requires data access, the agent will initiate the OAuth flow, prompting the user to grant the necessary permissions to access their BigQuery data. After authorization, the agent becomes a personal data analyst, capable of answering complex questions about the cymbal_pets dataset, from listing tables and schemas to executing sophisticated analytical queries.

4. Alternate Integration with Gemini CLI

Beyond the web-based interface provided by the ADK, developers and data analysts can also interact with the BigQuery MCP server directly through the Gemini Command Line Interface (CLI). This integration offers a powerful, terminal-based workflow for those who prefer command-line interactions for scripting, automation, or quick data exploration. To enable this, the Gemini CLI’s configuration file, typically located at ~/.gemini/settings.json, must be updated. A specific JSON block defining the BigQuery MCP server connection details is added under the mcpServers field. If a configuration already exists, this new server entry is merged into the existing structure. This one-time setup registers the BigQuery MCP server as a recognized tool provider for the Gemini CLI, making its capabilities available for subsequent sessions. This method streamlines the process of connecting to enterprise data, abstracting the underlying API calls into a simplified and interactive command-line experience that can be easily integrated into existing development workflows and scripts.

Once the settings.json file has been configured, the final steps are to authenticate and launch the CLI. Authentication is handled seamlessly by running the standard gcloud auth application-default login command in the terminal. This command refreshes or establishes the necessary credentials, ensuring that the Gemini CLI has the required permissions to interact with Google Cloud services on the user’s behalf. With authentication complete, the Gemini CLI can be started. It will now be equipped with the tools and context provided by the BigQuery MCP server, allowing the user to pose natural language questions directly in the terminal. The CLI can then query datasets, retrieve table information, and perform data analysis by leveraging the connected BigQuery instance. This provides a powerful alternative for developers who need to quickly integrate data analytics into their scripts or prefer a text-based interface, demonstrating the versatility of the MCP standard in supporting diverse development environments and user preferences while maintaining a consistent and secure connection to enterprise data.

Advancing AI with Standardized Data Access

The integration of the BigQuery MCP server with leading agent development frameworks represents a significant simplification in the creation of data-aware generative AI applications. By establishing a single, standard protocol, developers can build intelligent data agents compatible with a wide range of IDEs and development kits. This standardized approach streamlines the development workflow and fosters a more interoperable ecosystem for AI tools. The ability to connect directly to BigQuery using either the fully managed service or a more flexible open-source toolkit provides options that cater to different organizational needs for control and convenience. As these tools are adopted, it becomes clear that adherence to established AI security and safety guidelines is paramount, especially when deploying agents into production environments or allowing them to interact with sensitive, real-world data. The successful implementation of these agents ultimately depends on a combination of powerful technology and responsible development practices.