What Data Does Google Analytics Collect?

Cody Schneider8 min read

Google Analytics gathers a treasure trove of information about how people find and interact with your website, but it often feels like a black box. Understanding exactly what data it's collecting is the first step to turning those numbers into meaningful business decisions. This article will break down the specific types of data Google Analytics collects, explaining how it tracks user behavior and what the common reports actually mean.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

How Google Analytics Actually Collects Data

Before looking at what data is collected, it helps to understand how it's collected. When you install Google Analytics on your site, you add a small piece of JavaScript code called the Google tag (gtag.js). When a user visits your website, this code executes in their browser.

Here’s the simple version of what happens next:

  1. The code drops a first-party cookie in the user's browser. This cookie helps Google Analytics identify them as a unique user and stitch together their actions over time.
  2. As the user navigates your site - clicking links, watching videos, filling out forms - the tracking code sends "hits" or packages of data back to Google's servers.
  3. These hits contain all the juicy details about the user, their session, and their actions, which Google then processes and organizes into your reports.

With that context, let's explore the categories of data you'll find inside your account.

User & Session Data: Who, Where, and How

These data types help you understand who your audience is and how they arrived on your website. This is the foundation for analyzing visitor behavior and marketing effectiveness.

1. Acquisition Data: How Users Found You

Without knowing which channels are driving traffic, you can't optimize your marketing efforts. Acquisition reports answer the critical question: "Where are my visitors coming from?"

  • Source: The specific place a user came from, like "google," "bing," "facebook.com," or the name of an email newsletter.
  • Medium: The category of the source. For example, "organic" for search engine traffic, "cpc" (cost-per-click) for paid ads, "referral" for links from other websites, or "email."
  • Campaign: Tracks specific marketing campaigns. If you use UTM parameters in your links (e.g., for a "summer_sale" campaign), that information will show up here.
  • Default Channel Grouping: Google’s high-level bucketing of your traffic, combining source and medium into easy-to-understand categories like Organic Search, Direct, Paid Social, and Referral.

For example, if someone clicks a paid search ad on Google, their traffic might be attributed to google / cpc. If they click a link in your email newsletter, it might be newsletter / email. This data is essential for calculating your return on investment for different marketing activities.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

2. Demographic and Geographic Data: Who and Where Your Users Are

Understanding the characteristics of your audience helps you create better content and target your ads more effectively. Analytics collects:

  • Geographic Data: Detects a user's geographical location using their IP address (though GA4 anonymizes IPs for privacy). You can see data by Continent, Country, Region, and City. This is fantastic for identifying new market opportunities or tailoring content to specific regions.
  • Demographic Data: Includes Age and Gender. It's important to know this data is not collected from everyone. It's sampled from users who are signed into their Google accounts and have ads personalization enabled. You must enable a feature called Google Signals in your GA4 property to collect this.
  • Interest Data: Also collected via Google Signals, this data categorizes users based on their browsing and search history within Google's ecosystem. You get insights into their "Affinity Categories" (e.g., "Technology/Technophiles") and "In-Market Segments" (e.g., "Software/Business & Productivity Software").

3. Technology Data: What Devices Your Users Are On

Optimizing your user experience requires knowing the technology your audience uses. Google Analytics tracks:

  • Device Category: Whether the user is on a Desktop, Mobile, or Tablet. This is critical for prioritizing design and development efforts. If 80% of your users are on mobile, your site had better be mobile-perfect.
  • Browser: Chrome, Safari, Firefox, Edge, etc. This can help developers identify and fix browser-specific compatibility issues.
  • Operating System: Windows, macOS, iOS, Android, etc.

Event & Conversion Data: What Your Users Are Doing

This is where Google Analytics 4 really shines. Unlike its predecessor (Universal Analytics), which was centered around "sessions" and "pageviews," GA4's data model is built entirely around events. An event is any specific interaction a user has with your website.

Events provide a much more flexible and accurate way to measure what truly matters to your business, whether it's a purchase, a video view, or a PDF download.

The Four Types of Events in GA4

Events in GA4 fall into four distinct categories:

1. Automatically Collected Events

As soon as you install the Google tag, it starts collecting a list of a dozen essential events without any extra work. Some of the most common ones include:

  • session_start: Fired when a user begins a new session.
  • first_visit: Tracks when a user visits your site for the very first time.
  • page_view: Fired each time a new page loads. This replaces the old "pageview" metric from Universal Analytics.
  • user_engagement: Fires periodically when your site is in the foreground, helping GA4 calculate engagement metrics.

2. Enhanced Measurement Events

This is a powerful feature you can enable with a single click in your GA4 settings. Once activated, Google Analytics will automatically start tracking more complex interactions that used to require custom coding:

  • Scrolls: Tracks when a user scrolls 90% of the way down a page.
  • Outbound clicks: Captures when a user clicks a link that takes them to a different domain.
  • Site search: Logs the search terms users input into your site's search bar.
  • Video engagement: Automatically tracks when users start, progress through, and complete embedded YouTube videos on your site.
  • File downloads: Fired when a user clicks a link to a common file type (like a PDF, DOCX, or CSV).

3. Recommended Events

Google provides a list of recommended events with standardized names and parameters for common business scenarios across industries like E-commerce, Travel, and Gaming. Using these standardized names helps Google better understand your data and makes your reports more consistent. Examples include:

  • add_to_cart
  • begin_checkout
  • purchase
  • generate_lead
  • sign_up

4. Custom Events

If there's a specific interaction vital to your business that isn't covered by the other categories, you can create a custom event. This gives you complete freedom to track anything you want. For example, a SaaS company might create a custom event called plan_comparison_view to see how many users visit their pricing comparison page.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

From Events to Conversions

In GA4, a Conversion is simply an event that you have marked as being important to your business. Any event you collect - automatic, enhanced, recommended, or custom - can be toggled "on" as a conversion in the settings. This tells Google Analytics to track it as a key performance indicator (KPI). Events like purchase, form_submit, or demo_request are prime candidates for conversions.

Dimensions vs. Metrics: The Building Blocks of Your Reports

Every single report in Google Analytics is made up of two types of data: Dimensions and Metrics. Understanding the difference is fundamental to building any meaningful report.

  • Dimensions are the attributes of your data. They are typically words, not numbers, and they describe the people, places, and things you are measuring. Dimensions answer questions starting with "What" or "Who." Examples: Country, Device Category, Source / Medium, Page title.
  • Metrics are the quantitative measurements. They are the actual numbers - the counts or sums that provide scale to your dimensions. Metrics answer questions starting with "How many" or "How much." Examples: Users, Sessions, Engagement rate, Conversions, Total revenue.

Think of it like a spreadsheet. The dimensions are the column headers for text labels (like "City"), and the metrics are the column headers for numbers (like "Population"). You use a metric to measure a dimension. For instance, you could analyze the metric Users across the dimension Country to see how many users visited from each location.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

What Google Analytics Does NOT Collect (On Purpose)

For all the data Google Analytics gathers, there's one thing it is strictly forbidden from collecting: Personally Identifiable Information (PII). Google's terms of service prohibit sending any data that could be used to directly identify an individual to their servers.

This includes things like:

  • Full names
  • Email addresses
  • Mailing addresses
  • Phone numbers
  • Social security numbers or other national ID numbers

Accidentally collecting PII (for instance, by capturing an email in a page URL after a form submission) is a serious violation that can lead to Google terminating your account. Additionally, in response to privacy regulations like GDPR and CCPA, GA4 has introduced features like IP anonymization (which is on by default) to further protect user privacy.

Final Thoughts

Google Analytics collects an incredibly rich dataset covering user demographics, acquisition channels, device technology, and specific on-site interactions through its event-based model. By organizing this information into dimensions and metrics, you can start to build a clear picture of how your website drives business goals.

Understanding what data is available is the first step, but transforming it into actionable insights without spending hours wrestling with reports is the true challenge. At our company, we built Graphed because we know that most teams are drowning in GA4 data but starving for answers. We connect directly to your Google Analytics account, allowing you to ask questions in plain English - like "Which marketing channels brought in the most new users last month?" or "Show me a chart of user engagement by device" - and get instant dashboards and reports, so you can spend less time fumbling through complex interfaces and more time acting on the insights.

Related Articles