How to Extract Data from PDF to Google Sheets

Cody Schneider8 min read

Getting useful data trapped inside a PDF file and into a spreadsheet can feel like a genuine headache. Whether it's a vendor invoice, a monthly performance printout, or a product catalog, that valuable tabular data is often stuck in a format designed for viewing, not for analysis. This guide will walk you through several practical methods to extract data from a PDF and move it into Google Sheets, from the super simple to the more advanced.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

First, Assess the PDF: Is the Data Text or an Image?

Before you try any method, you need to understand what kind of PDF you're working with. This single step will save you a ton of frustration. There are two main types:

  • Text-Based (or "True") PDFs: These are files generated directly from a program like Excel, Word, or reporting software. The text and numbers inside are actual data that your computer can recognize.
  • Image-Based (or Scanned) PDFs: These are essentially photographs of a paper document. To a computer, the tables and text are just pixels in a single large image, not individual characters or numbers.

Here's a quick test to tell the difference: Open the PDF and try to click and drag your cursor to highlight a single line of text or a number in a table. If you can cleanly highlight the text, you have a text-based PDF and will have an easier time. If your cursor either selects the entire page as an image or you can't highlight anything at all, you're dealing with a scanned PDF and will need a tool with Optical Character Recognition (OCR) to "read" the image.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

Method 1: The Simple (But Often Messy) Copy and Paste

For text-based PDFs with clean, simple tables, the most direct approach is often worth a try first. It doesn’t always work perfectly, but when it does, it's the fastest way to get your data transferred.

Step-by-Step Instructions:

  1. Open the PDF: Use your default PDF viewer (like Adobe Acrobat Reader or Preview for Mac) or open it directly in a browser like Chrome.
  2. Highlight the Table: Carefully click and drag your cursor over the entire table you want to extract.
  3. Copy the Data: Press Ctrl+C (on Windows) or Cmd+C (on Mac) to copy the highlighted data to your clipboard.
  4. Paste into Google Sheets: Open a new Google Sheet. Click on cell A1 and press Ctrl+V (or Cmd+V) to paste. Often, a better option is to right-click and choose Paste special > Paste values only (or use the shortcut Ctrl+Shift+V / Cmd+Shift+V). This avoids bringing over any strange formatting from the PDF.

Tips for a Better Paste:

You might notice the data doesn't line up perfectly. Sometimes all the data from a row gets squished into a single column. If this happens, don't despair! Google Sheets has a built-in tool to fix this: Text to Columns.

  • Highlight the column containing your jumbled data.
  • Go to the menu and click Data > Split text to columns.
  • A small box will appear asking for a separator. Google Sheets is usually smart enough to "Detect automatically," but you can also choose "Space" or "Comma" if you see a clear pattern in how the data is grouped. Your single column of data should now magically split into separate, organized columns.

This method is best for small, well-structured tables without merged cells or multi-line-height rows, which tend to confuse a simple copy/paste.

Method 2: Using Google Drive to Convert PDFs (Built-in OCR)

If you're dealing with a scanned, image-based PDF, or if the copy-paste method resulted in a completely garbled mess, Google's ecosystem has a powerful, free tool to help: Google Drive's built-in OCR capability.

Step-by-Step Instructions:

  1. Upload to Google Drive: Drag and drop your PDF file into your Google Drive account.
  2. Open with Google Docs: Once uploaded, find the PDF in your Drive. Right-click on the file, navigate to Open with > Google Docs.
  3. Let Google Work its Magic: This is the key part. Google Drive will take a minute or two to process the file. It performs OCR on the content and attempts to convert everything into an editable Google Doc.
  4. Find Your Table: A new Google Doc will open in a new tab. You'll typically see an image of the original PDF page at the top, and below it, you'll find the computer-readable text that Google extracted. Scroll down to find your table.
  5. Copy and Paste from Docs to Sheets: Highlight the table within the Google Doc, copy it (Ctrl+C / Cmd+C), then paste it into your desired Google Sheet. The formatting should be much cleaner now that it's been processed through a structured format like Google Docs.

Know its Limitations:

Google's OCR is surprisingly good for a free tool, but it's not flawless. Be prepared to do some manual cleanup. Common errors include mistaking "O" for "0", "1" for "l", or misinterpreting complex table borders. Always double-check your numbers before running any analysis.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

Method 3: For Reliable Results - Using a PDF to CSV Converter

When you need higher accuracy or you're dealing with very large, complex tables, a dedicated third-party tool is often the best choice. There are many online tools (just search for "PDF to CSV converter") that are specifically designed for this type of extraction. The general workflow is the same for most of them.

This approach gives you a clean CSV (Comma-Separated Values) file, which is a universally compatible format for spreadsheets.

General Workflow:

  1. Find a Converter Tool: Choose a reputable online PDF converter. Be mindful of privacy and avoid uploading highly sensitive documents to free, unknown web services.
  2. Upload Your PDF: Follow the on-screen instructions to upload your file.
  3. Select Output Format: Choose CSV or Excel (.xlsx) as your output format.
  4. Download the Converted File: Once processed, download the resulting CSV file to your computer.
  5. Import into Google Sheets: Open a Google Sheet, and in the menu, go to File > Import. Navigate to the "Upload" tab and select the CSV file you just downloaded. Google Sheets will give you options for how to handle the import, but the default settings usually work perfectly.

This method typically yields the most accurate results for table extraction, as these tools are purpose-built to recognize row and column structures. It requires an extra step, but it often saves significant time on data cleanup.

Method 4: For the Tech-Savvy - Google Apps Script

If you find yourself needing to extract data from similarly formatted PDFs repeatedly (e.g., weekly reports sent from a vendor), you can automate the entire process using Google Apps Script, which is a JavaScript-based coding platform built into the G Suite ecosystem.

This is definitely an advanced solution, but it is the most powerful and scalable. It allows you to write a custom script that can:

  • Access a specific PDF file from Google Drive.
  • Use built-in or third-party parsing libraries to read the PDF content.
  • Apply custom logic to find and extract the specific data you need.
  • Write that data directly into a specified location in your Google Sheet.
  • You can even set it up to run automatically on a schedule!

Diving into Apps Script is beyond the scope of this tutorial, but it’s an important option to be aware of if your simple one-off extraction task begins to turn into a repetitive weekly manual process. For those comfortable with coding, it's the ultimate way to set it and forget it.

GraphedGraphed

Still Building Reports Manually?

Watch how growth teams are getting answers in seconds — not days.

Watch Graphed demo video

Don't Forget About Data Cleanup!

No matter which method you use, it's very likely your data will need some polishing before it's ready for analysis. Here are a few essential Google Sheets functions and tools to help:

  • Find and Replace (Ctrl+H / Cmd+H): Perfect for removing unwanted characters like currency symbols ($), commas in numbers, or asterisks.
  • TRIM Function: Use =TRIM(A1) to remove any pesky leading or trailing spaces from a cell. This is crucial for making sure your text and numbers are clean.
  • Value Function: Sometimes numbers are pasted as text. =VALUE(A1) will convert a text-formatted number into a proper numerical value that you can use in calculations.
  • Data Formatting: Highlight your columns and use the Format > Number menu to correctly format your data as currency, percentages, or dates. This is essential for accurate calculations and charting.

Final Thoughts

Extracting data from PDFs into Google Sheets is a common but very solvable problem. By first identifying whether your file is text or image-based, you can choose the right tool for the job, whether it's a simple copy-paste, Google Drive's OCR, or a dedicated converter tool, to turn that static document into actionable data.

Manual data extraction from PDFs and other reporting channels is often just the beginning of a long reporting process. At Graphed we think you should be able to skip the manual work entirely. Rather than wrestling with files, you can connect your data sources like Google Analytics, Shopify, Facebook Ads, or even Google Sheets itself, and then just ask for the report you need in plain English. We designed it so your team can get real-time answers and live dashboards in seconds, not hours, letting you focus on the insights, not the mechanics of data wrangling.

Related Articles

How to Enable Data Analysis in Excel

Enable Excel's hidden data analysis tools with our step-by-step guide. Uncover trends, make forecasts, and turn raw numbers into actionable insights today!