How to Put PDF Data into Excel
Chances are you’ve faced this exact problem: you have a perfect table of data, but it's trapped inside a PDF file. You need to analyze, sort, or chart it in Microsoft Excel, but when you try to copy and paste, you just get a jumbled mess. This article will show you several reliable methods to move data from a PDF into Excel, from simple workarounds to powerful built-in tools.
Method 1: The Simple Copy and Paste (It’s Worth a Shot)
Before diving into more complex solutions, always try the simplest one first. It doesn’t always work, but when it does, it saves you a lot of time. This method works best with digitally-created PDFs that contain simple, clean tables.
How To Do It:
- Open your PDF file in a reader like Adobe Acrobat, or even in your web browser.
- Use your cursor to carefully highlight all the data within the table you want to copy.
- Press CTRL + C (or Command + C on a Mac) to copy the highlighted data.
- Open a blank worksheet in Excel.
- Select cell A1 and press CTRL + V (or Command + V on a Mac) to paste the data.
Why It Often Fails:
PDFs are not designed for data extraction. They prioritize preserving the visual layout, not the underlying data structure. When you copy-paste, you're likely to run into issues like:
- All Data in One Column: Your entire table might get squished into Column A, with each row of text in a single cell.
- Formatting Nightmares: You might get odd spacing, random line breaks, or other formatting artifacts that make the data unusable without extensive cleanup.
- Scanned Documents: If your PDF is a scan of a paper document, the "data" is actually just part of an image. Copying and pasting will only grab messy, unreadable characters, if anything at all.
If the paste result is clean, you’re done! If it’s a jumble, it’s time to move on to a more robust method.
Method 2: Using Microsoft Word as a Bridge to Excel
This surprising trick works much better than a direct copy-paste because Word is incredibly good at interpreting PDF structures and converting them into an editable format. You can use Word as an intermediary to properly format the table before moving it to Excel.
How To Do It:
- Find your PDF file in your computer’s folder.
- Right-click the PDF file.
- Select Open with and choose Microsoft Word. (If Word isn’t in the list, you might need to find it by clicking “Choose another app.”)
- Word will show a pop-up message: “Word will now convert your PDF to an editable Word document…” Click OK.
- Word will convert the file. This might take a minute for large files. Once it’s open, you should see your data in a proper, editable table within the Word document.
- Highlight the entire table in Word.
- Copy it (CTRL + C).
- Go back to your blank Excel sheet and paste it (CTRL + V).
In most cases, the data will now appear in Excel with the correct rows and columns intact, saving you a huge amount of manual cleanup.
When This Method Works Best:
The Word method is excellent for most computer-generated PDFs, including reports, invoices, and price lists. As with the direct copy-paste method, it will not work on PDFs that are scans or images of text. For those, you’ll need a different approach entirely.
Method 3: Excel’s Built-in “Get Data From PDF” Feature
For users with modern versions of Excel (Microsoft 365, Excel 2021, and some later versions of Excel 2016), you have a powerful tool built right into the application: Power Query. This is by far the most reliable and flexible method for importing data from native PDFs.
Step-by-Step Instructions:
1. Go to the Data Tab
Open a blank Excel workbook and navigate to the Data tab in the main ribbon.
2. Select Get Data > From File > From PDF
On the far left of the Data ribbon, click the Get Data button. A dropdown menu will appear. From there, select From File, and then choose From PDF.
3. Find and Import Your PDF
A file browser window will pop up. Locate the PDF on your computer that contains the data you want to import and click Import.
4. Use the Navigator Window
After a moment, the Navigator window will appear. This is Excel’s way of showing you all the digestible data it found in the PDF. On the left side, you'll see a list of available tables and pages it has identified. Click on each item in the list to see a preview of the data on the right. Find the table that matches what you need to import.
5. Load or Transform the Data
At the bottom of the Navigator window, you’ll see a few options. The two most important are Load and Transform Data.
- Load: Use this option if the data in the preview looks perfect and needs no changes. Clicking Load will instantly dump the data into a new worksheet in your Excel file, formatted as an official Excel Table.
- Transform Data: Use this option if the data is messy and needs to be cleaned up before being added to your sheet. This is where the true power lies. Clicking it opens the Power Query Editor.
A Quick Look at the Power Query Editor
The Power Query Editor is a data transformation tool where you can clean, shape, and reorganize your data. It’s perfect for tackling common issues like:
- Unwanted Columns: You can simply right-click a column header and select "Remove."
- Merged Data: If you have "City, State, Zip" all in one column, you can use the "Split Column" tool to separate them.
- Incorrect Data Types: You can change a column from text to a number or a date.
- Blank Rows or Errors: You can easily filter these out.
Each change you make is recorded as a "step" on the right-hand panel. Once finished, click the Close & Load button at the top-left, and the data is perfectly moved to the Excel sheet.
Method 4: Dealing with Scanned and Image-Based PDFs
What if your PDF is essentially a picture of a document? If you try any of the methods above, they will fail because there is no text data to extract - only pixels. To solve this, you need to use a technology called Optical Character Recognition (OCR).
What is OCR?
OCR software scans an image, identifies characters that look like text, and converts them into actual, machine-readable text that you can edit and copy. Many tools can perform OCR.
Your Options for OCR Conversion:
- Online PDF-to-Excel Converters: There are plenty of free websites where you can upload a PDF, have it run through an OCR process, and then download an Excel file. Search "PDF to Excel Converter OCR" to find them. Be cautious: do not upload documents containing sensitive or private information to free online tools.
- Adobe Acrobat Pro: The paid version of Adobe Acrobat has a powerful, built-in OCR feature. You can open your scanned PDF and use the “Export PDF” tool to save it directly as an Excel spreadsheet. It’s one of the most accurate options available.
- Microsoft OneNote: Believe it or not, OneNote has a decent OCR tool. You can insert your PDF as a printout, right-click the image, and select "Copy Text from this Page of the Printout." Then, you can paste that text into Excel for further cleanup. The formatting won't be perfect, but it's a great option if you already have Microsoft Office.
After using an OCR tool, the data will still likely require some manual cleanup in Excel, but it will save you from having to retype everything by hand.
Final Thoughts
Getting data from a PDF into Excel has gotten much easier over the years. Gone are the days of frustrating, messy copy-pasting - as long as you use the right tool for the job. For most situations, leveraging Excel's own amazing Power Query 'Get Data from PDF' feature is your best option, giving you the control to clean and shape your data before it even hits your spreadsheet.
Extracting data is often just the first step in a long, manual process of analysis and reporting. At Graphed, we built a way to skip the tedium of copying, pasting, and building charts by hand. We directly connect to your core business platforms - like Google Analytics, Shopify, QuickBooks, and CRMs - to automate your reporting. Instead of fighting with PDFs or wrestling with pivot tables, you can just ask questions in plain English and instantly get the dashboards and answers you need. If you're ready to move beyond manual data work, give Graphed a try.
Related Articles
What SEO Tools Work with Google Analytics?
Discover which SEO tools integrate seamlessly with Google Analytics to provide a comprehensive view of your site's performance. Optimize your SEO strategy now!
Looker Studio vs Metabase: Which BI Tool Actually Fits Your Team?
Looker Studio and Metabase both help you turn raw data into dashboards, but they take completely different approaches. This guide breaks down where each tool fits, what they are good at, and which one matches your actual workflow.
How to Create a Photo Album in Meta Business Suite
How to create a photo album in Meta Business Suite — step-by-step guide to organizing Facebook and Instagram photos into albums for your business page.