Excel Tips

Improve your PDF data handling with Power Query

Ever struggled to extract crucial financial data from PDF files into Excel? 

As accountants and finance professionals, this can be a regular occurrence.  

To help you, this episode explores Microsoft Excel’s Power Query feature. You’ll uncover how to seamlessly import data from system-generated PDFs like invoices and statements. 

From tackling tricky formatting issues to cleaning up messy rows, learn through real-world examples using an Australian Tax Office (ATO) PDF as a case study.  

Additionally, you’ll gain smart strategies for handling regularly updated PDFs to automate your workflows.  

For fast access, use these timestamps:

  • 0:17 - Introduction to Importing Data from PDFs using Excel's Power Query feature.
  • 0:37 - Types of PDFs and Their Reliability
  • 1:05 - Use Cases for Importing PDF data
  • 3:17 - Accessing Power Query and Importing a PDF
  • 4:15 - Tables vs. Pages in PDFs
  • 5:40 - The Complexity of Importing PDF Data
  • 6:25 - Removing Repeated Headers
  • 7:05 - Addressing Errors and Dealing with Extra Rows
  • 9:20 - Creating a Filter Column
  • 9:45 - Reversing Rows Again
  • 10:05 - PDF File Names
  • 11:20 - Reusing Power Queries
  • 11:42 - Conclusion

If you’re ready to save time and streamline your data handling processes, this episode is a must-listen. 

Host: Neale Blackwood CPA has more than 20 years of experience as a Microsoft Excel educator. He is the author of more than 200 INTHEBLACK articles as well as a book, Advanced Excel Reporting for Management Accountants.

CPA Australia publishes four podcasts, providing commentary and thought leadership across business, finance, and accounting:  

  • With Interest
  • INTHEBLACK 
  • INTHEBLACK Out Loud
  • Excel Tips 

Search for them in your podcast platform.  

You can email the podcast team at podcasts@cpaaustralia.com.au