Skip to content

SyncfusionExamples/Extract-data-from-PDF-document

Repository files navigation

Extract useful data from PDF document using C#

The Syncfusion .NET PDF library allows users to extract various types of useful data from a PDF document, such as text, images, attachments, and form data. This repository contains examples to extract various types of data from PDF documents using C#.

Sample name Description
Extract text from PDF Extract all text from a PDF document.
Extract text from specific page Extract all text from a specific page of PDF document.
Extract text from predefined bounds Extract text from the specified bounds in the PDF document.
Layout based text extraction Extract text from the given PDF page based on the layout of the PDF document.
Extract text from scanned PDF The Syncfusion .NET OCR library is used to extract text from the scanned PDFs and images.
Extract images Extract images from an existing PDF document.
Find text Find text from an existing PDF document.
Find text using regex Extract the required text data from the PDF document using regex (regular expression).
Extract PDF document information Extract document information from an existing PDF document.
Extract attachment Extract attachments such as Word document, Excel document, text file, images and more from an existing PDF document.
Extract filled PDF form data Export the filled PDF form data and save it as XML file.

How to run the examples

  • Download this project to a location in your disk.
  • Open the solution file using Visual Studio.
  • Rebuild the solution to install the required NuGet package.
  • Run the application.

Resources

Support and feedback

License

This is a commercial product and requires a paid license for possession or use. Syncfusion’s licensed software, including this component, is subject to the terms and conditions of Syncfusion's EULA. You can purchase a licnense here or start a free 30-day trial here.

About Syncfusion

Founded in 2001 and headquartered in Research Triangle Park, N.C., Syncfusion has more than 26,000+ customers and more than 1 million users, including large financial institutions, Fortune 500 companies, and global IT consultancies.

Today, we provide 1600+ components and frameworks for web (Blazor, ASP.NET Core, ASP.NET MVC, ASP.NET WebForms, JavaScript, Angular, React, Vue, and Flutter), mobile (Xamarin, Flutter, UWP, and JavaScript), and desktop development (WinForms, WPF, WinUI(Preview), Flutter and UWP). We provide ready-to-deploy enterprise software for dashboards, reports, data integration, and big data processing. Many customers have saved millions in licensing fees by deploying our software.

About

This repository contains examples to extract various data from PDF documents in .NET apps using Syncfusion .NET PDF library.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages