Skip to content

Parses a pdf for a regex string and creates a new pdf for each specific match.

Notifications You must be signed in to change notification settings

geeksteev/SplitPdfByRegex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SplitPdfByRegex

Parses a pdf for a regex string and creates a new pdf for each specific match.

I refer to a regular expression match as a department code as this was how I implemented the application.

Testing PDF - The pdf I was using to test was 200+ pages and contained 90 departments. Each page contained information for one department. A department can have multiple pages of information. There are no pages in the pdf that contain information for more than one department. Each page also contained a department code in the header.

This application reads through each page of a pdf until it finds a department code. Once it finds a department code, it creates a new pdf for that department. If it finds mulitiple pages for that department, it will append those pages to the newly created pdf respective to that department.

About

Parses a pdf for a regex string and creates a new pdf for each specific match.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages