Skip to content

This repo contains some code I wrote in C# using Lucene .Net and some other libraries, in order to extract text from .pdf and .txt files using .NET, indexing and searching that text.

dirten/data-indexing-and-searching

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

data-indexing-and-searching

textIndexing:contains the code that extracts the textual information of a .txt file and creates a Lucene Index.

pdfIndexing: is the same for .pdf files (text extraction & Lucene Index)

indexSearcher: is the module that performs the queries on any Lucene / Lucene.NET index.

About

This repo contains some code I wrote in C# using Lucene .Net and some other libraries, in order to extract text from .pdf and .txt files using .NET, indexing and searching that text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C# 100.0%