Skip to content

jordivicedo/NBoilerpipe

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NBoilerpipe is a C# port of boilerpipe 1.2 (http://code.google.com/p/boilerpipe/) library. Most of the code is converted with the Sharpen tool (https://github.com/slluis/sharpen). The code uses the Sharpen libary (with modification) from NGit project (https://github.com/slluis/ngit) and HmtlAgilityPack (http://htmlagilitypack.codeplex.com/).

NBoilerpipe is only been tested with Mono.

Usage:

using NBoilerpipe.Extractors;
...
String html = GetHtmlText();
var text = ArticleExtractor.INSTANCE.GetText (html);
//var text = DefaultExtractor.INSTANCE.GetText (html);
...

About

Boilerpipe 1.2 port for .NET

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C# 100.0%