Exemplos de ISpider.HtmlToTextAsync em C# (CSharp)

Linguagem de programação: C# (CSharp)

Classe / Tipo: ISpider

Método / Função: HtmlToTextAsync

Exemplos em hotexamples.com: 1

ISpider.HtmlToTextAsync em C# (CSharp) - 1 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de ISpider.HtmlToTextAsync em C# (CSharp) extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

Exit(10)

Pause(7)

Log(7)

Contiune(6)

TurnRight(3)

TurnLeft(3)

MoveFront(3)

GetPosition(3)

RunAsync(2)

GetOrientation(2)

HasCompleted(1)

Start(1)

RemoveNodesFromDocument(1)

LoadPage(1)

HtmlToTextAsync(1)

AddCookie(1)

HandleTags(1)

HandleMedia(1)

Continue(1)

Grab(1)

GetType(1)

GetMedia(1)

GetHeadersOfSize(1)

Extract(1)

DownloadArticleByHeader(1)

Dispose(1)

Crawl(1)

HandleLinks(1)

Métodos Frequentes

Exit (10)

Pause (7)

Log (7)

Contiune (6)

TurnRight (3)

TurnLeft (3)

MoveFront (3)

GetPosition (3)

RunAsync (2)

GetOrientation (2)

Métodos Frequentes

HasCompleted (1)

Start (1)

RemoveNodesFromDocument (1)

LoadPage (1)

HtmlToTextAsync (1)

AddCookie (1)

HandleTags (1)

HandleMedia (1)

Continue (1)

Grab (1)

GetType (1)

GetMedia (1)

GetHeadersOfSize (1)

Extract (1)

DownloadArticleByHeader (1)

Dispose (1)

Crawl (1)

HandleLinks (1)

Métodos Frequentes

GetType (1)

GetMedia (1)

GetHeadersOfSize (1)

Extract (1)

DownloadArticleByHeader (1)

Dispose (1)

Crawl (1)

HandleLinks (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: Extractor.cs Projeto: mimustafa/MediaSpin

public string ExtractBodyTextFromArticleDocument(HtmlDocument articleHtmlDocument) { RemoveHeadersFromDocument(articleHtmlDocument); RemoveLinksFromDocument(articleHtmlDocument); RemoveUnorderedListsFromDocument(articleHtmlDocument); RemoveScriptsFromDocument(articleHtmlDocument); if (articleHtmlDocument?.DocumentNode?.OuterHtml == null) { return(String.Empty); } var cleanedHtml = articleHtmlDocument.DocumentNode.OuterHtml; var htmlToTextConversion = _spider.HtmlToTextAsync(cleanedHtml); Task.WaitAll(htmlToTextConversion); if (htmlToTextConversion.IsCompletedSuccessfully) { var articleText = htmlToTextConversion.Result.Replace("\n", " "); var finalArticleText = RemoveNonBodyTextSentences(articleText); return(finalArticleText); } else { throw new Exception($"could not convert the following html to text {cleanedHtml}"); } }