Table of Contents

Class WikipediaHelper

Namespace
SunamoWikipedia
Assembly
SunamoWikipedia.dll

Provides methods for parsing Wikipedia HTML content including tables and lists.

public class WikipediaHelper
Inheritance
WikipediaHelper
Inherited Members
Extension Methods

Methods

HtmlEntitiesList(Func<List<string>, List<string>, string>)

Parses an HTML entities list from Wikipedia and returns formatted dictionary values.

public static string HtmlEntitiesList(Func<List<string>, List<string>, string> dictionaryValuesFormatter)

Parameters

dictionaryValuesFormatter Func<List<string>, List<string>, string>

A function that takes names and characters lists and returns a formatted string.

Returns

string

A formatted string of HTML entity dictionary values.

ParseList(string)

Parses a Wikipedia page containing a list structure and extracts all anchor text values.

public static List<string> ParseList(string html)

Parameters

html string

The HTML content to parse.

Returns

List<string>

A list of extracted text values from anchors.

ParseTable(string, params string[])

Parses HTML content and extracts Wikipedia tables that contain the specified columns.

public static List<HtmlTableParserWiki> ParseTable(string html, params string[] columnNames)

Parameters

html string

The HTML content to parse.

columnNames string[]

The column names that the table must contain.

Returns

List<HtmlTableParserWiki>

A list of parsed Wikipedia table objects.