Class WikipediaHelper
- Namespace
- SunamoWikipedia
- Assembly
- SunamoWikipedia.dll
Provides methods for parsing Wikipedia HTML content including tables and lists.
public class WikipediaHelper
- Inheritance
-
WikipediaHelper
- Inherited Members
- Extension Methods
Methods
HtmlEntitiesList(Func<List<string>, List<string>, string>)
Parses an HTML entities list from Wikipedia and returns formatted dictionary values.
public static string HtmlEntitiesList(Func<List<string>, List<string>, string> dictionaryValuesFormatter)
Parameters
dictionaryValuesFormatterFunc<List<string>, List<string>, string>A function that takes names and characters lists and returns a formatted string.
Returns
- string
A formatted string of HTML entity dictionary values.
ParseList(string)
Parses a Wikipedia page containing a list structure and extracts all anchor text values.
public static List<string> ParseList(string html)
Parameters
htmlstringThe HTML content to parse.
Returns
ParseTable(string, params string[])
Parses HTML content and extracts Wikipedia tables that contain the specified columns.
public static List<HtmlTableParserWiki> ParseTable(string html, params string[] columnNames)
Parameters
htmlstringThe HTML content to parse.
columnNamesstring[]The column names that the table must contain.
Returns
- List<HtmlTableParserWiki>
A list of parsed Wikipedia table objects.