Difference between revisions of "Linking, Searching, and Visualizing Entities for the Swedish Wikipedia"
(Infobox work) |
(Adding embed) |
||
Line 9: | Line 9: | ||
== Overview == | == Overview == | ||
In this paper, authors describe a new system to extract, index, search, and visualize entities on [[Wikipedia]]. To carry out the extraction, authors designed a high-performance entity linker and authors used a document model to store the resulting linguistic annotations. The entity linker ,HERD, extracts the mentions from text using a string matching Engine and links the mto entities with a combination of rules, PageRank, and feature vectors based on the [[Wikipedia categories]]. The document model, Docforia, consists of layers, where each layer is a sequence of ranges describing a specific annotation,here thee ntities. Authors evaluated HERD with the ERD’14 protocol (Carmel et al., 2014) and authors reached the competitive F1-score of 0.746 on the English development set. Authors applied HERD to the whole collection of Swedish articles of Wikipedia and authors used Lucene to index the layers and a search module to interactively retrieve articles and metadata given a title, a phrase, or a property. The user can then select an entity and visualize concordance in articles or paragraphs. A demonstration of the entity search and visualization is available for Swedish at this address: http://vilde.cs.lth.se:9001/sv-herd/. (Less) | In this paper, authors describe a new system to extract, index, search, and visualize entities on [[Wikipedia]]. To carry out the extraction, authors designed a high-performance entity linker and authors used a document model to store the resulting linguistic annotations. The entity linker ,HERD, extracts the mentions from text using a string matching Engine and links the mto entities with a combination of rules, PageRank, and feature vectors based on the [[Wikipedia categories]]. The document model, Docforia, consists of layers, where each layer is a sequence of ranges describing a specific annotation,here thee ntities. Authors evaluated HERD with the ERD’14 protocol (Carmel et al., 2014) and authors reached the competitive F1-score of 0.746 on the English development set. Authors applied HERD to the whole collection of Swedish articles of Wikipedia and authors used Lucene to index the layers and a search module to interactively retrieve articles and metadata given a title, a phrase, or a property. The user can then select an entity and visualize concordance in articles or paragraphs. A demonstration of the entity search and visualization is available for Swedish at this address: http://vilde.cs.lth.se:9001/sv-herd/. (Less) | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Södergren, Anton; Klang, Marcus; Nugues, Pierre. (2016). "[[Linking, Searching, and Visualizing Entities for the Swedish Wikipedia]]". | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Södergren |first1=Anton |last2=Klang |first2=Marcus |last3=Nugues |first3=Pierre |title=Linking, Searching, and Visualizing Entities for the Swedish Wikipedia |date=2016 |url=https://wikipediaquality.com/wiki/Linking,_Searching,_and_Visualizing_Entities_for_the_Swedish_Wikipedia}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Södergren, Anton; Klang, Marcus; Nugues, Pierre. (2016). &quot;<a href="https://wikipediaquality.com/wiki/Linking,_Searching,_and_Visualizing_Entities_for_the_Swedish_Wikipedia">Linking, Searching, and Visualizing Entities for the Swedish Wikipedia</a>&quot;. | ||
+ | </nowiki> | ||
+ | </code> |
Revision as of 08:50, 15 May 2020
Authors | Anton Södergren Marcus Klang Pierre Nugues |
---|---|
Publication date | 2016 |
Links | Original |
Linking, Searching, and Visualizing Entities for the Swedish Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Anton Södergren, Marcus Klang and Pierre Nugues.
Overview
In this paper, authors describe a new system to extract, index, search, and visualize entities on Wikipedia. To carry out the extraction, authors designed a high-performance entity linker and authors used a document model to store the resulting linguistic annotations. The entity linker ,HERD, extracts the mentions from text using a string matching Engine and links the mto entities with a combination of rules, PageRank, and feature vectors based on the Wikipedia categories. The document model, Docforia, consists of layers, where each layer is a sequence of ranges describing a specific annotation,here thee ntities. Authors evaluated HERD with the ERD’14 protocol (Carmel et al., 2014) and authors reached the competitive F1-score of 0.746 on the English development set. Authors applied HERD to the whole collection of Swedish articles of Wikipedia and authors used Lucene to index the layers and a search module to interactively retrieve articles and metadata given a title, a phrase, or a property. The user can then select an entity and visualize concordance in articles or paragraphs. A demonstration of the entity search and visualization is available for Swedish at this address: http://vilde.cs.lth.se:9001/sv-herd/. (Less)
Embed
Wikipedia Quality
Södergren, Anton; Klang, Marcus; Nugues, Pierre. (2016). "[[Linking, Searching, and Visualizing Entities for the Swedish Wikipedia]]".
English Wikipedia
{{cite journal |last1=Södergren |first1=Anton |last2=Klang |first2=Marcus |last3=Nugues |first3=Pierre |title=Linking, Searching, and Visualizing Entities for the Swedish Wikipedia |date=2016 |url=https://wikipediaquality.com/wiki/Linking,_Searching,_and_Visualizing_Entities_for_the_Swedish_Wikipedia}}
HTML
Södergren, Anton; Klang, Marcus; Nugues, Pierre. (2016). "<a href="https://wikipediaquality.com/wiki/Linking,_Searching,_and_Visualizing_Entities_for_the_Swedish_Wikipedia">Linking, Searching, and Visualizing Entities for the Swedish Wikipedia</a>".