Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles

From Wikipedia Quality
Revision as of 10:00, 1 March 2021 by Lindsey (talk | contribs) (+ categories)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles
Authors
Marius Pasca
Publication date
2018
DOI
10.1145/3178876.3186025
Links
Original

Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles - scientific work related to Wikipedia quality published in 2018, written by Marius Pasca.

Overview

A lightweight method distinguishes articles within Wikipedia that are classes (“Novel”, “Book”) from other articles (“Three Men in a Boat”, “Diary of a Pilgrimage”). It exploits clues available within the article text and within categories associated with articles in Wikipedia, while not requiring any linguistic preprocessing tools. Experimental results show that classes can be identified among Wikipedia articles in multiple languages, at aggregate precision and recall above 0.9 and 0.6 respectively.

Embed

Wikipedia Quality

Pasca, Marius. (2018). "[[Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles]]". International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025.

English Wikipedia

{{cite journal |last1=Pasca |first1=Marius |title=Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles |date=2018 |doi=10.1145/3178876.3186025 |url=https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles |journal=International World Wide Web Conferences Steering Committee}}

HTML

Pasca, Marius. (2018). &quot;<a href="https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles">Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles</a>&quot;. International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025.