Difference between revisions of "Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles"
(Infobox) |
(+ categories) |
||
(One intermediate revision by one other user not shown) | |||
Line 10: | Line 10: | ||
== Overview == | == Overview == | ||
A lightweight method distinguishes articles within [[Wikipedia]] that are classes (“Novel”, “Book”) from other articles (“Three Men in a Boat”, “Diary of a Pilgrimage”). It exploits clues available within the article text and within [[categories]] associated with articles in Wikipedia, while not requiring any linguistic preprocessing tools. Experimental results show that classes can be identified among Wikipedia articles in [[multiple languages]], at aggregate precision and recall above 0.9 and 0.6 respectively. | A lightweight method distinguishes articles within [[Wikipedia]] that are classes (“Novel”, “Book”) from other articles (“Three Men in a Boat”, “Diary of a Pilgrimage”). It exploits clues available within the article text and within [[categories]] associated with articles in Wikipedia, while not requiring any linguistic preprocessing tools. Experimental results show that classes can be identified among Wikipedia articles in [[multiple languages]], at aggregate precision and recall above 0.9 and 0.6 respectively. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Pasca, Marius. (2018). "[[Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles]]". International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Pasca |first1=Marius |title=Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles |date=2018 |doi=10.1145/3178876.3186025 |url=https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles |journal=International World Wide Web Conferences Steering Committee}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Pasca, Marius. (2018). &quot;<a href="https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles">Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles</a>&quot;. International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | |||
+ | |||
+ | [[Category:Scientific works]] |
Latest revision as of 11:00, 1 March 2021
Authors | Marius Pasca |
---|---|
Publication date | 2018 |
DOI | 10.1145/3178876.3186025 |
Links | Original |
Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles - scientific work related to Wikipedia quality published in 2018, written by Marius Pasca.
Overview
A lightweight method distinguishes articles within Wikipedia that are classes (“Novel”, “Book”) from other articles (“Three Men in a Boat”, “Diary of a Pilgrimage”). It exploits clues available within the article text and within categories associated with articles in Wikipedia, while not requiring any linguistic preprocessing tools. Experimental results show that classes can be identified among Wikipedia articles in multiple languages, at aggregate precision and recall above 0.9 and 0.6 respectively.
Embed
Wikipedia Quality
Pasca, Marius. (2018). "[[Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles]]". International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025.
English Wikipedia
{{cite journal |last1=Pasca |first1=Marius |title=Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles |date=2018 |doi=10.1145/3178876.3186025 |url=https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles |journal=International World Wide Web Conferences Steering Committee}}
HTML
Pasca, Marius. (2018). "<a href="https://wikipediaquality.com/wiki/Finding_Needles_in_an_Encyclopedic_Haystack:_Detecting_Classes_Among_Wikipedia_Articles">Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles</a>". International World Wide Web Conferences Steering Committee. DOI: 10.1145/3178876.3186025.