Difference between revisions of "Categorizing Learning Objects based on Wikipedia as Substitute Corpus"

From Wikipedia Quality
Jump to: navigation, search
(Adding infobox)
(Embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
As metadata is often not sufficiently provided by authors of Learning Resources, automatic metadata generation methods are used to create metadata afterwards. One kind of metadata is categorization, particularly the partition of Learning Resources into distinct subject [[categories]]. A disadvantage of state-of-the-art categorization methods is that they require corpora of sample Learning Resources. Unfortunately, large corpora of well-labeled Learning Resources are rare. This paper presents a new approach for the task of subject categorization of Learning Resources. Instead of using typical Learning Resources, the free encyclopedia [[Wikipedia]] is applied as training corpus. The approach presented in this paper is to apply the k-Nearest-Neighbors method for comparing a Learning Resource to Wikipedia articles. Different parameters have been evaluated regarding their impact on the categorization performance.
 
As metadata is often not sufficiently provided by authors of Learning Resources, automatic metadata generation methods are used to create metadata afterwards. One kind of metadata is categorization, particularly the partition of Learning Resources into distinct subject [[categories]]. A disadvantage of state-of-the-art categorization methods is that they require corpora of sample Learning Resources. Unfortunately, large corpora of well-labeled Learning Resources are rare. This paper presents a new approach for the task of subject categorization of Learning Resources. Instead of using typical Learning Resources, the free encyclopedia [[Wikipedia]] is applied as training corpus. The approach presented in this paper is to apply the k-Nearest-Neighbors method for comparing a Learning Resource to Wikipedia articles. Different parameters have been evaluated regarding their impact on the categorization performance.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Meyer, Marek; Rensing, Christoph; Steinmetz, Ralf. (2007). "[[Categorizing Learning Objects based on Wikipedia as Substitute Corpus]]".
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Meyer |first1=Marek |last2=Rensing |first2=Christoph |last3=Steinmetz |first3=Ralf |title=Categorizing Learning Objects based on Wikipedia as Substitute Corpus |date=2007 |url=https://wikipediaquality.com/wiki/Categorizing_Learning_Objects_based_on_Wikipedia_as_Substitute_Corpus}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Meyer, Marek; Rensing, Christoph; Steinmetz, Ralf. (2007). &amp;quot;<a href="https://wikipediaquality.com/wiki/Categorizing_Learning_Objects_based_on_Wikipedia_as_Substitute_Corpus">Categorizing Learning Objects based on Wikipedia as Substitute Corpus</a>&amp;quot;.
 +
</nowiki>
 +
</code>

Revision as of 13:08, 10 February 2021


Categorizing Learning Objects based on Wikipedia as Substitute Corpus
Authors
Marek Meyer
Christoph Rensing
Ralf Steinmetz
Publication date
2007
Links
Original Preprint

Categorizing Learning Objects based on Wikipedia as Substitute Corpus - scientific work related to Wikipedia quality published in 2007, written by Marek Meyer, Christoph Rensing and Ralf Steinmetz.

Overview

As metadata is often not sufficiently provided by authors of Learning Resources, automatic metadata generation methods are used to create metadata afterwards. One kind of metadata is categorization, particularly the partition of Learning Resources into distinct subject categories. A disadvantage of state-of-the-art categorization methods is that they require corpora of sample Learning Resources. Unfortunately, large corpora of well-labeled Learning Resources are rare. This paper presents a new approach for the task of subject categorization of Learning Resources. Instead of using typical Learning Resources, the free encyclopedia Wikipedia is applied as training corpus. The approach presented in this paper is to apply the k-Nearest-Neighbors method for comparing a Learning Resource to Wikipedia articles. Different parameters have been evaluated regarding their impact on the categorization performance.

Embed

Wikipedia Quality

Meyer, Marek; Rensing, Christoph; Steinmetz, Ralf. (2007). "[[Categorizing Learning Objects based on Wikipedia as Substitute Corpus]]".

English Wikipedia

{{cite journal |last1=Meyer |first1=Marek |last2=Rensing |first2=Christoph |last3=Steinmetz |first3=Ralf |title=Categorizing Learning Objects based on Wikipedia as Substitute Corpus |date=2007 |url=https://wikipediaquality.com/wiki/Categorizing_Learning_Objects_based_on_Wikipedia_as_Substitute_Corpus}}

HTML

Meyer, Marek; Rensing, Christoph; Steinmetz, Ralf. (2007). &quot;<a href="https://wikipediaquality.com/wiki/Categorizing_Learning_Objects_based_on_Wikipedia_as_Substitute_Corpus">Categorizing Learning Objects based on Wikipedia as Substitute Corpus</a>&quot;.