Wikipedia based Semantic Related Chinese Words Exploring and Relatedness Computing

From Wikipedia Quality
Revision as of 19:59, 25 May 2019 by Olivia (talk | contribs) (Creating a new page - Wikipedia based Semantic Related Chinese Words Exploring and Relatedness Computing)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Wikipedia based Semantic Related Chinese Words Exploring and Relatedness Computing - scientific work related to Wikipedia quality published in 2009, written by Zhong Yi-xin.

Overview

To find how to collect semantic related words and calculate semantic relatedness,an experiment is done to download about 50 thousand documents from the web site of Chinese Wikipedia and extract hyperlinks between lines which contains semantic information.By mining hyperlinked references in documents,about 400 thousand semantic related word pairs are collected.With more experiments on topic groups of related words,tightly related words are grouped into smaller sets with an average semantic relatedness calculated.Semantic relatedness is calculated using information of hyperlink positions and frequencies in documents.Comparing with the result by classic algorithms,the reliability of the new measures is analyzed.