Difference between revisions of "Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes"

From Wikipedia Quality
Jump to: navigation, search
(Adding wikilinks)
(Infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes
 +
| date = 2011
 +
| authors = [[Simon Stenström]]
 +
| link = http://www.diva-portal.org/smash/record.jsf?pid=diva2:654219
 +
}}
 
'''Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes''' - scientific work related to [[Wikipedia quality]] published in 2011, written by [[Simon Stenström]].
 
'''Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes''' - scientific work related to [[Wikipedia quality]] published in 2011, written by [[Simon Stenström]].
  
 
== Overview ==
 
== Overview ==
 
The main focus of [[Natural Language Processing]] has been aimed to understanding texts better, but little work has been aimed toward finding good search results to a query, given annotated data. This is the problem Author have focused on.This thesis discuss both how to index annotated data, in which cases a search engine over annotated data offer better search results than a regular full text search engine, how the ranking function differ between annotated data and unstructured data search and how to evaluate a annotated search engine.I created a search engine over the semantically annotated [[Wikipedia]] information boxes and a baseline full-text search system over the same data. The thesis show that with some simple work, a annotated search engine can improve the performance with between 17 and 27 percent compared to the baseline even on a diverse data collection such as the Wikipedia information boxes.
 
The main focus of [[Natural Language Processing]] has been aimed to understanding texts better, but little work has been aimed toward finding good search results to a query, given annotated data. This is the problem Author have focused on.This thesis discuss both how to index annotated data, in which cases a search engine over annotated data offer better search results than a regular full text search engine, how the ranking function differ between annotated data and unstructured data search and how to evaluate a annotated search engine.I created a search engine over the semantically annotated [[Wikipedia]] information boxes and a baseline full-text search system over the same data. The thesis show that with some simple work, a annotated search engine can improve the performance with between 17 and 27 percent compared to the baseline even on a diverse data collection such as the Wikipedia information boxes.

Revision as of 23:00, 11 June 2019


Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes
Authors
Simon Stenström
Publication date
2011
Links
Original

Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes - scientific work related to Wikipedia quality published in 2011, written by Simon Stenström.

Overview

The main focus of Natural Language Processing has been aimed to understanding texts better, but little work has been aimed toward finding good search results to a query, given annotated data. This is the problem Author have focused on.This thesis discuss both how to index annotated data, in which cases a search engine over annotated data offer better search results than a regular full text search engine, how the ranking function differ between annotated data and unstructured data search and how to evaluate a annotated search engine.I created a search engine over the semantically annotated Wikipedia information boxes and a baseline full-text search system over the same data. The thesis show that with some simple work, a annotated search engine can improve the performance with between 17 and 27 percent compared to the baseline even on a diverse data collection such as the Wikipedia information boxes.