Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes
Authors | Simon Stenström |
---|---|
Publication date | 2011 |
Links | Original |
Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes - scientific work related to Wikipedia quality published in 2011, written by Simon Stenström.
Overview
The main focus of Natural Language Processing has been aimed to understanding texts better, but little work has been aimed toward finding good search results to a query, given annotated data. This is the problem Author have focused on.This thesis discuss both how to index annotated data, in which cases a search engine over annotated data offer better search results than a regular full text search engine, how the ranking function differ between annotated data and unstructured data search and how to evaluate a annotated search engine.I created a search engine over the semantically annotated Wikipedia information boxes and a baseline full-text search system over the same data. The thesis show that with some simple work, a annotated search engine can improve the performance with between 17 and 27 percent compared to the baseline even on a diverse data collection such as the Wikipedia information boxes.
Embed
Wikipedia Quality
Stenström, Simon. (2011). "[[Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes]]".
English Wikipedia
{{cite journal |last1=Stenström |first1=Simon |title=Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes |date=2011 |url=https://wikipediaquality.com/wiki/Annotated_Search_–_Indexing,_Searching_and_Ranking_Within_Annotated_Wikipedia_Information_Boxes}}
HTML
Stenström, Simon. (2011). "<a href="https://wikipediaquality.com/wiki/Annotated_Search_–_Indexing,_Searching_and_Ranking_Within_Annotated_Wikipedia_Information_Boxes">Annotated Search – Indexing, Searching and Ranking Within Annotated Wikipedia Information Boxes</a>".