Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names

From Wikipedia Quality
Jump to: navigation, search


Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names
Authors
Mohammad Shafkat Amin
Anupam Bhattacharjee
Hasan M. Jamil
Publication date
2010
DOI
10.1145/1774088.1774445
Links
Original

Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names - scientific work related to Wikipedia quality published in 2010, written by Mohammad Shafkat Amin, Anupam Bhattacharjee and Hasan M. Jamil.

Overview

As the volume of information available on the internet is growing exponentially, it is clear that most of this information will have to be processed and digested by computers to produce useful information for human consumption. Unfortunately, most web contents are currently designed for direct human consumption in which it is assumed that a human will decipher the information presented to him in some context and will be able to connect the missing dots, if any. In particular, information presented in some tabular form often does not accompany descriptive titles or column names similar to attribute names in tables. While such omissions are not really an issue for humans, it is truly hard to extract information in autonomous systems in which a machine is expected to understand the meaning of the table presented and extract the right information in the context of the query. It is even more difficult when the information needed is distributed across the globe and involve semantic heterogeneity. In this paper, goal is to address the issue of how to interpret tables with missing column names by developing a method for the assignment of attributes names in an arbitrary table extracted from the web in a fully autonomous manner. Authors propose a novel approach by leveraging Wikipedia for the first time for column name discovery for the purpose of table annotation. Authors show that this leads to an improved likelihood of capturing the context and interpretation of the table accurately and producing a semantically meaningful query response.

Embed

Wikipedia Quality

Amin, Mohammad Shafkat; Bhattacharjee, Anupam; Jamil, Hasan M.. (2010). "[[Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names]]".DOI: 10.1145/1774088.1774445.

English Wikipedia

{{cite journal |last1=Amin |first1=Mohammad Shafkat |last2=Bhattacharjee |first2=Anupam |last3=Jamil |first3=Hasan M. |title=Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names |date=2010 |doi=10.1145/1774088.1774445 |url=https://wikipediaquality.com/wiki/Wikipedia_Driven_Autonomous_Label_Assignment_in_Wrapper_Induced_Tables_with_Missing_Column_Names}}

HTML

Amin, Mohammad Shafkat; Bhattacharjee, Anupam; Jamil, Hasan M.. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Wikipedia_Driven_Autonomous_Label_Assignment_in_Wrapper_Induced_Tables_with_Missing_Column_Names">Wikipedia Driven Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names</a>&quot;.DOI: 10.1145/1774088.1774445.