CFP last date
16 December 2024
Reseach Article

Automatic Extraction of Entity Alias from the Web

by Sumitra A. Jakhete, Shweta C. Dharmadhikari
International Journal of Applied Information Systems
Foundation of Computer Science (FCS), NY, USA
Volume 3 - Number 8
Year of Publication: 2012
Authors: Sumitra A. Jakhete, Shweta C. Dharmadhikari
10.5120/ijais12-450567

Sumitra A. Jakhete, Shweta C. Dharmadhikari . Automatic Extraction of Entity Alias from the Web. International Journal of Applied Information Systems. 3, 8 ( August 2012), 5-9. DOI=10.5120/ijais12-450567

@article{ 10.5120/ijais12-450567,
author = { Sumitra A. Jakhete, Shweta C. Dharmadhikari },
title = { Automatic Extraction of Entity Alias from the Web },
journal = { International Journal of Applied Information Systems },
issue_date = { August 2012 },
volume = { 3 },
number = { 8 },
month = { August },
year = { 2012 },
issn = { 2249-0868 },
pages = { 5-9 },
numpages = {9},
url = { https://www.ijais.org/archives/volume3/number8/249-0567/ },
doi = { 10.5120/ijais12-450567 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2023-07-05T10:46:06.458511+05:30
%A Sumitra A. Jakhete
%A Shweta C. Dharmadhikari
%T Automatic Extraction of Entity Alias from the Web
%J International Journal of Applied Information Systems
%@ 2249-0868
%V 3
%N 8
%P 5-9
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

An individual is known by more than one name on the web. Identifying the correct alias for an entity is playing a crucial role in the field of information retrieval, relation extraction, sentiment analysis, and entity name disambiguation as well as in biomedical fields. Traditional system provides the solution on solving lexical ambiguity, but it lagged on the problem of referential ambiguity. Through this paper we emphasis on referential ambiguity to extract correct alias for a given name. Given a name alias dataset retrieves lexical pattern from a web search engine. With the help of Lexical-pattern and using second level depth extract candidate aliases. As to identify correct alias from a list of aliases we used similarity measures as well as graph mining measures such as degree distribution and clustering coefficient. We integrate different word sore and calculate the final weight of each candidate alias. There by our method providing more promising result in terms achieving a statistically significant mean reciprocal rank (MRR) of 0. 611 and improves the precision and minimize the recall that than the previous baseline method.

References
  1. Danushka Bollegala, Yutaka Matsuo, and Mitsuru Ishizuka, Member, IEEE 2011 Automatic Discovery of Personal Name Aliases from the Web In IEEE Transaction on knowledge and data engineering, vol. 23, no. 6.
  2. Christian Borgelt 2009 Graph Mining: An Overview In Proc. 19th GMA/GI Workshop Computational Intelligence, Germany.
  3. D. Kavitha 2011 A Survey on Assorted Approaches to Graph Data Mining In International Journal of Computer Applications (0975 – 8887) Volume 14– No. 1.
  4. Deepayan Chakrabarti,Yiping Zhany, Christos Faloutsos 2004 R-MAT: A Recursive Model for Graph Mining In Proceedings of the 2004 SIAM International Conference on Data Mining.
  5. Danushka Bollegala, Yutaka Matsuo, and Mitsuru Ishizuka 2006 Extracting Key Phrases to Disambiguate personal names on the web In CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing.
  6. Dmitri V. Kalashnikov Zhaoqi Chen Rabia Nuray-Turan Sharad Mehrotra Zheng Zhang 2009 Web People Search via Connection Analysis In IEEE International Conference on Data Engineering.
  7. T. Hokama and H. Kitagawa 2006 Extracting Mnemonic Names of People from the Web In Proc. Ninth Int'l Conf. Asian Digital Libraries (ICADL '06), pp. 121-130.
  8. C. Galvez and F. Moya-Anegon 2007 Approximate Personal Name- Matching through Finite-State Graphs In J. Am. Soc. for Information Science and Technology, vol. 58, page No. . 1-17.
  9. Md. Rafiqul Islam, Md. Rakibul Islam 2008 An Effective Term Weighting Method Using Random Walk Model for Text Classification In Proceedings of 11th International Conference on Computer and Information Technology (ICCIT 2008) 25-27, Khulna, Bangladesh.
  10. Michael Berry 2010 Text Mining Application and Theory, John Wiley and Sons Ltd.
  11. Soumen Chakrabarti 2003 Mining the web: Discover the web form hypertext data, ISBN 1558607544 Elsevier.
  12. G. Salton and M. McGill 1986 Introduction to Modern Information Retreival. McGraw-Hill Inc.
  13. A. Bagga and B. Baldwin, 1998 Entity-Based Cross-Document Coreferencing Using the Vector Space Model In Proc. Int'l Conf. Computational Linguistics (COLING '98), Page No. . 79-85.
  14. Sumitra Jakhete, Shweta Dharmadhikari 2012 Analysis of Anchor text based on Pattern Growth Graph Algorithm for Name Alias Detection System In CiiT International Journal of Data Mining and Knowledge Engineering. DOI: DMKE062012003.
  15. Sumitra J. , Shweta D. , Madhuri C. 2012 Name Alias Detection system using graph mining method In 2nd international conference on computer application, Pondicherry, Volume 5,Page No. . 125-127.
  16. Web search Engine such as www. googleapi. com //30-06-2012.
Index Terms

Computer Science
Information Sciences

Keywords

Graph Mining Text Mining Web Mining Web Text Analysis