Linguistics Faculty Publications

A Method for Automating the Extraction of Specialized Information from the Web

Ling Lin
Antonio Liotta
Andrew R. Hippisley, University of KentuckyFollow

Abstract

The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior

Document Type

Book Chapter

Publication Date

2005

Language

English

Notes/Citation Information

This chapter and book are part of Lecture Notes in Computer Science Series, Volume 3801/2005.

Digital Object Identifier (DOI)

http://dx.doi.org/10.1007/11596448_72

Repository Citation

Lin, Ling; Liotta, Antonio; and Hippisley, Andrew R., "A Method for Automating the Extraction of Specialized Information from the Web" (2005). Linguistics Faculty Publications. 6.
https://uknowledge.uky.edu/lin_facpub/6

Link to Full Text

Contact Author

COinS

Linguistics Faculty Publications

A Method for Automating the Extraction of Specialized Information from the Web

Abstract

Document Type

Publication Date

Language

Notes/Citation Information

Digital Object Identifier (DOI)

Repository Citation

Search

Browse by Author

Author Corner

Connect

Linguistics Faculty Publications

A Method for Automating the Extraction of Specialized Information from the Web

Authors

Abstract

Document Type

Publication Date

Language

Notes/Citation Information

Digital Object Identifier (DOI)

Repository Citation

Share

Search

Browse by Author

Author Corner

Connect