Archived

This content is available here strictly for research, reference, and/or recordkeeping and as such it may not be fully accessible. If you work or study at University of Kentucky and would like to request an accessible version, please use the SensusAccess Document Converter.

A Method for Automating the Extraction of Specialized Information from the Web

Abstract

The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior

Document Type

Book Chapter

Publication Date

2005

Language

English

Notes/Citation Information

This chapter and book are part of Lecture Notes in Computer Science Series, Volume 3801/2005.

Digital Object Identifier (DOI)

http://dx.doi.org/10.1007/11596448_72

Share

COinS