A Method for Automating the Extraction of Specialized Information from the Web
The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior
Digital Object Identifier (DOI)
Citation / Publisher Attribution
Lin, Ling; Liotta, Antonio; and Hippisley, Andrew. 2005. "A Method for Automating the Extraction of Specialized Information from the Web." Lecture Notes in Computer Science, V. 3801, p 489-494. Computational Intelligence and Security. Yue Hao, Jiming Liu, Yuping Wang, Yiu-ming Cheung, Hujun Yin, Licheng Jiao, Jianfeng Ma and Yong-Chang Jiao (eds.) The original publication is available at www.springerlink.com.
Lin, Ling; Liotta, Antonio; and Hippisley, Andrew R., "A Method for Automating the Extraction of Specialized Information from the Web" (2005). Linguistics Faculty Publications. Paper 6.