Day 2

Archived

This content is available here for research, reference, and/or recordkeeping.

Harvesting and Parsing an HTML-based Newspaper

Eric Weig, University of KentuckyFollow

Start Date

8-11-2016 2:10 PM

Description

This article outlines one in-house model for archiving and providing access to HTML-based news in the Kentucky Digital Newspaper Program (KDNP) at the University of Kentucky (UK). To allow for search and retrieval of HTML-based news in the KDNP which already contains news content digitized from analog sources, the encapsulation of HTML content using XML encoded CDATA strings read by a prototype open-source PHP viewer is described.

Notes

The downloadable item is a presentation-based article published in the conference proceedings. It has a different title (Archiving and Accessing HTML-Based Newspapers Using XML and CDATA Strings) and its copyright information is as follows:

Download

Included in

Archival Science Commons, Journalism Studies Commons, Mass Communication Commons

COinS

Aug 11th, 2:10 PM

Harvesting and Parsing an HTML-based Newspaper

Day 2

Archived

Harvesting and Parsing an HTML-based Newspaper

Start Date

Description

Notes

Included in

Search

Browse by Author

Author Corner

Connect

Day 2

Archived

Harvesting and Parsing an HTML-­based Newspaper

Presenter Information

Start Date

Description

Notes

Included in

Share

Search

Browse by Author

Author Corner

Connect

Harvesting and Parsing an HTML-based Newspaper