Conference icon

Conference

Joint repairs for web wrappers

Abstract:

Automated web scraping is a popular means for acquiring data from the web. Scrapers (or wrappers) are derived from either manually or automatically annotated examples, often resulting in under/over segmented data, together with missing or spurious content. Automatic repair and maintenance of the extracted data is thus a necessary complement to automatic wrapper generation. Moreover, the extracted data is often the result of a long-term data acquisition effort and thus jointly repairing wrappe...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed
Version:
Accepted manuscript

Actions


Access Document


Files:
Publisher copy:
10.1109/ICDE.2016.7498320

Authors


More by this author
Department:
Oxford, MPLS, Computer Science
Role:
Author
More by this author
Institution:
University of Oxford
Department:
Oxford, MPLS, Computer Science
Role:
Author
Publisher:
Institute of Electrical and Electronics Engineers Publisher's website
Publication date:
2016-06-05
Acceptance date:
2015-12-20
DOI:
URN:
uuid:84ccdc92-cf32-4ab4-a052-1d33ada23dbd
Source identifiers:
614894
Local pid:
pubs:614894

Terms of use


Metrics


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP