Thesis icon

Thesis

Reducing human effort in web data extraction

Abstract:

The human effort in large-scale web data extraction significantly affects both the extraction flexibility and the economic cost. Our work aims to reduce the human effort required by web data extraction tasks in three specific scenarios.

(I) Data demand is unclear, and the user has to guide the wrapper induction by annotations. To maximally save the human effort in the annotation process, wrappers should be robust, i.e., immune to the webpage’s change, to avoid the wrapper r...

Expand abstract

Actions


Authors


More by this author
Division:
MPLS
Department:
Computer Science
Role:
Author

Contributors

Role:
Supervisor
More from this funder
Name:
Department of Computer Science, University of Oxford
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford
Keywords:
Subjects:
UUID:
uuid:04bd39dd-bfec-4c07-91db-980fcbc745ba
Deposit date:
2018-09-24

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP