Published April 25, 2017 | Version v1
Journal article Open

MULTILEVEL DATA VERIFICATION ALERT SYSTEM FOR WRAPPER MAINTENANCE

Description

Unfortunately, websites are continuously evolving and structural changes happen with no forewarning, which usually results in wrappers working incorrectly. Thus, wrappers maintenance is necessary for detecting whether wrapper is extracting erroneous data. Wrappers are pieces of software used to extract data from websites and structure them for further application processing. The solution consists of using verification models to detect whether wrapper output is statistically similar to the output produced by the wrapper itself when it was successfully invoked in the past. Current proposals present some weaknesses, as the data used to build these models are supposed to be homogeneous, independent or representative enough, or following a single predefined mathematical model.         

Files

Files (695.9 kB)

Name Size Download all
md5:c6d67a3b9d7c6264263ddbe3b223431b
695.9 kB Download