Published October 5, 2020 | Version v1
Dataset Open

NwQM: A neural quality assessment framework for Wikipedia

Description

This contains the datasets we used for implementing "NwQM: A neural quality assessment framework for Wikipedia".

"wikipages.csv" contains the text page content, talk page content, and the quality for each page.

"sample_wiki_images.zip" contains the screenshots for a sample of pages.

"finetuned_inceptionv3_model.h5" is the InceptionV3 model finetuned on Wikipedia.

"finetuned_inceptionv3_embeddings.json" contains the finetuned InceptionV3 embeddings of each image generated from finetuned_inceptionv3_model.h5. 

Files

finetuned_inceptionv3_embeddings.json

Files (4.1 GB)

Name Size Download all
md5:2023e9be60d2a3dfec93801ab3e0aa27
1.3 GB Preview Download
md5:c7ea4b7c385f5e3212c307c0ac5d863b
262.6 MB Download
md5:c737a8fac43ce000a17a44c827388b61
791.1 MB Preview Download
md5:161d531be8e5dd219faf7a672799cff4
1.8 GB Preview Download