Published October 5, 2020
| Version v1
Dataset
Open
NwQM: A neural quality assessment framework for Wikipedia
Description
This contains the datasets we used for implementing "NwQM: A neural quality assessment framework for Wikipedia".
"wikipages.csv" contains the text page content, talk page content, and the quality for each page.
"sample_wiki_images.zip" contains the screenshots for a sample of pages.
"finetuned_inceptionv3_model.h5" is the InceptionV3 model finetuned on Wikipedia.
"finetuned_inceptionv3_embeddings.json" contains the finetuned InceptionV3 embeddings of each image generated from finetuned_inceptionv3_model.h5.