Published September 12, 2006 | Version v1
Dataset Open

Houvardas06 Author Identification: C50-Attribution

Description

This dataset contains 2500 texts from 50 different authors/candidates (C50). The ground truth can be found inside a json-file.

Files

houvardas06-authorship-attribution-dataset-c50-2015-10-20.zip

Files (8.6 MB)

Additional details

References

  • Houvardas, J. and E. Stamatatos, N-gram Feature Selection for Authorship Identification. In J. Euzenat, and J. Domingue (Eds.) Proc. of the 12th Int. Conf. on Artificial Intelligence: Methodology, Systems, Applications (AIMSA'06), LNCS 4183, pp. 77-86, 2006