Dataset Restricted Access

Kong15 Originality: Text Alignment

Kong, Leilei; Lu, Zhimao; Han, Yong; Qi, Haoliang; Han, Zhongyuan; Wang, Qibo; Hao, Zhenyuan; Zhang, Jing

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>Kong, Leilei</dc:creator>
  <dc:creator>Lu, Zhimao</dc:creator>
  <dc:creator>Han, Yong</dc:creator>
  <dc:creator>Qi, Haoliang</dc:creator>
  <dc:creator>Han, Zhongyuan</dc:creator>
  <dc:creator>Wang, Qibo</dc:creator>
  <dc:creator>Hao, Zhenyuan</dc:creator>
  <dc:creator>Zhang, Jing</dc:creator>
  <dc:description>We provide you with a training corpus that consists of pairs of documents, one of which may contain passages of text reused from the other. The reused text is subject to various kinds of (automatic) obfuscation to hide the fact it has been reused. Enclosed in the evaluation corpora, a file named pairs is found, which lists all pairs of suspicious documents and source documents.</dc:description>
  <dc:subject>reused text</dc:subject>
  <dc:subject>text alignment</dc:subject>
  <dc:title>Kong15 Originality: Text Alignment</dc:title>
All versions This version
Views 7272
Downloads 11
Data volume 787.1 kB787.1 kB
Unique views 6161
Unique downloads 11


Cite as