Published October 29, 2022 | Version v1.1.0.0
Dataset Open

chen-echo/FGNER-corpus: Geological fine-grained corpus

Authors/Creators

Description

  • A corpus for the identification of CHINESE geologically named entities based on three-part geological reports.
  • Contains twenty-one labels, the corresponding entities for the labels are listed in the below.
  • ROC.SEDI 沉积岩 ROC.META 变质岩 ROC.IG 岩浆岩
  • SMG.ROC 岩性地层 SMG.chrono 年代地层
  • MIN.native 自然元素矿物 MIN.sulASIM 硫化物及其类似化合物 MIN.halide 卤化物矿物 MIN.oxihydro 氧化物及其氢氧化物矿物 MIN.oxis 含氧盐矿物 MIN.ROC 岩石类非金属矿物
  • GCH.AR 太古宙 GCH.PT 元古宙 CGH.PH 显生宙
  • GST.fold 褶皱 GST.fault 断裂 GST.joint 节理 GST.contact 接触关系
  • GAC.ENDO 内力地质作用 GAC.EXO 外力地质作用
  • OT 其他

Files

FGNER_dev.txt

Files (1.9 MB)

Name Size Download all
md5:e52e5011d2d5c24a00ec61a69174e690
281.9 kB Preview Download
md5:5bcbf3f403dbe9973e41d9873405d74e
275.3 kB Preview Download
md5:38d6e2cf84cc78e9b26c22d4cacc8a5d
880.1 kB Preview Download
md5:2899a13c7c94b4d2330894572c7a1dc1
86.6 kB Preview Download
md5:59613f6d21ed3c2c3db681257e948367
81.1 kB Preview Download
md5:5176ff58165dec9a0b03a04d52c5d0fc
156.9 kB Preview Download
md5:443da3064838e6db2915fa3494a56620
159.8 kB Preview Download

Additional details