Published March 8, 2024 | Version 2.2
Dataset Open

Thai NER 2.2

  • 1. @PyThaiNLP

Description

This version is fixed wrong tag (DATA -> DATE) from Thai NER 2.1.

Dataset

Size

  • Train: 3,938 docs
  • Validation: 1,313 docs
  • Test: 1,313 Docs

Some data come from crowdsourcing between Dec 2018 - Nov 2019. https://github.com/wannaphong/thai-ner

Domain

  • News (It, politics, economy, social)
  • PR (KKU news)
  • general

Source

And more (the lists are lost.)

Tag

  • DATE - date
  • TIME - time
  • EMAIL - email
  • LEN - length
  • LOCATION - Location
  • ORGANIZATION - Company / Organization
  • PERSON - Person name
  • PHONE - phone number
  • TEMPERATURE - temperature
  • URL - URL
  • ZIP - Zip code
  • MONEY - the amount
  • LAW - legislation
  • PERCENT - PERCENT

Files

up2hub.ipynb

Files (4.3 MB)

Name Size Download all
md5:63bd0c8fbc1ee263ac239ba536a32102
859.3 kB Download
md5:e1e91a24f5f9f2e57352fdf7c29f9c66
2.6 MB Download
md5:c8d1c98bdca96be254b19c1089ea9075
8.4 kB Preview Download
md5:3e3a6c7794d30ca6fec025be67697e22
838.0 kB Download

Additional details

Related works