French Word Sense Disambiguation with Princeton WordNet Identifiers

Loïc Vial

doi:10.5281/zenodo.3549806

Published November 21, 2019 | Version 1.0

Dataset Open

French Word Sense Disambiguation with Princeton WordNet Identifiers

Loïc Vial¹

1. Univ. Grenoble Alpes

This is a dataset for the Word Sense Disambiguation of French using Princeton WordNet identifiers. It contains two training corpora : the SemCor and the WordNet Gloss Corpus, both automatically translated from their original English version, and with sense tags automatically aligned. It contains also a test corpus : the task 12 of SemEval 2013, originally sense annotated with BabelNet identifiers, converted into Princeton WordNet 3.0.

Files

semcor.fr.xml

Files (271.6 MB)

Name	Size	Download all
semcor.fr.xml md5:87f0a390ccfda959de51063aad08082d	86.0 MB	Preview Download
semeval2013task12.fr.xml md5:17530c037c658dd56eccb00bd8e66b7b	834.6 kB	Preview Download
wngt.fr.xml md5:8a8d772b920667ab375c2460c519c9f0	184.7 MB	Preview Download

927

Views

374

Downloads

Show more details

	All versions	This version
Views	927	924
Downloads	374	372
Data volume	58.1 GB	58.1 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

French

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: November 21, 2019
Modified: January 24, 2020

French Word Sense Disambiguation with Princeton WordNet Identifiers

Authors/Creators

Description

Files

semcor.fr.xml

Files (271.6 MB)