WOLF (Wordnet Libre du Français, free French Wordnet)

I am trying to create an interface for WOLF (Wordnet Libre du Français, Free French Wordnet). The goal is to reproduce AWNDatabaseManger for Arabic Wordnet ( http://www.talp.upc.edu/index.php/technology/resources/multilingual-lexicons-and-machine-translation-resources/multilingual-lexicons/72- awn ), but for WOLF.

The problem I am facing is that I cannot find the correct data specifications for WOLF ( http://alpage.inria.fr/~sagot/wolf-en.html ) or WoNeF (which is another French translated Wordnet http://wonef.fr/ )

For Arabic Wordnet, they provided detailed data specifications that can be found at http://globalwordnet.org/arabic-wordnet/awn-data-spec/ I'm trying to find the same for WOLF or WoNeF.

Otherwise, how do I match two files?

For example, a word and its relation in awn look like this:

<item itemid="$ajarap_AlS~amog_n1AR" offset="111586059" lexfile="" name="شَجَرَة الصَّمْغ " type="synset" headword="" POS="n" source="" gloss="" authorshipid="80" />

<word wordid="$ajarap__1" value="شَجَرَة الصَّمْغ " synsetid="$ajarap_AlS~amog_n1AR" frequency="" corpus="" authorshipid="11461" />

<link type="has_hyponym" link1="$ajarap_AlS~amog_n1AR" link2="&gt;ukAlibotws_n1AR" authorshipid="35038" />

 <link type="has_hyponym" link1="$ajarap_n1AR" link2="$ajarap_AlS~amog_n1AR" authorshipid="35041" />

The word defintion (item) and its relationship (link) are shared with different attributes.

whereas in WOLF a word and its relationship look like this:

<SYNSET>
    <ILR type="near_antonym">eng-30-00002098-a</ILR>
    <ILR type="be_in_state">eng-30-05200169-n</ILR>
    <ILR type="be_in_state">eng-30-05616246-n</ILR>
    <ILR type="eng_derivative">eng-30-05200169-n</ILR>
    <ILR type="eng_derivative">eng-30-05616246-n</ILR>
    <ID>eng-30-00001740-a</ID>
    <SYNONYM>
        <LITERAL lnote="2/2:fr.csbgen,fr.csen">comptable</LITERAL>
    </SYNONYM>
    <DEF>(usually followed by `to') having the necessary means or skill or know-how or authority to do something
    </DEF>
    <USAGE>able to swim</USAGE>
    <USAGE>she was able to program her computer</USAGE>
    <USAGE>we were at last able to buy a car</USAGE>
    <USAGE>able to get a grant for the project</USAGE>
    <BCS>3</BCS>
    <POS>a</POS>
</SYNSET>

I can make assumptions that the brilliance of the awn attribute is equal to using the wolf tag, and the awn pos attribute is equal to the wolf sign of the wolf.

But the fact is that I do not want to make assumptions, I am looking for suitable documentation from which I can be sure and make comparisons between these two files.

Can someone point me to the required documents?

+4
2

NLTK Python, synsets, , , WOLF

>>> from nltk.corpus import wordnet as wn
>>> [synset.lemma_names('fra') for synset in wn.synsets(u'chien'.decode('utf-8'), lang='fra')]
[[u'canis_familiaris', u'chien'], [u'aboyeur', u'chien', u'chienchien', u'clébard', u'toutou'], [u'chien', u'chien_de_chasse'], [u'chien'], [u'chien', u'clic', u'cliquer', u'cliquet'], [u'chien', u'franc', u'hot-dog'], [u'achille', u'chien', u'quignon', u'talon'], [u'chien'], [u'chien']]
0

Source: https://habr.com/ru/post/1616856/


All Articles