User:Vuara/Categorization dictionary

From Wikibooks, open books for an open world
Jump to: navigation, search

http://www.simstat.com/WordStat/WordNet.htm

WordNet based Categorization dictionary

Description: This categorization dictionary is derived from theWordNet® database to provides basic categorizations of noun, verbs, adjectives and adverbs currently found in the WordNet 2.0 database into 44 syntactic category and logical groupings. Four versions of this categorization dictionary are currently available:

Words only

The full version offers categorization of 109231 words into 44 WordNet lexical categories. More than 43,000 of those entries are categorized in more than one category.

The limited version consists of 65425 unambiguous words categorized into those same categories. Unambiguous words are defined as words that are categorized in only one Wordnet lexical category.

Words & Phrases

The full version offers categorization of 174268 words and phrases into 44 WordNet lexical categories. More than 47,000 of those entries are categorized in more than one category.

The limited version consists of 126869 unambiguous words and phrases categorized into those same categories. Unambiguous words and phrases are defined as entries that are categorized in only one Wordnet lexical category. Requirement: WordStat 4.0 (with English lemmatization option enabled) or WordStat 3.1 if the text corpus to analyze has already been lemmatized.

Note: WordNet® is an online lexical reference system developed by the Cognitive Science Laboratory at Princeton University under the direction of Professor George A. Miller (Principal Investigator). WordNet 1.7.1 Copyright © 2001 by Princeton University. All rights reserved.

Download categorization dictionary

Click here to download the four dictionaries (about 2Mb).

WordNet lexical categories and sample words

DIMENSION EXAMPLES UNAMBIGUOUS WORDS TOTAL WORDS UNAMBIGUOUS WORDS & PHRASES TOTAL WORDS & PHRASES

I. ADVERBS

      All 
actually, deeply, fully 
3233 
3782  4051  4667 

II. ADJECTIVES

      All   

13716

16917  14273  17553 
      Pertainyms 

abyssal, genetic, intimal 2817

4209  2837  4247 
      Participial 

kidnapped, pulled, sublimed 85

125  87  127 
III. NOUNS     
     Act 
abolition, badminton, diagnosis 

2934

6388 5527  9158 
     Animal 
abalone, bacteria, coyote 

5037 5975 13127 14181

     Artifact 
accelerator, aquarium, candlestick 

5371 8650 12374 15864

     Attribute 
adequacy, assertiveness, cadence 

2258 3969 2804 4581

     Body 
abdomen, bronchus, collagen 

837 1371 2892 3447

     Cognition 
activism, amnesia, covariance 

1259 2787 2640 4225

     Communication 
alexandrine, allusion, cantata 

2070 4904 4877 7847

     Event 
bonfire, conjuncture, diving 

278 1239 620 1613

     Feeling 
ambivalence, appetite, cynicism 

259

690  309  744 
     Food 
appetizer, beer, borsch 

809 1708 2344 3504

     Group 
army, bolshevism, Benelux 

757 1653 2771 3757

     Location 
Afghanistan, Alsace, Babylonia 

1967 2746 3732 4588

     Motive 

agromania, egomania, incentive 25 55 47 77

     Object 
abyss, electron, granule 

607 1203 1528 2156

     Person 
adjuster, correspondent, creator 

7748 10763 14836 17987

     Phenomenon 
aftermath, blizzard, depolarization 

187 483 646 960

     Plant 
achillea, buxus, cultivar 

4282 4984 16781 17701

     Possession 
benefaction, coinage, fellowship 

205 643 1015 1483

     Process 
absorption, autoregulation, catalysis 

435 831 703 1106

     Quantity 

ampere, baud, carat 615 1193 1261 1860

     Relation 
causality, fatherhood, relevance 

127 377 376 635

     Shape 
azimuth, cuboid, parabola 

93 381 229 524

     State 
aberrance, affluence, homelessness 

2031 3474 3957 5469

     Substance 
Aldol, alkyd, cellulose 

1861 2583 3703 4469

     Time 
bedtime, days, December 

423 767 1221 1592

IV. VERBS     
     Body bungle, disinfect, hibernate  154 759  282  960 
     Change amplify, blossom, crystallize 796 2233  1049  2702 
     Cognition amaze, brainstorm, diagnose 196 849  317  1053 
     Communication applaud, argue, circumstantiate 465 1816  711  2214 
     Competition capitulate, equalize, outrival 77 489  142  609 
     Consumption cater, gratify, quench 56 305  111  398 
     Contact attach, bulldoze, collide 367 1993  650  2479 
     Creation annotate, confect, enact 128 732  196  873 
     Emotion agonize, condole, exacerbate 102 505 191  639 
     Motion clump, deflect, eject 159 1177  441  1657 
     Perception cense, conceal, hallucinate 87 543  152  647 
     Possession allocate, donate, hospitalize 155 806  269  1024 
     Social abolish, celebrate, demote 248 1271  492  1669 
     Stative coexist, embody, incriminate 95 746  249  1014 
     Weather freeze, ignite, pour 3 94 23  130