Skip to content

hunspell/mythes

Repository files navigation

MyThes is a simple thesaurus that uses a structured
text data file and an index file with binary search
to lookup words and phrases and return information
on part of speech, meanings, and synonyms

MyThes was originall written to provide a thesaurus
for the OpenOffice.org project 

The Main features of MyThes are:

1. written in C++ to make it easier to interface with 
   LibreOffice, OpenOffice, AbiWord, Pspell, etc

2. it is stateless, uses no static variables and
   should be completely reentrant with no ifdefs  

3. it compiles with -ansi and -pedantic and -Wall
   with no warnigns so it shouldbe quite portable

4. it uses a simple perl program to read the structured
   text file and create the index needed for binary
   searching

5. it is very simple with *lots* of comments.
   The main "smarts" are in the structure of the
   text file that makes up the thesaurus data

6. It comes with a ready-to-go structured thesaurus
   data file for en_US extracted from the WordNet-2.0 data.

   Please see WordNet_license.txt and WordNet_readme.txt
   for more information on the very useful project!

   See http://www.danielnaber.de/wn2ooo/ for utilities to
   regenerate an up to date English thesaurus from the most
   recent WordNet data.
   
7. The source code has a BSD license (and no advertising clause)


MyThes comes with a simple example program that looks up some words and returns
meanings and synonyms.

To build it simply do the following:

unzip mythes.zip
cd mythes
./configure
make

To run the example program:
./example th_en_US_new.idx th_en_US_new.dat checkme.lst

To run the example program with stemming and morphological generation:
e.g. to check mouse, mice, rodents, eats, eaten, ate, eating etc. words
./example morph.idx morph.dat morph.lst morph.aff morph.dic

NOTE: this is only an example and test environment for dictionary developers,
full English stemming and morphological generation needs an improved
English Hunspell dictionary.

László Németh <nemeth at OO.o>
Kevin Hendricks <[email protected]>