The Design of Software (CLOSED)

A public forum for discussing the design of software, from the user interface to the code architecture. Now closed.

The "Design of Software" discussion group has been merged with the main Joel on Software discussion group.

The archives will remain online indefinitely.

Where to download english dictionary in plain text?

I am looking for a plain text file containing plural - singular noun pairs.

Anyone know where to download?
Phil Gates Send private email
Tuesday, July 04, 2006
 
 
Have you tried the gutenburg project?

If there is a dictionary that's out of copyright, they may have it.
Mr. Analogy {Shrinkwrap µISV since 1995} Send private email
Tuesday, July 04, 2006
 
 
Ispell generates plurals algorithmically; the version I have installed for American English uses these rules for forming plurals:

flag *S:
    [^AEIOU]Y  >      -Y,IES          # As in imply > implies
    [AEIOU]Y    >      S              # As in convey > conveys
    [SXZH]      >      ES              # As in fix > fixes
    [^SXZHY]    >      S              # As in bat > bats

Special cases are in the main word list, but these cover the vast majority of words.

(The point being that for many applications, you may just need a list of singular nouns. For other applications, this suggestion will be worthless.)
Mark Jeffcoat Send private email
Tuesday, July 04, 2006
 
 
If you're simply looking for a word list, there are several included with almost every distribution of Linux.  Look in /usr/share/dict.
Steve Moyer Send private email
Wednesday, July 05, 2006
 
 
You might look up WordNet (a Princeton research project, IIRC). *Lots* of data about English-language words.

The Perl module Lingua::EN::Inflect will, given a list of singular English nouns, generate gramatically correct plurals (including classical vs. modern variants). Depending on what you're trying to accomplish, just examining its rule sets and exception cases may be helpful.
John C.
Thursday, July 06, 2006
 
 
Ah, of course neither of the above is plain text, so perhaps that's not helpful.
John C.
Thursday, July 06, 2006
 
 
The Jazzy project on sourceforge has text files full of English words:

http://sourceforge.net/projects/jazzy

Not sure about whether they include plurals.
MB Send private email
Friday, July 07, 2006
 
 
Try Moby Words, which is now open source.

Also, the LAMPwords dictionary, which is encoded into a graph structure for compact storage and lookup for Scrabble players on Palm handhelds, but it is also open source, and enables you to write a program to unpack the dictionary.

Of course, the quality of the dictionary depends on what you are using it for.  Competitive Scrabble play allows many archaic words that are best left out of a spelling checker.
David Jones Send private email
Friday, July 07, 2006
 
 
You might want to look at Wordnet: http://wordnet.princeton.edu/
Passer-by
Monday, July 10, 2006
 
 

This topic is archived. No further replies will be accepted.

Other recent topics Other recent topics
 
Powered by FogBugz