Skip to content

An Italian dictionary with more than 3 million words

License

Notifications You must be signed in to change notification settings

mircomacrelli/italian-dictionary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains a list of Italian words I use as a dictionary in various text editors like IntelliJ IDEA. The dictionary is stored as a compressed file because it contains 3.009.116 unique words that would otherwise use almost 43 megabytes of space.

The file is compressed first with incremental encoding and then with gzip. This is unusual, but is also very effective in this particular case. I was able to reduce the dictionary to less than 2% of its original size!

I've included the two Ruby scripts that I use to compress and expand the dictionary. So just use those to work with the compressed file.

About

An Italian dictionary with more than 3 million words

Topics

Resources

License

Stars

Watchers

Forks

Languages