EnglishDictHash  README.md

File README.md from the latest check-in


EnglishDictHash

A list of Murmur hashes of English words. Can be useful for quickly checking if a word exists in English.

Two files are generated: words.bin and words_lower.bin. Lower is the dictionary with all words lowercased.

Format (always big-endian):

  • 1st uint32: Murmur seed which is always 0x695e0677.
  • 2nd int32: Dictionary size which is always 235886.
  • All other uint32: Murmur hashes of words, sorted ascending.