integrating other spellchecking tools

francis . tyers Sun, 19 May 2013 01:28:13 -0700

Hi,

If I wanted to integrate a "spellchecker" (or wordlist) other than the DAWG 
one that is bundled with Tesseract, how might I go about it ?


In dict/dawg.cpp there is 

  /// Returns true if the given word is in the Dawg.
  bool word_in_dawg(const WERD_CHOICE &word) const;

But then I don't see any reference to it in the code outside of dict/ and 
it just seems to be used for constructing the Trie.

There is also:

cube/word_list_lang_model.h and cube/lang_model.h

  // Determines if a sequence of 32-bit chars is valid in this language 
model
  // starting from the root. If the eow_flag is ON, also checks for
  // a valid EndOfWord. If final_edge is not NULL, returns a pointer to the 
last
  // edge
  bool IsValidSequence(const char_32 *sequence, bool eow_flag,
                       LangModEdge **final_edge = NULL);

But I don't see it used outside of the language model code.

Would it be enough to implement this for a new language model ? 

Any thoughts or comments would be gratefully received,

Regards,

Fran

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

integrating other spellchecking tools

Reply via email to