kuromoji - japanese morphological analyzer

Please Visit: http://lifelongprogrammer.blogspot.com



Japanese Tokenizer: Multi-Language Analysis in Solr

http://www.atilika.org/

http://mentaldetritus.blogspot.com/2013/03/custom-japanese-tokenization-in-solr-40.html

Solr 4.0 (really, it's been there since 3.6) has a new analysis module for handling Japanese, called Kuromoji.



java -cp kuromoji-0.7.7.jar org.atilika.kuromoji.TokenizerRunner

<fieldType name="text_ja" class="solr.TextField" positionIncrementGap="100" autoGeneratePhraseQueries="false">



from Google Plus RSS Feed for 101157854606139706613 http://www.atilika.org

via LifeLong Community

No comments:

Post a Comment