tiny_segmenter 0.0.6
Ruby port of TinySegmenter.js for tokenizing Japanese text. Uses a Naive Bayes model that has been trained using the RWCP corpus and optimized using L1-norm regularization. The resultant model is quite compact, yet has a 95% accuracy rate.
Gemfile:
=
インストール:
=
バージョン履歴:
- 0.0.6 October 26, 2015 (16KB)
- 0.0.4 March 31, 2013 (16KB)
- 0.0.2 August 27, 2012 (14KB)
- 0.0.1 August 20, 2012 (11.5KB)