Refine by Language

Refine by Category

Tokenizers Projects

yzhang / rseg

A Chinese Word Segmentation(中文分词) routine in pure Ruby

Ruby     101   %d years ago

arbox / tokenizer

A simple tokenizer in Ruby for NLP tasks.

Ruby     35   3 months ago

veer66 / thailang4r

Thai language utility for Ruby

Ruby     21   %d years ago

markburns / mecab

MeCab ruby binding with gemspec

Ruby     18   %d years ago

parhamr / nlp-pure

Natural language processing algorithms implemented in pure Ruby with minimal dependencies

Ruby     16   2 months ago

6 / tiny_segmenter

Ruby port of TinySegmenter.js for tokenizing Japanese text

Ruby     10   %d years ago

mimosa / jieba-jruby

jieba-analysis(结巴分词) for jRuby

Ruby     4   %d years ago