Natural Language Processing Projects


fxsjy / jieba

结巴中文分词

Python     8272   11 days ago


nlp-compromise / compromise

natural language processing

JavaScript     6392   2 days ago


naturalnode / natural

general natural language facilities for node

JavaScript     6298   11 days ago


clips / pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Python     5350   3 months ago


nltk / nltk

NLTK Source

Python     4642   today


sloria / textblob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Python     3862   2 months ago


wooorm / franc

Natural language detection

JavaScript     2217   24 days ago


isnowfy / snownlp

Python library for processing Chinese text

Python     1891   10 days ago


louismullie / treat

Natural language processing framework for Ruby.

Ruby     1255   4 days ago


saffsd / langid.py

Stand-alone language identification system

Python     758   10 months ago


peterc / whatlanguage

A language detection library for Ruby that uses bloom filters for speed.

Ruby     613   %d years ago


jbrooksuk / node-summary

Node module that summarizes text using a naive summarization algorithm

JavaScript     548   24 days ago


threedaymonk / text

Collection of text algorithms. gem install text

Ruby     528   %d years ago


kmike / pymorphy2

Morphological analyzer / inflection engine for Russian and Ukrainian languages.

Python     455   5 months ago


angeloskath / php-nlp-tools

Natural Language Processing Tools in PHP

PHP     397   2 months ago


louismullie / stanford-core-nlp

Ruby bindings to the Stanford Core NLP tools (English, French, German).

Ruby     367   2 months ago


mimino666 / langdetect

Port of Google's language-detection library to Python.

Python     261   3 months ago


aurelian / ruby-stemmer

Expose libstemmer_c to Ruby

Ruby     228   8 months ago


ssoper / summarize

A Ruby C wrapper for Open Text Summarizer

Ruby     193   %d years ago


diasks2 / pragmatic_segmenter

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

Ruby     186   9 days ago


diasks2 / pragmatic_segmenter

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

Ruby     186   9 days ago


yohasebe / engtagger

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger

Ruby     173   6 months ago


nieldlr / hanzi

HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js

JavaScript     161   3 months ago


richtr / guesslanguage.js

A natural language detection library based on trigram statistical analysis for Node.js and the Web.

JavaScript     145   %d years ago


nyxtom / salient

Machine Learning, Natural Language Processing and Sentiment Analysis Toolkit for Node.js

JavaScript     141   11 months ago


atrilla / nlptools

Natural Language Processing Toolkit for PHP

PHP     130   %d years ago


duanhongyi / genius

a chinese segment base on crf

Python     129   7 months ago


mihaivalentin / lunr-languages

A collection of languages stemmers and stopwords for Lunr Javascript library

JavaScript     125   10 days ago


yzhang / rseg

A Chinese Word Segmentation(中文分词) routine in pure Ruby

Ruby     99   %d years ago


kimtaro / ve

A linguistic framework that's easy to use.

Ruby     99   %d years ago


abitdodgy / words_counted

A Ruby natural language processor.

Ruby     96   7 months ago


sublee / korean

A library for Korean morphology

Python     85   27 days ago


louismullie / open-nlp

Ruby bindings to the OpenNLP Java toolkit.

Ruby     84   %d years ago


lfcipriani / punkt-segmenter

Ruby port of the NLTK Punkt sentence segmentation algorithm

Ruby     81   %d years ago


jedp / porter-stemmer

Martin Porter's stemmer for node.js

JavaScript     79   %d years ago


fortnightlabs / snowball-js

javascript implementation of the popular snowball word stemming nlp algorithm

JavaScript     77   %d years ago


tiendung / ruby-nlp

Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer

Ruby     75   %d years ago


zencephalon / tactful_tokenizer

Accurate Bayesian sentence tokenizer in Ruby.

Ruby     71   %d years ago


kristopolous / porter-stemmer

A Javascript Implementation of the Porter Stemmer

JavaScript     68   %d years ago


ged / linkparser

A high-level interface to the CMU Link Grammar. (Github mirror)

Ruby     62   8 months ago


fangpenlin / loso

Another Chinese segmentation library.

Python     60   %d years ago


namd / pypln.backend

Pipeline for distributed Natural Language Processing, made in Python

Python     56   3 months ago


louismullie / scalpel

A fast and accurate rule-based sentence segmentation tool for Ruby.

Ruby     46   %d years ago


ealdent / uea-stemmer

Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing

Ruby     41   %d years ago


deepfryed / ots

ruby bindings to open text summarizer

Ruby     36   %d years ago


arbox / tokenizer

A simple tokenizer in Ruby for NLP tasks.

Ruby     35   24 days ago


skroutz / turkish_stemmer

A simple Turkish stemming library

Ruby     32   1 months ago


veer66 / thailang4r

Thai language utility for Ruby

Ruby     20   %d years ago


mcfreely / epitome

A Lexrank implementation in ruby

Ruby     18   5 months ago


markburns / mecab

MeCab ruby binding with gemspec

Ruby     17   %d years ago


skroutz / greek_stemmer

A simple Greek stemming library

Ruby     17   17 days ago


parhamr / nlp-pure

Natural language processing algorithms implemented in pure Ruby with minimal dependencies

Ruby     15   8 days ago


mblongii / ruby-ner

Named entity recognition with Stanford NER and Ruby

Ruby     14   %d years ago


apohllo / srx-english

English sentence segmentation rules based on SRX standard.

Ruby     12   %d years ago


6 / tiny_segmenter

Ruby port of TinySegmenter.js for tokenizing Japanese text

Ruby     10   10 months ago


diasks2 / word_count_analyzer

Word Count Analyzer is a Ruby gem that analyzes a string for potential areas of the text that might cause word count discrepancies depending on the tool used. It also provides comprehensive configuration options so you can easily customize how different gray areas should be counted and find the right word count for your purposes.

Ruby     10   10 months ago


lefnord / rstt

another ruby wrapper for Stuttgarter Tree Tagger

Ruby     7   %d years ago


knife / nlp

NLP tools for Polish language

Ruby     5   %d years ago


mimosa / jieba-jruby

jieba-analysis(结巴分词) for jRuby

Ruby     4   %d years ago


thesp0nge / wc

Your Word Counter Gem

Ruby     3   %d years ago


atelierconvivialite / word_count

A word counter for String and Hash in Ruby

Ruby     2   %d years ago


lefnord / nlp_toolz

wrapper for basic nlp tools

Ruby     2   %d years ago


pks / zipf

Ruby NLP library

Ruby     2   %d years ago