Projects tagged ‘part_of_speech’

Natural Language Toolkit (NLTK)

Analyzed 11 months ago

NLTK — the Natural Language Toolkit — is a suite of open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux.

234K lines of code

42 current contributors

almost 1 year since last commit

45 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Python

Licenses: apache_2

Apertium

Analyzed 11 months ago

Apertium is an open-source machine translation platform, aimed at related-language pairs but expanded to deal with more divergent language pairs. The platform provides 1. a language-independent machine translation engine 2. tools to manage the linguistic data necessary to build a machine ... [More]

96.7K lines of code

0 current contributors

12 months since last commit

13 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in C++

Licenses: GNU_Free_..., gpl, gpl3_or_l...

Tags c++ computational-linguistics computational_linguistics linguistics machine_translation morphological_analysis natural_language nlp part_of_speech translation

Apache OpenNLP

A

Claimed by Apache Software Foundation Analyzed 11 months ago

Apache OpenNLP is a Java machine learning toolkit for natural language processing (NLP).

157K lines of code

8 current contributors

12 months since last commit

12 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags analysis apache chunker classifier computational_linguistics coreferenceresolution java machine_learning maxent natural_language_processing ner nlp 8 more...

LanguageTool

Analyzed 11 months ago

LanguageTool is an Open Source language checker for English, German, Polish, Dutch, and other languages. It's rule based, i.e. it will find errors for which a rule is defined in an XML configuration files. Rules for more complicated errors can be written in Java.

1.24M lines of code

37 current contributors

11 months since last commit

11 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Java

Licenses: lgpl

Tags errors grammar libreoffice linguistics natural_language_processing nlp openoffice part_of_speech pattern_matching proofreading spellcheck spelling 1 more...

Treex - NLP Framework

T

Analyzed 11 months ago

Treex (formerly TectoMT) is a highly modular NLP software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to ... [More]

242K lines of code

4 current contributors

12 months since last commit

4 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Perl

Licenses: artistic_gpl

Tags classifier computational_linguistics coreferenceresolution corpora grammar linguistics machine_learning natural_language natural_language_processing nlp parser part_of_speech 6 more...

RelEx Semantic Relationship Extractor

R

Analyzed 11 months ago

RelEx is an English-language semantic relationship extractor, built on the Carnegie-Mellon Link Grammar parser. It can identify dependency-grammar dependencies, such as subject, object, indirect object and many other relationships between words in a sentence. It can also provide part-of-speech ... [More]

11.8K lines of code

4 current contributors

about 1 year since last commit

2 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in Java

Licenses: apache_2

Tags ai anaphora artificial_intelligence computational_linguistics corpus_linguistics dependency dependency_grammar grammar hobbs java linguistics natural_language 8 more...

Link Grammar

L

Analyzed 11 months ago

The Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of a set of labeled links connecting pairs of words. The parser also produces a "constituent" (Penn ... [More]

77.2K lines of code

4 current contributors

over 1 year since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in C

Licenses: lgpl21_or...

Tags arabic biomedical cmu computational_linguistics english german grammar linguistics morphological_analysis morphology natural_language natural_language_processing 8 more...

opencorpora

O

Analyzed 11 months ago

An engine for creating and annotating textual corpora

38.6K lines of code

3 current contributors

over 1 year since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This

Mostly written in PHP

Licenses: gpl

Tags computational_linguistics corpora corpus corpus_linguistics crowdsourcing disambiguation linguistics natural-language-processing natural_language_processing nlp part_of_speech russian 1 more...

spaCy

S

Analyzed 11 months ago

spaCy is a library for advanced natural language processing in Python and Cython. spaCy is built on the very latest research, but it isn't researchware. It was designed from day one to be used in real products. spaCy currently supports English, German, French and Spanish, as well as tokenization for ... [More]

123K lines of code

0 current contributors

12 months since last commit

1 users on Open Hub

Activity Not Available

0 Reviews

I Use This