Software Listing of Author : "The Stanford Natural Language Processing Group"
- Chinese Word Segmenter
- License: Shareware
- Price:


Chinese is standardly written without spaces between words (as are some other languages).
Chinese Word Segmenter will split Chinese text into a sequence of words, defined according to some word segmentation standard. It is a Java implementation of the CRF-based Chinese Word Segmenter.
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: WinOther
- Classifer
- License: Shareware
- Price:


Classifier is a machine learning tool that will take data items and place them into one of k classes. A probabilistic classifier, like this one, can also give a probability distribution over the class assignment for a data item.
This software is a Java implementation of a maximum entropy classifier. Maximum entropy models are otherwise known as conditional loglinear models, and are essentially equivalent to multiclass logistic regression models (though parameterized slightly differently, in a way that is advantageous with sparse explanatory feature vectors).
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: WinOther
- CoreNLP
- License: Shareware
- Price:


CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, to name a few.
It normalizes dates, times, and numeric quantities, mark up the structure of sentences in terms of phrases and word dependencies, indicate which noun phrases refer to the same entities. It provides the foundational building blocks for higher level text understanding applications.
The goal of this project is to enable people to quickly and painlessly get complete linguistic annotations of natural language texts. It is designed to be highly flexible and extensible, i.e., with a single option you can change which tools should be enabled and which should be disabled.
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: WinOther
- Part Of Speech Tagger
- License: Shareware
- Price:


Part Of Speech Tagger or POS Tagger is a piece of software that reads text in some language and assigns parts of speech to each word and other token.
Some of these assignments are: noun, verb, adjective, to name just a few, although generally computational applications use more fine-grained POS tags like 'noun-plural'.
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: Win7 x64, Windows 7, WinOther, WinVista
- Stanford Named Entity Recognizer
- License: Shareware
- Price:


Stanford Named Entity Recognizer, also known as Stanford NER, is an Open Source utility that offers a high-performance machine learning based named entity recognition system.
Named Entity Recognition (NER) labels sequences of words in a text. The software provides a general implementation of linear chain Conditional Random Field (CRF) sequence models, coupled with well-engineered feature extractors for Named Entity Recognition.
Included with the download are good 3 class (PERSON, ORGANIZATION, LOCATION) named entity recognizers for English and another pair of models trained on the CoNLL 2003 English training data.
Stanford Named Entity Recognizer is a cross-platform utility capable of running on any operating system that comes with Java support (Windows, Mac OS X, Linux).
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: WinOther
- Topic Modeling Toolbox
- License: Shareware
- Price:


Topic Modeling Toolbox or TMT brings topic modeling tool desigend for social scientists and others who wish to perform analysis on datasets that have a substantial textual component.
It allows you to import and manipulate text from cells in Excel and other spreadsheets or generate rich Excel-compatible outputs.
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: Win7 x64, Windows 7, WinOther, WinVista
- Tregex
- License: Shareware
- Price:


Tregex is a tool for matching patterns in trees, based on tree relationships and regular expression matches on nodes (the name is short for "tree regular expressions").
It contains essentially the same functionality as TGrep2 (which had a superset of the functionality of the original tgrep), plus several extremely useful relations for natural language trees, for example "A is the lexical head of B", and "A and B share a (hand-specified) variable substring" (useful for finding nodes coindexed with each other).
Because it does not create preprocessed indexed corpus files, it is however somewhat slower than TGrep2 when searching over large treebanks. As a Java application, it is platform independent, and can be used programmatically in Java software.
There is also both a graphical interface (also...
- Publisher: The Stanford Natural Language Processing Group
- Date Released:
- Download
- Platform: Win7 x64, Windows 7, WinOther, WinVista
