General
- NLTK: the Natural Language Processing Toolkit
- WEKA: easy to use toolkit to play around with different machine learning algorithms
- CoNLL shared task data: annotated data sets for a number of NLP tasks in a number of languages
Information Retrieval
Language Identification
Pre-processing (Sentence Splitters, Tokenisers, POS Taggers, Lemmatisers, Morphological Analysers)
Syntactic Analysis, Parsers
Text Mining / Information Extraction
Semantic Analysis
Webpages with further information on NLP resources
- Part I: Getting Started with NLTK (this article)
- Part II: Sentence Tokenize and Word Tokenize
- Part III: Part-Of-Speech Tagging and POS Tagger
- Part IV: Stemming and Lemmatization
- Part V: Using Stanford Text Analysis Tools in Python
- Part VI: Add Stanford Word Segmenter Interface for Python NLTK
- Part VII: A Preliminary Study on Text Classification
- Part VIII: Using External Maximum Entropy Modeling Libraries for Text Classification
- Part IX: From Text Classification to Sentiment Analysis
- Part X: Play With Word2Vec Models based on NLTK Corpus
Biomedical natural language processing
包含各種語言模型的來源和工具
- Word vectors: vector representations of words
- N-gram counts: counts of word sequence occurrences
- Language models: models of word sequence probabilities
Awesome Community-Curated NLP List
https://github.com/alvations/awesome-community-curated-nlp
用于自然语言处理的Java或Python
https://gxnotes.com/article/51044.html
A curated list of resources for NLP (Natural Language Processing) for Chinese 中文自然语言处理相关资料
https://github.com/crownpku/awesome-chinese-nlp
1. Chinese NLP Toolkits 中文NLP工具
- Toolkits 综合NLP工具包
- 常用的英文或支持多语言的NLP工具包
- Chinese Word Segment 中文分词
- Information Extraction 信息提取
- QA & Chatbot 问答和聊天机器人
3. Organizations 相关中文NLP组织和会议
4. Learning Materials 学习资料
Natural Language Processing Tools (http://www.phontron.com/nlptools.php)
- Dependency Parser(依存語法分析器)
- Finite State Models(有限狀態模型)
- 一般NLP庫(通用NLP工具)
- Language Modeling(語言模型)
- Machine Learning(機器學習)
- Machine Translation Alignment(機器翻譯對齊)
- Machine Translation Decoder(機器翻譯解碼)
- Machine Translation Evaluation(機器翻譯評測)
- Morphological Analysis(形態分析)
- Phrase Structure Parsing(短語結構語法分析器)
- Pronunciation Estimation(發音估計?)
- Speech Recognition(語音識別)
Open-source NLP software
http://entopix.com/so-you-need-to-understand-language-data-open-source-nlp-software-can-help.html
Python NLTK Tools List for Natural Language Processing (NLP)
http://www.datasciencecentral.com/profiles/blogs/python-nlp-tools
Some of the most popular NLP/NLU platforms
- IBM's Watson Conversation Service
- Microsoft LUIS
- Google Natural Language API
- Wit.ai
- Api.ai
- Alexa Skills Kit
- Recast.AI
- Pat
沒有留言:
張貼留言