PartsOfSpeech_Tagger

This is a parts of speech tagger written in python. It implements a hidden markov model and the viterbi algorithm. I recommend testing and training on the 'development.txt' and 'training.txt' files. Using any other files will require you to edit the code(only slightly). I was able to achieve ~95% accuracy doing this. The accuracy really depends on how large your corpus is. I wasn't able to get my hands on the Penn treebank corpus, but have read that it is the best for POS tagging.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.DS_Store		.DS_Store
POS_HMM.py		POS_HMM.py
README.md		README.md
development.txt		development.txt
training.txt		training.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PartsOfSpeech_Tagger

About

Releases

Packages

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

EthanBlackburn/PartsOfSpeech_Tagger

Folders and files

Latest commit

History

Repository files navigation

PartsOfSpeech_Tagger

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Packages