Content-Length: 262344 | pFad | http://github.com/fthbrmnby/Text-Preprocess/#start-of-content

1E GitHub - fthbrmnby/Text-Preprocess: 2016 - 2017 Graduation Project
Skip to content

fthbrmnby/Text-Preprocess

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text-Preprocess

Text preprocessing pipeline for my graduation project. Pipeline includes sentence boundary detection, sentence tokenizer, stemmer, disambugiator and POS TAG. This pipeline uses Turkish NLP library zemberek-nlp by Ahmet A. Akın and Turkish Deasciifier for Java by Ahmet Alp Balkan.

Dataset

Type Number of Reviews
Positive 220,284
Negative 14,881

Requirements

  • JAVA 8
  • Maven

Releases

No releases published

Packages

No packages published

Languages









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/fthbrmnby/Text-Preprocess/#start-of-content

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy